Segmenting words from natural speech: subsegmental variation in segmental cues.

J Child Lang

University of Maryland Center for Advanced Study of Language (CASL) and Department of Linguistics, the Ohio State University.

Published: June 2010

Most computational models of word segmentation are trained and tested on transcripts of speech, rather than the speech itself, and assume that speech is converted into a sequence of symbols prior to word segmentation. We present a way of representing speech corpora that avoids this assumption, and preserves acoustic variation present in speech. We use this new representation to re-evaluate a key computational model of word segmentation. One finding is that high levels of phonetic variability degrade the model's performance. While robustness to phonetic variability may be intrinsically valuable, this finding needs to be complemented by parallel studies of the actual abilities of children to segment phonetically variable speech.

Download full-text PDF

Source
http://dx.doi.org/10.1017/S0305000910000085DOI Listing

Publication Analysis

Top Keywords

word segmentation
12
phonetic variability
8
speech
7
segmenting natural
4
natural speech
4
speech subsegmental
4
subsegmental variation
4
variation segmental
4
segmental cues
4
cues computational
4

Similar Publications

Background: Computed tomography pulmonary angiography (CTPA) is frequently performed in patients with pulmonary hypertension (PH) and may aid non-invasive estimation of pulmonary hemodynamics. We, therefore, investigated automated volumetry of intrapulmonary vasculature on CTPA, separated into core and peel fractions of the lung volume and its potential to differentially reflect pulmonary hemodynamics in patients with pre- and postcapillary PH.

Methods: A retrospective case-control study of 72 consecutive patients with PH according to the 2022 joint guidelines of the European Society of Cardiology and the European Respiratory Society who underwent right heart catheterization (RHC) and CTPA within 7 days between August 2013 and February 2016 at Thoraxklinik at Heidelberg University Hospital (Heidelberg, Germany) was conducted.

View Article and Find Full Text PDF

Artificial Intelligence-Powered Training Database for Clinical Thinking: App Development Study.

JMIR Form Res

January 2025

Centre for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China.

Background: With the development of artificial intelligence (AI), medicine has entered the era of intelligent medicine, and various aspects, such as medical education and talent cultivation, are also being redefined. The cultivation of clinical thinking abilities poses a formidable challenge even for seasoned clinical educators, as offline training modalities often fall short in bridging the divide between current practice and the desired ideal. Consequently, there arises an imperative need for the expeditious development of a web-based database, tailored to empower physicians in their quest to learn and hone their clinical reasoning skills.

View Article and Find Full Text PDF

TExCNN: Leveraging Pre-Trained Models to Predict Gene Expression from Genomic Sequences.

Genes (Basel)

December 2024

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.

Background/objectives: Understanding the relationship between DNA sequences and gene expression levels is of significant biological importance. Recent advancements have demonstrated the ability of deep learning to predict gene expression levels directly from genomic data. However, traditional methods are limited by basic word encoding techniques, which fail to capture the inherent features and patterns of DNA sequences.

View Article and Find Full Text PDF

Simulating Early Phonetic and Word Learning Without Linguistic Categories.

Dev Sci

March 2025

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Études Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.

Before they even talk, infants become sensitive to the speech sounds of their native language and recognize the auditory form of an increasing number of words. Traditionally, these early perceptual changes are attributed to an emerging knowledge of linguistic categories such as phonemes or words. However, there is growing skepticism surrounding this interpretation due to limited evidence of category knowledge in infants.

View Article and Find Full Text PDF

Cardiac magnetic resonance (CMR) imaging enables a one-stop assessment of heart structure and function. Artificial intelligence (AI) can simplify and automate work flows and improve image post-processing speed and diagnostic accuracy; thus, it greatly affects many aspects of CMR. This review highlights the application of AI for left heart analysis in CMR, including quality control, image segmentation, and global and regional functional assessment.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!