Visual speech segmentation: using facial cues to locate word boundaries in continuous speech.

Lang Cogn Process

Department of Psychology and Program in Linguistics, The Pennsylvania State University, 643 Moore Building, University Park, PA 16802, USA.

Published: January 2014

Speech is typically a multimodal phenomenon, yet few studies have focused on the exclusive contributions of visual cues to language acquisition. To address this gap, we investigated whether visual prosodic information can facilitate speech segmentation. Previous research has demonstrated that language learners can use lexical stress and pitch cues to segment speech and that learners can extract this information from talking faces. Thus, we created an artificial speech stream that contained minimal segmentation cues and paired it with two synchronous facial displays in which visual prosody was either informative or uninformative for identifying word boundaries. Across three familiarisation conditions (audio stream alone, facial streams alone, and paired audiovisual), learning occurred only when the facial displays were informative to word boundaries, suggesting that facial cues can help learners solve the early challenges of language acquisition.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4091796	PMC
http://dx.doi.org/10.1080/01690965.2013.791703	DOI Listing

Publication Analysis

Top Keywords

word boundaries

speech segmentation

facial cues

language acquisition

facial displays

facial

cues

speech

visual

visual speech

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!