Previous studies have shown that infant-directed speech ('motherese') exhibits overemphasized acoustic properties which may facilitate the acquisition of phonetic categories by infant learners. It has been suggested that the use of infant-directed data for training automatic speech recognition systems might also enhance the automatic learning and discrimination of phonetic categories. This study investigates the properties of infant-directed vs. adult-directed speech from the point of view of the statistical pattern recognition paradigm underlying automatic speech recognition. Isolated-word speech recognizers were trained on adult-directed vs. infant-directed data sets and were tested on both matched and mismatched data. Results show that recognizers trained on infant-directed speech did not always exhibit better recognition performance; however, their relative loss in performance on mismatched data was significantly less severe than that of recognizers trained on adult-directed speech and presented with infant-directed test data. An analysis of the statistical distributions of a subset of phonetic classes in both data sets showed that this pattern is caused by larger class overlaps in infant-directed speech. This finding has implications for both automatic speech recognition and theories of infant speech perception.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1121/1.1869172 | DOI Listing |
Sensors (Basel)
January 2025
SHCCIG Yubei Coal Industry Co., Ltd., Xi'an 710900, China.
The coal mining industry in Northern Shaanxi is robust, with a prevalent use of the local dialect, known as "Shapu", characterized by a distinct Northern Shaanxi accent. This study addresses the practical need for speech recognition in this dialect. We propose an end-to-end speech recognition model for the North Shaanxi dialect, leveraging the Conformer architecture.
View Article and Find Full Text PDFJ Clin Med
January 2025
Assistance Publique-Hôpitaux de Paris, Hôpital Bicêtre, Service d'Oto-Rhino-Laryngologie, 78 Rue du Général Leclerc, 94270 Le Kremlin-Bicêtre, France.
Hearing aids (HAs) have been used for standard high-frequency hearing loss and tinnitus, but their effects on speech intelligibility in noise (SIN) in people with normal hearing, including hidden hearing loss (HHL), have been little explored. We included in a prospective cohort study patients who experience poor SIN and have normal pure tone average in quiet conditions or slight HL. We used open-fit HAs.
View Article and Find Full Text PDFGeorgian Med News
November 2024
1Department of Nursing, Hangzhou Geriatric Hospital, Gongshu District, Zhejiang, China.
Objective: The integration of physical therapy (PT), occupational therapy (OT), and speech therapy (ST) into a triple therapy approach has gained recognition in the rehabilitation of patients. The integration of PT-OT-ST triple therapy with accelerated recovery strategies in pulmonary rehabilitation for elderly mechanically ventilated patients is anticipated to overcome the limitations of traditional rehabilitation approaches.
Methods: By applying stringent inclusion and exclusion criteria, a total of 60 elderly patients over 60 years old requiring mechanical ventilation were selected.
Disabil Rehabil Assist Technol
January 2025
School of Rehabilitation Therapy, Queen's University, Kingston, Ontario, Canada.
This article explores the existing research evidence on the potential effectiveness of lipreading as a communication strategy to enhance speech recognition in individuals with hearing impairment. A scoping review was conducted, involving a search of six electronic databases (MEDLINE, Embase, Web of Science, Engineering Village, CINAHL, and PsycINFO) for research papers published between January 2013 and June 2023. This study included original research papers with full texts available in English, covering all study designs: qualitative, quantitative, and mixed methods.
View Article and Find Full Text PDFJ Imaging
January 2025
Department of Electrical and Computer Engineering, Illinois Institute of Technology, Chicago, IL 60616, USA.
The integration of artificial intelligence into daily life significantly enhances the autonomy and quality of life of visually impaired individuals. This paper introduces the Visual Impairment Spatial Awareness (VISA) system, designed to holistically assist visually impaired users in indoor activities through a structured, multi-level approach. At the foundational level, the system employs augmented reality (AR) markers for indoor positioning, neural networks for advanced object detection and tracking, and depth information for precise object localization.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!