A fully automated approach for baby cry signal segmentation and boundary detection of expiratory and inspiratory episodes.

Lina Abou-Abbas Chakib Tadj Hesam Alaie Fersaie

J Acoust Soc Am

Department of Electrical Engineering, École de Technologie Supérieure, Quebec University, 1100 Rue Notre Dame Ouest, Montréal, Quebec H3C 1K3, Canada.

Published: September 2017

The detection of cry sounds is generally an important pre-processing step for various applications involving cry analysis such as diagnostic systems, electronic monitoring systems, emotion detection, and robotics for baby caregivers. Given its complexity, an automatic cry segmentation system is a rather challenging topic. In this paper, a framework for automatic cry sound segmentation for application in a cry-based diagnostic system has been proposed. The contribution of various additional time- and frequency-domain features to increase the robustness of a Gaussian mixture model/hidden Markov model (GMM/HMM)-based cry segmentation system in noisy environments is studied. A fully automated segmentation algorithm to extract cry sound components, namely, audible expiration and inspiration, is introduced and is grounded on two approaches: statistical analysis based on GMMs or HMMs classifiers and a post-processing method based on intensity, zero crossing rate, and fundamental frequency feature extraction. The main focus of this paper is to extend the systems developed in previous works to include a post-processing stage with a set of corrective and enhancing tools to improve the classification performance. This full approach allows to precisely determine the start and end points of the expiratory and inspiratory components of a cry signal, EXP and INSV, respectively, in any given sound signal. Experimental results have indicated the effectiveness of the proposed solution. EXP and INSV detection rates of approximately 94.29% and 92.16%, respectively, were achieved by applying a tenfold cross-validation technique to avoid over-fitting.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5593797	PMC
http://dx.doi.org/10.1121/1.5001491	DOI Listing

Publication Analysis

Top Keywords

fully automated

cry

cry signal

expiratory inspiratory

automatic cry

cry segmentation

segmentation system

cry sound

exp insv

segmentation

Similar Publications

MGFusion: a multimodal large language model-guided information perception for infrared and visible image fusion.

Front Neurorobot

December 2024

Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, Yunnan, China.

Zengyi Yang Yunping Li Xin Tang MingHong Xie

Existing image fusion methods primarily focus on complex network structure designs while neglecting the limitations of simple fusion strategies in complex scenarios. To address this issue, this study proposes a new method for infrared and visible image fusion based on a multimodal large language model. The method proposed in this paper fully considers the high demand for semantic information in enhancing image quality as well as the fusion strategies in complex scenes.

View Article and Find Full Text PDF

Similar Publications

Predicting largest expected aftershock ground motions using automated machine learning (AutoML)-based scheme.

Sci Rep

January 2025

College of Civil and Transportation Engineering, Hohai University, No. 1 Xikang Road, Nanjing City, 210098, Jiangsu Province, People's Republic of China.

Xiaohui Yu Meng Wang Chaolie Ning Kun Ji

Aftershocks can cause additional damage or even lead to the collapse of structures already weakened by a mainshock. Scarcity of in-situ recorded aftershock accelerograms heightens the need to develop synthetic aftershock ground motions. These synthesized motions are crucial for assessing the cumulative seismic demand on structures subjected to mainshock-aftershock sequences.

View Article and Find Full Text PDF

Similar Publications

Automated estimation of individualized organ-specific dose and noise from clinical CT scans.

Phys Med Biol

January 2025

Radiology, Stanford University, 1201 Welch Rd, P270, Stanford, California, 94305-6104, UNITED STATES.

Sen Wang Maria Jose Medrano Abdullah-Al-Zubaer Imran Wonkyeong Lee Jennie Cao

Radiation dose and diagnostic image quality are opposing constraints in x-ray CT. Conventional methods do not fully account for organ-level radiation dose and noise when considering radiation risk and clinical task. In this work, we develop a pipeline to generate individualized organ-specific dose and noise at desired dose levels from clinical CT scans.

View Article and Find Full Text PDF

Similar Publications

Fully automated segmentation of brain and scalp blood vessels on multi-parametric magnetic resonance imaging using multi-view cascaded networks.

Comput Methods Programs Biomed

January 2025

Medical AI Lab, School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, 518060, China. Electronic address:

Songxiong Wu Zilong Huang Mingyu Wang Ping Zeng Biwen Tan

Background And Objective: Neurosurgical navigation is a critical element of brain surgery, and accurate segmentation of brain and scalp blood vessels is crucial for surgical planning and treatment. However, conventional methods for segmenting blood vessels based on statistical or thresholding techniques have limitations. In recent years, deep learning-based methods have emerged as a promising solution for blood vessel segmentation, but the segmentation of small blood vessels and scalp blood vessels remains challenging.

View Article and Find Full Text PDF

Similar Publications

Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.

JMIR Med Inform

January 2025

Medical Big Data Research Center, Chinese PLA General Hospital, Beijing, China.

Yan Zhuang Junyan Zhang Xiuxing Li Chao Liu Yue Yu

Background: Machine learning models can reduce the burden on doctors by converting medical records into International Classification of Diseases (ICD) codes in real time, thereby enhancing the efficiency of diagnosis and treatment. However, it faces challenges such as small datasets, diverse writing styles, unstructured records, and the need for semimanual preprocessing. Existing approaches, such as naive Bayes, Word2Vec, and convolutional neural networks, have limitations in handling missing values and understanding the context of medical texts, leading to a high error rate.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!