Classification of Infant Crying Sounds Using SE-ResNet-Transformer.

Sensors (Basel)

Department of Computer Science and Technology, Anhui University of Finance and Economics, Bengbu 233030, China.

Published: October 2024

Recently, emotion analysis has played an important role in the field of artificial intelligence, particularly in the study of speech emotion analysis, which can help understand one of the most direct ways of human emotional communication-speech. This study focuses on the emotion analysis of infant crying. Within cries lies a variety of information, including hunger, pain, and discomfort. This paper proposes an improved classification model using ResNet and transformer. It utilizes modified Mel-frequency cepstral coefficient Mel-frequency cepstral coefficient (MFCC) features obtained through feature engineering from infant cries and integrates SE attention mechanism modules into residual blocks to enhance the model's ability to adjust channel weights. The proposed method achieved 93% accuracy rate in experiments, offering advantages of shorter training time and higher accuracy compared to other traditional models. It provides an efficient and stable solution for infant cry classification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11510884PMC
http://dx.doi.org/10.3390/s24206575DOI Listing

Publication Analysis

Top Keywords

emotion analysis
12
infant crying
8
mel-frequency cepstral
8
cepstral coefficient
8
classification infant
4
crying sounds
4
sounds se-resnet-transformer
4
se-resnet-transformer emotion
4
analysis played
4
played role
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!