Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features.

Sensors (Basel)

Department of Computer and Communication Engineering, Ming Chuan University, Taoyuan 333, Taiwan.

Published: June 2022

In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion recognition technology to apply it to Chinese speech emotion recognition. The main purpose of this research is to transform gradually popularized smart home voice assistants or AI system service robots from a touch-sensitive interface to a voice operation. This research proposed a specifically designed Deep Neural Network (DNN) model to develop a Chinese speech emotion recognition system. In this research, 29 acoustic characteristics in acoustic theory are used as the training attributes of the proposed model. This research also proposes a variety of audio adjustment methods to amplify datasets and enhance training accuracy, including waveform adjustment, pitch adjustment, and pre-emphasize. This study achieved an average emotion recognition accuracy of 88.9% in the CASIA Chinese sentiment corpus. The results show that the deep learning model and audio adjustment method proposed in this study can effectively identify the emotions of Chinese short sentences and can be applied to Chinese voice assistants or integrated with other dialogue applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269147PMC
http://dx.doi.org/10.3390/s22134744DOI Listing

Publication Analysis

Top Keywords

emotion recognition
28
chinese speech
12
speech emotion
12
deep neural
8
neural network
8
voice assistants
8
audio adjustment
8
emotion
7
recognition
7
chinese
6

Similar Publications

Background: Facial emotion recognition testing in Alzheimer's disease (AD) patients has been identified as key for early detection and as a marker for disease progression. Emotion recognition remains one of the most difficult domains to assess in culturally diverse populations due to a lack of culturally adapted tools. This study assessed the feasibility of a cross-cultural test for emotion recognition, the TIE-93, in French and North African populations living in France.

View Article and Find Full Text PDF

Dementia Care Practice.

Alzheimers Dement

December 2024

Sounds Good Choir, NFP, Downers Grove, IL, USA.

Background: Singing improves mood, social, and physical well-being (Pentikainen et al., 2021). Choral singing has therefore gained recognition as a highly recommended activity for older adults and persons with dementia to fight isolation (Petrovsky et al.

View Article and Find Full Text PDF

Technology and Dementia Preconference.

Alzheimers Dement

December 2024

Harvard Medical School, Belmont, MA, USA.

There is growing recognition of the importance of changes in behavior as an early clinical marker of the onset of Alzheimer's disease. Behavior symptoms may precede the onset of cognitive symptoms by as many as three years. However, these symptoms are often confused for primary psychiatric pathology and there is an urgent need for markers that can help quantify behaviors with the eventual goal of helping distinguish behavior changes related to psychiatric pathology from behavior changes related to the onset of dementia.

View Article and Find Full Text PDF

Objectives: Knowledge about the long-term course and prognosis of persistent somatic symptoms (PSS) is important to improve clinical decision-making and guidance for patients with PSS. Therefore, we aimed to: (1) identify distinct 5-year trajectories of symptom severity, physical and mental functioning in adult patients with PSS and (2) explore patient characteristics associated with these trajectories.

Design: We used longitudinal data (seven measurements over a 5-year period) of the PROSPECTS study: a prospective cohort of adult patients with PSS.

View Article and Find Full Text PDF

Background: Recognition of emotion changes is of great significance to a person's physical and mental health. At present, EEG-based emotion recognition methods are mainly focused on time or frequency domains, but rarely on spatial information. Therefore, the goal of this study is to improve the performance of emotion recognition by integrating frequency and spatial domain information under multi-frequency bands.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!