Hierarchical transformer speech depression detection model research based on Dynamic window and Attention merge.

Xiaoping Yue Chunna Zhang Zhijian Wang Yang Yu Shengqiang Cong Yuming Shen Jinchi Zhao

PeerJ Comput Sci

School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, Liaoning, China.

Published: September 2024

Depression Detection of Speech is widely applied due to its ease of acquisition and imbuing with emotion. However, there exist challenges in effectively segmenting and integrating depressed speech segments. Multiple merges can also lead to blurred original information. These problems diminish the effectiveness of existing models. This article proposes a Hierarchical Transformer model for speech depression detection based on dynamic window and attention merge, abbreviated as DWAM-Former. DWAM-Former utilizes a Learnable Speech Split module (LSSM) to effectively separate the phonemes and words within an entire speech segment. Moreover, the Adaptive Attention Merge module (AAM) is introduced to generate representative feature representations for each phoneme and word in the sentence. DWAM-Former also associates the original feature information with the merged features through a Variable-Length Residual module (VL-RM), reducing feature loss caused by multiple mergers. DWAM-Former has achieved highly competitive results in the depression detection dataset DAIC-WOZ. An MF1 score of 0.788 is received in the experiment, representing a 7.5% improvement over previous research.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622959	PMC
http://dx.doi.org/10.7717/peerj-cs.2348	DOI Listing

Publication Analysis

Top Keywords

depression detection

attention merge

hierarchical transformer

speech depression

based dynamic

dynamic window

window attention

speech

transformer speech

depression

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!