Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech.

Nikola Simić Siniša Suzić Tijana Nosek Mia Vujović Zoran Perić Milan Savić Vlado Delić

Entropy (Basel)

Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia.

Published: March 2022

Speaker recognition is an important classification task, which can be solved using several approaches. Although building a speaker recognition model on a closed set of speakers under neutral speaking conditions is a well-researched task and there are solutions that provide excellent performance, the classification accuracy of developed models significantly decreases when applying them to emotional speech or in the presence of interference. Furthermore, deep models may require a large number of parameters, so constrained solutions are desirable in order to implement them on edge devices in the Internet of Things systems for real-time detection. The aim of this paper is to propose a simple and constrained convolutional neural network for speaker recognition tasks and to examine its robustness for recognition in emotional speech conditions. We examine three quantization methods for developing a constrained network: floating-point eight format, ternary scalar quantization, and binary scalar quantization. The results are demonstrated on the recently recorded SEAC dataset.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947568	PMC
http://dx.doi.org/10.3390/e24030414	DOI Listing

Publication Analysis

Top Keywords

speaker recognition

emotional speech

constrained convolutional

convolutional neural

scalar quantization

speaker

constrained

recognition constrained

neural networks

networks emotional

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!