Non-intrusive speech quality assessment with attention-based ResNet-BiLSTM.

Signal Image Video Process

Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China.

Published: April 2023

Speech quality is frequently affected by a variety factors in online conferencing applications, such as background noise, reverberation, packet loss and network jitter. In real scenarios, it is impossible to obtain a clean reference signal for evaluating the quality of the conferencing speech. Therefore, an effective non-intrusive speech quality assessment (NISQA) method is necessary. In this paper, we propose a new network framework for NISQA based on ResNet and BiLSTM. ResNet is utilized to extract local features, while BiLSTM is used to integrate representative features with long-term time dependencies and sequential characteristics. Considering that ResNet may result in the loss of context information when applied to the NISQA task, we propose a variant of ResNet which can preserve the time series information of the conferencing speech. The experimental results demonstrate that the proposed method has a high correlation with the mean opinion score of clean, noisy and processed speech.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10088708PMC
http://dx.doi.org/10.1007/s11760-023-02559-2DOI Listing

Publication Analysis

Top Keywords

speech quality
12
non-intrusive speech
8
quality assessment
8
conferencing speech
8
speech
5
quality
4
assessment attention-based
4
attention-based resnet-bilstm
4
resnet-bilstm speech
4
quality frequently
4

Similar Publications

Exploring the Intersection of Hegemonic Masculinity, Sexuality, and Addiction in Men: A Qualitative Study.

Healthcare (Basel)

December 2024

Department of Personality, Evaluation and Psychological Treatment, Faculty of Psychology and Speech Therapy, University of Murcia, Building 31, 30100 Murcia, Spain.

In our society, as well as in many other parts of the world, sexuality is shaped through gender-differentiated socialization. This process compels individuals to align their desires, behaviors, emotions, and thoughts with the expectations of normative sexuality, especially hegemonic heterosexuality. The primary objective of this current research was to examine the influence of hegemonic masculinity on the sexuality of men struggling with addiction.

View Article and Find Full Text PDF

Construction of prediction model of early glottic cancer based on machine learning.

Acta Otolaryngol

January 2025

Department of Otorhinolaryngology Head and Neck Surgery, Tianjin First Central Hospital, Tianjin, China.

Background: The early diagnosis of glottic laryngeal cancer is the key to successful treatment, and machine learning (ML) combined with narrow-band imaging (NBI) laryngoscopy provides a new idea for the early diagnosis of glottic laryngeal cancer.

Objective: To explore the clinical applicability of the diagnosis of early glottic cancer based on ML combined with NBI.

Material And Methods: A retrospective study was conducted on 200 patients diagnosed with laryngeal mass, and the general clinical characteristics and pathological results of the patients were collected.

View Article and Find Full Text PDF

Motherese Directed at Prelinguistic Infants at Risk for Neurological Disorders: An Exploratory Study.

J Child Lang

January 2025

Department of Developmental Neuroscience, IRCCS Stella Maris Foundation, Pisa, Italy.

To investigate how a high risk for infant neurological impairment affects the quality of infant verbal interactions, and in particular properties of infant-directed speech, spontaneous interactions between 14 mothers and their 4.5-month-old infants at high risk for neurological disorders (7 female) were recorded and acoustically compared with those of 14 dyads with typically developing infants (8 female). Mothers of at-risk infants had proportionally less voicing, and the proportion of voicing decreased with increasing severity of the infants' long-term outcome.

View Article and Find Full Text PDF

Purpose: This study aimed to assess the levels and sources of noise in the emergency intensive care unit (EICU) of an emergency department and investigate their effects on the sleep quality of conscious patients.

Methods: A study was conducted on patients admitted to the EICU from December 2020 to December 2023. They were categorised according to their sleep quality with the Pittsburgh Sleep Quality Index.

View Article and Find Full Text PDF

Importance: Family-centered care (FCC) in neonatal intensive care units (NICUs) is critical for parental involvement and infant well-being, yet few studies have evaluated the impact of FCC interventions on practice or examined how implementation fidelity may affect these outcomes.

Objectives: To evaluate the association between the Close Collaboration With Parents intervention and FCC practices and how implementation fidelity may modify these outcomes.

Design, Setting, And Participants: This nonrandomized clinical trial had a before-and-after design.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!