A multi-stage transfer learning strategy for diagnosing a class of rare laryngeal movement disorders.

Comput Biol Med

The Department of Electrical Engineering and Computer Science, Vanderbilt University, 2301 Vanderbilt Place, Nashville, 37235, TN, USA.

Published: November 2023

Background: It remains hard to directly apply deep learning-based methods to assist diagnosing essential tremor of voice (ETV) and abductor and adductor spasmodic dysphonia (ABSD and ADSD). One of the main challenges is that, as a class of rare laryngeal movement disorders (LMDs), there are limited available databases to be investigated. Another worthy explored research question is which above sub-disorder benefits most from diagnosis based on sustained phonations. The question is from the fact that sustained phonations can help detect pathological voice from healthy voice.

Method: A transfer learning strategy is developed for LMD diagnosis with limited data, which consists of three fundamental parts. (1) An extra vocally healthy database from the International Dialects of English Archive (IDEA) is employed to pre-train a convolutional autoencoder. (2) The transferred proportion of the pre-trained encoder is explored. And its impact on LMD diagnosis is also evaluated, yielding a two-stage transfer model. (3) A third stage is designed following the initial two stages to embed information of pathological sustained phonation into the model. This stage verifies the different effects of applying sustained phonation on diagnosing the three sub-disorders, and helps boost the final diagnostic performance.

Results: The analysis in this study is based on clinician-labeled LMD data obtained from the Vanderbilt University Medical Center (VUMC). We find that diagnosing ETV shows sensitivity to sustained phonation within the current database. Meanwhile, the results show that the proposed multi-stage transfer learning strategy can produce (1) accuracy of 65.3% on classifying normal and other three sub-disorders all at once, (2) accuracy of 85.3% in differentiating normal, ABSD, and ETV, and (3) accuracy of 77.7% for normal, ADSD and ETV. These findings demonstrate the effectiveness of the proposed approach.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107534DOI Listing

Publication Analysis

Top Keywords

transfer learning
12
learning strategy
12
sustained phonation
12
multi-stage transfer
8
class rare
8
rare laryngeal
8
laryngeal movement
8
movement disorders
8
sustained phonations
8
lmd diagnosis
8

Similar Publications

The feasibility of using machine learning to predict COVID-19 cases.

Int J Med Inform

January 2025

School of Geography and the Environment, University of Oxford, South Parks Road, Oxford OX1 3QY, United Kingdom. Electronic address:

Background: Coronavirus Disease 2019 (COVID-19), caused by the SARS-CoV-2 virus, emerged as a global health crisis in 2019, resulting in widespread morbidity and mortality. A persistent challenge during the pandemic has been the accuracy of reported epidemic data, particularly in underdeveloped regions with limited access to COVID-19 test kits and healthcare infrastructure. In the post-COVID era, this issue remains crucial.

View Article and Find Full Text PDF

Identification of an ANCA-associated vasculitis cohort using deep learning and electronic health records.

Int J Med Inform

January 2025

Rheumatology and Allergy Clinical Epidemiology Research Center and Division of Rheumatology, Allergy, and Immunology, and Mongan Institute, Department of Medicine, Massachusetts General Hospital Boston MA USA. Electronic address:

Background: ANCA-associated vasculitis (AAV) is a rare but serious disease. Traditional case-identification methods using claims data can be time-intensive and may miss important subgroups. We hypothesized that a deep learning model analyzing electronic health records (EHR) can more accurately identify AAV cases.

View Article and Find Full Text PDF

Background: The application of natural language processing in medicine has increased significantly, including tasks such as information extraction and classification. Natural language processing plays a crucial role in structuring free-form radiology reports, facilitating the interpretation of textual content, and enhancing data utility through clustering techniques. Clustering allows for the identification of similar lesions and disease patterns across a broad dataset, making it useful for aggregating information and discovering new insights in medical imaging.

View Article and Find Full Text PDF

Diagnosis of lung cancer using salivary miRNAs expression and clinical characteristics.

BMC Pulm Med

January 2025

Universal Scientific Education and Research Network (USERN), Tehran, Iran.

Objective: Lung cancer (LC), the primary cause for cancer-related death globally is a diverse illness with various characteristics. Saliva is a readily available biofluid and a rich source of miRNA. It can be collected non-invasively as well as transported and stored easily.

View Article and Find Full Text PDF

Background: Drug-drug interactions (DDIs) especially antagonistic ones present significant risks to patient safety, underscoring the urgent need for reliable prediction methods. Recently, substructure-based DDI prediction has garnered much attention due to the dominant influence of functional groups and substructures on drug properties. However, existing approaches face challenges regarding the insufficient interpretability of identified substructures and the isolation of chemical substructures.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!