AI Article Synopsis

  • The paper discusses using a deep learning model to objectively assess speech functions during awake craniotomies, aiming to improve surgical outcomes by minimizing reliance on clinician observations.
  • It involved analyzing 1883 audio clips from surgeries in Japan and France to train a Wav2Vec2-based model, which achieved an F1-score of 84.12% for Japanese data and 74.68% when tested across languages.
  • While the initial results are promising, further evaluation and integration of noise reduction techniques are necessary to enhance the model's performance and accuracy.

Article Abstract

Purpose: Awake craniotomy presents a unique opportunity to map and preserve critical brain functions, particularly speech, during tumor resection. The ability to accurately assess linguistic functions in real-time not only enhances surgical precision, but also contributes significantly to improving postoperative outcomes. However, today, its evaluation is subjective as it relies on a clinician's observations only. This paper explores the use of a deep learning based model for the objective assessment of speech arrest and speech impairments during awake craniotomy.

Methods: We extracted 1883 3-second audio clips containing the patient's response following direct electrical stimulation from 23 awake craniotomies recorded from two operating rooms of the Tokyo Women's Medical University Hospital (Japan) and two awake craniotomies recorded from the University Hospital of Brest (France). A Wav2Vec2-based model has been trained and used to detect speech arrests and speech impairments. Experiments were performed with different datasets settings and preprocessing techniques and the performances of the model were evaluated using the F1-score.

Results: The F1-score was 84.12% when the model was trained and tested on Japanese data only. In a cross-language situation, the F1-score was 74.68% when the model was trained on Japanese data and tested on French data.

Conclusions: The results are encouraging even in a cross-language situation but further evaluation is required. The integration of preprocessing techniques, in particular noise reduction, improved the results significantly.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11548-024-03301-0DOI Listing

Publication Analysis

Top Keywords

speech impairments
12
model trained
12
speech arrests
8
arrests speech
8
impairments awake
8
awake craniotomy
8
awake craniotomies
8
craniotomies recorded
8
university hospital
8
preprocessing techniques
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!