Phoneme pronunciations are usually considered as basic skills for learning a foreign language. Practicing the pronunciations in a computer-assisted way is helpful in a self-directed or long-distance learning environment. Recent researches indicate that machine learning is a promising method to build high-performance computer-assisted pronunciation training modalities. Many data-driven classifying models, such as support vector machines, back-propagation networks, deep neural networks and convolutional neural networks, are increasingly widely used for it. Yet, the acoustic waveforms of phoneme are essentially modulated from the base vibrations of vocal cords, and this fact somehow makes the predictors collinear, distorting the classifying models. A commonly-used solution to address this issue is to suppressing the collinearity of predictors via partial least square regressing algorithm. It allows to obtain high-quality predictor weighting results via predictor relationship analysis. However, as a linear regressor, the classifiers of this type possess very simple topology structures, constraining the universality of the regressors. For this issue, this paper presents an heterogeneous phoneme recognition framework which can further benefit the phoneme pronunciation diagnostic tasks by combining the partial least square with support vector machines. A French phoneme data set containing 4830 samples is established for the evaluation experiments. The experiments of this paper demonstrates that the new method improves the accuracy performance of the phoneme classifiers by 0.21 - 8.47% comparing to state-of-the-arts with different data training data density.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8523060PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0257901PLOS

Publication Analysis

Top Keywords

machine learning
8
french phoneme
8
phoneme pronunciation
8
pronunciation training
8
classifying models
8
support vector
8
vector machines
8
neural networks
8
partial square
8
phoneme
7

Similar Publications

In the context of Chinese clinical texts, this paper aims to propose a deep learning algorithm based on Bidirectional Encoder Representation from Transformers (BERT) to identify privacy information and to verify the feasibility of our method for privacy protection in the Chinese clinical context. We collected and double-annotated 33,017 discharge summaries from 151 medical institutions on a municipal regional health information platform, developed a BERT-based Bidirectional Long Short-Term Memory Model (BiLSTM) and Conditional Random Field (CRF) model, and tested the performance of privacy identification on the dataset. To explore the performance of different substructures of the neural network, we created five additional baseline models and evaluated the impact of different models on performance.

View Article and Find Full Text PDF

Human vs Machine: The Future of Decision-making in Plastic and Reconstructive Surgery.

Aesthet Surg J

January 2025

Department of Plastic, Reconstructive and Aesthetic Surgery, Faculty of Medicine, Altınbas University, Istanbul, Turkey.

Background: Artificial intelligence (AI)-driven technologies offer transformative potential in plastic surgery, spanning pre-operative planning, surgical procedures, and post-operative care, with the promise of improved patient outcomes.

Objectives: To compare the web-based ChatGPT-4o (omni; OpenAI, San Francisco, CA) and Gemini Advanced (Alphabet Inc., Mountain View, CA), focusing on their data upload feature and examining outcomes before and after exposure to CME articles, particularly regarding their efficacy relative to human participants.

View Article and Find Full Text PDF

How Outcome Prediction Could Aid Clinical Practice.

Br J Hosp Med (Lond)

January 2025

Department of Surgery & Cancer, Imperial College London, London, UK.

Predictive algorithms have myriad potential clinical decision-making implications from prognostic counselling to improving clinical trial efficiency. Large observational (or "real world") cohorts are a common data source for the development and evaluation of such tools. There is significant optimism regarding the benefits and use cases for risk-based care, but there is a notable disparity between the volume of clinical prediction models published and implementation into healthcare systems that drive and realise patient benefit.

View Article and Find Full Text PDF

Tryptophan catabolism is a central pathway in many cancers, serving to sustain an immunosuppressive microenvironment. The key enzymes involved in this tryptophan metabolism such as indoleamine 2,3-dioxygenase 1 (IDO1) and tryptophan 2,3-dioxygenase (TDO) are reported as promising novel targets in cancer immunotherapy. IDO1 and TDO overexpression in TNBC cells promote resistance to cell death, proliferation, invasion, and metastasis.

View Article and Find Full Text PDF

Radio frequency identification (RFID) technology and marker recognition algorithms can offer an efficient and non-intrusive means of tracking animal positions. As such, they have become important tools for invertebrate behavioral research. Both approaches require fixing a tag or marker to the study organism, and so it is useful to quantify the effects such procedures have on behavior before proceeding with further research.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!