Efficient incremental training using a novel NMT-SMT hybrid framework for translation of low-resource languages.

Front Artif Intell

School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.

Published: September 2024

The data-hungry statistical machine translation (SMT) and neural machine translation (NMT) models offer state-of-the-art results for languages with abundant data resources. However, extensive research is imperative to make these models perform equally well for low-resource languages. This paper proposes a novel approach to integrate the best features of the NMT and SMT systems for improved translation performance of low-resource English-Tamil language pair. The suboptimal NMT model trained with the small parallel corpus translates the monolingual corpus and selects only the best translations, to retrain itself in the next iteration. The proposed method employs the SMT phrase-pair table to determine the best translations, based on the maximum match between the words of the phrase-pair dictionary and each of the individual translations. This repeating cycle of translation and retraining generates a large quasi-parallel corpus, thus making the NMT model more powerful. SMT-integrated incremental training demonstrates a substantial difference in translation performance as compared to the existing approaches for incremental training. The model is strengthened further by adopting a beam search decoding strategy to produce best possible translations for each input sentence. Empirical findings prove that the proposed model with BLEU scores of 19.56 and 23.49 outperforms the baseline NMT with scores 11.06 and 17.06 for Eng-to-Tam and Tam-to-Eng translations, respectively. METEOR score evaluation further corroborates these results, proving the supremacy of the proposed model.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11461459PMC
http://dx.doi.org/10.3389/frai.2024.1381290DOI Listing

Publication Analysis

Top Keywords

incremental training
12
best translations
12
low-resource languages
8
machine translation
8
translation performance
8
nmt model
8
proposed model
8
translation
6
nmt
5
model
5

Similar Publications

Question: Cognitive-behavioural therapy (CBT) is frequently implemented for individuals with attention-deficit hyperactivity disorder (ADHD). It is still unknown which specific components are effective, because CBT is a complex intervention with several components. The objective of this review was to assess the efficacy of CBT components for ADHD.

View Article and Find Full Text PDF

Introduction: Despite recommendations from the WHO, antenatal care (ANC) coverage remains low in many low-income and middle-income countries (LMICs). Community health workers (CHWs) can play an important role in expanding ANC coverage through pregnancy identification, provision of health education, screening for complications, delivery of therapeutic care and referral to higher levels of care. However, despite the success of CHW programmes in various countries, WHO has called for additional research to develop evidence-based models that optimise CHW service delivery and that can be replicated across geographies.

View Article and Find Full Text PDF

Deep learning systems are prone to catastrophic forgetting when learning from a sequence of tasks, as old data from previous tasks is unavailable when learning a new task. To address this, some methods propose replaying data from previous tasks during new task learning, typically using extra memory to store replay data. However, it is not expected in practice due to memory constraints and data privacy issues.

View Article and Find Full Text PDF

Unlabelled: A cost-effectiveness analysis of FRAX® intervention thresholds (ITs) in Indian women over 50 years indicated that generic alendronate was cost-effective for age-dependent major osteoporotic fracture (MOF) ITs and hip fracture (HF) ITs starting at ages 60 and 65 years for full and real-world adherence, respectively. Alendronate was cost-effective at fixed MOF IT of 14% and HF IT of 3.5%, regardless of age.

View Article and Find Full Text PDF

Objective: Inflammatory characteristics in pericoronary adipose tissue (PCAT) may enhance the diagnostic capability of radiomics techniques for identifying vulnerable plaques. This study aimed to evaluate the incremental value of PCAT radiomics scores in identifying vulnerable plaques defined by intravascular ultrasound imaging (IVUS).

Methods: In this retrospective study, a PCAT radiomics model was established and validated using IVUS as the reference standard.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!