This article introduces the Saudi Learner Translation Corpus (SauLTC), an innovative multi-version English-Arabic parallel corpus featuring part-of-speech tagging. We describe the corpus parameters and compilation process and explain how textual processing and sentence alignment are conducted. The participants include 366 student translators, 48 instructors, and 23 alignment verifiers. The corpus provides access to two target versions of every ST to allow the detection of the changes in the translation and revision processes from the initial to the final draft. The translations were collected over three years, yielding 5,160,386 tokens. The metadata of 23 sentence alignment verifiers were added to the analysis as a unique variable to investigate individual differences in the manual verification process. This unidirectional corpus can be used to identify student translators' strategies and errors in translation and analyze the efficacy of instructors' feedback. Furthermore, it is accessible via an application and a website. It provides translation teachers and researchers with a database that can help develop corpus-based and corpus-driven teaching materials.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11498738PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0303729PLOS

Publication Analysis

Top Keywords

learner translation
12
translation corpus
12
saudi learner
8
sentence alignment
8
alignment verifiers
8
corpus
7
translation
6
corpus design
4
design compilation
4
compilation english-arabic
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!