Computer-aided translation tools based on translation memories are widely used to assist professional translators. A translation memory (TM) consists of a set of translation units (TU) made up of source- and target-language segment pairs. For the translation of a new source segment s, these tools search the TM and retrieve the TUs (s,t) whose source segments are more similar to s. The translator then chooses a TU and edit the target segment t to turn it into an adequate translation of s. Fuzzy-match repair (FMR) techniques can be used to automatically modify the parts of t that need to be edited. We describe a language-independent FMR method that first uses machine translation to generate, given s and (s,t), a set of candidate fuzzy-match repaired segments, and then chooses the best one by estimating their quality. An evaluation on three different language pairs shows that the selected candidate is a good approximation to the best (oracle) candidate produced and is closer to reference translations than machine-translated segments and unrepaired fuzzy matches ( t). In addition, a single quality estimation model trained on a mix of data from all the languages performs well on any of the languages used.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2020.3021361DOI Listing

Publication Analysis

Top Keywords

fuzzy-match repair
8
quality estimation
8
translation
7
repair guided
4
guided quality
4
estimation computer-aided
4
computer-aided translation
4
translation tools
4
tools based
4
based translation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!