Automatic Classification, Visualization and Analysis of Errors in Machine Translation

dc.contributor.authorJayaweera, Chathuri
dc.contributor.authorDias, Gihan
dc.date.accessioned2026-05-20T05:18:23Z
dc.date.issued2020
dc.description.abstractAlthough the quality of machine translation (MT) has improved in recent years, machine translated documents still contain errors. MT quality is often evaluated using a single numeric score. However, this may not adequately characterise the system. We provide an error visualizer, which shows differences between corresponding lines of two translations. In addition to insertions, deletions and substitutions, our system also shows transpositions. We also provide an error analyzer which gives statistics of each type of error in the document. In addition, it shows errors in context: the words commonly adjacent to each error, and also the adjacent parts of speech (POS). This feature - unique to our system - allows the identification of the context in which errors occur, so they can be rectified easily. The system was evaluated by three MT system developers, who identified useful features and provided feedback which was used to improve the system.
dc.identifier.citationJayaweera, Chathuri., & Dias, Gihan. (2020). Automatic Classification, Visualization and Analysis of Errors in Machine Translation. Sri Lanka Technology Campus, Padukka, Sri Lanka.
dc.identifier.urihttps://res.sltc.ac.lk/handle/789/58
dc.language.isoen
dc.publisherSri Lanka Technology Campus
dc.subjectcomparison
dc.subjecterror analysis
dc.subjecterror classification
dc.subjectevaluation
dc.subjectmachine translation
dc.subjectMT
dc.titleAutomatic Classification, Visualization and Analysis of Errors in Machine Translation
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Automatic Classification....pdf
Size:
67.94 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: