Skip to main navigation Skip to search Skip to main content

Hybrid translation with classification: Revisiting rule-based and neural machine translation

  • Jin Xia Huang*
  • , Kyung Soon Lee
  • , Young Kil Kim
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature-and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.

Original languageEnglish
Article number201
JournalElectronics (Switzerland)
Volume9
Issue number2
DOIs
StatePublished - 2020.02

Keywords

  • Feature-based classification
  • Hybrid machine translation
  • Neural machine translation
  • Rule-based machine translation

Fingerprint

Dive into the research topics of 'Hybrid translation with classification: Revisiting rule-based and neural machine translation'. Together they form a unique fingerprint.

Cite this