Abstract
This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature-and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.
| Original language | English |
|---|---|
| Article number | 201 |
| Journal | Electronics (Switzerland) |
| Volume | 9 |
| Issue number | 2 |
| DOIs | |
| State | Published - 2020.02 |
Keywords
- Feature-based classification
- Hybrid machine translation
- Neural machine translation
- Rule-based machine translation
Fingerprint
Dive into the research topics of 'Hybrid translation with classification: Revisiting rule-based and neural machine translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver