Memory-augmented neural machine translation
Feng, Yang and Zhang, Shiyue and Zhang, Andi and Wang, Dong and Abel, Andrew; (2017) Memory-augmented neural machine translation. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (ACL), DNK, pp. 1390-1399. ISBN 9781945626838 (https://doi.org/10.18653/v1/d17-1146)
Preview |
Text.
Filename: Feng_etal_EMNLP2017_Memory_augmented_neural_machine_translation.pdf
Final Published Version License: Download (371kB)| Preview |
Abstract
Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs. This paper presents a novel memory-augmented NMT (M-NMT) architecture, which stores knowledge about how words (usually infrequently encountered ones) should be translated in a memory and then utilizes them to assist the neural model. We use this memory mechanism to combine the knowledge learned from a conventional statistical machine translation system and the rules learned by an NMT system, and also propose a solution for out-of-vocabulary (OOV) words based on this framework. Our experiments on two Chinese-English translation tasks demonstrated that the M-NMT architecture outperformed the NMT baseline by 9.0 and 2.7 BLEU points on the two tasks, respectively. Additionally, we found this architecture resulted in a much more effective OOV treatment compared to competitive methods.
ORCID iDs
Feng, Yang, Zhang, Shiyue, Zhang, Andi, Wang, Dong and Abel, Andrew ORCID: https://orcid.org/0000-0002-3631-8753;-
-
Item type: Book Section ID code: 86690 Dates: DateEvent11 September 2017PublishedSubjects: Science > Mathematics > Electronic computers. Computer science Department: Faculty of Engineering > Electronic and Electrical Engineering
Faculty of Science > Computer and Information SciencesDepositing user: Pure Administrator Date deposited: 07 Sep 2023 01:02 Last modified: 11 Nov 2024 15:33 URI: https://strathprints.strath.ac.uk/id/eprint/86690