A Loss-Augmented Approach to Training Syntactic Machine Translation Systems

Tong Xiao; Derek F. Wong; Jingbo Zhu

doi:10.1109/TASLP.2016.2594383

A Loss-Augmented Approach to Training Syntactic Machine Translation Systems

Xiao, T., Wong, D.F., Zhu, J.

Source

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2016 > 24 > 11 > 2069 - 2083

Abstract

Current syntactic machine translation (MT) systems implicitly use beam-width unlimited search in learning model parameters (e.g., feature values for each translation rule). However, a limited beam-width has to be adopted in decoding new sentences, and the MT output is in general evaluated by various metrics, such as BLEU and TER. In this paper, we address: 1) the mismatch of adopted beam-widths between training and decoding; and 2) the mismatch of training criteria and MT evaluation metrics. Unlike previous work, we model the two problems in a single training paradigm simultaneously. We design a loss-augmented approach that explicitly considers the limited beam-width and evaluation metric in training, and present a simple but effective method to learn the model. By using beam search and BLEU-related losses, our approach improves a state-of-the-art syntactic MT system by +1.0 BLEU on Chinese-to-English and English-to-Chinese translation tasks. It even outperforms seven previous training approaches over 0.8 BLEU points. More interestingly, promising improvements are observed when our approach works with TER.