Boxing Chen
National Research Council
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Boxing Chen.
workshop on statistical machine translation | 2014
Boxing Chen; Colin Cherry
BLEU is the de facto standard machine translation (MT) evaluation metric. How- ever, because BLEU computes a geo- metric mean of n-gram precisions, it of- ten correlates poorly with human judg- ment on the sentence-level. There- fore, several smoothing techniques have been proposed. This paper systemati- cally compares 7 smoothing techniques for sentence-level BLEU. Three of them are first proposed in this paper, and they correlate better with human judgments on the sentence-level than other smoothing techniques. Moreover, we also compare the performance of using the 7 smoothing techniques in statistical machine transla- tion tuning.
meeting of the association for computational linguistics | 2017
Boxing Chen; Colin Cherry; George F. Foster; Samuel Larkin
In this paper, we propose a new domain adaptation technique for neural machine translation called cost weighting, which is appropriate for adaptation scenarios in which a small in-domain data set and a large general-domain data set are available. Cost weighting incorporates a domain classifier into the neural machine translation training algorithm, using features derived from the encoder representation in order to distinguish in-domain from out-of-domain data. Classifier probabilities are used to weight sentences according to their domain similarity when updating the parameters of the neural translation model. We compare cost weighting to two traditional domain adaptation techniques developed for statistical machine translation: data selection and sub-corpus weighting. Experiments on two large-data tasks show that both the traditional techniques and our novel proposal lead to significant gains, with cost weighting outperforming the traditional methods.
workshop on statistical machine translation | 2015
Boxing Chen; Hongyu Guo; Roland Kuhn
Translations generated by current statistical systems often have a large variance, in terms of their quality against human references. To cope with such variation, we propose to evaluate translations using a multi-level framework. The method varies the evaluation criteria based on the clusters to which a translation belongs. Our experiments on the WMT metric task data show that the multi-level framework consistently improves the performance of two benchmarking metrics, resulting in better correlation with human judgment.
meeting of the association for computational linguistics | 2013
Boxing Chen; Roland Kuhn; George F. Foster
workshop on statistical machine translation | 2012
Boxing Chen; Roland Kuhn; George F. Foster
workshop on statistical machine translation | 2011
Boxing Chen; Roland Kuhn
meeting of the association for computational linguistics | 2012
Boxing Chen; Roland Kuhn; Samuel Larkin
meeting of the association for computational linguistics | 2010
Boxing Chen; George F. Foster; Roland Kuhn
north american chapter of the association for computational linguistics | 2013
Boxing Chen; George F. Foster; Roland Kuhn
international conference on computational linguistics | 2010
Roland Kuhn; Boxing Chen; George F. Foster; Evan Stratford