Qin Gao
Carnegie Mellon University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Qin Gao.
Software Engineering, Testing, and Quality Assurance for Natural Language Processing | 2008
Qin Gao; Stephan Vogel
Training word alignment models on large corpora is a very time-consuming processes. This paper describes two parallel implementations of GIZA++ that accelerate this word alignment process. One of the implementations runs on computer clusters, the other runs on multi-processor system using multi-threading technology. Results show a near-linear speed-up according to the number of CPUs used, and alignment quality is preserved.
The Prague Bulletin of Mathematical Linguistics | 2010
Qin Gao; Stephan Vogel
Training Phrase-Based Machine Translation Models on the Cloud: Open Source Machine Translation Toolkit Chaski In this paper we present an opensource machine translation toolkit Chaski which is capable of training phrase-based machine translation models on Hadoop clusters. The toolkit provides a full training pipeline including distributed word alignment, word clustering and phrase extraction. The toolkit also provides an extended error-tolerance mechanism over standard Hadoop error-tolerance framework. The paper will describe the underlying methodology and the design of the system, together with instructions of how to run the system on Hadoop clusters.
workshop on statistical machine translation | 2008
Nguyen Bach; Qin Gao; Stephan Vogel
This paper describes the statistical machine translation systems submitted to the ACL-WMT 2008 shared translation task. Systems were submitted for two translation directions: English→Spanish and Spanish→English. Using sentence pair confidence scores estimated with source and target language models, improvements are observed on the News-Commentary test sets. Genre-dependent sentence pair confidence score and integration of sentence pair confidence score into phrase table are also investigated.
workshop on statistical machine translation | 2010
Qin Gao; Nguyen Bach; Stephan Vogel
north american chapter of the association for computational linguistics | 2010
Qin Gao; Stephan Vogel
meeting of the association for computational linguistics | 2011
Qin Gao; Stephan Vogel
Archive | 2009
Nguyen Bach; Qin Gao; Stephan Vogel
Machine Translation Summit XII | 2009
Francisco Guzmán; Qin Gao; Stephan Vogel
meeting of the association for computational linguistics | 2011
Qin Gao; Stephan Vogel
international joint conference on natural language processing | 2011
Nguyen Bach; Qin Gao; Stephan Vogel; Alex Waibel