Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Qin Gao is active.

Publication


Featured researches published by Qin Gao.


Software Engineering, Testing, and Quality Assurance for Natural Language Processing | 2008

Parallel Implementations of Word Alignment Tool

Qin Gao; Stephan Vogel

Training word alignment models on large corpora is a very time-consuming processes. This paper describes two parallel implementations of GIZA++ that accelerate this word alignment process. One of the implementations runs on computer clusters, the other runs on multi-processor system using multi-threading technology. Results show a near-linear speed-up according to the number of CPUs used, and alignment quality is preserved.


The Prague Bulletin of Mathematical Linguistics | 2010

Training Phrase-Based Machine Translation Models on the Cloud Open Source Machine Translation Toolkit Chaski

Qin Gao; Stephan Vogel

Training Phrase-Based Machine Translation Models on the Cloud: Open Source Machine Translation Toolkit Chaski In this paper we present an opensource machine translation toolkit Chaski which is capable of training phrase-based machine translation models on Hadoop clusters. The toolkit provides a full training pipeline including distributed word alignment, word clustering and phrase extraction. The toolkit also provides an extended error-tolerance mechanism over standard Hadoop error-tolerance framework. The paper will describe the underlying methodology and the design of the system, together with instructions of how to run the system on Hadoop clusters.


workshop on statistical machine translation | 2008

Improving Word Alignment with Language Model Based Confidence Scores

Nguyen Bach; Qin Gao; Stephan Vogel

This paper describes the statistical machine translation systems submitted to the ACL-WMT 2008 shared translation task. Systems were submitted for two translation directions: English→Spanish and Spanish→English. Using sentence pair confidence scores estimated with source and target language models, improvements are observed on the News-Commentary test sets. Genre-dependent sentence pair confidence score and integration of sentence pair confidence score into phrase table are also investigated.


workshop on statistical machine translation | 2010

A Semi-Supervised Word Alignment Algorithm with Partial Manual Alignments

Qin Gao; Nguyen Bach; Stephan Vogel


north american chapter of the association for computational linguistics | 2010

Consensus versus expertise: a case study of word alignment with Mechanical Turk

Qin Gao; Stephan Vogel


meeting of the association for computational linguistics | 2011

Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules

Qin Gao; Stephan Vogel


Archive | 2009

Source-side Dependency Tree Reordering Models with Subtree Movements and Constraints

Nguyen Bach; Qin Gao; Stephan Vogel


Machine Translation Summit XII | 2009

Reassessment of the role of phrase extraction in pbsmt

Francisco Guzmán; Qin Gao; Stephan Vogel


meeting of the association for computational linguistics | 2011

Utilizing Target-Side Semantic Role Labels to Assist Hierarchical Phrase-based Machine Translation

Qin Gao; Stephan Vogel


international joint conference on natural language processing | 2011

TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training

Nguyen Bach; Qin Gao; Stephan Vogel; Alex Waibel

Collaboration


Dive into the Qin Gao's collaboration.

Top Co-Authors

Avatar

Stephan Vogel

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Nguyen Bach

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Francisco Guzmán

Qatar Computing Research Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Vamshi Ambati

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Alex Waibel

Karlsruhe Institute of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge