Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Tagyoung Chung is active.

Publication


Featured researches published by Tagyoung Chung.


empirical methods in natural language processing | 2009

Unsupervised Tokenization for Machine Translation

Tagyoung Chung; Daniel Gildea

Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creates a challenge for tokenization. Moreover, morphologically rich languages such as Korean present an even bigger challenge, since optimal token boundaries for machine translation in these languages are often unclear. Both rule-based solutions and statistical solutions are currently used. In this paper, we present unsupervised methods to solve tokenization problem. Our methods incorporate information available from parallel corpus to determine a good tokenization for machine translation.


Computational Linguistics | 2014

Sampling tree fragments from forests

Tagyoung Chung; Licheng Fang; Daniel Gildea; Daniel Stefankovic

We study the problem of sampling trees from forests, in the setting where probabilities for each tree may be a function of arbitrarily large tree fragments. This setting extends recent work for sampling to learn Tree Substitution Grammars to the case where the tree structure (TSG derived tree) is not fixed. We develop a Markov chain Monte Carlo algorithm which corrects for the bias introduced by unbalanced forests, and we present experiments using the algorithm to learn Synchronous Context-Free Grammar rules for machine translation. In this application, the forests being sampled represent the set of Hiero-style rules that are consistent with fixed input word-level alignments. We demonstrate equivalent machine translation performance to standard techniques but with much smaller grammars.


empirical methods in natural language processing | 2010

Effects of Empty Categories on Machine Translation

Tagyoung Chung; Daniel Gildea


north american chapter of the association for computational linguistics | 2010

Factors Affecting the Accuracy of Korean Parsing

Tagyoung Chung; Matt Post; Daniel Gildea


meeting of the association for computational linguistics | 2011

Issues Concerning Decoding with Synchronous Context-free Grammar

Tagyoung Chung; Licheng Fang; Daniel Gildea


north american chapter of the association for computational linguistics | 2012

Tuning as Linear Regression

Marzieh Bazrafshan; Tagyoung Chung; Daniel Gildea


international conference on information technology: new generations | 2011

Dynamic Item Recommendation by Topic Modeling for Social Networks

Sang Su Lee; Tagyoung Chung; Dennis McLeod


workshop on statistical machine translation | 2012

Direct Error Rate Minimization for Statistical Machine Translation

Tagyoung Chung; Michel Galley


meeting of the association for computational linguistics | 2011

Terminal-Aware Synchronous Binarization

Licheng Fang; Tagyoung Chung; Daniel Gildea


IWSLT | 2011

SCFG latent annotation for machine translation.

Tagyoung Chung; Licheng Fang; Daniel Gildea

Collaboration


Dive into the Tagyoung Chung's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Licheng Fang

University of Rochester

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dennis McLeod

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Karolina Owczarzak

National Institute of Standards and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge