Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xianchao Wu is active.

Publication


Featured researches published by Xianchao Wu.


north american chapter of the association for computational linguistics | 2009

Semi-Supervised Lexicon Mining from Parenthetical Expressions in Monolingual Web Pages

Xianchao Wu; Naoaki Okazaki; Jun’ichi Tsujii

This paper presents a semi-supervised learning framework for mining Chinese-English lexicons from large amount of Chinese Web pages. The issue is motivated by the observation that many Chinese neologisms are accompanied by their English translations in the form of parenthesis. We classify parenthetical translations into bilingual abbreviations, transliterations, and translations. A frequency-based term recognition approach is applied for extracting bilingual abbreviations. A self-training algorithm is proposed for mining transliteration and translation lexicons. In which, we employ available lexicons in terms of morpheme levels, i.e., phoneme correspondences in transliteration and grapheme (e.g., suffix, stem, and prefix) correspondences in translation. The experimental results verified the effectiveness of our approaches.


ACM Transactions on Asian Language Information Processing | 2013

Syntax-Based Post-Ordering for Efficient Japanese-to-English Translation

Katsuhito Sudoh; Xianchao Wu; Kevin Duh; Hajime Tsukada; Masaaki Nagata

This article proposes a novel reordering method for efficient two-step Japanese-to-English statistical machine translation (SMT) that isolates reordering from SMT and solves it after lexical translation. This reordering problem, called post-ordering, is solved as an SMT problem from Head-Final English (HFE) to English. HFE is syntax-based reordered English that is very successfully used for reordering with English-to-Japanese SMT. The proposed method incorporates its advantage into the reverse direction, Japanese-to-English, and solves the post-ordering problem by accurate syntax-based SMT with target language syntax. Two-step SMT with the proposed post-ordering empirically reduces the decoding time of the accurate but slow syntax-based SMT by its good approximation using intermediate HFE. The proposed method improves the decoding speed of syntax-based SMT decoding by about six times with comparable translation accuracy in Japanese-to-English patent translation experiments.


Machine Translation | 2010

Improve syntax-based translation using deep syntactic structures

Xianchao Wu; Takuya Matsuzaki; Jun’ichi Tsujii

This paper introduces deep syntactic structures to syntax-based Statistical Machine Translation (SMT). We use a Head-driven Phrase Structure Grammar (HPSG) parser to obtain the deep syntactic structures of a sentence, which include not only a fine-grained syntactic property description but also a semantic representation. Considering the abundant information included in the deep syntactic structures, it is interesting to investigate whether or not they improve the traditional syntax-based translation models based on PCFG parsers. In order to use deep syntactic structures for SMT, this paper focuses on extracting tree-to-string translation rules from aligned HPSG tree–string pairs. The major challenge is to properly localize the non-local relations among nodes in an HPSG tree. To localize the semantic dependencies among words and phrases, which can be inherently non-local, a minimum covering tree is defined by taking a predicate word and its lexical/phrasal arguments as the frontier nodes. Starting from this definition, a linear-time algorithm is proposed to extract translation rules through one-time traversal of the leaf nodes in an HPSG tree. Extensive experiments on a tree-to-string translation system testified the effectiveness of our proposal.


Archive | 2011

Post-ordering in Statistical Machine Translation

Katsuhito Sudoh; Xianchao Wu; Kevin Duh; Haji; Eri Tsukada; Masaaki Nagata


international joint conference on natural language processing | 2011

Extracting Pre-ordering Rules from Predicate-Argument Structures

Xianchao Wu; Katsuhito Sudoh; Kevin Duh; Hajime Tsukada; Masaaki Nagata


NTCIR | 2011

NTT-UT Statistical Machine Translation in NTCIR-9 PatentMT

Katsuhito Sudoh; Kevin Duh; Hajime Tsukada; Masaaki Nagata; Xianchao Wu; Takuya Matsuzaki; Jun’ichi Tsujii


meeting of the association for computational linguistics | 2010

Fine-Grained Tree-to-String Translation Rule Extraction

Xianchao Wu; Takuya Matsuzaki; Jun’ichi Tsujii


meeting of the association for computational linguistics | 2012

Head Finalization Reordering for Chinese-to-Japanese Machine Translation

Dan Han; Katsuhito Sudoh; Xianchao Wu; Kevin Duh; Hajime Tsukada; Masaaki Nagata


meeting of the association for computational linguistics | 2012

Learning to Translate with Multiple Objectives

Kevin Duh; Katsuhito Sudoh; Xianchao Wu; Hajime Tsukada; Masaaki Nagata


meeting of the association for computational linguistics | 2011

Effective Use of Function Words for Rule Generalization in Forest-Based Translation

Xianchao Wu; Takuya Matsuzaki; Jun’ichi Tsujii

Collaboration


Dive into the Xianchao Wu's collaboration.

Top Co-Authors

Avatar

Masaaki Nagata

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar

Hajime Tsukada

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar

Katsuhito Sudoh

Nippon Telegraph and Telephone

View shared research outputs
Top Co-Authors

Avatar

Kevin Duh

Nara Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Takuya Matsuzaki

National Institute of Informatics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dan Han

Graduate University for Advanced Studies

View shared research outputs
Top Co-Authors

Avatar

Yusuke Miyao

National Institute of Informatics

View shared research outputs
Researchain Logo
Decentralizing Knowledge