Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Chenchen Ding is active.

Publication


Featured researches published by Chenchen Ding.


acm transactions on asian and low resource language information processing | 2016

Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Chenchen Ding; Keisuke Sakanushi; Hirona Touji; Mikio Yamamoto

A rule-based pre-ordering approach is proposed for statistical Japanese-to-English machine translation using the dependency structure of source-side sentences. A Japanese sentence is pre-ordered to an English-like order at the morpheme level for a statistical machine translation system during the training and decoding phase to resolve the reordering problem. In this article, extra-chunk pre-ordering of morphemes is proposed, which allows Japanese functional morphemes to move across chunk boundaries. This contrasts with the intra-chunk reordering used in previous approaches, which restricts the reordering of morphemes within a chunk. Linguistically oriented discussions show that correct pre-ordering cannot be realized without extra-chunk movement of morphemes. The proposed approach is compared with five rule-based pre-ordering approaches designed for Japanese-to-English translation and with a language independent statistical pre-ordering approach on a standard patent dataset and on a news dataset obtained by crawling Internet news sites. Two state-of-the-art statistical machine translation systems, one phrase-based and the other hierarchical phrase-based, are used in experiments. Experimental results show that the proposed approach outperforms the compared approaches on automatic reordering measures (Kendall’s τ, Spearman’s ρ, fuzzy reordering score, and test set RIBES) and on the automatic translation precision measure of test set BLEU score.


international conference on audio, language and image processing | 2014

To filter discontinuous word alignment for statistical machine translationaper

Chenchen Ding; Mikio Yamamoto

We propose a language-independent approach to clean up word alignment errors in an aligned parallel corpus, which are caused by the unsupervised word-align process. In such an aligned corpus, we evaluate the alignment patterns of one-to-many discontinuous words by statistical measures of collocation. The alignment of discontinuous words without strong collocation tendencies will be taken as errors and deleted. We conduct experiments on two-directional Japanese-English and German-English translation tasks. The experiment results show the state-of-the-art word alignment filtered by the proposed approach can lead to a better translation performance.


conference of the european chapter of the association for computational linguistics | 2014

Dependency Tree Abstraction for Long-Distance Reordering in Statistical Machine Translation

Chenchen Ding; Yuki Arase

Word reordering is a crucial technique in statistical machine translation in which syntactic information plays an important role. Synchronous context-free grammar has typically been used for this purpose with various modifications for adding flexibilities to its synchronized tree generation. We permit further flexibilities in the synchronous context-free grammar in order to translate between languages with drastically different word order. Our method pre-processes a parallel corpus by


Archive | 2014

Empirical Dependency-Based Head Finalization for Statistical Chinese-, English-, and French-to-Myanmar (Burmese) Machine Translation

Chenchen Ding; Ye Kyaw Thu; Masao Utiyama; Andrew M. Finch; Eiichiro Sumita


international joint conference on natural language processing | 2013

An Unsupervised Parameter Estimation Algorithm for a Generative Dependency N-gram Language Model

Chenchen Ding; Mikio Yamamoto


IWSLT | 2011

Long-distance hierarchical structure transformation rules utilizing function words.

Chenchen Ding; Takashi Inui; Mikio Yamamoto


Journal of Information Processing | 2014

A Generative Dependency N-gram Language Model: Unsupervised Parameter Estimation and Application

Chenchen Ding; Mikio Yamamoto


meeting of the association for computational linguistics | 2018

Simplified Abugidas

Chenchen Ding; Masao Utiyama; Eiichiro Sumita


international conference on computational linguistics | 2016

Similar Southeast Asian Languages: Corpus-Based Case Study on Thai-Laotian and Malay-Indonesian.

Chenchen Ding; Masao Utiyama; Eiichiro Sumita


情報科学技術フォーラム講演論文集 | 2014

E-020 An Improvement in Quantifiers Movement of Head Finalization Reordering for E-J Machine Translation

Masanori Taniguchi; Chenchen Ding; Mikio Yamamoto

Collaboration


Dive into the Chenchen Ding's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Eiichiro Sumita

National Institute of Information and Communications Technology

View shared research outputs
Top Co-Authors

Avatar

Masao Utiyama

National Institute of Information and Communications Technology

View shared research outputs
Top Co-Authors

Avatar

Andrew M. Finch

National Institute of Information and Communications Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Takashi Inui

Tokyo Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Ye Kyaw Thu

Okayama Prefectural University

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge