Ahmed El Kholy
Columbia University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ahmed El Kholy.
Machine Translation | 2012
Ahmed El Kholy; Nizar Habash
Much of the work on statistical machine translation (SMT) from morphologically rich languages has shown that morphological tokenization and orthographic normalization help improve SMT quality because of the sparsity reduction they contribute. In this article, we study the effect of these processes on SMT when translating into a morphologically rich language, namely Arabic. We explore a space of tokenization schemes and normalization options. We also examine a set of six detokenization techniques and evaluate on detokenized and orthographically correct (enriched) output. Our results show that the best performing tokenization scheme is that of the Penn Arabic Treebank. Additionally, training on orthographically normalized (reduced) text then jointly enriching and detokenizing the output outperforms training on enriched text.
language resources and evaluation | 2014
Arfath Pasha; Mohamed Al-Badrashiny; Mona T. Diab; Ahmed El Kholy; Ramy Eskander; Nizar Habash; Manoj Pooleery; Owen Rambow; Ryan M. Roth
meeting of the association for computational linguistics | 2013
Ahmed El Kholy; Nizar Habash; Gregor Leusch; Hassan Sawaf
Archive | 2012
Ahmed El Kholy; Nizar Habash
Archive | 2011
Ahmed El Kholy; Nizar Habash
workshop on statistical machine translation | 2011
Yuval Marton; Ahmed El Kholy; Nizar Habash
international joint conference on natural language processing | 2013
Ahmed El Kholy; Nizar Habash; Gregor Leusch; Hassan Sawaf
international conference on natural language generation | 2012
Ahmed El Kholy; Nizar Habash
international joint conference on natural language processing | 2013
Mohammad Sadegh Rasooli; Ahmed El Kholy; Nizar Habash
arXiv: Computation and Language | 2016
Ahmed El Kholy; Nizar Habash