Ossama Obeid
Carnegie Mellon University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ossama Obeid.
empirical methods in natural language processing | 2014
Behrang Mohit; Alla Rozovskaya; Nizar Habash; Wajdi Zaghouani; Ossama Obeid
We present a summary of the first shared task on automatic text correction for Arabic text. The shared task received 18 systems submissions from nine teams in six countries and represented a diversity of approaches. Our report includes an overview of the QALB corpus which was the source of the datasets used for training and evaluation, an overview of participating systems, results of the competition and an analysis of the results and systems.
meeting of the association for computational linguistics | 2015
Alla Rozovskaya; Houda Bouamor; Nizar Habash; Wajdi Zaghouani; Ossama Obeid; Behrang Mohit
We present a summary of QALB-2015, the second shared task on automatic text correction of Arabic texts. The shared task extends QALB-2014, which focused on correcting errors in Arabic texts produced by native speakers of Arabic. The competition this year, in addition to native data, includes texts produced by learners of Arabic as a foreign language. The report includes an overview of the QALB corpus, which is the dataset used for training and evaluation, an overview of participating systems, results of the competition and an analysis of the results and systems.
meeting of the association for computational linguistics | 2015
Houda Bouamor; Wajdi Zaghouani; Mona T. Diab; Ossama Obeid; Kemal Oflazer; Mahmoud Ghoneim; Abdelati Hawwari
Arabic script writing is typically underspecified for short vowels and other mark up, referred to as diacritics. Apart from the lexical ambiguity found in words, similar to that exhibited in other languages, the lack of diacritics in written Arabic script adds another layer of ambiguity which is an artifact of the orthography. Diacritization of written text has a significant impact on Arabic NLP applications. In this paper, we present a pilot study on building a diacritized multi-genre corpus in Arabic. We annotate a sample of nondiacritized words extracted from five text genres. We explore different annotation strategies: Basic where we present only the bare undiacritized forms to the annotators, Intermediate (Basic forms+their POS tags), and Advanced (automatically diacritized words). We present the impact of the annotation strategy on annotation quality. Moreover, we study different diacritization schemes in the process.
language resources and evaluation | 2014
Wajdi Zaghouani; Behrang Mohit; Nizar Habash; Ossama Obeid; Nadi Tomeh; Alla Rozovskaya; Noura Farra; Sarah Alkuhlani; Kemal Oflazer
international joint conference on natural language processing | 2013
Ossama Obeid; Wajdi Zaghouani; Behrang Mohit; Nizar Habash; Kemal Oflazer; Nadi Tomeh
language resources and evaluation | 2016
Wajdi Zaghouani; Houda Bouamor; Abdelati Hawwari; Mona T. Diab; Ossama Obeid; Mahmoud Ghoneim; Sawsan Alqahtani; Kemal Oflazer
language resources and evaluation | 2016
Wajdi Zaghouani; Nizar Habash; Ossama Obeid; Behrang Mohit; Houda Bouamor; Kemal Oflazer
Archive | 2015
Houda Bouamor; Wajdi Zaghouani; Mona T. Diab; Ossama Obeid; Kemal Oflazer; Mahmoud Ghoneim; Abdelati Hawwari
language resources and evaluation | 2018
Houda Bouamor; Nizar Habash; Mohammad Salameh; Wajdi Zaghouani; Owen Rambow; Dana Abdulrahim; Ossama Obeid; Salam Khalifa; Fadhl Eryani; Alexander Erdmann; Kemal Oflazer
language resources and evaluation | 2018
Ossama Obeid; Salam Khalifa; Nizar Habash; Houda Bouamor; Wajdi Zaghouani; Kemal Oflazer