Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mohammad Salameh is active.

Publication


Featured researches published by Mohammad Salameh.


north american chapter of the association for computational linguistics | 2015

Sentiment after Translation: A Case-Study on Arabic Social Media Posts

Mohammad Salameh; Saif M. Mohammad; Svetlana Kiritchenko

When text is translated from one language into another, sentiment is preserved to varying degrees. In this paper, we use Arabic social media posts as stand-in for source language text, and determine loss in sentiment predictability when they are translated into English, manually and automatically. As benchmarks, we use manually and automatically determined sentiment labels of the Arabic texts. We show that sentiment analysis of English translations of Arabic texts produces competitive results, w.r.t. Arabic sentiment analysis. We discover that even though translation significantly reduces the human ability to recover sentiment, automatic sentiment systems are still able to capture sentiment information from the translations.


Journal of Artificial Intelligence Research | 2016

How translation alters sentiment

Saif M. Mohammad; Mohammad Salameh; Svetlana Kiritchenko

Sentiment analysis research has predominantly been on English texts. Thus there exist many sentiment resources for English, but less so for other languages. Approaches to improve sentiment analysis in a resource-poor focus language include: (a) translate the focus language text into a resource-rich language such as English, and apply a powerful English sentiment analysis system on the text, and (b) translate resources such as sentiment labeled corpora and sentiment lexicons from English into the focus language, and use them as additional resources in the focus-language sentiment analysis system. In this paper we systematically examine both options. We use Arabic social media posts as stand-in for the focus language text. We show that sentiment analysis of English translations of Arabic texts produces competitive results, w.r.t. Arabic sentiment analysis. We show that Arabic sentiment analysis systems benefit from the use of automatically translated English sentiment lexicons. We also conduct manual annotation studies to examine why the sentiment of a translation is different from the sentiment of the source word or text. This is especially relevant for building better automatic translation systems. In the process, we create a state-of-the-art Arabic sentiment analysis system, a new dialectal Arabic sentiment lexicon, and the first Arabic-English parallel corpus that is independently annotated for sentiment by Arabic and English speakers.


north american chapter of the association for computational linguistics | 2016

SemEval-2016 Task 7: Determining Sentiment Intensity of English and Arabic Phrases

Svetlana Kiritchenko; Saif M. Mohammad; Mohammad Salameh

We present a shared task on automatically determining sentiment intensity of a word or a phrase. The words and phrases are taken from three domains: general English, English Twitter, and Arabic Twitter. The phrases include those composed of negators, modals, and degree adverbs as well as phrases formed by words with opposing polarities. For each of the three domains, we assembled the datasets that include multi-word phrases and their constituent words, both manually annotated for real-valued sentiment intensity scores. The three datasets were presented as the test sets for three separate tasks (each focusing on a specific domain). Five teams submitted nine system outputs for the three tasks. All datasets created for this shared task are freely available to the research community.


meeting of the association for computational linguistics | 2014

Lattice Desegmentation for Statistical Machine Translation

Mohammad Salameh; Colin Cherry; Grzegorz Kondrak

Morphological segmentation is an effective sparsity reduction strategy for statistical machine translation (SMT) involving morphologically complex languages. When translating into a segmented language, an extra step is required to desegment the output; previous studies have desegmented the 1-best output from the decoder. In this paper, we expand our translation options by desegmentingn-best lists or lattices. Our novel lattice desegmentation algorithm effectively combines both segmented and desegmented views of the target language for a large subspace of possible translation outputs, which allows for inclusion of features related to the desegmentation process, as well as an unsegmented language model (LM). We investigate this technique in the context of English-to-Arabic and English-to-Finnish translation, showing significant improvements in translation quality over desegmentation of 1-best decoder outputs.


meeting of the association for computational linguistics | 2015

Multiple System Combination for Transliteration

Garrett Nicolai; B. D. Hauer; Mohammad Salameh; Adam St Arnaud; Ying Xu; Lei Yao; Grzegorz Kondrak

We report the results of our experiments in the context of the NEWS 2015 Shared Task on Transliteration. We focus on methods of combining multiple base systems, and leveraging transliterations from multiple languages. We show error reductions over the best base system of up to 10% when using supplemental transliterations, and up to 20% when using system combination. We also discuss the quality of the shared task datasets.


north american chapter of the association for computational linguistics | 2015

What Matters Most in Morphologically Segmented SMT Models

Mohammad Salameh; Colin Cherry; Grzegorz Kondrak

Morphological segmentation is an effective strategy for addressing difficulties caused by morphological complexity. In this study, we use an English-to-Arabic test bed to determine what steps and components of a phrase-based statistical machine translation pipeline benefit the most from segmenting the target language. We test several scenarios that differ primarily in when desegmentation is applied, showing that the most important criterion for success in segmentation is to allow the system to build target words from morphemes that span phrase boundaries. We also investigate the impact of segmented and unsegmented target language models (LMs) on translation quality. We show that an unsegmented LM is helpful according to BLEU score, but also leads to a drop in the overall usage of compositional morphology, bringing it to well below the amount observed in human references.


north american chapter of the association for computational linguistics | 2016

Integrating Morphological Desegmentation into Phrase-based Decoding

Mohammad Salameh; Colin Cherry; Grzegorz Kondrak

When translating into a morphologically complex language, segmenting the target language can reduce data sparsity, while introducing the complication of desegmenting the system output. We present a method for decoderintegrated desegmentation, allowing features that consider the desegmented target, such as a word-level language model, to be considered throughout the entire search space. Our results on a large-scale, English to Arabic translation task show significant improvement over the 1-best desegmentation baseline.


north american chapter of the association for computational linguistics | 2018

SemEval-2018 Task 1: Affect in Tweets.

Saif M. Mohammad; Felipe Bravo-Marquez; Mohammad Salameh; Svetlana Kiritchenko


workshop on innovative use of nlp for building educational applications | 2013

Cognate and Misspelling Features for Natural Language Identification

Garrett Nicolai; B. D. Hauer; Mohammad Salameh; Lei Yao; Grzegorz Kondrak


language resources and evaluation | 2016

Sentiment Lexicons for Arabic Social Media.

Saif M. Mohammad; Mohammad Salameh; Svetlana Kiritchenko

Collaboration


Dive into the Mohammad Salameh's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Colin Cherry

National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lei Yao

University of Alberta

View shared research outputs
Top Co-Authors

Avatar

Hind Saddiki

New York University Abu Dhabi

View shared research outputs
Top Co-Authors

Avatar

Nasser Zalmout

New York University Abu Dhabi

View shared research outputs
Top Co-Authors

Avatar

Nizar Habash

New York University Abu Dhabi

View shared research outputs
Researchain Logo
Decentralizing Knowledge