Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Majdi Sawalha is active.

Publication


Featured researches published by Majdi Sawalha.


international conference on communications | 2013

SALMA: Standard Arabic Language Morphological Analysis

Majdi Sawalha; Eric Atwell; Mohammad A. M. Abushariah

Morphological analyzers are preprocessors for text analysis. Many Text Analytics applications need them to perform their tasks. This paper reviews the SALMA-Tools (Standard Arabic Language Morphological Analysis) [1]. The SALMA-Tools is a collection of open-source standards, tools and resources that widen the scope of Arabic word structure analysis - particularly morphological analysis, to process Arabic text corpora of different domains, formats and genres, of both vowelized and non-vowelized text. Tag-assignment is significantly more complex for Arabic than for many languages. The morphological analyzer should add the appropriate linguistic information to each part or morpheme of the word (proclitic, prefix, stem, suffix and enclitic); in effect, instead of a tag for a word, we need a subtag for each part. Very fine-grained distinctions may cause problems for automatic morphosyntactic analysis - particularly probabilistic taggers which require training data, if some words can change grammatical tag depending on function and context; on the other hand, fine-grained distinctions may actually help to disambiguate other words in the local context. The SALMA - Tagger is a fine grained morphological analyzer which is mainly depends on linguistic information extracted from traditional Arabic grammar books and prior-knowledge broad-coverage lexical resources; the SALMA - ABCLexicon. More fine-grained tag sets may be more appropriate for some tasks. The SALMA - Tag Set is a standard tag set for encoding, which captures long-established traditional fine-grained morphological features of Arabic, in a notation format intended to be compact yet transparent.


international conference on computational linguistics | 2008

Comparative Evaluation of Arabic Language Morphological Analysers and Stemmers

Majdi Sawalha; Eric Atwell


Archive | 2011

An artificial intelligence approach to Arabic and Islamic content on the internet

Eric Atwell; Claire Brierley; Kais Dukes; Majdi Sawalha; Abdul-Baquee M. Sharaf


language resources and evaluation | 2010

Fine-grain morphological analyzer and part-of-speech tagger for Arabic text

Majdi Sawalha; Eric Atwell


language resources and evaluation | 2012

Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing

Claire Brierley; Majdi Sawalha; Eric Atwell


language resources and evaluation | 2010

Constructing and using broad-coverage lexical resource for enhancing morphological analysis of Arabic

Majdi Sawalha; Eric Atwell


Journal of Semitic Studies | 2016

A Verified Arabic-IPA Mapping for Arabic Transcription Technology, Informed by Quranic Recitation, Traditional Arabic Linguistics, and Modern Phonetics

Clare Brierley; Majdi Sawalha; Barry Heselwood; Eric Atwell


Archive | 2014

A proposed model for Quranic Arabic WordNet

Manal AlMaayah; Majdi Sawalha; Mohammad A. M. Abushariah


language resources and evaluation | 2012

Predicting Phrase Breaks in Classical and Modern Standard Arabic Text

Majdi Sawalha; Claire Brierley; Eric Atwell


Archive | 2014

Automatically generated, phonemic Arabic-IPA pronunciation tiers for the boundary annotated Qur’an dataset for machine learning (version 2.0)

Majdi Sawalha; Claire Brierley; Eric Atwell

Collaboration


Dive into the Majdi Sawalha's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge