Lamia Tounsi
Dublin City University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Lamia Tounsi.
international conference on computational linguistics | 2014
Joachim Wagner; Piyush Arora; Santiago Cortes; Utsab Barman; Dasha Bogdanova; Jennifer Foster; Lamia Tounsi
We describe the work carried out by DCU on the Aspect Based Sentiment Analysis task at SemEval 2014. Our team submitted one constrained run for the restaurant domain and one for the laptop domain for sub-task B (aspect term polarity prediction), ranking highest out of 36 systems on the restaurant test set and joint highest out of 32 systems on the laptop test set.
systems and frameworks for computational morphology | 2011
Mohammed Attia; Pavel Pecina; Antonio Toral; Lamia Tounsi; Josef van Genabith
Current Arabic lexicons, whether computational or otherwise, make no distinction between entries from Modern Standard Arabic (MSA) and Classical Arabic (CA), and tend to include obsolete words that are not attested in current usage. We address this problem by building a large-scale, corpus-based lexical database that is representative of MSA. We use an MSA corpus of 1,089,111,204 words, a pre-annotation tool, machine learning techniques, and knowledge-based templatic matching to automatically acquire and filter lexical knowledge about morpho-syntactic attributes and inflection paradigms. Our lexical database is scalable, interoperable and suitable for constructing a morphological analyser, regardless of the design approach and programming language used. The database is formatted according to the international ISO standard in lexical resource representation, the Lexical Markup Framework (LMF). This lexical database is used in developing an open-source finite-state morphological processing toolkit. We build a web application, AraComLex (Arabic Computer Lexicon), for managing and curating the lexical database.
asia information retrieval symposium | 2011
Iman Saleh; Lamia Tounsi; Josef van Genabith
In this paper we investigate automatic identification of Arabic temporal and numerical expressions. The objectives of this paper are 1) to describe ZamAn , a machine learning method we have developed to label Arabic temporals, processing the functional dashtag -TMP used in the Arabic treebank to mark a temporal modifier which represents a reference to a point in time or a span of time, and 2) to present Raqm , a machine learning method applied to identify different forms of numerical expressions in order to normalise them into digits. We present a series of experiments evaluating how well ZamAn (resp. Raqm ) copes with the enriched Arabic data achieving state-of-the-art results of F1-measure of 88.5% (resp. 96%) for bracketing and 73.1% (resp. 94.4%) for detection.
Proceedings of the First Celtic Language Technology Workshop | 2014
Teresa Lynn; Jennifer Foster; Mark Dras; Lamia Tounsi
We present a study of cross-lingual direct transfer parsing for the Irish language. Firstly we discuss mapping of the annotation scheme of the Irish Dependency Treebank to a universal dependency scheme. We explain our dependency label mapping choices and the structural changes required in the Irish Dependency Treebank. We then experiment with the universally annotated treebanks of ten languages from four language family groups to assess which languages are the most useful for cross-lingual parsing of Irish by using these treebanks to train delexicalised parsing models which are then applied to sentences from the Irish Dependency Treebank. The best results are achieved when using Indonesian, a language from the Austronesian language family.
algorithmic decision theory | 2013
Léa Amandine Deleris; Stéphane Deparis; Bogdan Sacaleanu; Lamia Tounsi
By exploiting advances in natural language processing, we believe that information contained in unstructured texts can be leveraged to facilitate risk modeling and decision support in healthcare. In this paper, we present our initial investigations into dependence relation extraction and aggregation into a Bayesian Belief Network structure. Our results are based on a corpus composed of MEDLINE® abstracts dealing with breast cancer risk factors.
international conference natural language processing | 2005
Lamia Tounsi; Denis Maurel; B. Beatrice
This paper we present a new method to detect and compute a set of sub structures of an automaton. This method is applied through a search algorithm for sub automata recognition and used in natural language processing (NLP) applications such as dictionaries. This algorithm is based on the notion of height and cardinality of states; it visits the states of a minimal deterministic finite state automaton in a depth first order where each state is inspected once.
north american chapter of the association for computational linguistics | 2010
Reut Tsarfaty; Djamé Seddah; Yoav Goldberg; Sandra Kuebler; Yannick Versley; Marie Candito; Jennifer Foster; Ines Rehbein; Lamia Tounsi
language resources and evaluation | 2010
Mohammed Attia; Antonio Toral; Lamia Tounsi; Pavel Pecina; Josef van Genabith
north american chapter of the association for computational linguistics | 2010
Mohammed Attia; Jennifer Foster; Deirdre Hogan; Joseph Le Roux; Lamia Tounsi; Josef van Genabith
language resources and evaluation | 2010
Mohammed Attia; Antonio Toral; Lamia Tounsi; Monica Monachini; Josef van Genabith