Steven Neale
University of Lisbon
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Steven Neale.
processing of the portuguese language | 2016
João António Rodrigues; António Branco; Steven Neale; João Ricardo Silva
In this article we describe the creation and distribution of the first publicly available word embeddings for Portuguese. Our embeddings are evaluated on their own and also compared with the original English models on a well-known analogy task. We gathered a large Portuguese corpus of 1.7 billion tokens, developed the first distributional semantic analogies test set for Portuguese, and proceeded with the first parametrization and evaluation of Portuguese word embeddings models.
meeting of the association for computational linguistics | 2016
Rosa Del Gaudio; Gorka Labaka; Eneko Agirre; Petya Osenova; Kiril Simov; Martin Popel; Dieke Oele; Gertjan van Noord; Luís Gomes; João António Rodrigues; Steven Neale; João Ricardo Silva; Andreia Querido; Nuno Rendeiro; António Branco
This paper presents the description of 12 systems submitted to the WMT16 IT-task, covering six different languages, namely Basque, Bulgarian, Dutch, Czech, Portuguese and Spanish. All these systems were developed under the scope of the QTLeap project, presenting a common strategy. For each language two different systems were submitted, namely a phrasebased MT system built using Moses, and a system exploiting deep language engineering approaches, that in all the languages but Bulgarian was implemented using TectoMT. For 4 of the 6 languages, the TectoMT-based system performs better than the Moses-based one.
processing of the portuguese language | 2016
João António Rodrigues; Luís Gomes; Steven Neale; Andreia Querido; Nuno Rendeiro; Sanja Štajner; João Ricardo Silva; António Branco
Machine translation (MT) from English to Portuguese has not typically received much attention in existing research. In this paper, we focus on MT from English to Portuguese for the specific domain of information technology (IT), building a small in-domain parallel corpus to address the lack of IT-specific and publicly-available parallel corpora and then adapted an existing hybrid MT system to the new language pair (English to Portuguese). We further improved the initial version of the EN-PT hybrid system by adding various modules to address the most frequently occurring errors in the initial system. In order to assess the improvements achieved by each of these dedicated modules, we compared all versions of our MT system automatically. In addition, we conduct and report on a detailed error analysis of the initial and final versions of our system.
processing of the portuguese language | 2016
Steven Neale; Rita Valadas Pereira; João Ricardo Silva; António Branco
The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them.
Lecture Notes in Artificial Intelligence | 2016
Steven Neale; Rita Valadas; João de Abreu e Silva; António Branco
Autism Spectrum Disorder (ASD) often comprises difficulties in the acquisition of communication and language skills. Several researchers and companies have developed software to help individuals with ASD developing those skills; however, there is a lack of applications in Portuguese that are tailored to the individual needs of each child. In this context, we present VITHEA-Kids, a platform where caregivers can create exercises and customize the interaction between each child and the platform. We also developed a module for the automatic generation of multiple choice exercises, meant to be integrated in VITHEA-Kids. We evaluated this work with caregivers (which provided promising indicators), with a child (ongoing upon thesis delivery) and we also evaluated the generation of incorrect answers in multiple choice exercises (achieving acceptance rates between 61.11 % and 92.22 %).
language resources and evaluation | 2016
Steven Neale; Luís Gomes; Eneko Agirre; Oier Lopez de Lacalle; António Branco
language resources and evaluation | 2016
Arantxa Otegi; Nora Aranberri; António Branco; Jan Hajic; Martin Popel; Kiril Simov; Eneko Agirre; Petya Osenova; Rita Valadas Pereira; João Ricardo Silva; Steven Neale
language resources and evaluation | 2018
Steven Neale; Kevin Donnelly; Gareth Watkins; Dawn Knight
language resources and evaluation | 2018
Steven Neale
Archive | 2017
Steven Neale; Irena Spasic; Jennifer Needs; Gareth Watkins; Steve Morris; Teresa Fitzpatrick; Lindsay Marshall; Dawn Knight