Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Aingeru Mayor is active.

Publication


Featured researches published by Aingeru Mayor.


international conference on computational linguistics | 2009

Transfer-Based MT from Spanish into Basque: Reusability, Standardization and Open Source

Iñaki Alegria; Arantza Díaz de Ilarraza; Gorka Labaka; Mikel Lersundi; Aingeru Mayor; Kepa Sarasola

We present an open architecture we have designed in a project for machine translation from Spanish into Basque based on rules. The main objective has been the construction of an open, reusable and interoperable framework which can be improved in the next future combining it with the statistical model. The MT architecture reuses several open tools and it is based on an unique XML format for the flow between the different modules, which makes easer the interaction among different developers of tools and resources. Being Basque a resource-poor language this is a key feature in our aim for future improvements and extensions of the engine. The result is an open source software which can be downloaded from matxin.sourceforge.net,and we think it could be adapted to translating between other languages with few resources.


Machine Translation | 2011

Matxin, an open-source rule-based machine translation system for Basque

Aingeru Mayor; Iñaki Alegria; Arantza Díaz de Ilarraza; Gorka Labaka; Mikel Lersundi; Kepa Sarasola

We present the first publicly available machine translation (MT) system for Basque. The fact that Basque is both a morphologically rich and less-resourced language makes the use of statistical approaches difficult, and raises the need to develop a rule-based architecture which can be combined in the future with statistical techniques. The MT architecture proposed reuses several open-source tools and is based on a unique XML format to facilitate the flow between the different modules, which eases the interaction among different developers of tools and resources. The result is the rule-based Matxin MT system, an open-source toolkit, whose first implementation translates from Spanish to Basque. We have performed innovative work on the following tasks: construction of a dependency analyser for Spanish, use of rich linguistic information to translate prepositions and syntactic functions (such as subject and object markers), construction of an efficient module for verbal chunk transfer, and design and implementation of modules for ordering words and phrases, independently of the source language.


finite-state methods and natural language processing | 2005

An FST grammar for verb chain transfer in a spanish-basque MT system

Iñaki Alegria; Arantza Díaz de Ilarraza; Gorka Labaka; Mikel Lersundi; Aingeru Mayor; Kepa Sarasola

We are developing an Spanish-Basque MT system using the traditional transfer model and based on shallow and dependency parsing. The project is based on the previous work of our group but integrated in OpenTrad initiative [2]. This abstract sumarizes the current status of development of an FST grammar for the structural transfer of verb chains. This task is quite complex due to the high distance between both languages. In the actual implementation we are using XRCE Finite States Tools [1].


international conference on computational linguistics | 2002

Semiautomatic labelling of semantic features

Arantza Díaz de Ilarraza; Aingeru Mayor; Kepa Sarasola

This paper presents the strategy and design of a highly efficient semiautomatic method for labelling the semantic features of common nouns, using semantic relationships between words, and based on the information extracted from an electronic monolingual dictionary. The method, that uses genus data, specific relators and synonymy information, obtains an accuracy of over 99% and a scope of 68,2% with regard to all the common nouns contained in a real corpus of over 1 million words, after the manual labelling of only 100 nouns.


The People?s Web Meets NLP: Collaboratively Constructed Language Resources. Springer | 2013

Reciprocal Enrichment Between Basque Wikipedia and Machine Translation

Iñaki Alegria; Unai Cabezon; Unai Fernandez de Betoño; Gorka Labaka; Aingeru Mayor; Kepa Sarasola; Arkaitz Zubiaga

In this chapter, we define a collaboration framework that enables Wikipedia editors to generate new articles while they help development of Machine Translation (MT) systems by providing post-edition logs. This collaboration framework was tested with editors of Basque Wikipedia. Their post-editing of Computer Science articles has been used to improve the output of a Spanish to Basque MT system called Matxin. For the collaboration between editors and researchers, we selected a set of 100 articles from the Spanish Wikipedia. These articles would then be used as the source texts to be translated into Basque using the MT engine. A group of volunteers from Basque Wikipedia reviewed and corrected the raw MT translations. This collaboration ultimately produced two main benefits: (i) the change logs that would potentially help improve the MT engine by using an automated statistical post-editing system, and (ii) the growth of Basque Wikipedia. The results show that this process can improve the accuracy of an Rule Based MT (RBMT) system in nearly 10 % benefiting from the post-edition of 50,000 words in the Computer Science domain. We believe that our conclusions can be extended to MT engines involving other less-resourced languages lacking large parallel corpora or frequently updated lexical knowledge, as well as to other domains.


language and technology conference | 2009

Valuable language resources and applications supporting the use of Basque

Iñaki Alegria; Maxux J. Aranzabe; Xabier Arregi; Xabier Artola; Arantza Díaz de Ilarraza; Aingeru Mayor; Kepa Sarasola

We present some Language Technology applications and resources that have proven to be valuable tools to promote the use of Basque, a low density language. We also present the strategy we have followed for almost twenty years to develop those tools and derived applications as the top of an integrated environment of language resources, language tools and other applications. In our opinion, if Basque is now in a quite good position in Language Technology is because those guidelines have been followed.


Archive | 2005

An Open-Source Shallow-Transfer Machine Translation Engine for the Romance Languages of Spain

Antonio M. Corbí-Bellot; Mikel L. Forcada; Sergio Ortiz-Rojas; Juan Antonio Pérez; Gema Ramírez-Sánchez; Felipe Sánchez-Martínez; Iñaki Alegria; Aingeru Mayor; Kepa Sarasola; Euskal Herriko Unibertsitatea


Archive | 2005

An Open Architecture for Transfer-based Machine Translation between Spanish and Basque

Iñaki Alegria; Arantza Díaz de Ilarraza Sánchez; Gorka Labaka; Mikel Lersundi; Aingeru Mayor; Kepa Sarasola; Mikel L. Forcada; Sergio Ortiz-Rojas; Llus Padr; Ixa Taldea; Informatika Fakultatea


Archive | 2009

Matxin: moving towards language independence

Aingeru Mayor; Ixa Taldea; Euskal Herriko Unibertsitatea; Francis M. Tyers


international joint conference on natural language processing | 2008

Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source

Iñaki Alegria; Xabier Arregi; Xabier Artola; A. Diaz de Ilarraza; Gorka Labaka; Mikel Lersundi; Aingeru Mayor; Kepa Sarasola

Collaboration


Dive into the Aingeru Mayor's collaboration.

Top Co-Authors

Avatar

Kepa Sarasola

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Gorka Labaka

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Iñaki Alegria

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Mikel Lersundi

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Arantza Díaz de Ilarraza

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Unai Cabezon

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Iaki Alegria

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar

Arantxa Diaz de Ilarraza

University of the Basque Country

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge