Olatz Ansa
University of the Basque Country
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Olatz Ansa.
MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing | 2004
Iñaki Alegria; Olatz Ansa; Xabier Artola; Nerea Ezeiza; Koldo Gojenola; Ruben Urizar
This paper describes the representation of Basque Multiword Lexical Units and the automatic processing of Multiword Expressions. After discussing and stating which kind of multiword expressions we consider to be processed at the current stage of the work, we present the representation schema of the corresponding lexical units in a general-purpose lexical database. Due to its expressive power, the schema can deal not only with fixed expressions but also with morphosyntactically flexible constructions. It also allows us to lemmatize word combinations as a unit and yet to parse the components individually if necessary. Moreover, we describe HABIL, a tool for the automatic processing of these expressions, and we give some evaluation results. This work must be placed in a general framework of written Basque processing tools, which currently ranges from the tokenization and segmentation of single words up to the syntactic tagging of general texts.
Knowledge and Information Systems | 2015
Arantxa Otegi; Xabier Arregi; Olatz Ansa; Eneko Agirre
Traditional information retrieval (IR) systems use keywords to index and retrieve documents. The limitations of keywords were recognized since the early days, specially when different but closely related words are used in the query and the relevant document. Query expansion techniques like pseudo-relevance feedback (PRF) and document clustering techniques rely on the target document set in order to bridge the gap between those words. This paper explores the use of knowledge-based semantic relatedness techniques to overcome the vocabulary mismatch between the query and documents, both on IR and Passage Retrieval for question answering. We performed query expansion and document expansion using WordNet, with positive effects over a language modeling baseline on three datasets, and over PRF on two of those datasets. Our analysis shows that our models and PRF are complementary; in that, PRF is better for easy queries, and our models are stronger for difficult queries and that our models generalize better to other collections, being more robust to parameter adjustments. In addition, we show that our method has a positive impact in an end-to-end question answering system for Basque and that it can be readily applied to other knowledge bases, as our good results using Wikipedia show, paving the way for the use of other knowledge structures such as medical ontologies and linked data repositories.
cross language evaluation forum | 2009
Eneko Agirre; Olatz Ansa; Xabier Arregi; Maddalen Lopez de Lacalle; Arantxa Otegi; Xabier Saralegi; Hugo Zaragoza
This article describes the participation of the joint Elhuyar-IXA group in the ResPubliQA exercise at QA&CLEF. In particular, we participated in the English-English monolingual task and in the Basque-English crosslingual one. Our focus has been threefold: (1) to check to what extent information retrieval (IR) can achieve good results in passage retrieval without question analysis and answer validation, (2) to check Machine Readable Dictionary (MRD) techniques for the Basque to English retrieval when faced with the lack of parallel corpora for Basque in this domain, and (3) to check the contribution of semantic relatedness based on WordNet to expand the passages to related words. Our results show that IR provides good results in the monolingual task, that our crosslingual system performs lower than the monolingual runs, and that semantic relatedness improves the results in both tasks (by 6 and 2 points, respectively).
flexible query answering systems | 2006
Olatz Ansa; Xabier Arregi; Arantxa Otegi; Andoni Valverde
This paper presents a general platform for a Basque monolingual question answering (QA) system. It focuses on the architecture of the platform, paying special attention to: 1) the integration of the development and evaluation environments, and 2) the systematic use of XML declarative files to control the execution of the modules and the communication between them. Moreover, a first pilot experiment is discussed.
arXiv: Computation and Language | 2000
Eneko Agirre; Olatz Ansa; Eduard H. Hovy; David Martinez
arXiv: Computation and Language | 2001
Eneko Agirre; Olatz Ansa; Eduard H. Hovy; David Martinez
Archive | 1998
Eneko Agirre; Izaskun Aldezabal; Iñaki Alegria; Olatz Ansa; Xabier Arregi; Jose Maria Arriola
Proceedings of the IRCS Workshop on linguistic databases | 2006
Izaskun Aldezabal; Olatz Ansa; Bertol Arrieta; Xabier Artola; Aitzol Ezeiza; Gregorio Hernández; Mikel Lersundi
cross language evaluation forum | 2008
Olatz Ansa; Xabier Arregi; Arantxa Otegi; Ander Soraluze
CLEF (Working Notes) | 2008
Olatz Ansa; Xabier Arregi; Arantxa Otegi; Ander Soraluze