Kostadin Cholakov
University of Groningen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kostadin Cholakov.
international conference on computational linguistics | 2008
Kostadin Cholakov; Valia Kordoni; Yi Zhang
In this paper we illustrate and underline the importance of making detailed linguistic information a central part of the process of automatic acquisition of large-scale lexicons as a means for enhancing robustness and at the same time ensuring maintainability and re-usability of deep lexicalised grammars. Using the error mining techniques proposed in (van Noord, 2004) we show very convincingly that the main hindrance to portability of deep lexicalised grammars to domains other than the ones originally developed in, as well as to robustness of systems using such grammars is low lexical coverage. To this effect, we develop linguistically-driven methods that use detailed morphosyntactic information to automatically enhance the performance of deep lexicalised grammars maintaining at the same time their usually already achieved high linguistic quality.
Natural Language Engineering | 2014
Kostadin Cholakov
In recent studies it has been shown that syntax-based semantic space models outperform models in which the context is represented as a bag-of-words in several semantic analysis tasks. This has been generally attributed to the fact that syntax-based models employ corpora that are syntactically annotated by a parser and a computational grammar. However, if the corpora processed contain words which are unknown to the parser and the grammar, a syntax-based model may lose its advantage since the syntactic properties of such words are unavailable. On the other hand, bag-of-words models do not face this issue since they operate on raw, non-annotated corpora and are thus more robust. In this paper, we compare the performance of syntax-based and bag-of-words models when applied to the task of learning the semantics of unknown words. In our experiments, unknown words are considered the words which are not known to the Alpino parser and grammar of Dutch. In our study, the semantics of an unknown word is defined by finding its most similar word in cornetto , a Dutch lexico-semantic hierarchy. We show that for unknown words the syntax-based model performs worse than the bag-of-words approach. Furthermore, we show that if we first learn the syntactic properties of unknown words by an appropriate lexical acquisition method, then in fact the syntax-based model does outperform the bag-of-words approach. The conclusion we draw is that, for words unknown to a given grammar, a bag-of-words model is more robust than a syntax-based model. However, the combination of lexical acquisition and syntax-based semantic models is best suited for learning the semantics of unknown words.
Archive | 2009
J. (John) Nerbonne; G.J.M. (Gertjan) van Noord; Kostadin Cholakov
empirical methods in natural language processing | 2010
Kostadin Cholakov; Gertjan van Noord
recent advances in natural language processing | 2009
Kostadin Cholakov; Gertjan van Noord
international conference on computational linguistics | 2010
Kostadin Cholakov; Gertjan van Noord
international joint conference on natural language processing | 2011
Kostadin Cholakov; Gertjan van Noord; Valia Kordoni; Yi Zhang
Archive | 2016
Valia Kordoni; Lexi Birch; Ioana Buliga; Kostadin Cholakov; Markus Egg; Federico Gaspari; Yota Georgakopoulou; Maria Gialama; I.H.E. Hendrickx; Mitja Jermol; Katia Lida Kermanidis; Joss Moorkens; Davor Orlic; Michael Papadopoulos; Maja Popović; Rico Sennrich; Vilelmini Sosoni; Dimitrios Tsoumakos; Antal van den Bosch; Menno van Zaanen; Andy Way
recent advances in natural language processing | 2011
Kostadin Cholakov; Gertjan van Noord; Valia Kordoni; Yi Zhang
language resources and evaluation | 2016
Valia Kordoni; A.P.J. van den Bosch; Katia Lida Kermanidis; Vilelmini Sosoni; Kostadin Cholakov; I.H.E. Hendrickx; Matthias Huck