D.L. Theijssen
Radboud University Nijmegen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by D.L. Theijssen.
Information Retrieval | 2011
Suzan Verberne; Hans van Halteren; D.L. Theijssen; Stephan Raaijmakers; Lou Boves
In this paper, we evaluate a number of machine learning techniques for the task of ranking answers to why-questions. We use TF-IDF together with a set of 36 linguistically motivated features that characterize questions and answers. We experiment with a number of machine learning techniques (among which several classifiers and regression techniques, Ranking SVM and SVMmap) in various settings. The purpose of the experiments is to assess how the different machine learning approaches can cope with our highly imbalanced binary relevance data, with and without hyperparameter tuning. We find that with all machine learning techniques, we can obtain an MRR score that is significantly above the TF-IDF baseline of 0.25 and not significantly lower than the best score of 0.35. We provide an in-depth analysis of the effect of data imbalance and hyperparameter tuning, and we relate our findings to previous research on learning to rank for Information Retrieval.
Corpus Linguistics and Linguistic Theory | 2013
D.L. Theijssen; L.F.M. ten Bosch; Bert Cranen; H. van Halteren; Lou Boves
Abstract In existing research on syntactic alternations such as the dative alternation, (give her the apple vs. give the apple to her), the linguistic data is often analysed with the help of logistic regression models. In this article, we evaluate the use of logistic regression for this type of research, and present two different approaches: Bayesian Networks and Memory-based learning. For the Bayesian Network, we use the higher-level semantic features suggested in the literature, while we limit ourselves to lexical items in the memory-based approach. We evaluate the suitability of the three approaches by applying them to a large data set (>11,000 instances) extracted from the British National Corpus, and comparing their quality in terms of classification accuracy, their interpretability in the context of linguistic research, and their actual classification of individual cases. Our main finding is that the classifications are very similar across the three approaches, also when employing lexical items instead of the higher-level features, because most of the alternation is determined by the verb and the length of the two objects (here: her and the apple).
language resources and evaluation | 2012
D.L. Theijssen; Lou Boves; Hans van Halteren; Nelleke Oostdijk
In this article, we automatically create two large and richly annotated data sets for studying the English dative alternation. With an intrinsic and an extrinsic evaluation, we address the question of whether such data sets that are obtained and enriched automatically are suitable for linguistic research, even if they contain errors. The extrinsic evaluation consists of building logistic regression models with these data sets. We conclude that the automatic approach for detecting instances of the dative alternation still needs human intervention, but that it is indeed possible to annotate the instances with features that are syntactic, semantic and discourse-related in nature. Only the automatic classification of the concreteness of nouns is problematic.
computational linguistics in the netherlands | 2008
D.L. Theijssen; H. van Halteren; Suzan Verberne; Lou Boves
Aly, R.B.N.;Hauff, C.;Hamer-Mulder, I.C. den [et al.] (ed.), Proceedings of the Dutch-Belgium Information Retrieval workshop (DIR 2009) | 2009
Suzan Verberne; Stephan Raaijmakers; D.L. Theijssen; Lou Boves
Linguistic Inquiry | 2008
D.L. Theijssen
CLIN Journal | 2011
D.L. Theijssen; H. van Halteren; Lou Boves; Nelleke Oostdijk
computational linguistics in the netherlands | 2007
D.L. Theijssen; Suzan Verberne; Nelleke Oostdijk; Lou Boves
computational linguistics in the netherlands | 2010
D.L. Theijssen; H. van Halteren; T. Boonpiyapat; A. Lohfink; B. Ruiter; H. Westerbeek
Li, H.;Liu, T.Y.;Zhai, C. [et al.] (ed.), Proceedings of the Workshop Learning to Rank for Information Retrieval (LR4IR 2009) | 2009
Suzan Verberne; H. van Halteren; Stephan Raaijmakers; D.L. Theijssen; Lou Boves