Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where D.L. Theijssen is active.

Publication


Featured researches published by D.L. Theijssen.


Information Retrieval | 2011

Learning to rank for why-question answering

Suzan Verberne; Hans van Halteren; D.L. Theijssen; Stephan Raaijmakers; Lou Boves

In this paper, we evaluate a number of machine learning techniques for the task of ranking answers to why-questions. We use TF-IDF together with a set of 36 linguistically motivated features that characterize questions and answers. We experiment with a number of machine learning techniques (among which several classifiers and regression techniques, Ranking SVM and SVMmap) in various settings. The purpose of the experiments is to assess how the different machine learning approaches can cope with our highly imbalanced binary relevance data, with and without hyperparameter tuning. We find that with all machine learning techniques, we can obtain an MRR score that is significantly above the TF-IDF baseline of 0.25 and not significantly lower than the best score of 0.35. We provide an in-depth analysis of the effect of data imbalance and hyperparameter tuning, and we relate our findings to previous research on learning to rank for Information Retrieval.


Corpus Linguistics and Linguistic Theory | 2013

Choosing alternatives: Using Bayesian Networks and memory based learning to study the dative alternation

D.L. Theijssen; L.F.M. ten Bosch; Bert Cranen; H. van Halteren; Lou Boves

Abstract In existing research on syntactic alternations such as the dative alternation, (give her the apple vs. give the apple to her), the linguistic data is often analysed with the help of logistic regression models. In this article, we evaluate the use of logistic regression for this type of research, and present two different approaches: Bayesian Networks and Memory-based learning. For the Bayesian Network, we use the higher-level semantic features suggested in the literature, while we limit ourselves to lexical items in the memory-based approach. We evaluate the suitability of the three approaches by applying them to a large data set (>11,000 instances) extracted from the British National Corpus, and comparing their quality in terms of classification accuracy, their interpretability in the context of linguistic research, and their actual classification of individual cases. Our main finding is that the classifications are very similar across the three approaches, also when employing lexical items instead of the higher-level features, because most of the alternation is determined by the verb and the length of the two objects (here: her and the apple).


language resources and evaluation | 2012

Evaluating automatic annotation: automatically detecting and enriching instances of the dative alternation

D.L. Theijssen; Lou Boves; Hans van Halteren; Nelleke Oostdijk

In this article, we automatically create two large and richly annotated data sets for studying the English dative alternation. With an intrinsic and an extrinsic evaluation, we address the question of whether such data sets that are obtained and enriched automatically are suitable for linguistic research, even if they contain errors. The extrinsic evaluation consists of building logistic regression models with these data sets. We conclude that the automatic approach for detecting instances of the dative alternation still needs human intervention, but that it is indeed possible to annotate the instances with features that are syntactic, semantic and discourse-related in nature. Only the automatic classification of the concreteness of nouns is problematic.


computational linguistics in the netherlands | 2008

Features for automatic discourse analysis of paragraphs

D.L. Theijssen; H. van Halteren; Suzan Verberne; Lou Boves


Aly, R.B.N.;Hauff, C.;Hamer-Mulder, I.C. den [et al.] (ed.), Proceedings of the Dutch-Belgium Information Retrieval workshop (DIR 2009) | 2009

Learning to Rank Answers to Why-Questions

Suzan Verberne; Stephan Raaijmakers; D.L. Theijssen; Lou Boves


Linguistic Inquiry | 2008

Using the ICE-GB Corpus to model the English dative alternation

D.L. Theijssen


CLIN Journal | 2011

On the difficulty of making concreteness concrete

D.L. Theijssen; H. van Halteren; Lou Boves; Nelleke Oostdijk


computational linguistics in the netherlands | 2007

Evaluating deep syntactic parsing

D.L. Theijssen; Suzan Verberne; Nelleke Oostdijk; Lou Boves


computational linguistics in the netherlands | 2010

To ish or not to ish

D.L. Theijssen; H. van Halteren; T. Boonpiyapat; A. Lohfink; B. Ruiter; H. Westerbeek


Li, H.;Liu, T.Y.;Zhai, C. [et al.] (ed.), Proceedings of the Workshop Learning to Rank for Information Retrieval (LR4IR 2009) | 2009

Learning to Rank QA Data: Evaluating Machine Learning Techniques for Ranking Answers to Why-Questions

Suzan Verberne; H. van Halteren; Stephan Raaijmakers; D.L. Theijssen; Lou Boves

Collaboration


Dive into the D.L. Theijssen's collaboration.

Top Co-Authors

Avatar

Lou Boves

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

H. van Halteren

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

Nelleke Oostdijk

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

Suzan Verberne

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

Hans van Halteren

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

Bert Cranen

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

E. van de Sande

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

F. Groothoff

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

J. Tiems

Radboud University Nijmegen

View shared research outputs
Top Co-Authors

Avatar

K.M. Fikkers

Radboud University Nijmegen

View shared research outputs
Researchain Logo
Decentralizing Knowledge