Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Marten van Schijndel is active.

Publication


Featured researches published by Marten van Schijndel.


Topics in Cognitive Science | 2013

A model of language processing as hierarchic sequential prediction.

Marten van Schijndel; Andrew Exley; William Schuler

Computational models of memory are often expressed as hierarchic sequence models, but the hierarchies in these models are typically fairly shallow, reflecting the tendency for memories of superordinate sequence states to become increasingly conflated. This article describes a broad-coverage probabilistic sentence processing model that uses a variant of a left-corner parsing strategy to flatten sentence processing operations in parsing into a similarly shallow hierarchy of learned sequences. The main result of this article is that a broad-coverage model with constraints on hierarchy depth can process large newspaper corpora with the same accuracy as a state-of-the-art parser not defined in terms of sequential working memory operations.


north american chapter of the association for computational linguistics | 2015

Hierarchic syntax improves reading time prediction

Marten van Schijndel; William Schuler

Previous work has debated whether humans make use of hierarchic syntax when processing language (Frank and Bod, 2011; Fossum and Levy, 2012). This paper uses an eye-tracking corpus to demonstrate that hierarchic syntax significantly improves reading time prediction over a strong n-gram baseline. This study shows that an interpolated 5-gram baseline can be made stronger by combining n-gram statistics over entire eye-tracking regions rather than simply using the last n-gram in each region, but basic hierarchic syntactic measures are still able to achieve significant improvements over this improved baseline.


Frontiers in Psychology | 2016

Salience and Attention in Surprisal-Based Accounts of Language Processing

Alessandra Zarcone; Marten van Schijndel; Jorrig Vogels; Vera Demberg

The notion of salience has been singled out as the explanatory factor for a diverse range of linguistic phenomena. In particular, perceptual salience (e.g., visual salience of objects in the world, acoustic prominence of linguistic sounds) and semantic-pragmatic salience (e.g., prominence of recently mentioned or topical referents) have been shown to influence language comprehension and production. A different line of research has sought to account for behavioral correlates of cognitive load during comprehension as well as for certain patterns in language usage using information-theoretic notions, such as surprisal. Surprisal and salience both affect language processing at different levels, but the relationship between the two has not been adequately elucidated, and the question of whether salience can be reduced to surprisal / predictability is still open. Our review identifies two main challenges in addressing this question: terminological inconsistency and lack of integration between high and low levels of representations in salience-based accounts and surprisal-based accounts. We capitalize upon work in visual cognition in order to orient ourselves in surveying the different facets of the notion of salience in linguistics and their relation with models of surprisal. We find that work on salience highlights aspects of linguistic communication that models of surprisal tend to overlook, namely the role of attention and relevance to current goals, and we argue that the Predictive Coding framework provides a unified view which can account for the role played by attention and predictability at different levels of processing and which can clarify the interplay between low and high levels of processes and between predictability-driven expectation and attention-driven focus.


Cognition | 2016

Investigating locality effects and surprisal in written English syntactic choice phenomena

Rajakrishnan Rajkumar; Marten van Schijndel; Michael White; William Schuler

We investigate the extent to which syntactic choice in written English is influenced by processing considerations as predicted by Gibsons (2000) Dependency Locality Theory (DLT) and Surprisal Theory (Hale, 2001; Levy, 2008). A long line of previous work attests that languages display a tendency for shorter dependencies, and in a previous corpus study, Temperley (2007) provided evidence that this tendency exerts a strong influence on constituent ordering choices. However, Temperleys study included no frequency-based controls, and subsequent work on sentence comprehension with broad-coverage eye-tracking corpora found weak or negative effects of DLT-based measures when frequency effects were statistically controlled for (Demberg & Keller, 2008; van Schijndel, Nguyen, & Schuler 2013; van Schijndel & Schuler, 2013), calling into question the actual impact of dependency locality on syntactic choice phenomena. Going beyond Temperleys work, we show that DLT integration costs are indeed a significant predictor of syntactic choice in written English even in the presence of competing frequency-based and cognitively motivated control factors, including n-gram probability and PCFG surprisal as well as embedding depth (Wu, Bachrach, Cardenas, & Schuler, 2010; Yngve, 1960). Our study also shows that the predictions of dependency length and surprisal are only moderately correlated, a finding which mirrors Dember & Kellers (2008) results for sentence comprehension. Further, we demonstrate that the efficacy of dependency length in predicting the corpus choice increases with increasing head-dependent distances. At the same time, we find that the tendency towards dependency locality is not always observed, and with pre-verbal adjuncts in particular, non-locality cases are found more often than not. In contrast, surprisal is effective in these cases, and the embedding depth measures further increase prediction accuracy. We discuss the implications of our findings for theories of language comprehension and production, and conclude with a discussion of questions our work raises for future research.


north american chapter of the association for computational linguistics | 2015

Evidence of syntactic working memory usage in MEG data

Marten van Schijndel; Brian Murphy; William Schuler

While reading times are often used to measure working memory load, frequency effects (such as surprisal or n-gram frequencies) also have strong confounding effects on reading times. This work uses a naturalistic audio corpus with magnetoencephalographic (MEG) annotations to measure working memory load during sentence processing. Alpha oscillations in posterior regions of the brain have been found to correlate with working memory load in non-linguistic tasks (Jensen et al., 2002), and the present study extends these findings to working memory load caused by syntactic center embeddings. Moreover, this work finds that frequency effects in naturally-occurring stimuli do not significantly contribute to neural oscillations in any frequency band, which suggests that many modeling claims could be tested on this sort of data even without controlling for frequency effects.


north american chapter of the association for computational linguistics | 2015

AZMAT: Sentence Similarity Using Associative Matrices

Evan Jaffe; Lifeng Jin; David King; Marten van Schijndel

This work uses recursive autoencoders (Socher et al., 2011), word embeddings (Pennington et al., 2014), associative matrices (Schuler, 2014) and lexical overlap features to model human judgments of sentential similarity on SemEval-2015 Task 2: English STS (Agirre et al., 2015). Results show a modest positive correlation between system predictions and human similarity scores, ranking 69th out of 74 submitted systems.


meeting of the association for computational linguistics | 2014

Bootstrapping into Filler-Gap: An Acquisition Story

Marten van Schijndel; Micha Elsner

Analyses of filler-gap dependencies usually involve complex syntactic rules or heuristics; however recent results suggest that filler-gap comprehension begins earlier than seemingly simpler constructions such as ditransitives or passives. Therefore, this work models filler-gap acquisition as a byproduct of learning word orderings (e.g. SVO vs OSV), which must be done at a very young age anyway in order to extract meaning from language. Specifically, this model, trained on part-of-speech tags, represents the preferred locations of semantic roles relative to a verb as Gaussian mixtures over real numbers. This approach learns role assignment in filler-gap constructions in a manner consistent with current developmental findings and is extremely robust to initialization variance. Additionally, this model is shown to be able to account for a characteristic error made by learners during this period (A and B gorped interpreted as A gorped B).


international conference on computational linguistics | 2012

Accurate Unbounded Dependency Recovery using Generalized Categorial Grammars

Luan Nguyen; Marten van Schijndel; William Schuler


north american chapter of the association for computational linguistics | 2013

An Analysis of Frequency- and Memory-Based Processing Costs

Marten van Schijndel; William Schuler


international conference on computational linguistics | 2016

Memory access during incremental sentence processing causes reading time latency.

Cory Shain; Marten van Schijndel; Richard Futrell; Edward Gibson; William Schuler

Collaboration


Dive into the Marten van Schijndel's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Luan Nguyen

University of Minnesota

View shared research outputs
Top Co-Authors

Avatar

Andrew Exley

University of Minnesota

View shared research outputs
Top Co-Authors

Avatar

Edward Gibson

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Richard Futrell

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Lifeng Jin

Health Protection Agency

View shared research outputs
Top Co-Authors

Avatar

Rajakrishnan Rajkumar

Indian Institute of Technology Delhi

View shared research outputs
Researchain Logo
Decentralizing Knowledge