Shane Bergsma
University of Alberta
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Shane Bergsma.
meeting of the association for computational linguistics | 2006
Shane Bergsma; Dekang Lin
We present an approach to pronoun resolution based on syntactic paths. Through a simple bootstrapping procedure, we learn the likelihood of coreference between a pronoun and a candidate noun based on the path in the parse tree between the two entities. This path information enables us to handle previously challenging resolution instances, and also robustly addresses traditional syntactic coreference constraints. Highly coreferent paths also allow mining of precise probabilistic gender/number information. We combine statistical knowledge with well known features in a Support Vector Machine pronoun resolution classifier. Significant gains in performance are observed on several datasets.
empirical methods in natural language processing | 2008
Shane Bergsma; Dekang Lin; Randy Goebel
We present a discriminative method for learning selectional preferences from unlabeled text. Positive examples are taken from observed predicate-argument pairs, while negatives are constructed from unobserved combinations. We train a Support Vector Machine classifier to distinguish the positive from the negative instances. We show how to partition the examples for efficient training with 57 thousand features and 6.5 million training instances. The model outperforms other recent approaches, achieving excellent correlation with human plausibility judgments. Compared to Mutual Information, it identifies 66% more verb-object pairs in unseen text, and resolves 37% more pronouns correctly in a pronoun resolution experiment.
conference on computational natural language learning | 2005
Colin Cherry; Shane Bergsma
We propose an unsupervised Expectation Maximization approach to pronoun resolution. The system learns from a fixed list of potential antecedents for each pronoun. We show that unsupervised learning is possible in this context, as the performance of our system is comparable to supervised methods. Our results indicate that a probabilistic gender/number model, determined automatically from unlabeled text, is a powerful feature for this task.
international joint conference on artificial intelligence | 2011
Shane Bergsma; Benjamin Van Durme
Speakers of many different languages use the Internet. A common activity among these users is uploading images and associating these images with words (in their own language) as captions, filenames, or surrounding text. We use these explicit, monolingual, image-to-word connections to successfully learn implicit, bilingual, word-to-word translations. Bilingual pairs of words are proposed as translations if their corresponding images have similar visual features. We generate bilingual lexicons in 15 language pairs, focusing on words that have been automatically identified as physical objects. The use of visual similarity substantially improves performance over standard approaches based on string similarity: for generated lexicons with 1000 translations, including visual information leads to an absolute improvement in accuracy of 8-12% over string edit distance alone.
discourse anaphora and anaphor resolution colloquium | 2011
Shane Bergsma; David Yarowsky
We present
canadian conference on artificial intelligence | 2005
Shane Bergsma
\textsc{Nada}
international joint conference on natural language processing | 2009
Qing Dou; Shane Bergsma; Sittichai Jiampojamarn; Grzegorz Kondrak
: the Non-Anaphoric Detection Algorithm.
meeting of the association for computational linguistics | 2014
Charley Beller; Rebecca Knowles; Craig Harman; Shane Bergsma; Margaret Mitchell; Benjamin Van Durme
\textsc{Nada}
conference on computational natural language learning | 2009
Shane Bergsma; Dekang Lin; Randy Goebel
is a novel, publicly-available program that accurately distinguishes between the referential and non-referential pronoun it in raw English text. Like recent state-of-the-art approaches,
Archive | 2010
Shane Bergsma
\textsc{Nada}