Sujay Kumar Jauhar
Carnegie Mellon University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sujay Kumar Jauhar.
north american chapter of the association for computational linguistics | 2015
Sujay Kumar Jauhar; Chris Dyer; Eduard H. Hovy
Words are polysemous. However, most approaches to representation learning for lexical semantics assign a single vector to every surface word type. Meanwhile, lexical ontologies such as WordNet provide a source of complementary knowledge to distributional information, including a word sense inventory. In this paper we propose two novel and general approaches for generating sense-specific word embeddings that are grounded in an ontology. The first applies graph smoothing as a postprocessing step to tease the vectors of different senses apart, and is applicable to any vector space model. The second adapts predictive maximum likelihood models that learn word embeddings with latent variables representing senses grounded in an specified ontology. Empirical results on lexical semantic tasks show that our approaches effectively captures information from both the ontology and distributional statistics. Moreover, in most cases our sense-specific models outperform other models we compare against.
meeting of the association for computational linguistics | 2016
Sujay Kumar Jauhar; Peter D. Turney; Eduard H. Hovy
Question answering requires access to a knowledge base to check facts and reason about information. Knowledge in the form of natural language text is easy to acquire, but difficult for automated reasoning. Highly-structured knowledge bases can facilitate reasoning, but are difficult to acquire. In this paper we explore tables as a semi-structured formalism that provides a balanced compromise to this trade-off. We first use the structure of tables to guide the construction of a dataset of over 9000 multiple-choice questions with rich alignment annotations, easily and efficiently via crowd-sourcing. We then use this annotated data to train a semi-structured feature-driven model for question answering that uses tables as a knowledge base. In benchmark evaluations, we significantly outperform both a strong unstructured retrieval baseline and a highly structured Markov Logic Network model.
joint conference on lexical and computational semantics | 2015
Sujay Kumar Jauhar; Raul D. Guerra; Edgar Gonzàlez Pellicer; Marta Recasens
Discourse deixis is a linguistic phenomenon in which pronouns have verbal or clausal, rather than nominal, antecedents. Studies have estimated that between 5% and 10% of pronouns in non-conversational data are discourse deictic. However, current coreference resolution systems ignore this phenomenon. This paper presents an automatic system for the detection and resolution of discourse-deictic pronouns. We introduce a two-step approach that first recognizes instances of discourse-deictic pronouns, and then resolves them to their verbal antecedent. Both components rely on linguistically motivated features. We evaluate the components in isolation and in combination with two state-of-the-art coreference resolvers. Results show that our system outperforms several baselines, including the only comparable discourse deixis system, and leads to small but statistically significant improvements over the full coreference resolution systems. An error analysis lays bare the need for a less strict evaluation of this task.
north american chapter of the association for computational linguistics | 2015
Manaal Faruqui; Jesse Dodge; Sujay Kumar Jauhar; Chris Dyer; Eduard H. Hovy; Noah A. Smith
joint conference on lexical and computational semantics | 2012
Lucia Specia; Sujay Kumar Jauhar; Rada Mihalcea
Proceedings of the First Workshop on Metaphor in NLP | 2013
Dirk Hovy; Shashank Shrivastava; Sujay Kumar Jauhar; Mrinmaya Sachan; Kartik Goyal; Huying Li; Whitney Sanders; Eduard H. Hovy
joint conference on lexical and computational semantics | 2012
Sujay Kumar Jauhar; Lucia Specia
meeting of the association for computational linguistics | 2013
Kartik Goyal; Sujay Kumar Jauhar; Huiying Li; Mrinmaya Sachan; Shashank Srivastava; Eduard H. Hovy
meeting of the association for computational linguistics | 2013
Kartik Goyal; Sujay Kumar Jauhar; Huiying Li; Mrinmaya Sachan; Shashank Srivastava; Eduard H. Hovy
joint conference on lexical and computational semantics | 2017
Sujay Kumar Jauhar; Eduard H. Hovy