Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sujay Kumar Jauhar is active.

Publication


Featured researches published by Sujay Kumar Jauhar.


north american chapter of the association for computational linguistics | 2015

Ontologically Grounded Multi-sense Representation Learning for Semantic Vector Space Models.

Sujay Kumar Jauhar; Chris Dyer; Eduard H. Hovy

Words are polysemous. However, most approaches to representation learning for lexical semantics assign a single vector to every surface word type. Meanwhile, lexical ontologies such as WordNet provide a source of complementary knowledge to distributional information, including a word sense inventory. In this paper we propose two novel and general approaches for generating sense-specific word embeddings that are grounded in an ontology. The first applies graph smoothing as a postprocessing step to tease the vectors of different senses apart, and is applicable to any vector space model. The second adapts predictive maximum likelihood models that learn word embeddings with latent variables representing senses grounded in an specified ontology. Empirical results on lexical semantic tasks show that our approaches effectively captures information from both the ontology and distributional statistics. Moreover, in most cases our sense-specific models outperform other models we compare against.


meeting of the association for computational linguistics | 2016

Tables as Semi-structured Knowledge for Question Answering.

Sujay Kumar Jauhar; Peter D. Turney; Eduard H. Hovy

Question answering requires access to a knowledge base to check facts and reason about information. Knowledge in the form of natural language text is easy to acquire, but difficult for automated reasoning. Highly-structured knowledge bases can facilitate reasoning, but are difficult to acquire. In this paper we explore tables as a semi-structured formalism that provides a balanced compromise to this trade-off. We first use the structure of tables to guide the construction of a dataset of over 9000 multiple-choice questions with rich alignment annotations, easily and efficiently via crowd-sourcing. We then use this annotated data to train a semi-structured feature-driven model for question answering that uses tables as a knowledge base. In benchmark evaluations, we significantly outperform both a strong unstructured retrieval baseline and a highly structured Markov Logic Network model.


joint conference on lexical and computational semantics | 2015

Resolving Discourse-Deictic Pronouns: A Two-Stage Approach to Do It

Sujay Kumar Jauhar; Raul D. Guerra; Edgar Gonzàlez Pellicer; Marta Recasens

Discourse deixis is a linguistic phenomenon in which pronouns have verbal or clausal, rather than nominal, antecedents. Studies have estimated that between 5% and 10% of pronouns in non-conversational data are discourse deictic. However, current coreference resolution systems ignore this phenomenon. This paper presents an automatic system for the detection and resolution of discourse-deictic pronouns. We introduce a two-step approach that first recognizes instances of discourse-deictic pronouns, and then resolves them to their verbal antecedent. Both components rely on linguistically motivated features. We evaluate the components in isolation and in combination with two state-of-the-art coreference resolvers. Results show that our system outperforms several baselines, including the only comparable discourse deixis system, and leads to small but statistically significant improvements over the full coreference resolution systems. An error analysis lays bare the need for a less strict evaluation of this task.


north american chapter of the association for computational linguistics | 2015

Retrofitting Word Vectors to Semantic Lexicons

Manaal Faruqui; Jesse Dodge; Sujay Kumar Jauhar; Chris Dyer; Eduard H. Hovy; Noah A. Smith


joint conference on lexical and computational semantics | 2012

SemEval-2012 Task 1: English Lexical Simplification

Lucia Specia; Sujay Kumar Jauhar; Rada Mihalcea


Proceedings of the First Workshop on Metaphor in NLP | 2013

Identifying Metaphorical Word Use with Tree Kernels

Dirk Hovy; Shashank Shrivastava; Sujay Kumar Jauhar; Mrinmaya Sachan; Kartik Goyal; Huying Li; Whitney Sanders; Eduard H. Hovy


joint conference on lexical and computational semantics | 2012

UOW-SHEF: SimpLex -- Lexical Simplicity Ranking based on Contextual and Psycholinguistic Features

Sujay Kumar Jauhar; Lucia Specia


meeting of the association for computational linguistics | 2013

A Structured Distributional Semantic Model for Event Co-reference

Kartik Goyal; Sujay Kumar Jauhar; Huiying Li; Mrinmaya Sachan; Shashank Srivastava; Eduard H. Hovy


meeting of the association for computational linguistics | 2013

A Structured Distributional Semantic Model : Integrating Structure with Semantics

Kartik Goyal; Sujay Kumar Jauhar; Huiying Li; Mrinmaya Sachan; Shashank Srivastava; Eduard H. Hovy


joint conference on lexical and computational semantics | 2017

Embedded Semantic Lexicon Induction with Joint Global and Local Optimization.

Sujay Kumar Jauhar; Eduard H. Hovy

Collaboration


Dive into the Sujay Kumar Jauhar's collaboration.

Top Co-Authors

Avatar

Eduard H. Hovy

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Kartik Goyal

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Mrinmaya Sachan

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Chris Dyer

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Huiying Li

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lucia Specia

University of Sheffield

View shared research outputs
Top Co-Authors

Avatar

Peter D. Turney

National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dirk Hovy

University of Southern California

View shared research outputs
Researchain Logo
Decentralizing Knowledge