Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Xuchen Yao is active.

Publication


Featured researches published by Xuchen Yao.


meeting of the association for computational linguistics | 2014

Information Extraction over Structured Data: Question Answering with Freebase

Xuchen Yao; Benjamin Van Durme

Answering natural language questions using the Freebase knowledge base has recently been explored as a platform for advancing the state of the art in open domain semantic parsing. Those efforts map questions to sophisticated meaning representations that are then attempted to be matched against viable answer candidates in the knowledge base. Here we show that relatively modest information extraction techniques, when paired with a webscale corpus, can outperform these sophisticated approaches by roughly 34% relative gain.


north american chapter of the association for computational linguistics | 2015

Lean Question Answering over Freebase from Scratch

Xuchen Yao

For the task of question answering (QA) over Freebase on the WEBQUESTIONS dataset (Berant et al., 2013), we found that 85% of all questions (in the training set) can be directly answered via a single binary relation. Thus we turned this task into slot-filling for tuples: predicting relations to get answers given a question’s topic. We design efficient data structures to identify question topics organically from 46 million Freebase topic names, without employing any NLP processing tools. Then we present a lean QA system that runs in real time (in offline batch testing it answered two thousand questions in 51 seconds on a laptop). The system also achieved 7.8% better F1 score (harmonic mean of average precision and recall) than the previous state of the art.


meeting of the association for computational linguistics | 2014

Freebase QA: Information Extraction or Semantic Parsing?

Xuchen Yao; Jonathan Berant; Benjamin Van Durme

We contrast two seemingly distinct approaches to the task of question answering (QA) using Freebase: one based on information extraction techniques, the other on semantic parsing. Results over the same test-set were collected from two state-ofthe-art, open-source systems, then analyzed in consultation with those systems’ creators. We conclude that the differences between these technologies, both in task performance, and in how they get there, is not significant. This suggests that the semantic parsing community should target answering more compositional open-domain questions that are beyond the reach of more direct information extraction methods.


international joint conference on natural language processing | 2015

Domain-Specific Paraphrase Extraction

Ellie Pavlick; Juri Ganitkevitch; Tsz Ping Chan; Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch

The validity of applying paraphrase rules depends on the domain of the text that they are being applied to. We develop a novel method for extracting domainspecific paraphrases. We adapt the bilingual pivoting paraphrase method to bias the training data to be more like our target domain of biology. Our best model results in higher precision while retaining complete recall, giving a 10% relative improvement in AUC.


north american chapter of the association for computational linguistics | 2013

Answer Extraction as Sequence Tagging with Tree Edit Distance

Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark


graph based methods for natural language processing | 2011

Nonparametric Bayesian word sense induction

Xuchen Yao; Benjamin Van Durme


Archive | 2010

Question Generation with Minimal Recursion Semantics

Xuchen Yao


meeting of the association for computational linguistics | 2013

A Lightweight and High Performance Monolingual Word Aligner

Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark


empirical methods in natural language processing | 2013

Semi-Markov Phrase-Based Monolingual Alignment

Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark


Dialogue & Discourse | 2012

Semantics-based Question Generation and Implementation

Xuchen Yao; Gosse Bouma; Yi Zhang

Collaboration


Dive into the Xuchen Yao's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

S. Duarte

University of Groningen

View shared research outputs
Top Co-Authors

Avatar

Gosse Bouma

University of Groningen

View shared research outputs
Top Co-Authors

Avatar

David R. Traum

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Kenji Sagae

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Ron Artstein

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Jianqiang Ma

University of Tübingen

View shared research outputs
Top Co-Authors

Avatar

Charley Beller

Johns Hopkins University

View shared research outputs
Researchain Logo
Decentralizing Knowledge