Xuchen Yao
Johns Hopkins University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Xuchen Yao.
meeting of the association for computational linguistics | 2014
Xuchen Yao; Benjamin Van Durme
Answering natural language questions using the Freebase knowledge base has recently been explored as a platform for advancing the state of the art in open domain semantic parsing. Those efforts map questions to sophisticated meaning representations that are then attempted to be matched against viable answer candidates in the knowledge base. Here we show that relatively modest information extraction techniques, when paired with a webscale corpus, can outperform these sophisticated approaches by roughly 34% relative gain.
north american chapter of the association for computational linguistics | 2015
Xuchen Yao
For the task of question answering (QA) over Freebase on the WEBQUESTIONS dataset (Berant et al., 2013), we found that 85% of all questions (in the training set) can be directly answered via a single binary relation. Thus we turned this task into slot-filling for tuples: predicting relations to get answers given a question’s topic. We design efficient data structures to identify question topics organically from 46 million Freebase topic names, without employing any NLP processing tools. Then we present a lean QA system that runs in real time (in offline batch testing it answered two thousand questions in 51 seconds on a laptop). The system also achieved 7.8% better F1 score (harmonic mean of average precision and recall) than the previous state of the art.
meeting of the association for computational linguistics | 2014
Xuchen Yao; Jonathan Berant; Benjamin Van Durme
We contrast two seemingly distinct approaches to the task of question answering (QA) using Freebase: one based on information extraction techniques, the other on semantic parsing. Results over the same test-set were collected from two state-ofthe-art, open-source systems, then analyzed in consultation with those systems’ creators. We conclude that the differences between these technologies, both in task performance, and in how they get there, is not significant. This suggests that the semantic parsing community should target answering more compositional open-domain questions that are beyond the reach of more direct information extraction methods.
international joint conference on natural language processing | 2015
Ellie Pavlick; Juri Ganitkevitch; Tsz Ping Chan; Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch
The validity of applying paraphrase rules depends on the domain of the text that they are being applied to. We develop a novel method for extracting domainspecific paraphrases. We adapt the bilingual pivoting paraphrase method to bias the training data to be more like our target domain of biology. Our best model results in higher precision while retaining complete recall, giving a 10% relative improvement in AUC.
north american chapter of the association for computational linguistics | 2013
Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark
graph based methods for natural language processing | 2011
Xuchen Yao; Benjamin Van Durme
Archive | 2010
Xuchen Yao
meeting of the association for computational linguistics | 2013
Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark
empirical methods in natural language processing | 2013
Xuchen Yao; Benjamin Van Durme; Chris Callison-Burch; Peter Clark
Dialogue & Discourse | 2012
Xuchen Yao; Gosse Bouma; Yi Zhang