Jisheng Liang | Researchain

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jisheng Liang is active.

Explore More

Publication

Featured researches published by Jisheng Liang.

Sigkdd Explorations | 2005

Extracting statistical data frames from text

Jisheng Liang; Krzysztof Koperski; Thien Nguyen; Giovanni B. Marchisio

We present a framework that bridges the gap between natural language processing (NLP) and text mining. Central to this is a new approach to text parameterization that captures many interesting attributes of text usually ignored by standard indices, like the term-document matrix. By storing NLP tags, the new index supports a higher degree of knowledge discovery and pattern finding from text. The index is relatively compact, enabling dynamic search of arbitrary relationships and events in large document collections. We can export search results in formats and data structures that are transparent to statistical analysis tools like S-PLUSID®. In a number of experiments, we demonstrate how this framework can turn mountains of unstructured information into informative statistical graphs.

Proceedings of the 3rd International Semantic Search Workshop on | 2010

A large-scale system for annotating and querying quotations in news feeds

Jisheng Liang; Navdeep S. Dhillon; Krzysztof Koperski

In this paper, we describe a system that automatically extracts quotations from news feeds, and allows efficient retrieval of the semantically annotated quotes. APIs for real-time querying of over 10 million quotes extracted from recent news feeds are publicly available. In addition, each day we add around 60 thousand new quotes extracted from around 50 thousand news articles or blogs. We apply computational linguistic techniques such as coreference resolution, entity recognition and disambiguation to improve both precision and recall of the quote detection. We support faceted search on both speakers and entities mentioned in the quotes.

Archive | 2007

A Case Study in Natural Language Based Web Search

Giovanni B. Marchisio; Navdeep S. Dhillon; Jisheng Liang; Carsten Tusk; Krzysztof Koperski; Thien Nguyen; Dan White; Lubos Pochman

Is there a public for natural language based search? This study, based on our experience with a Web portal, attempts to address criticisms on the lack of scalability and usability of natural language approaches to search. Our solution is based on InFact®, a natural language search engine that combines the speed of keyword search with the power of natural language processing. InFact performs clause level indexing, and offers a full spectrum of functionality that ranges from Boolean keyword operators to linguistic pattern matching in real time, which include recognition of syntactic roles, such as subject/object and semantic categories, such as people and places. A user of our search can navigate and retrieve information based on an understanding of actions, roles and relationships. In developing InFact, we ported the functionality of a deep text analysis platform to a modern search engine architecture. Our distributed indexing and search services are designed to scale to large document collections and large numbers of users. We tested the operational viability of InFact as a search platform by powering a live search on the Web. Site statistics and user logs demonstrate that a statistically significant segment of the user population is relying on natural language search functionality. Going forward, we will focus on promoting this functionality to an even greater percentage of users through a series of creative interfaces.

north american chapter of the association for computational linguistics | 2006

Ontology-Based Natural Language Query Processing for the Biological Domain

Jisheng Liang; Thien Huu Nguyen; Krzysztof Koperski; Giovanni B. Marchisio

This paper describes a natural language query engine that enables users to search for entities, relationships, and events that are extracted from biological literature. The query interpretation is guided by a domain ontology, which provides a mapping between linguistic structures and domain conceptual relations. We focus on the usability of the natural language interface to users who are used to keyword-based information retrieval. Preliminary evaluation of our approach using the GENIA corpus and ontology shows promising results.

Archive | 2009

Method and system for extending keyword searching to syntactically and semantically annotated data

Giovanni B. Marchisio; Krzysztof Koperski; Jisheng Liang; Thien Nguyen; Carsten Tusk; Navdeep S. Dhillon; Lubos Pochman; Matthew E. Brown

Archive | 2004

Method and system for enhanced data searching

Giovanni B. Marchisio; Krzysztof Koperski; Jisheng Liang; Alejandro Murua; Thien Nguyen; Carsten Tusk; Navdeep S. Dhillon; Lubos Pochman

Archive | 2008

NLP-based entity recognition and disambiguation

Jisheng Liang; Krzysztof Koperski; Navdeep S. Dhillon; Carsten Tusk; Satish Bhatti

Archive | 2006

Extending keyword searching to syntactically and semantically annotated data

Giovanni B. Marchisio; Navdeep S. Dhillon; Carsten Tusk; Krzysztof Koperski; Jisheng Liang; Thien Nguyen; Matthew E. Brown

Archive | 2011

Content recommendation based on collections of entities

Krzysztof Koperski; Jisheng Liang; Neil S. Roseman

Archive | 2011

Recommending mobile device activities

Jisheng Liang; Will Hunsinger; Satish Bhatti

Explore More

Collaboration

Dive into the Jisheng Liang's collaboration.

Top Co-Authors

Krzysztof Koperski

Mathsoft

View shared research outputs

Top Co-Authors

Giovanni B. Marchisio

Mathsoft

View shared research outputs

Top Co-Authors

Thien Huu Nguyen

New York University

View shared research outputs

Explore More

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot

Dive into the research topics where Jisheng Liang is active.

Publication

Featured researches published by Jisheng Liang.

Extracting statistical data frames from text

A large-scale system for annotating and querying quotations in news feeds

A Case Study in Natural Language Based Web Search

Ontology-Based Natural Language Query Processing for the Biological Domain

Method and system for extending keyword searching to syntactically and semantically annotated data

Method and system for enhanced data searching

NLP-based entity recognition and disambiguation

Extending keyword searching to syntactically and semantically annotated data

Content recommendation based on collections of entities

Recommending mobile device activities

Collaboration

Dive into the Jisheng Liang's collaboration.