Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zhaoqi Chen is active.

Publication


Featured researches published by Zhaoqi Chen.


IEEE Transactions on Knowledge and Data Engineering | 2008

Web People Search via Connection Analysis

Dmitri V. Kalashnikov; Zhaoqi Chen; Sharad Mehrotra

Nowadays, searches for Webpages of a person with a given name constitute a notable fraction of queries to web search engines. Such a query would normally return Webpages related to several namesakes, who happened to have the queried name, leaving the burden of disambiguating and collecting pages relevant to a particular person (from among the namesakes) on the user. In this article we develop a Web People Search approach that clusters Webpages based on their association to different people. Our method exploits a variety of semantic information extracted from Web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the Web pages. We demonstrate the effectiveness of our approach by testing the efficacy of the disambiguation algorithms and its impact on person search.


international conference on management of data | 2009

Exploiting context analysis for combining multiple entity resolution systems

Zhaoqi Chen; Dmitri V. Kalashnikov; Sharad Mehrotra

Entity Resolution (ER) is an important real world problem that has attracted significant research interest over the past few years. It deals with determining which object descriptions co-refer in a dataset. Due to its practical significance for data mining and data analysis tasks many different ER approaches has been developed to address the ER challenge. This paper proposes a new ER Ensemble framework. The task of ER Ensemble is to combine the results of multiple base-level ER systems into a single solution with the goal of increasing the quality of ER. The framework proposed in this paper leverages the observation that often no single ER method always performs the best, consistently outperforming other ER techniques in terms of quality. Instead, different ER solutions perform better in different contexts. The framework employs two novel combining approaches, which are based on supervised learning. The two approaches learn a mapping of the clustering decisions of the base-level ER systems, together with the local context, into a combined clustering decision. The paper empirically studies the framework by applying it to different domains. The experiments demonstrate that the proposed framework achieves significantly higher disambiguation quality compared to the current state of the art solutions.


acm/ieee joint conference on digital libraries | 2007

Adaptive graphical approach to entity resolution

Zhaoqi Chen; Dmitri V. Kalashnikov; Sharad Mehrotra

Entity resolution is a very common Information Quality (IQ) problem with many different applications. In digital libraries, it is related to problems of citation matching and author name disambiguation; in Natural Language Processing, it is related to coreference matching and object identity; in Web application, it is related to Web page disambiguation. The problem of Entity Resolution arises because objects/entities in real world datasets are often referred to by descriptions, which might not be unique identifiers of these entities, leading to ambiguity. The goal is to group all the entity descriptions that refer to the same real world entities. In this paper we present a graphical approach for entity resolution. It complements the traditional methodology with the analysis of the entity-relationship graph constructed for the dataset being analyzed. The paper demonstrates that a technique that measures the degree of interconnectedness between various pairs of nodes in the graph can significantly improve the quality of entity resolution. Furthermore, the paper presents an algorithm for making that technique self-adaptive to the underlying data, thus minimizing the required participation from the domain-analyst and potentially further improving the disambiguation quality.


international conference on data engineering | 2009

WEST: Modern Technologies for Web People Search

Dmitri V. Kalashnikov; Zhaoqi Chen; Sharad Mehrotra; Zheng Zhang

In this paper we describe WEST (Web Entity Search Technologies) system that we have developed to improve people search over the Internet. Recently the problem of Web People Search (WePS) has attracted significant attention from both the industry and academia. In the classic formulation of WePS problem the user issues a query to a web search engine that consists of a name of a person of interest. For such a query, a traditional search engine such as Yahoo or Google would return webpages that are related to any people who happened to have the queried name. The goal of WePS, instead, is to output a set of clusters of webpages, one cluster per each distinct person, containing all of the webpages related to that person. The user then can locate the desired cluster and explore the webpages it contains.


siam international conference on data mining | 2005

Exploiting relationships for domain-independent data cleaning †

Dmitri V. Kalashnikov; Sharad Mehrotra; Zhaoqi Chen


information quality in information systems | 2005

Exploiting relationships for object consolidation

Zhaoqi Chen; Dmitri V. Kalashnikov; Sharad Mehrotra


international conference on data engineering | 2007

Disambiguation Algorithm for People Search on the Web

Dmitri V. Kalashnikov; Sharad Mehrotra; Zhaoqi Chen; Naveen Ashish


Archive | 2009

Exploiting Web querying for Web People Search in WePS2

Zhaoqi Chen; Dmitri V. Kalashnikov; Sharad Mehrotra


Archive | 2008

Entity resolution based on graph analysis and multi-system ensembles

Sharad Mehrotra; Zhaoqi Chen


Archive | 2004

Ex-ploiting relationships for data cleaning

Dmitri V. Kalashnikov; Sharad Mehrotra; Zhaoqi Chen

Collaboration


Dive into the Zhaoqi Chen's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Naveen Ashish

University of California

View shared research outputs
Top Co-Authors

Avatar

Zheng Zhang

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge