Yunxiao Ma
Microsoft
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yunxiao Ma.
international world wide web conferences | 2007
Zaiqing Nie; Yunxiao Ma; Shuming Shi; Ji-Rong Wen; Wei-Ying Ma
The primary function of current Web search engines is essentially relevance ranking at the document level. However, myriad structured information about real-world objects is embedded in static Web pages and online Web databases. Document-level information retrieval can unfortunately lead to highly inaccurate relevance ranking in answering object-oriented queries. In this paper, we propose a paradigm shift to enable searching at the object level. In traditional information retrieval models, documents are taken as the retrieval units and the content of a document is considered reliable. However, this reliability assumption is no longer valid in the object retrieval context when multiple copies of information about the same object typically exist. These copies may be inconsistent because of diversity of Web site qualities and the limited performance of current information extraction techniques. If we simply combine the noisy and inaccurate attribute information extracted from different sources, we may not be able to achieve satisfactory retrieval performance. In this paper, we propose several language models for Web object retrieval, namely an unstructured object retrieval model, a structured object retrieval model, and a hybrid model with both structured and unstructured retrieval features. We test these models on a paper search engine and compare their performances. We conclude that the hybrid model is the superior by taking into account the extraction errors at varying levels.
conference on information and knowledge management | 2009
Shuming Shi; Bin Lu; Yunxiao Ma; Ji-Rong Wen
Mainstream link-based static-rank algorithms (e.g. PageRank and its variants) express the importance of a page as the linear combination of its in-links and compute page importance scores by solving a linear system in an iterative way. Such linear algorithms, however, may give apparently unreasonable static-rank results for some link structures. In this paper, we examine the static-rank computation problem from the viewpoint of evidence combination and build a probabilistic model for it. Based on the model, we argue that a nonlinear formula should be adopted, due to the correlation or dependence between links. We focus on examining some simple formulas which only consider the correlation between links in the same domain. Experiments conducted on 100 million web pages (with multiple static-rank quality evaluation metrics) show that higher quality static-rank could be yielded by the new nonlinear algorithms. The convergence of the new algorithms is also proved in this paper by nonlinear functional analysis.
Archive | 2011
Zaiqing Nie; Yong Cao; Gang Luo; Ruochi Zhang; Xiaojiang Liu; Yunxiao Ma; Bo Zhang; Ying-Qing Xu; Ji-Rong Wen
text retrieval conference | 2009
Zhicheng Dou; Kun Chen; Ruihua Song; Yunxiao Ma; Shuming Shi; Ji-Rong Wen
Archive | 2006
Ji-Rong Wen; Shuming Shi; Wei-Ying Ma; Yunxiao Ma; Zaiqing Nie
Archive | 2005
Zaiqing Nie; Yunxiao Ma; Ji-Rong Wen; Wei-Ying Ma
Archive | 2009
Ji-Rong Wen; Yu Chen; Guomao Xin; Yunxiao Ma; Yi Liu; Zhicheng Dou; Qing Yu; Shuming Shi
Archive | 2009
Ji-Rong Wen; Guomao Xin; Yunxiao Ma; Yu Chen; Qing Yu; Yi Liu; Zhicheng Dou; Shuming Shi
Archive | 2009
Bin Lu; Shuming Shi; Yunxiao Ma; Ji-Rong Wen
Archive | 2011
Zaiqing Nie; Yong Cao; Gang Luo; Ruochi Zhang; Xiaojiang Liu; Yunxiao Ma; Bo Zhang; Ying-Qing Xu; Ji-Rong Wen