Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Yuheng Hu is active.

Publication


Featured researches published by Yuheng Hu.


international conference on big data | 2014

BayesWipe: A multimodal system for data cleaning and consistent query answering on structured bigdata

Sushovan De; Yuheng Hu; Yi Chen; Subbarao Kambhampati

Recent efforts in data cleaning of structured data have focused exclusively on problems like data deduplication, record matching, and data standardization; none of these focus on fixing incorrect attribute values in tuples. Correcting values in tuples is typically performed by a minimum cost repair of tuples that violate static constraints like CFDs (which have to be provided by domain experts, or learned from a clean sample of the database). In this paper, we provide a method for correcting individual attribute values in a structured database using a Bayesian generative model and a statistical error model learned from the noisy database directly. We thus avoid the necessity for a domain expert or clean master data. We also show how to efficiently perform consistent query answering using this model over a dirty database, in case write permissions to the database are unavailable. We evaluate our methods over both synthetic and real data.


Journal of Data and Information Quality | 2016

BayesWipe: A Scalable Probabilistic Framework for Improving Data Quality

Sushovan De; Yuheng Hu; Venkata Vamsikrishna Meduri; Yi Chen; Subbarao Kambhampati

Recent efforts in data cleaning of structured data have focused exclusively on problems like data deduplication, record matching, and data standardization; none of the approaches addressing these problems focus on fixing incorrect attribute values in tuples. Correcting values in tuples is typically performed by a minimum cost repair of tuples that violate static constraints like Conditional Functional Dependencies (which have to be provided by domain experts or learned from a clean sample of the database). In this article, we provide a method for correcting individual attribute values in a structured database using a Bayesian generative model and a statistical error model learned from the noisy database directly. We thus avoid the necessity for a domain expert or clean master data. We also show how to efficiently perform consistent query answering using this model over a dirty database, in case write permissions to the database are unavailable. We evaluate our methods over both synthetic and real data.


international conference on weblogs and social media | 2014

What we instagram: A first analysis of instagram photo content and user types

Yuheng Hu; Lydia Manikonda; Subbarao Kambhampati


national conference on artificial intelligence | 2012

ET-LDA: joint topic modeling for aligning events and their twitter feedback

Yuheng Hu; Ajita John; Fei Wang; Subbarao Kambhampati


international conference on weblogs and social media | 2012

What Were the Tweets About? Topical Associations between Public Events and Twitter Feeds

Yuheng Hu; Ajita John; Doree Duncan Seligmann; Fei Wang


international conference on weblogs and social media | 2013

Dude, srsly?: The Surprisingly Formal Nature of Twitter's Language

Yuheng Hu; Kartik Talamadupula; Subbarao Kambhampati


international joint conference on artificial intelligence | 2013

Listening to the crowd: automated analysis of events via aggregated twitter sentiment

Yuheng Hu; Fei Wang; Subbarao Kambhampati


arXiv: Social and Information Networks | 2014

Analyzing User Activities, Demographics, Social Network Structure and User-Generated Content on Instagram.

Lydia Manikonda; Yuheng Hu; Subbarao Kambhampati


national conference on artificial intelligence | 2013

Herding the Crowd: Automated Planning for Crowdsourced Planning

Kartik Talamadupula; Subbarao Kambhampati; Yuheng Hu; Tuan Anh Nguyen; Hankz Hankui Zhuo


arXiv: Learning | 2012

ET-LDA: Joint Topic Modeling For Aligning, Analyzing and Sensemaking of Public Events and Their Twitter Feeds

Yuheng Hu; Ajita John; Fei Wang; Doree Duncan Seligmann; Subbarao Kambhampati

Collaboration


Dive into the Yuheng Hu's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Sushovan De

Arizona State University

View shared research outputs
Top Co-Authors

Avatar

Yi Chen

New Jersey Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge