Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Rahul Kapoor is active.

Publication


Featured researches published by Rahul Kapoor.


international conference on management of data | 2005

Data cleaning in microsoft SQL server 2005

Surajit Chaudhuri; Kris Ganjam; Venkatesh Ganti; Rahul Kapoor; Vivek R. Narasayya; Theo Vassilakis

When collecting and combining data from various sources into a data warehouse, ensuring high data quality and consistency becomes a significant, often expensive, challenge. Common data quality problems include inconsistent data conventions amongst sources such as different abbreviations or synonyms; data entry errors such as spelling mistakes; missing, incomplete, outdated or otherwise incorrect attribute values. These data defects generally manifest themselves as foreign-key mismatches and approximately duplicate records, both of which make further data mining and decision support analyses either impossible or suspect. We demonstrate two new data cleansing operators, Fuzzy Lookup and Fuzzy Grouping, which address these problems in a scalable and domain-independent manner. These operators are implemented within Microsoft SQL Server 2005 Integration Services. Our demo will explain their functionality and highlight multiple real-world scenarios in which they can be used to achieve high data quality.


Archive | 2003

Duplicate data elimination system

Rahul Kapoor; Venkatesh Ganti; Surajit Chaudhuri


Archive | 2007

Techniques to manage event notifications

LiHui V Xu; Satish R. Thatte; Rahul Kapoor; Rolando Jimenez Salgado; Todd J. Abel; Anuj Bansal


Archive | 2004

Method for efficient query execution using dynamic queries in database environments

Rahul Kapoor; Nigel R. Ellis; Prakash Sundaresan


Archive | 2006

Single virtual client for multiple client access and equivalency

Rahul Kapoor; Rolando Jimenez Salgado; Satish R. Thatte; Yi Mao; Ricard Roma i Dalfó; Anuj Bansal; Saji Varkey


Archive | 2007

Techniques for a web services data access layer

Ricard Roma i Dalfó; Constantin Stanciu; Rolando Jimenez Salgado; Satish R. Thatte; Sundar Paranthaman; Rahul Kapoor


Archive | 2005

Template-driven approach to extract, transform, and/or load

Rahul Kapoor; Sandhya D. Jain


Archive | 2006

Detecting and managing changes in business data integration solutions

Burra Gopal; Oleg Gregory Ovanesyan; Rahul Kapoor; Parul Manek; Sandhya D. Jain; Muthiah K. Annamalai; Sharon E. Edelstein; Peiwei Cao; Alexandru Croicu


Archive | 2005

Fuzzy lookup table maintenance

Rahul Kapoor; Theodore Vassilakis


Archive | 2006

Versioning and concurrency control for multiple client access of data

Rahul Kapoor; Rolando Jimenez Salgado; Kaushik Raj; Satish R. Thatte; Xiaoyu Wu

Collaboration


Dive into the Rahul Kapoor's collaboration.

Researchain Logo
Decentralizing Knowledge