Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hernan Laffitte is active.

Publication


Featured researches published by Hernan Laffitte.


international conference on big data | 2016

Extensive large-scale study of error surfaces in sampling-based distinct value estimators for databases

Vinay Deolalikar; Hernan Laffitte

The problem of distinct value estimation has many applications. Being a critical component of query optimizers in databases, it also has high commercial impact. Many distinct value estimators have been proposed, using various statistical approaches. However, characterizing the errors incurred by these estimators is an open problem: existing analytical approaches are not powerful enough, and extensive empirical studies at large scale do not exist. We conduct an extensive large-scale empirical study of 11 distinct value estimators from four different approaches to the problem over families of Zipfian distributions whose parameters model real-world applications. Our study is the first that scales to the size of a billion-rows that todays large commercial databases have to operate in. This allows us to characterize the error that is encountered in real-world applications of distinct value estimation. By mining the generated data, we show that estimator error depends on a key latent parameter — the average uniform class size — that has not been studied previously. This parameter also allows us to unearth error patterns that were previously unknown. Importantly, ours is the first approach that provides a framework for visualizing the error patterns in distinct value estimation, facilitating discussion of this problem in enterprise settings. Our characterization of errors can be used for several problems in distinct value estimation, such as the design of hybrid estimators. This work aims at the practitioner and the researcher alike, and addresses questions frequently asked by both audiences.


file and storage technologies | 2009

Provenance as data mining: combining file system metadata with content analysis

Vinay Deolalikar; Hernan Laffitte


Archive | 2010

SYSTEM AND METHOD FOR IDENTIFYING FRESH INFORMATION IN A DOCUMENT SET

Vinay Deolalikar; Hernan Laffitte


Archive | 2010

System and method for displaying documents

Vinay Deolalikar; Alistair Veitch; Hernan Laffitte; Ixai Lanzagorta Ochoa; Charles B. Morrey


Archive | 2011

INDICATING DOCUMENTS IN A THREAD REACHING A THRESHOLD

Vinay Deolalikar; Hernan Laffitte


Archive | 2010

SYSTEM AND METHOD FOR DETERMINING THE PROVENANCE OF A DOCUMENT

Vinay Deolalikar; Hernan Laffitte


Archive | 2010

SYSTEM AND METHOD FOR IDENTIFYING THE PRINCIPAL DOCUMENTS IN A DOCUMENT SET

Vinay Deolalikar; Hernan Laffitte


Archive | 2013

DETERMINING TOPIC RELEVANCE OF AN EMAIL THREAD

Vinay Deolalikar; Hernan Laffitte


Archive | 2012

ADAPTIVE HIERARCHICAL CLUSTERING ALGORITHM

Vinay Deolalikar; Hernan Laffitte


Archive | 2014

Virtual node deployments of cluster-based applications

Jun Li; Hernan Laffitte; Donald E Bollinger; Eric Wu

Collaboration


Dive into the Hernan Laffitte's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Fei Chen

University of Wisconsin-Madison

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge