Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Boulos Harb is active.

Publication


Featured researches published by Boulos Harb.


spoken language technology workshop | 2010

Query language modeling for voice search

Ciprian Chelba; Johan Schalkwyk; Thorsten Brants; Vida Ha; Boulos Harb; Will Neveitt; Carolina Parada; Peng Xu

The paper presents an empirical exploration of google.com query stream language modeling. We describe the normalization of the typed query stream resulting in out-of-vocabulary (OoV) rates below 1% for a one million word vocabulary. We present a comprehensive set of experiments that guided the design decisions for a voice search service. In the process we re-discovered a less known interaction between Kneser-Ney smoothing and entropy pruning, and found empirical evidence that hints at non-stationarity of the query stream, as well as strong dependence on various English locales—USA, Britain and Australia.


international conference on data engineering | 2014

Near neighbor join

Herald Kllapi; Boulos Harb; Cong Yu

An increasing number of Web applications such as friends recommendation depend on the ability to join objects at scale. The traditional approach taken is nearest neighbor join (also called similarity join), whose goal is to find, based on a given join function, the closest set of objects or all the objects within a distance threshold to each object in the input. The scalability of techniques utilizing this approach often depends on the characteristics of the objects and the join function. However, many real-world join functions are intricately engineered and constantly evolving, which makes the design of white-box methods that rely on understanding the join function impractical. Finding a technique that can join extremely large number of objects with complex join functions has always been a tough challenge. In this paper, we propose a practical alternative approach called near neighbor join that, although does not find the closest neighbors, finds close neighbors, and can do so at extremely large scale when the join functions are complex. In particular, we design and implement a super-scalable system we name SAJ that is capable of best-effort joining of billions of objects for complex functions. Extensive experimental analysis over real-world large datasets shows that SAJ is scalable and generates good results.


international conference on data engineering | 2013

Recent progress towards an ecosystem of structured data on the Web

Nitin Gupta; Alon Y. Halevy; Boulos Harb; Heidi Lam; Hongrae Lee; Jayant Madhavan; Fei Wu; Cong Yu

Google Fusion Tables aims to support an ecosystem of structured data on the Web by providing a tool for managing and visualizing data on the one hand, and for searching and exploring for data on the other. This paper describes a few recent developments in our efforts to further the ecosystem.


international conference on acoustics, speech, and signal processing | 2012

Mobile music modeling, analysis and recognition

Pavel Golik; Boulos Harb; Ananya Misra; Michael Riley; Alex Rudnick; Eugene Weinstein

We present an analysis of music modeling and recognition techniques in the context of mobile music matching, substantially improving on the techniques presented in [1]. We accomplish this by adapting the features specifically to this task, and by introducing new modeling techniques that enable using a corpus of noisy and channel-distorted data to improve mobile music recognition quality. We report the results of an extensive empirical investigation of the systems robustness under realistic channel effects and distortions. We show an improvement of recognition accuracy by explicit duration modeling of music phonemes and by integrating the expected noise environment into the training process. Finally, we propose the use of frame-to-phoneme alignment for high-level structure analysis of polyphonic music.


conference on innovative data systems research | 2015

Applying WebTables in Practice

Sreeram Balakrishnan; Alon Y. Halevy; Boulos Harb; Hongrae Lee; Jayant Madhavan; Afshin Rostamizadeh; Warren Shen; Kenneth Wilder; Fei Wu; Cong Yu


conference of the international speech communication association | 2009

Back-off language model compression

Boulos Harb; Ciprian Chelba; Jeffrey A. Dean; Sanjay Ghemawat


Archive | 2009

Compounded Text Segmentation

Carolina Parada; Boulos Harb; Johan Schalkwyk


very large data bases | 2015

Efficient evaluation of object-centric exploration queries for visualization

You Wu; Boulos Harb; Jun Yang; Cong Yu


Archive | 2011

Managing information about entities using clusters of received observations

Joseph Janos; Alan C. Strohm; Steven M. Stern; Arnaud Sahuguet; Ademir de Alvarenga Oliveira; Boulos Harb


Archive | 2011

Managing information about entities using observations generated from user modified values

Joseph Janos; Alan C. Strohm; Boulos Harb; Steven M. Stern; Arnaud Sahuguet; Ademir de Alvarenga Oliveira

Collaboration


Dive into the Boulos Harb's collaboration.

Researchain Logo
Decentralizing Knowledge