Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Diamantino Antonio Caseiro is active.

Publication


Featured researches published by Diamantino Antonio Caseiro.


international conference on acoustics, speech, and signal processing | 2010

Use of geographical meta-data in ASR language and acoustic models

Enrico Bocchieri; Diamantino Antonio Caseiro

The query distribution, in the speech recognition applications of directory assistance (DA) and voice-search, depends on the customers location. This motivates the research on query models conditioned on the user location, here denoted as local models. We describe and test our methods for the estimation of local models with various degrees of spacial “granularity”, for the recognition of city-state (sub-task of DA) and for the recognition of business listings, spoken over iPhones in a nation-wide business-listing voice-search service. Our local language models improve the accuracy of city-state by 2.4% absolute (32% relative error reduction), and of voice-search by 2.2% (7% relative).


international conference on acoustics, speech, and signal processing | 2011

Speech recognition modeling advances for mobile voice search

Enrico Bocchieri; Diamantino Antonio Caseiro; Dimitrios Dimitriadis

This paper reports on the development and advances in automatic speech recognition for the AT&T Speak4it® voice-search application. With Speak4it as real-life example, we show the effectiveness of acoustic model (AM) and language model (LM) estimation (adaptation and training) on relatively small amounts of application field-data. We then introduce algorithmic improvements concerning the use of sentence length in LM, of non-contextual features in AM decision-trees, and of the Teager energy in the acoustic front-end. The combination of these algorithms, integrated into the AT&T Watson recognizer, yields substantial accuracy improvements. LM and AM estimation on field-data samples increases the word accuracy from 66.4% to 77.1%, a relative word error reduction of 32%. The algorithmic improvements increase the accuracy to 79.7%, an additional 11.3% relative error reduction.


international conference on acoustics, speech, and signal processing | 2011

An alternative front-end for the AT&T WATSON LV-CSR system

Dimitrios Dimitriadis; Enrico Bocchieri; Diamantino Antonio Caseiro

In previously published work, we have proposed a novel feature extraction algorithm, based on the Teager-Kaiser energy estimates, that approximates human auditory characteristics and that is more robust to sub-band noise than the mean-square estimates of standard MFCCs. We refer to the novel features as Teager energy cepstrum coefficients (TECC). Herein, we study the TECC performance under additive noise and suggest how to predict the noisy TECC deviations by estimating the subband SNR values. Then, we report on the effectiveness of the TECCs when they are used in the acoustic front-end of the state-of-the-art AT&T WATSON large-vocabulary recognizer. The TECC front-end is tested in the real-life voice-search Speak4it application for mobile devices. It provides a 6% relative word error rate reduction w.r.t. the MFCC front-end, using the same high performance language model, lexicon and acoustic model training.


Archive | 2009

Systems and Methods for Creating and Using Geo-Centric Language Models

Amanda Stent; Diamantino Antonio Caseiro; Ilija Zeljkovic; Jay Gordon Wilpon


Archive | 2009

System and method for combining geographic metadata in automatic speech recognition language and acoustic models

Enrico Bocchieri; Diamantino Antonio Caseiro


Archive | 2009

SYSTEM AND METHOD FOR HANDLING REPEAT QUERIES DUE TO WRONG ASR OUTPUT

Andrej Ljolje; Diamantino Antonio Caseiro


Archive | 2011

System and method for optimizing speech recognition and natural language parameters with user feedback

Andrej Ljolje; Diamantino Antonio Caseiro; Mazin Gilbert Gilbert; Vincent Goffin; Taniya Mishra


Archive | 2011

SYSTEM AND METHOD FOR SPEECH RECOGNITION MODELING FOR MOBILE VOICE SEARCH

Enrico Bocchieri; Diamantino Antonio Caseiro; Dimitrios Dimitriadis


Archive | 2013

System and method for selecting network-based versus embedded speech processing

Benjamin J. Stern; Enrico Bocchieri; Diamantino Antonio Caseiro; Danilo Giulianelli; Ladan Golipour


conference of the international speech communication association | 2011

SpeechForms: From Web to Speech and Back.

Luciano De Andrade Barbosa; Diamantino Antonio Caseiro; Giuseppe Di Fabbrizio

Researchain Logo
Decentralizing Knowledge