Matteo Contolini
Panasonic
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Matteo Contolini.
international conference on acoustics speech and signal processing | 1999
Roland Kuhn; Patrick Nguyen; Jean-Claude Junqua; Robert Boman; Nancy Niedzielski; Steven Fincke; Kenneth L. Field; Matteo Contolini
Previously, we presented a radically new class of fast adaptation techniques for speech recognition, based on prior knowledge of speaker variation. To obtain this prior knowledge, one applies a dimensionality reduction technique to T vectors of dimension D derived from T speaker-dependent (SD) models. This offline step yields T basis vectors, the eigenvoices. We constrain the model for new speaker S to be located in the space spanned by the first K eigenvoices. Speaker adaptation involves estimating K eigenvoice coefficients for the new speaker; typically, K is very small compared to original dimension D. Here, we review how to find the eigenvoices, give a maximum-likelihood estimator for the new speakers eigenvoice coefficients, and summarize mean adaptation experiments carried out on the Isolet database. We present new results which assess the impact on performance of changes in training of the SD models. Finally, we interpret the first few eigenvoices obtained.
Journal of the Acoustical Society of America | 2004
Roland Kuhn; Matteo Contolini; Robert Boman
The call screener employs a telephone system interface connected between a telephone network and a telephone device of a user. The interface selectively routes calls (and refrain from routing calls) based on the results from the dialogue system. The dialogue system elicits speech from an incoming caller and causes the telephone system interface to route calls from the incoming caller based on a comparison of the elicited speech with a set of stored speaker models. The stored speaker models may be maintained automatically by the system, using either a passive mode, in which calls exceeding a predetermined duration are assumed to be “acceptable” callers; and a proactive mode in which the system prompts the user at the end of the call to elect whether to save the speech models developed during that call in the acceptable user database. If desired, the user can attach other attributes or special tags to the stored models, indicating special handling or call routing rules to be applied when that caller calls again.
ACM Transactions on Speech and Language Processing | 1998
Roland Kuhn; Patrick Nguyen; Jean-Claude Junqua; Lloyd Goldwasser; Nancy Niedzielski; Steven Fincke; Ken Field; Matteo Contolini
Archive | 1999
Matteo Contolini; Jean-Claude Junqua; Roland Kuhn
Journal of the Acoustical Society of America | 1998
Roland Kuhn; Jean-Claude Junqua; Matteo Contolini
Archive | 2000
Jean-Claude Junqua; Matteo Contolini
Archive | 2004
Matteo Contolini; Jean-Claude Junqua
international conference on acoustics, speech, and signal processing | 1999
Roland Kuhn; Patrick Nguyen; Jean Claude Junqua; Robert Boman; Nancy Niedzielski; Steven Fincke; Ken Field; Matteo Contolini
Journal of the Acoustical Society of America | 1999
Jean-Claude Junqua; Roland Kuhn; Matteo Contolini; Murat Karaorman; Ken Field; Michael Galler; Yi Zhao
Archive | 1998
Matteo Contolini; Jean-Claude Junqua; Roland Kuhn