Pratibha Jain
DuPont
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Pratibha Jain.
international conference on acoustics, speech, and signal processing | 2000
Sangita Sharma; Daniel P. W. Ellis; Sachin S. Kajarekar; Pratibha Jain; Hynek Hermansky
We evaluate the performance of several feature sets on the Aurora task as defined by ETSI. We show that after a non-linear transformation, a number of features can be effectively used in a HMM-based recognition system. The non-linear transformation is computed using a neural network which is discriminatively trained on the phonetically labeled (forcibly aligned) training data. A combination of the non-linearly transformed PLP (perceptive linear predictive coefficients), MSG (modulation filtered spectrogram) and TRAP (temporal pattern) features yields a 63% improvement in error rate as compared to baseline me frequency cepstral coefficients features. The use of the non-linearly transformed RASTA-like features, with system parameters scaled down to take into account the ETSI imposed memory and latency constraints, still yields a 40% improvement in error rate.
Journal of the Acoustical Society of America | 2007
Harinath Garudadri; Sunil Sivadas; Hynek Hermansky; Nelson Morgan; Chuck Wooters; André Gustavo Adami; Maria Carmen Benitez Ortuzar; Lukas Burget; Stephane N. Dupont; Frantisek Grezl; Pratibha Jain; Sachin S. Kajarekar; Petr Motlicek
A distributed voice recognition system and method for obtaining acoustic features and speech activity at multiple frequencies by extracting high frequency components thereof on a device, such as a subscriber station and transmitting them to a network server having multiple stream processing capability, including cepstral feature processing, MLP nonlinear transformation processing, and multiband temporal pattern architecture processing. The features received at the network server are processed using all three streams, wherein each of the three streams provide benefits not available in the other two, thereby enhancing feature interpretation. Feature extraction and feature interpretation may operate at multiple frequencies, including but not limited to 8 kHz, 11 kHz, and 16 kHz.
conference of the international speech communication association | 2002
André Gustavo Adami; Lukas Burget; Stéphane Dupont; Harinath Garudadri; Frantisek Grezl; Hynek Hermansky; Pratibha Jain; Sachin S. Kajarekar; Nelson Morgan; Sunil Sivadas
Archive | 2002
Harinath Garudadri; Hynek Hermansky; Lukas Burget; Pratibha Jain; Sachin S. Kajarekar; Sunil Sivadas; Stéphane Dupont; Maria Carmen Benitez Ortuzar; Nelson Morgan
Archive | 2002
Harinath Garudadri; Sunil Sivadas; Hynek Hermansky; Nelson Morgan; Chuck Wooters; André Gustavo Adami; Maria Carmen Benitez Ortuzar; Lukas Burget; Stéphane Dupont; Frantisek Grezl; Pratibha Jain; Sachin S. Kajarekar; Petr Motlicek
conference of the international speech communication association | 2001
M. Carmen Benítez; Lukas Burget; Barry Y. Chen; Stéphane Dupont; Harinath Garudadri; Hynek Hermansky; Pratibha Jain; Sachin S. Kajarekar; Nelson Morgan; Sunil Sivadas
international conference on acoustics, speech, and signal processing | 1999
Hynek Hermansky; Sangita Sharma; Pratibha Jain
conference of the international speech communication association | 2003
Pratibha Jain; Hynek Hermansky
conference of the international speech communication association | 2002
Pratibha Jain; Hynek Hermansky; Brian Kingsbury
conference of the international speech communication association | 2002
Brian Kingsbury; Pratibha Jain; André Gustavo Adami