Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Amir Hossein Harati Nejad Torbati is active.

Publication


Featured researches published by Amir Hossein Harati Nejad Torbati.


IEEE Transactions on Audio, Speech, and Language Processing | 2016

A doubly hierarchical Dirichlet process hidden Markov model with a non-ergodic structure

Amir Hossein Harati Nejad Torbati; Joseph Picone

Nonparametric Bayesian models use a Bayesian framework to learn model complexity automatically from the data, eliminating the need for a complex model selection process. A Hierarchical Dirichlet Process Hidden Markov Model (HDPHMM) is the nonparametric Bayesian equivalent of a hidden Markov model (HMM), but is restricted to an ergodic topology that uses a Dirichlet Process Model to achieve a mixture distribution-like model. For applications involving ordered sequences (e.g., speech recognition), it is desirable to impose a left-to-right structure on the model. In this paper, we introduce a model based on HDPHMM that: 1) shares data points between states, 2) models non-ergodic structures, and 3) models non-emitting states. The first point is particularly important because Gaussian mixture models, which support such sharing, have been very effective at modeling modalities in a signal (e.g., speaker variability). Further, sharing data points allows models to be estimated more accurately, an important consideration for applications such as speech recognition in which some mixture components occur infrequently. We demonstrate that this new model produces a 20% relative reduction in error rate for phoneme classification and an 18% relative reduction on a speech recognition task on the TIMIT Corpus compared to a baseline system consisting of a parametric HMM.


conference on information sciences and systems | 2014

A left-to-right HDP-HMM with HDPM emissions

Amir Hossein Harati Nejad Torbati; Joseph Picone; Marc Sobel

Nonparametric Bayesian models use a Bayesian framework to learn the model complexity automatically from the data and eliminate the need for a complex model selection process. The Hierarchical Dirichlet Process hidden Markov model (HDP-HMM) is the nonparametric Bayesian equivalent of an HMM. However, HDP-HMM is restricted to an ergodic topology and uses a Dirichlet Process Model (DPM) to achieve a mixture distribution-like model. For applications such as speech recognition, where we deal with ordered sequences, it is desirable to impose a left-to-right structure on the model to improve its ability to model the sequential nature of the speech signal. In this paper, we introduce three enhancements to HDP-HMM: (1) a left-to-right structure: needed for sequential decoding of speech, (2) non-emitting initial and final states: required for modeling finite length sequences, (3) HDP mixture emissions: allows sharing of data across states. The latter is particularly important for speech recognition because Gaussian mixture models have been very effective at modeling speaker variability. Further, due to the nature of language, some models occur infrequently and have a small number of data points associated with them, even for large corpora. Sharing allows these models to be estimated more accurately. We demonstrate that this new HDP-HMM model produces a 15% increase in likelihoods and a 15% relative reduction in error rate on a phoneme classification task based on the TIMIT Corpus.


international conference on acoustics, speech, and signal processing | 2012

Applications of Dirichlet Process Mixtures to speaker adaptation

Amir Hossein Harati Nejad Torbati; Joseph Picone; Marc Sobel

Balancing unique acoustic characteristics of a speaker such as identity and accent, with general acoustic behavior that describes phoneme identity, is one of the great challenges in applying nonparametric Bayesian approaches to speaker adaptation. The Dirichlet Process Mixture (DPM) is a relatively new model that provides an elegant framework in which individual characteristics can be balanced with aggregate behavior without diluting the quality of the individual models. Unlike Gaussian Mixture models (GMMs), which tend to smear multimodal behavior through averaging, the DPM model attempts to preserve unique behaviors through use of an infinite mixture model. In this paper, we present some exploratory research on applying these models to the acoustic modeling component of the speaker adaptation problem. DPM based models are shown to provide up to 10% reduction in WER over maximum likelihood linear regression (MLLR) on a speaker adaptation task based on the Resource Management database.


conference of the international speech communication association | 2016

A Nonparametric Bayesian Approach for Spoken Term Detection by Example Query.

Amir Hossein Harati Nejad Torbati; Joseph Picone

State of the art speech recognition systems use data-intensive context-dependent phonemes as acoustic units. However, these approaches do not translate well to low resourced languages where large amounts of training data is not available. For such languages, automatic discovery of acoustic units is critical. In this paper, we demonstrate the application of nonparametric Bayesian models to acoustic unit discovery. We show that the discovered units are correlated with phonemes and therefore are linguistically meaningful. We also present a spoken term detection (STD) by example query algorithm based on these automatically learned units. We show that our proposed system produces a P@N of 61.2% and an EER of 13.95% on the TIMIT dataset. The improvement in the EER is 5% while P@N is only slightly lower than the best reported system in the literature.


IEEE Sensors Journal | 2014

Characterization of Mammary Tumors Using Noninvasive Tactile and Hyperspectral Sensors

Amrita Sahu; Firdous Saleheen; Vira Oleksyuk; Cushla McGoverin; Nancy Pleshko; Amir Hossein Harati Nejad Torbati; Joseph Picone; Karin U. Sorenmo; Chang-Hee Won

The use of both tactile and hyperspectral imaging sensors, which exploit the mechanical and physiological changes in tissues, can significantly increase the performance in automatic identification of tumors with malignant histopathology. Tactile imaging measures the elastic modulus of a tumor, whereas hyperspectral imaging detects important biochemical markers. Spontaneous mammary tumors in canines were used to demonstrate the efficacy of our approach. The tactile sensor achieved a sensitivity of 50% and a specificity of 100% in identifying malignant tumors. The sensitivity and specificity of the hyperspectral sensor were 71% and 76%, respectively. We investigated several machine learning techniques for fusing the tactile and spectral data, which increased the sensitivity and specificity to 86% and 97%, respectively. Our tactile and hyperspectral imaging sensors are noninvasive and harmless (no ionized radiation is used). These imaging sensors may not only eliminate unnecessary surgeries, but will also motivate the development of similar sensors for human clinical use, due to the fact that canine and human tumors have similar physiology and biology.


spoken language technology workshop | 2016

A nonparametric Bayesian approach for automatic discovery of a lexicon and acoustic units

Amir Hossein Harati Nejad Torbati; Joseph Picone

State of the art speech recognition systems use context-dependent phonemes as acoustic units. However, these approaches do not work well for low resourced languages where large amounts of training data or resources such as a lexicon are not available. For such languages, automatic discovery of acoustic units can be important. In this paper, we demonstrate the application of nonparametric Bayesian models to acoustic unit discovery. We show that the discovered units are linguistically meaningful. We also present a semi-supervised learning algorithm that uses a nonparametric Bayesian model to learn a mapping between words and acoustic units. We demonstrate that a speech recognition system using these discovered resources can approach the performance of a speech recognizer trained using resources developed by experts. We show that unsupervised discovery of acoustic units combined with semi-supervised discovery of the lexicon achieved performance (9.8% WER) comparable to other published high complexity systems. This nonparametric approach enables the rapid development of speech recognition systems in low resourced languages.


conference of the international speech communication association | 2013

Speech acoustic unit segmentation using hierarchical dirichlet processes.

Amir Hossein Harati Nejad Torbati; Joseph Picone; Marc Sobel


International Journal of Speech Technology | 2014

Predicting search term reliability for spoken term detection systems

Amir Hossein Harati Nejad Torbati; Joseph Picone


ieee international multi disciplinary conference on cognitive methods in situation awareness and decision support | 2013

Assessing search term strength in spoken term detection

Amir Hossein Harati Nejad Torbati; Joseph Picone


arXiv: Learning | 2017

Automatic Analysis of EEGs Using Big Data and Hybrid Deep Learning Architectures.

Meysam Golmohammadi; Amir Hossein Harati Nejad Torbati; Silvia Lopez de Diego; Iyad Obeid; Joseph Picone

Collaboration


Dive into the Amir Hossein Harati Nejad Torbati's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Karin U. Sorenmo

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge