Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where John Kominek is active.

Publication


Featured researches published by John Kominek.


acm multimedia | 1999

Multimodal people ID for a multimedia meeting browser

Jie Yang; Xiaojin Zhu; Ralph Gross; John Kominek; Yue Pan; Alex Waibel

A meeting browser is a system that allows users to review a multimedia meeting record from a variety of indexing methods. Identification of meeting participants is essential for creating such a multimedia meeting record. Moreover, knowing who is speaking can enhance the performance of speech recognition and indexing meeting transcription. In this paper, we present an approach that identifies meeting participants by fusing multimodal inputs. We use face ID, speaker ID, color appearance ID, and sound source directional ID to identify and track meeting. After describing the different modules in detail, we will discuss a framework for combining the information sources. Integration of the multimodal people ID into the multimedia meeting browser is in its preliminary stage.


language and technology conference | 2006

Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies

John Kominek; Alan W. Black

The speed with which pronunciation dictionaries can be bootstrapped depends on the efficiency of learning algorithms and on the ordering of words presented to the user. This paper presents an active-learning word selection strategy that is mindful of human limitations. Learning rates approach that of an oracle system that knows the final LTS rule set.


international conference on acoustics, speech, and signal processing | 2009

Optimizing segment label boundaries for statistical speech synthesis

Alan W. Black; John Kominek

This paper introduces a new optimization technique for moving segment labels (phone and subphonetic) to optimize statistical parametric speech synthesis models. The choice of objective measures is investigated thoroughly and listening tests show the results to significantly improve the quality of the generated speech equivalent to increasing the database size by 3 fold.


SSW | 2004

The CMU Arctic speech databases.

John Kominek; Alan W. Black


SSW | 2004

Impact of durational outlier removal from unit selection catalogs.

John Kominek; Alan W. Black


Archive | 2003

The CMU ARCTIC speech databases for speech synthesis research

John Kominek; Alan W. Black


conference of the international speech communication association | 2003

Evaluating and Correcting Phoneme Segmentation for Unit Selection Synthesis

John Kominek; Christina L. Bennett; Alan W. Black


conference of the international speech communication association | 2007

SPICE: Web-based Tools for Rapid Language Adaptation in Speech Processing Systems

Tanja Schultz; Alan W. Black; Sameer Badaskar; Matthew Hornyak; John Kominek


workshop spoken language technologies for under resourced languages | 2008

Synthesizer Voice Quality of New Languages Calibrated with Mean Mel Cepstral Distortion

John Kominek; Tanja Schultz; Alan W. Black


Archive | 2007

CMU Blizzard 2007: A Hybrid Acoustic Unit Selection System from Statistically Predicted Parameters

Alan W. Black; Christina L. Bennett; Benjamin C. Blanchard; John Kominek; Brian Langner; Kishore Prahallad; Arthur R. Toth

Collaboration


Dive into the John Kominek's collaboration.

Top Co-Authors

Avatar

Alan W. Black

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Arthur R. Toth

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Brian Langner

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Sameer Badaskar

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Kishore Prahallad

International Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Gregory Aist

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Jack Mostow

Carnegie Mellon University

View shared research outputs
Researchain Logo
Decentralizing Knowledge