Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Gopala Krishna Anumanchipalli is active.

Publication


Featured researches published by Gopala Krishna Anumanchipalli.


international conference on acoustics, speech, and signal processing | 2008

Significance of early tagged contextual graphemes in grapheme based speech synthesis and recognition systems

Gopala Krishna Anumanchipalli; Kishore Prahallad; Alan W. Black

In this paper we present our argument that context information could be used in early stages i.e., during the definition of mapping of the words into sequence of graphemes. We show that the early tagged contextual graphemes play a significant role in improving the performance of grapheme based speech synthesis and speech recognition systems.


spoken language technology workshop | 2012

Intent transfer in speech-to-speech machine translation

Gopala Krishna Anumanchipalli; Luís C. Oliveira; Alan W. Black

This paper presents an approach for transfer of speaker intent in speech-to-speech machine translation (S2SMT). Specifically, we describe techniques to retain the prominence patterns of the source language utterance through the translation pipeline and impose this information during speech synthesis in the target language. We first present an analysis of word focus across languages to motivate the problem of transfer. We then propose an approach for training an appropriate transfer function for intonation on a parallel speech corpus in the two languages within which the translation is carried out. We present our analysis and experiments on English↔Portuguese and English↔German language pairs and evaluate the proposed transformation techniques through objective measures.


Journal of the Acoustical Society of America | 2013

Data-driven intonational phonology

Gopala Krishna Anumanchipalli; Alan W. Black; Luís C. Oliveira

Intonational Phonology deals with the systematic way in which speakers effectively use pitch to add appropriate emphasis to the underlying string of words in an utterance. Two widely discussed aspects of pitch are the pitch accents and boundary events. These provide an insight into the sentence type, speaker attitude, linguistic background, and other aspects of prosodic form. The main hurdle, however, is the difficulty in getting annotations of these attributes in “real” speech. Besides being language independent, these attributes are known to be subjective and prone to high inter-annotator disagreements. Our investigations aim to automatically derive phonological aspects of intonation from large speech databases. Recurring and salient patterns in the pitch contours, observed jointly with an underlying linguistic context are automatically detected. Our computational framework unifies complementary paradigms such as the physiological Fujisaki model, Autosegmental Metrical phonology, and elegant pitch styli...


international conference on acoustics, speech, and signal processing | 2013

A style capturing approach to F0 transformation in voice conversion

Gopala Krishna Anumanchipalli; Luís C. Oliveira; Alan W. Black

In this paper, we present a new approach to F0 transformation, that can capture aspects of speaking style. Instead of using the traditional 5ms frames as units in transformation, we propose a method that looks at longer phonological regions such as metrical feet. We automatically detect metrical feet in the source speech, and for each of source speakers feet, we find its phonological correspondence in target speech. We use a statistical phrase accent model to represent the F0 contour, where a 4-dimensional TILT representation is used for the F0 is parameterized over each feet region for the source and target speakers. This forms the parallel data that is the training data for our transformation. We transform the phrase component using simple z-score mapping. We use a joint density Gaussian mixture model to transform the accent contours. Our transformation method generates F0 contours that are significantly more correlated with the target speech than a baseline, frame-based method.


conference of the international speech communication association | 2011

A Statistical Phrase/Accent Model for Intonation Modeling.

Gopala Krishna Anumanchipalli; Luís C. Oliveira; Alan W. Black


conference of the international speech communication association | 2013

Analysis and Modeling of "Focus" in Context

Dirk Hovy; Gopala Krishna Anumanchipalli; Alok Parlikar; Caroline Vaughn; Adam C. Lammert; Eduard H. Hovy; Alan W. Black


SSW | 2013

Text to Speech in New Languages without a Standardized Orthography

Sunayana Sitaram; Gopala Krishna Anumanchipalli; Justin Chiu; Alok Parlikar; Alan W. Black


Archive | 2013

Intra-Lingual and Cross-Lingual Prosody Modelling

Gopala Krishna Anumanchipalli


SSW | 2010

Improving speech synthesis for noisy environments.

Gopala Krishna Anumanchipalli; Prasanna Kumar Muthukumar; Udhyakumar Nallasamy; Alok Parlikar; Alan W. Black; Brian Langner


SSW | 2010

KLATTSTAT: knowledge-based parametric speech synthesis.

Gopala Krishna Anumanchipalli; Ying-Chang Cheng; Joseph Fernandez; Xiaohan Huang; Qi Mao; Alan W. Black

Collaboration


Dive into the Gopala Krishna Anumanchipalli's collaboration.

Top Co-Authors

Avatar

Alan W. Black

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Alok Parlikar

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Sunayana Sitaram

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Adam C. Lammert

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Brian Langner

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Caroline Vaughn

Carnegie Mellon University

View shared research outputs
Top Co-Authors

Avatar

Dirk Hovy

University of Southern California

View shared research outputs
Top Co-Authors

Avatar

Eduard H. Hovy

Carnegie Mellon University

View shared research outputs
Researchain Logo
Decentralizing Knowledge