Victor Popa
Nokia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Victor Popa.
international conference on acoustics, speech, and signal processing | 2012
Victor Popa; Hanna Silén; Jani Nurminen; Moncef Gabbouj
Many popular approaches to spectral conversion involve linear transformations determined for particular acoustic classes and compute the converted result as a linear combination between different local transformations in an attempt to ensure a continuous conversion. These methods often produce over-smoothed spectra and parameter tracks. The proposed method computes an individual linear transformation for every feature vector based on a small neighborhood in the acoustic space thus preserving local details. The method effectively reduces the over-smoothing by eliminating undesired contributions from acoustically remote regions. The method is evaluated in listening tests against the well-known Gaussian Mixture Model based conversion, representative of the class of methods involving linear transformations. Perceptual results indicate a clear preference for the proposed scheme.
Journal of Signal and Information Processing | 2011
Victor Popa; Jani Nurminen; Moncef Gabbouj
This paper presents a voice conversion technique based on bilinear models and introduces the concept of contextual modeling. The bilinear approach reformulates the spectral envelope representation from line spectral frequencies feature to a two-factor parameterization corresponding to speaker identity and phonetic information, the so-called style and content factors. This decomposition offers a flexible representation suitable for voice conversion and facilitates the use of efficient training algorithms based on singular value decomposition. In a contextual approach (bilinear) models are trained on subsets of the training data selected on the fly at conversion time depending on the characteristics of the feature vector to be converted. The performance of bilinear models and context modeling is evaluated in objective and perceptual tests by comparison with the popular GMM-based voice conversion method for several sizes and different types of training data.
Archive | 2007
Jani Nurminen; Victor Popa; Jilei Tian
Archive | 2006
Jani Nurminen; Victor Popa; Jilei Tian; Yuezhong Tang; Imre Kiss
conference of the international speech communication association | 2009
Victor Popa; Jani Nurminen; Moncef Gabbouj
Archive | 2006
Jilei Tian; Jani Nurminen; Victor Popa
Archive | 2006
Victor Popa; Jani Nurminen; Jilei Tian
Archive | 2007
Jilei Tian; Victor Popa; Jani Nurminen
Archive | 2007
Jani Nurminen; Victor Popa; Elina Helander; Jilie Tian
Archive | 2006
Jilei Tian; Jani Nurminen; Victor Popa