Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Umut Simsekli is active.

Publication


Featured researches published by Umut Simsekli.


IEEE Signal Processing Letters | 2015

Alpha-Stable Matrix Factorization

Umut Simsekli; Antoine Liutkus; Ali Taylan Cemgil

Matrix factorization (MF) models have been widely used in data analysis. Even though they have been shown to be useful in many applications, classical MF models often fall short when the observed data are impulsive and contain outliers. In this study, we present αMF, a MF model with α-stable observations. Stable distributions are a family of heavy-tailed distributions that is particularly suited for such impulsive data. We develop a Markov Chain Monte Carlo method, namely a Gibbs sampler, for making inference in the model. We evaluate our model on both synthetic and real audio applications. Our experiments on speech enhancement show that αMF yields superior performance to a popular audio processing model in terms of objective measures. Furthermore, αMF provides a theoretically sound justification for recent empirical results obtained in audio processing.


international conference on acoustics, speech, and signal processing | 2012

Score guided audio restoration via generalised coupled tensor factorisation

Umut Simsekli; Y. Kenan Yılmaz; A. Taylan Cemgil

Generalised coupled tensor factorisation is a recently proposed algorithmic framework for simultaneously estimating tensor factorisation models where several observed tensors can share a set of latent factors. This paper proposes a model in this framework for coupled factorisation of piano spectrograms and piano roll representations to solve audio interpolation and restoration problem. The model incorporates temporal and harmonic information from an approximate musical score (not necessarily belonging to the played piece), and spectral information from isolated piano sounds. The performance of the proposed approach is evaluated on the restoration of classical music pieces where we get about 5dB SNR improvement when 50% of data frames are missing.


workshop on applications of signal processing to audio and acoustics | 2011

Probabilistic latent tensor factorization framework for audio modeling

Ali Taylan Cemgil; Umut Simsekli; Yusuf Cem Sübakan

This paper introduces probabilistic latent tensor factorization (PLTF) as a general framework for hierarchical modeling of audio. This framework combines practical aspects of graphical modeling of machine learning with tensor factorization models. Once a model is constructed in the PLTF framework, the estimation algorithm is immediately available. We illustrate our approach using several popular models such as NMF or NMF2D and provide extensions with simulation results on real data for key audio processing tasks such as restoration and source separation.


international conference on acoustics, speech, and signal processing | 2014

Non-negative source-filter dynamical system for speech enhancement

Umut Simsekli; Jonathan Le Roux; John R. Hershey

Model-based speech enhancement methods, which rely on separately modeling the speech and the noise, have been shown to be powerful in many different problem settings. When the structure of the noise can be arbitrary, which is often the case in practice, modelbased methods have to focus on developing good speech models, whose quality will be key to their performance. In this study, we propose a novel probabilistic model for speech enhancement which precisely models the speech by taking into account the underlying speech production process as well as its dynamics. The proposed model follows a source-filter approach where the excitation and filter parts are modeled as non-negative dynamical systems. We present convergence-guaranteed update rules for each latent factor. In order to assess performance, we evaluate our model on a challenging speech enhancement task where the speech is observed under non-stationary noises recorded in a car. We show that our model outperforms state-of-the-art methods in terms of objective measures.


international conference on acoustics, speech, and signal processing | 2017

Alpha-stable multichannel audio source separation

Simon Leglaive; Umut Simsekli; Antoine Liutkus; Roland Badeau; Gaël Richard

In this paper, we focus on modeling multichannel audio signals in the short-time Fourier transform domain for the purpose of source separation. We propose a probabilistic model based on a class of heavy-tailed distributions, in which the observed mixtures and the latent sources are jointly modeled by using a certain class of multivariate alpha-stable distributions. As opposed to the conventional Gaussian models, where the observations are constrained to lie just within a few standard deviations from the mean, the proposed heavy-tailed model allows us to account for spurious data or important uncertainties in the model. We develop a Monte Carlo Expectation-Maximization algorithm for inferring the sources from the proposed model. We show that our approach leads to significant performance improvements in audio source separation under corrupted mixtures and in spatial audio object coding.


international conference on acoustics, speech, and signal processing | 2015

Section-level modeling of musical audio for linking performances to scores in Turkish makam music

Andre Holzapfel; Umut Simsekli; Sertan Sentürk; Ali Taylan Cemgil

Section linking aims at relating structural units in the notation of a piece of music to their occurrences in a performance of the piece. In this paper, we address this task by presenting a score-informed hierarchical Hidden Markov Model (HHMM) for modeling musical audio signals on the temporal level of sections present in a composition, where the main idea is to explicitly model the long range and hierarchical structure of music signals. So far, approaches based on HHMM or similar methods were mainly developed for a note-to-note alignment, i.e. an alignment based on shorter temporal units than sections. Such approaches, however, are conceptually problematic when the performances differ substantially from the reference score due to interpretation and improvisation, a very common phenomenon, for instance, in Turkish makam music. In addition to having low computational complexity compared to note-to-note alignment and achieving a transparent and elegant model, the experimental results show that our method outperforms a previously presented approach on a Turkish makam music corpus.


workshop on applications of signal processing to audio and acoustics | 2013

Hierarchical and coupled non-negative dynamical systems with application to audio modeling

Umut Simsekli; Jonathan Le Roux; John R. Hershey

Many kinds of non-negative data, such as power spectra and count data, have been modeled using non-negative matrix factorization. Even though this modeling paradigm has yielded successful applications, it falls short when the data have certain hierarchical and temporal structure. In this study, we propose a novel dynamical system model that can handle these kinds of complex structures that often arise in non-negative data. We show that our model can be extended to handle heterogeneous data for data-driven regularization. We present convergence-guaranteed update rules for each latent factor. In order to assess the performance, we evaluate our model on the transcription of classical piano pieces, and show that it outperforms related models. We also illustrate that the performance can be further improved by making use of symbolic data.


international workshop on machine learning for signal processing | 2012

Markov Chain Monte Carlo inference for probabilistic latent tensor factorization

Umut Simsekli; A. Taylan Cemgil

Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modeling multi-way data. Not only the popular tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents Markov Chain Monte Carlo procedures (namely the Gibbs sampler) for making inference on the PLTF framework. We provide the abstract algorithms that are derived for the general case and the overall procedure is illustrated on both synthetic and real data.


international conference on acoustics, speech, and signal processing | 2015

Learning mixed divergences in coupled matrix and tensor factorization models

Umut Simsekli; Ali Taylan Cemgil; Beyza Ermis

Coupled tensor factorization methods are useful for sensor fusion, combining information from several related datasets by simultaneously approximating them by products of latent tensors. In these methods, the choice of a suitable optimization criteria becomes difficult as observed datasets may have different statistical characteristics and their relative importance for the task at hand can vary. In this paper, we present an algorithmic framework for coupled factorization that, while estimating a latent factorization also estimates a specific ß-divergence for each dataset as well as the relative weights in an overall additive cost function. We evaluate the proposed method on both synthetical and real datasets, where we apply our methods on a link prediction problem. The results show that our method outperforms the state-of-the-art by a significant margin.


signal processing and communications applications conference | 2012

SVD-based polyphonic music transcription

Ismail Ari; Umut Simsekli; Ali Taylan Cemgil; Lale Akarun

The aim of this work is to perform polyphonic music transcription in an efficient way. The problem is formulated as a linear model and the speed is improved by a randomized SVD-based method. The method is shown to compete with the best resulting approaches in literature. The conventional methods seem to fail in this era of big data whereas the proposed method efficiently handles this by use of randomized algorithms for matrix decompositions. The method is able to improve time and space complexity without compromising the high success rate.

Collaboration


Dive into the Umut Simsekli's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Gaël Richard

Université Paris-Saclay

View shared research outputs
Top Co-Authors

Avatar

Roland Badeau

Université Paris-Saclay

View shared research outputs
Top Co-Authors

Avatar

John R. Hershey

Mitsubishi Electric Research Laboratories

View shared research outputs
Top Co-Authors

Avatar

Jonathan Le Roux

Mitsubishi Electric Research Laboratories

View shared research outputs
Top Co-Authors

Avatar

Alain Durmus

École normale supérieure de Cachan

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge