Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Roberto Togneri is active.

Publication


Featured researches published by Roberto Togneri.


IEEE Transactions on Pattern Analysis and Machine Intelligence | 2010

Linear Regression for Face Recognition

Imran Naseem; Roberto Togneri; Mohammed Bennamoun

In this paper, we present a novel approach of face identification by formulating the pattern recognition problem in terms of linear regression. Using a fundamental concept that patterns from a single-object class lie on a linear subspace, we develop a linear model representing a probe image as a linear combination of class-specific galleries. The inverse problem is solved using the least-squares method and the decision is ruled in favor of the class with the minimum reconstruction error. The proposed Linear Regression Classification (LRC) algorithm falls in the category of nearest subspace classification. The algorithm is extensively evaluated on several standard databases under a number of exemplary evaluation protocols reported in the face recognition literature. A comparative study with state-of-the-art algorithms clearly reflects the efficacy of the proposed approach. For the problem of contiguous occlusion, we propose a Modular LRC approach, introducing a novel Distance-based Evidence Fusion (DEF) algorithm. The proposed methodology achieves the best results ever reported for the challenging problem of scarf occlusion.


IEEE Transactions on Audio, Speech, and Language Processing | 2006

Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold

Alan Davis; Sven Nordholm; Roberto Togneri

Traditionally, voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal-to-noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set.


IEEE Circuits and Systems Magazine | 2011

An Overview of Speaker Identification: Accuracy and Robustness Issues

Roberto Togneri; Daniel Pullella

This paper presents the main paradigms for speaker identification, and recent work on missing data methods to increase robustness. The feature extraction, speaker modeling and system classification are discussed. Evaluations of speaker identification performance subject to environmental noise are presented. While performance is impressive in clean speech conditions, there is rapid degradation with mismatched additive noise. Missing data methods can compensate against arbitrary disturbances and remove environmental mismatches. An overview of missing data methods is provided and applications to robust speaker identification summarized. Finally combined approaches involving bottom-up estimation and top-down processing are reviewed, and their significance discussed.


Pattern Recognition | 2012

Robust regression for face recognition

Imran Naseem; Roberto Togneri; Mohammed Bennamoun

In this paper we address the problem of robust face recognition by formulating the pattern recognition task as a problem of robust estimation. Using a fundamental concept that in general, patterns from a single object class lie on a linear subspace (Barsi and Jacobs, 2003 [1]), we develop a linear model representing a probe image as a linear combination of class specific galleries. In the presence of noise, the well-conditioned inverse problem is solved using the robust Huber estimation and the decision is ruled in favor of the class with the minimum reconstruction error. The proposed Robust Linear Regression Classification (RLRC) algorithm is extensively evaluated for two important cases of robustness i.e. illumination variations and random pixel corruption. Illumination invariant face recognition is demonstrated on three standard databases under exemplary evaluation protocols reported in the literature. Comprehensive comparative analysis with the state-of-art illumination tolerant approaches indicates a comparable performance index for the proposed RLRC algorithm. The efficiency of the proposed approach in the presence of severe random noise is validated under several exemplary noise models such as dead-pixel problem, salt and pepper noise, speckle noise and Additive White Gaussian Noise (AWGN). The RLRC algorithm is found to be favorable compared with the benchmark generative approaches.


IEEE Transactions on Speech and Audio Processing | 2004

Convolutive blind signal separation with post-processing

Siow Yong Low; Sven Nordholm; Roberto Togneri

A new subband based speech enhancement scheme is presented. It integrates spatial and temporal signal processing methods to enhance speech signals in a noisy environment. The approach makes use of the popular blind signal separation (BSS) to spatially separate the target signal from the interference. Due to the multipath/reverberant environment, BSS has its fundamental limitation in its separation quality. To overcome that, an adaptive noise canceller (ANC) is employed to perform further interference reduction. The reference for the ANC in this case is simply the interference dominant output from the BSS. A higher order statistical method is proposed for the selection of the reference signal. This post processing acts as a spectral decorrelator and experimental results show that even in under-determined (more sources than elements) case, the structure offers impressive enhancement capability. Further, a remarkable improvement in recognition rate is registered when tested in automatic speech recognition (ASR).


international conference on pattern recognition | 2010

Robust Regression for Face Recognition

Imran Naseem; Roberto Togneri; Mohammed Bennamoun

In this paper we address the problem of illumination invariant face recognition. Using a fundamental concept that in general, patterns from a single object class lie on a linear subspace [2], we develop a linear model representing a probe image as a linear combination of class-specific galleries. In the presence of noise, the well-conditioned inverse problem is solved using the robust Huber estimation and the decision is ruled in favor of the class with the minimum reconstruction error. The proposed Robust Linear Regression Classification (RLRC) algorithm is extensively evaluated for two standard databases and has shown good performance index compared to the state-of-art robust approaches.


international conference on pattern recognition | 2010

Sparse Representation for Speaker Identification

Imran Naseem; Roberto Togneri; Mohammed Bennamoun

We address the closed-set problem of speaker identification by presenting a novel sparse representation classification algorithm. We propose to develop an over complete dictionary using the GMM mean super vector kernel for all the training utterances. A given test utterance corresponds to only a small fraction of the whole training database. We therefore propose to represent a given test utterance as a linear combination of all the training utterances, thereby generating a naturally sparse representation. Using this sparsity, the unknown vector of coefficients is computed via l1minimization which is also the sparsest solution [12]. Ideally, the vector of coefficients so obtained has nonzero entries representing the class index of the given test utterance. Experiments have been conducted on the standard TIMIT [14] database and a comparison with the state-of-art speaker identification algorithms yields a favorable performance index for the proposed algorithm.


IEEE Transactions on Pattern Analysis and Machine Intelligence | 2016

Automatic Shadow Detection and Removal from a Single Image

Salman Hameed Khan; Mohammed Bennamoun; Ferdous Ahmed Sohel; Roberto Togneri

We present a framework to automatically detect and remove shadows in real world scenes from a single image. Previous works on shadow detection put a lot of effort in designing shadow variant and invariant hand-crafted features. In contrast, our framework automatically learns the most relevant features in a supervised manner using multiple convolutional deep neural networks (ConvNets). The features are learned at the super-pixel level and along the dominant boundaries in the image. The predicted posteriors based on the learned features are fed to a conditional random field model to generate smooth shadow masks. Using the detected shadow masks, we propose a Bayesian formulation to accurately extract shadow matte and subsequently remove shadows. The Bayesian formulation is based on a novel model which accurately models the shadow generation process in the umbra and penumbra regions. The model parameters are efficiently estimated using an iterative optimization procedure. Our proposed framework consistently performed better than the state-of-the-art on all major shadow databases collected under a variety of conditions.


computer vision and pattern recognition | 2014

Automatic Feature Learning for Robust Shadow Detection

Salman Hameed Khan; Mohammed Bennamoun; Ferdous Ahmed Sohel; Roberto Togneri

We present a practical framework to automatically detect shadows in real world scenes from a single photograph. Previous works on shadow detection put a lot of effort in designing shadow variant and invariant hand-crafted features. In contrast, our framework automatically learns the most relevant features in a supervised manner using multiple convolutional deep neural networks (ConvNets). The 7-layer network architecture of each ConvNet consists of alternating convolution and sub-sampling layers. The proposed framework learns features at the super-pixel level and along the object boundaries. In both cases, features are extracted using a context aware window centered at interest points. The predicted posteriors based on the learned features are fed to a conditional random field model to generate smooth shadow contours. Our proposed framework consistently performed better than the state-of-the-art on all major shadow databases collected under a variety of conditions.


international conference on acoustics, speech, and signal processing | 2011

Speaker verification using sparse representation classification

Jia Min Karen Kua; Eliathamby Ambikairajah; Julien Epps; Roberto Togneri

Sparse representations of signals have received a great deal of attention in recent years, and the sparse representation classifier has very lately appeared in a speaker recognition system. This approach represents the (sparse) GMM mean supervector of an unknown speaker as a linear combination of an over-complete dictionary of GMM supervectors of many speaker models, and ℓ1-norm minimization results in a non-zero coefficient corresponding to the unknown speaker class index. Here this approach is tested on large databases, introducing channel-/session-variability compensation, and fused with a GMM-SVM system. Evaluations on the NIST 2001 SRE and NIST 2006 SRE database show that when the outputs of the MFCC UBM-GMM based classifier (for NIST 2001 SRE) or MFCC GMM-SVM based classifier (for NIST 2006 SRE) are fused with the MFCC GMM-Sparse Representation Classifier (GMM-SRC) based classifier, an absolute gain of 1.27% and 0.25% in EER can be achieved respectively.

Collaboration


Dive into the Roberto Togneri's collaboration.

Top Co-Authors

Avatar

Mohammed Bennamoun

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Imran Naseem

Karachi Institute of Economics and Technology

View shared research outputs
Top Co-Authors

Avatar

Serajul Haque

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar

Victor Sreeram

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar

Wei Liu

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar

Lyndon White

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar

Y. Attikiouzel

University of Western Australia

View shared research outputs
Top Co-Authors

Avatar

Ahmad Jazlan

University of Western Australia

View shared research outputs
Researchain Logo
Decentralizing Knowledge