Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Luis Miguel Mazaira-Fernández is active.

Publication


Featured researches published by Luis Miguel Mazaira-Fernández.


non linear speech processing | 2009

Glottal Source biometrical signature for voice pathology detection

Pedro Gómez-Vilda; Roberto Fernández-Baíllo; Victoria Rodellar-Biarge; Victor Nieto Lluis; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Rafael Martínez-Olalla; Juan Ignacio Godino-Llorente

The Glottal Source is an important component of voice as it can be considered as the excitation signal to the voice apparatus. The use of the Glottal Source for pathology detection or the biometric characterization of the speaker are important objectives in the acoustic study of the voice nowadays. Through the present work a biometric signature based on the speakers power spectral density of the Glottal Source is presented. It may be shown that this spectral density is related to the vocal fold cover biomechanics, and from literature it is well-known that certain speakers features as gender, age or pathologic condition leave changes in it. The paper describes the methodology to estimate the biometric signature from the power spectral density of the mucosal wave correlate, which after normalization can be used in pathology detection experiments. Linear Discriminant Analysis is used to confront the detection capability of the parameters defined on this glottal signature among themselves and compared to classical perturbation parameters. A database of 100 normal and 100 pathologic subjects equally balanced in gender and age is used to derive the best parameter cocktails for pathology detection and quantification purposes to validate this methodology in voice evaluation tests. In a study case presented to illustrate the detection capability of the methodology exposed a control subset of 24+24 subjects is used to determine a subjects voice condition in a pre- and post-surgical evaluation. Possible applications of the study can be found in pathology detection and grading and in rehabilitation assessment after treatment.


Cognitive Computation | 2013

Characterizing Neurological Disease from Voice Quality Biomechanical Analysis

Pedro Gómez-Vilda; Victoria Rodellar-Biarge; Víctor Nieto-Lluis; Cristina Muñoz-Mulas; Luis Miguel Mazaira-Fernández; Rafael Martínez-Olalla; Agustín Álvarez-Marquina; Carlos Ramírez-Calvo; Mario Fernández-Fernández

The dramatic impact of neurological degenerative pathologies in life quality is a growing concern nowadays. Many techniques have been designed for the detection, diagnosis, and monitoring of the neurological disease. Most of them are too expensive or complex for being used by primary attention medical services. On the other hand, it is well known that many neurological diseases leave a signature in voice and speech. Through the present paper, a new method to trace some neurological diseases at the level of phonation will be shown. In this way, the detection and grading of the neurological disease could be based on a simple voice test. This methodology is benefiting from the advances achieved during the last years in detecting and grading organic pathologies in phonation. The paper hypothesizes that some of the underlying neurological mechanisms affecting phonation produce observable correlates in vocal fold biomechanics and that these correlates behave differentially in neurological diseases than in organic pathologies. A general description about the main hypotheses involved and their validation by acoustic voice analysis based on biomechanical correlates of the neurological disease is given. The validation is carried out on a balanced database of normal and organic dysphonic patients of both genders. Selected study cases will be presented to illustrate the possibilities offered by this methodology.


Neurocomputing | 2011

Neuromorphic detection of speech dynamics

Pedro Gómez-Vilda; José Manuel Ferrández-Vicente; Victoria Rodellar-Biarge; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Rafael Martínez Olalla; Cristina Muñoz-Mulas

Speech and voice technologies are experiencing a profound review as new paradigms are sought to overcome some specific problems which cannot be completely solved by classical approaches. Neuromorphic Speech Processing is an emerging area in which research is turning the face to understand the natural neural processing of speech by the Human Auditory System in order to capture the basic mechanisms solving difficult tasks in an efficient way. In the present paper a further step ahead is presented in the approach to mimic basic neural speech processing by simple neuromorphic units standing on previous work to show how formant dynamics - and henceforth consonantal features - can be detected by using a general neuromorphic unit which can mimic the functionality of certain neurons found in the upper auditory pathways. Using these simple building blocks a General Speech Processing Architecture can be synthesized as a layered structure. Results from different simulation stages are provided as well as a discussion on implementation details. Conclusions and future work are oriented to describe the functionality to be covered in the next research steps.


international conference on digital signal processing | 2013

Estimating Tremor in Vocal Fold Biomechanics for Neurological Disease Characterization

Pedro Gómez-Vilda; Víctor Nieto-Lluis; Victoria Rodellar-Biarge; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Rafael Martínez-Olalla; Cristina Muñoz-Mulas; Mario Fernández-Fernández; Carlos Ramírez-Calvo

Neurological Diseases (ND) are affecting larger segments of aging population every year. Treatment is dependent on expensive accurate and frequent monitoring. It is well known that ND leave correlates in speech and phonation. The present work shows a method to detect alterations in vocal fold tension during phonation. These may appear either as hypertension or as cyclical tremor. Estimations of tremor may be produced by auto-regressive modeling of the vocal fold tension series in sustained phonation. The correlates obtained are a set of cyclicality coefficients, the frequency and the root mean square amplitude of the tremor. Statistical distributions of these correlates obtained from a set of male and female subjects are presented. Results from five study cases of female voice are also given.


non-linear speech processing | 2011

Neurological disease detection and monitoring from voice production

Pedro Gómez-Vilda; Victoria Rodellar-Biarge; Víctor Nieto-Lluis; Cristina Muñoz-Mulas; Luis Miguel Mazaira-Fernández; Carlos Ramírez-Calvo; Mario Fernández-Fernández; Elvira Toribio-Díaz

The dramatic impact of neurological degenerative pathologies in life quality is a growing concern. It is well known that many neurological diseases leave a fingerprint in voice and speech production. Many techniques have been designed for the detection, diagnose and monitoring the neurological disease. Most of them are costly or difficult to extend to primary attention medical services. Through the present paper it will be shown how some neurological diseases can be traced at the level of phonation. The detection procedure would be based on a simple voice test. The availability of advanced tools and methodologies to monitor the organic pathology of voice would facilitate the implantation of these tests. The paper hypothesizes that some of the underlying mechanisms affecting the production of voice produce measurable correlates in vocal fold biomechanics. A general description of the methodological foundations for the voice analysis system which can estimate correlates to the neurological disease is shown. Some study cases will be presented to illustrate the possibilities of the methodology to monitor neurological diseases by voice.


international work conference on the interplay between natural and artificial computation | 2007

A Bio-inspired Architecture for Cognitive Audio

Pedro Gómez-Vilda; José Manuel Ferrández-Vicente; Victoria Rodellar-Biarge; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández

A comprehensive view of speech and voice technologies is now demanding better and more complex tools amenable of extracting as much knowledge about sound and speech as possible. Many knowledge-extraction tasks from speech and voice share well-known procedures at the algorithmic level under the point of view of bio-inspiration. The same resources employed to decode speech phones may be used in the characterization of the speaker (gender, age, speaking group, etc.). Based on these facts the present paper examines a hierarchy of sound processing levels at the auditory and perceptual levels on the brain neural paths which can be translated into a bio-inspired audio-processing architecture. Through this paper its fundamental characteristics are analyzed in relation with current tendencies in cognitive audio processing. Examples extracted from speech processing applications in the domain of acoustic-phonetics are presented. These may find applicability in speakers characterization, forensics, and biometry, among others.


Frontiers in Bioengineering and Biotechnology | 2015

Improving Speaker Recognition by Biometric Voice Deconstruction

Luis Miguel Mazaira-Fernández; Agustín Álvarez-Marquina; Pedro Gómez-Vilda

Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.


non-linear speech processing | 2011

KPCA vs. PCA study for an age classification of speakers

Cristina Muñoz-Mulas; Rafael Martínez-Olalla; Pedro Gómez-Vilda; Elmar Wolfgang Lang; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Víctor Nieto-Lluis

Kernel-PCA and PCA techniques are compared in the task of age and gender separation. A feature extraction process that discriminates between vocal tract and glottal source is implemented. The reason why speech is processed in that way is because vocal tract length and resonant characteristics are related to gender and age and there is also a great relationship between glottal source and age and gender. The obtained features are then processed with PCA and kernel-PCA techniques. The results show that gender and age separation is possible and that kernel-PCA (especially with RBF kernel) clearly outperforms classical PCA or no preprocessing features.


international conference on digital signal processing | 2013

Wavelet description of the Glottal Gap

Pedro Gómez-Vilda; Víctor Nieto-Lluis; Victoria Rodellar-Biarge; Rafael Martínez-Olalla; Cristina Muñoz-Mulas; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Bartolomé Scola-Yurrita; Carlos Ramírez-Calvo; Daniel Poletti-Serafini

The Glottal Source correlates reconstructed from the phonated parts of voice may render interesting information with applicability in different fields. One of them is defective closure (gap) detection. Through the paper the background to explain the physical foundations of defective gap are reviewed. A possible method to estimate defective gap is also presented based on a Wavelet Description of the Glottal Source. The method is validated using results from the analysis of a gender-balanced speakers database. Normative values for the different parameters estimated are given. A set of study cases with deficient glottal closure is presented and discussed.


international work conference on the interplay between natural and artificial computation | 2009

Detection of Speech Dynamics by Neuromorphic Units

Pedro Gómez-Vilda; José Manuel Ferrández-Vicente; Victoria Rodellar-Biarge; Agustín Álvarez-Marquina; Luis Miguel Mazaira-Fernández; Rafael Martínez-Olalla; Cristina Muñoz-Mulas

Speech and voice technologies are experiencing a profound review as new paradigms are sought to overcome some specific problems which can not be completely solved by classical approaches. Neuromorphic Speech Processing is an emerging area in which research is turning the face to understand the natural neural processing of speech by the Human Auditory System in order to capture the basic mechanisms solving difficult tasks in an efficient way. In the present paper a further step ahead is presented in the approach to mimic basic neural speech processing by simple neuromorphic units standing on previous work to show how formant dynamics -and henceforth consonantal features-, can be detected by using a general neuromorphic unit which can mimic the functionality of certain neurons found in the Upper Auditory Pathways. Using these simple building blocks a General Speech Processing Architecture can be synthesized as a layered structure. Results from different simulation stages are provided as well as a discussion on implementation details. Conclusions and future work are oriented to describe the functionality to be covered in the next research steps.

Collaboration


Dive into the Luis Miguel Mazaira-Fernández's collaboration.

Top Co-Authors

Avatar

Cristina Muñoz-Mulas

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar

Pedro Gómez-Vilda

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Rafael Martínez-Olalla

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Víctor Nieto-Lluis

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge