Ahmed Al-nasheri
King Saud University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ahmed Al-nasheri.
Journal of Voice | 2017
Ahmed Al-nasheri; Ghulam Muhammad; Mansour Alsulaiman; Zulfiqar Ali; Tamer A. Mesallam; Mohamed Farahat; Khalid H. Malki; Mohamed A. Bencherif
BACKGROUND AND OBJECTIVE Automatic voice-pathology detection and classification systems may help clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. The main aim of this paper is to investigate Multidimensional Voice Program (MDVP) parameters to automatically detect and classify the voice pathologies in multiple databases, and then to find out which parameters performed well in these two processes. MATERIALS AND METHODS Samples of the sustained vowel /a/ of normal and pathological voices were extracted from three different databases, which have three voice pathologies in common. The selected databases in this study represent three distinct languages: (1) the Arabic voice pathology database; (2) the Massachusetts Eye and Ear Infirmary database (English database); and (3) the Saarbruecken Voice Database (German database). A computerized speech lab program was used to extract MDVP parameters as features, and an acoustical analysis was performed. The Fisher discrimination ratio was applied to rank the parameters. A t test was performed to highlight any significant differences in the means of the normal and pathological samples. RESULTS The experimental results demonstrate a clear difference in the performance of the MDVP parameters using these databases. The highly ranked parameters also differed from one database to another. The best accuracies were obtained by using the three highest ranked MDVP parameters arranged according to the Fisher discrimination ratio: these accuracies were 99.68%, 88.21%, and 72.53% for the Saarbruecken Voice Database, the Massachusetts Eye and Ear Infirmary database, and the Arabic voice pathology database, respectively.
Journal of Voice | 2017
Ahmed Al-nasheri; Ghulam Muhammad; Mansour Alsulaiman; Zulfiqar Ali
OBJECTIVES AND BACKGROUND Automatic voice pathology detection and classification systems effectively contribute to the assessment of voice disorders, which helps clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. This work concentrates on developing an accurate and robust feature extraction for detecting and classifying voice pathologies by investigating different frequency bands using correlation functions. In this paper, we extracted maximum peak values and their corresponding lag values from each frame of a voiced signal by using correlation functions as features to detect and classify pathological samples. These features are investigated in different frequency bands to see the contribution of each band on the detection and classification processes. MATERIAL AND METHODS Various samples of sustained vowel /a/ of normal and pathological voices were extracted from three different databases: English, German, and Arabic. A support vector machine was used as a classifier. We also performed a t test to investigate the significant differences in mean of normal and pathological samples. RESULTS The best achieved accuracies in both detection and classification were varied depending on the band, the correlation function, and the database. The most contributive bands in both detection and classification were between 1000 and 8000 Hz. In detection, the highest acquired accuracies when using cross-correlation were 99.809%, 90.979%, and 91.168% in the Massachusetts Eye and Ear Infirmary, Saarbruecken Voice Database, and Arabic Voice Pathology Database databases, respectively. However, in classification, the highest acquired accuracies when using cross-correlation were 99.255%, 98.941%, and 95.188% in the three databases, respectively.
acs/ieee international conference on computer systems and applications | 2014
Ahmed Al-nasheri; Zulfiqar Ali; Ghulam Muhammad; Mansour Alsulaiman
This paper investigates the contribution of frequency bands for automatic voice pathology detection. First, the input voice signal is passed through a number of time-domain band-pass filters. The center frequencies are spaced on an octave scale. Each filter output is then divided into overlapping frames. Auto-correlation function is applied to each block to find the first largest peak, in areas other than near the dc value, and its corresponding lag. Therefore, each frame is having only these two features (peak value and lag). As classifier, we use Gaussian mixture models (GMM) and support vector machine (SVM), separately. Two well-known available databases, one in English (MEEI) and the other one in German (SVD), are used in the investigation. The results demonstrate that the most significant frequency range to detect voice pathology is between 1500 Hz and 3500 Hz. Using this filter band and with only two features, the accuracy is above 97% in case of the MEEI database.
IEEE Access | 2018
Ahmed Al-nasheri; Ghulam Muhammad; Mansour Alsulaiman; Zulfiqar Ali; Khalid H. Malki; Tamer A. Mesallam; Mohamed F. Ibrahim
Automatic voice pathology detection and classification systems effectively contribute to the assessment of voice disorders, enabling the early detection of voice pathologies and the diagnosis of the type of pathology from which patients suffer. This paper concentrates on developing an accurate and robust feature extraction for detecting and classifying voice pathologies by investigating different frequency bands using autocorrelation and entropy. We extracted maximum peak values and their corresponding lag values from each frame of a voiced signal by using autocorrelation as features to detect and classify pathological samples. We also extracted the entropy for each frame of the voice signal after we normalized its values to be used as the features. These features were investigated in distinct frequency bands to assess the contribution of each band to the detection and classification processes. Various samples of the sustained vowel /a/ for both normal and pathological voices were extracted from three different databases in English, German, and Arabic. A support vector machine was used as a classifier. We also performed u-tests to investigate if there is a significant difference between the means of the normal and pathological samples. The best achieved accuracies in both detection and classification varied depending on the used band, method, and database. The most contributive bands in both detection and classification were between 1000 and 8000 Hz. The highest obtained accuracies in the case of detection were 99.69%, 92.79%, and 99.79% for Massachusetts eye and ear infirmary (MEEI), Saarbrücken voice database (SVD), and Arabic voice pathology database (AVPD), respectively. However, the highest achieved accuracies for classification were 99.54%, 99.53%, and 96.02% for MEEI, SVD, and AVPD, correspondingly, using the combined feature.
Journal of Voice | 2017
Zulfiqar Ali; Mansour Alsulaiman; Ghulam Muhammad; Irraivan Elamvazuthi; Ahmed Al-nasheri; Tamer A. Mesallam; Mohamed Farahat; Khalid H. Malki
A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection.
Journal of Healthcare Engineering | 2017
Tamer A. Mesallam; Mohamed Farahat; Khalid H. Malki; Mansour Alsulaiman; Zulfiqar Ali; Ahmed Al-nasheri; Ghulam Muhammad
A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.
2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW) | 2015
Ahmed Al-nasheri; Zulfiqar Ali; Ghulam Muhammad; Mansour Alsulaiman; Khalid H. Almalki; Tamer A. Mesallam; Mohamed Farahat
This paper investigates the use of Multi-Dimensional Voice Program (MDVP) parameters to automatically detect voice pathology in Arabic voice pathology database (AVPD). MDVP parameters are very popular among the physician / clinician to detect voice pathology; however, MDVP is a commercial software. AVPD is a newly developed speech database designed to suit a wide range of experiments in the field of automatic voice pathology detection, classification, and automatic speech recognition. This paper is the first step to evaluate MDVP parameters in AVPD using sustained vowel /a/. The experimental results demonstrate that some of the acoustic features show an excellent ability to discriminate between normal and pathological voices. The overall best accuracy is 81.33% by using SVM classifier.
2017 International Conference on Informatics, Health & Technology (ICIHT) | 2017
Zulfiqar Ali; Mansour Alsulaiman; Ghulam Muhammad; Ahmed Al-nasheri; Awais Mahmood
Data mining has a great potential in different areas of health informatics. Data mining in health industry can minimize the health cost as well as reduces the risk of life by informing a person at initial stage. An automatic classification system capable of mining pathological data may contribute in health informatics significantly. In this paper, an automatic system to differentiate between pathological and normal data is developed. The developed system mines the pathological data on the basis of an acoustic analysis. The purpose of the acoustic analysis is to estimate the auditory spectrum of a voice sample by using the principle of the human auditory system called as critical bandwidths. The estimated auditory spectrum simulates the behavior of a human ear and acts like an expert clinician who can identify a pathological voice by hearing. The pathological data used for this study is recorded from the people who are suffering from more than 100 different types of voice disorders. Voice of a disordered patient feels noisy, harsh, strain, breathy and unpleasant to ears. During the training phase of the proposed system, it takes labeled normal and pathological data to generate acoustic models by using the Gaussian mixture model. While in deployment phase, an unknown and unlabeled voice sample is given to the system to determine its type, i.e. normal or pathological. The best obtained accuracy of the system is 99.50%.
Biomedical Signal Processing and Control | 2017
Ghulam Muhammad; Mansour Alsulaiman; Zulfiqar Ali; Tamer A. Mesallam; Mohamed A. Farahat; Khalid H. Malki; Ahmed Al-nasheri; Mohamed A. Bencherif
Biocybernetics and Biomedical Engineering | 2016
Ghulam Muhammad; Ghadir Altuwaijri; Mansour Alsulaiman; Zulfiqar Ali; Tamer A. Mesallam; Mohamed Farahat; Khalid H. Malki; Ahmed Al-nasheri