John Sirigos
University of Patras
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by John Sirigos.
Speech Communication | 2002
John Sirigos; Nikos Fakotakis; George K. Kokkinakis
In this paper we present a hybrid ANN/HMM syllable recognition system based on vowel spotting. Using an advanced multilevel vowel-spotting module we track all vowel phonemes in speech signals from where we model the speech segments located between two successive vowels which are defined as syllables. In order to achieve minimum vowel losses and accurate detection, we focus on taking special care of the vowel spotter which is based on three different techniques: discrete hidden Markov models (DHMMs), multilayer perceptrons and heuristic rules.To set up the models of the syllable segments, hybrid DHMMs with multiple codebooks are used. The usual DHMM probability parameters are replaced by combined neural network outputs. For this purpose, we use both context dependent and context independent neural networks.The syllable recognition system was tested with the TIMIT and NTIMIT databases and the results obtained showed 75.09% and 59.30% average syllable recognition accuracy, respectively. It has to be noted that to achieve the above results no grammars or syllable-based lexicons were used.
international conference on acoustics speech and signal processing | 1996
Nikos Fakotakis; John Sirigos
We present a text independent speaker recognition system based on vowel spotting and feed forward multilayer perceptrons (MLPs). The perceptual linear predictive (PLP) speech analysis technique was used for parameter estimation, a feed forward MLP for vowel spotting and a simple MLP for the classification procedure. To train and test the system we used the TIMIT database. We conclude with results of the speaker verification and identification process, showing that the system described has a high recognition accuracy (/spl sim/98%) using short test utterances (2.5 sec). It also has a real-time response, is easily adapted to new speakers and requires a small amount of data for training purposes (three sentences per speaker).
international conference on digital signal processing | 1997
John Sirigos; Nikos Fakotakis; G. Kokkinakis
This paper presents a method for improving speech recognition in noisy environment by using neural networks. Two multilayer perceptrons (MLPs) are used. The first MLP minimises the difference between noisy and clean speech and the second one measures the degree of noise in the speech signal and adjusts the time interval between subsequent frames of the processed speech signal accordingly. If we use the technique presented in this paper as a pre-processing stage of a speech recognition system we can extend the application of the system to different environments without re-training it. We need only to train the preprocessing stage with a small portion of noisy data which is created by conducting part of the original clean speech database used for training the speech recognizer through the desired environment. There is no need for creating a new database in the desired working environment. Our method was tested on a vowel spotting system, and is trained with two well known databases: TIMIT and NTIMIT. The evaluation of the system through a vowel spotting process, shows a significant improvement of the recognition rate of the system.
international conference on electronics circuits and systems | 1996
John Sirigos; Vassilios Darsinos; Nikos Fakotakis; G. Kokkinakis
This paper describes a speaker independent vowel/non-vowel classifier based on neural networks and several rules. RASTA-PLP analysis of the speech signal resulting to mel-cepstral coefficients and a formant tracking method are used in order to provide the feature vectors for the MLP. To train and test the system we used a part of the TIMIT database. The results indicate that the performance of this classifier for speaker independent vowel classification is approximately 98.5% so it can be favorably used for speaker recognition or speech labeling purposes.
conference of the international speech communication association | 1995
John Sirigos; Nikos Fakotakis; George K. Kokkinakis
european signal processing conference | 1996
John Sirigos; Vassilios Darsinos; Nikos Fakotakis; George K. Kokkinakis
conference of the international speech communication association | 1999
Nikos Fakotakis; John Sirigos; George K. Kokkinakis
european signal processing conference | 1998
John Sirigos; Nikos Fakotakis; George K. Kokkinakis
conference of the international speech communication association | 1999
John Sirigos; Nikos Fakotakis; George K. Kokkinakis
european signal processing conference | 1996
D.A. Valkaniotis; John Sirigos; Nikos Fakotakis; George K. Kokkinakis