Is this you? Create Your Porfile

Gregor Rozinaj

Slovak University of Technology in Bratislava

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Gregor Rozinaj is active.

Explore More

Publication

Featured researches published by Gregor Rozinaj.

international conference on interactive mobile communication technologies and learning | 2015

Application of immersive technologies for education: State of the art

Péter Tamás Kovács; Niall Murray; Gregor Rozinaj; Yevgeniya Sulema; Renata Rybárová

Existing multimedia systems used in education mostly address only two senses by using two communication channels (visual and audio) of the five human senses (sight, hearing, taste, smell, and touch), limiting the potential efficiency of learning. This paper presents a survey on existing technical opportunities for the development of an immersive learning environment. Four components of the immersive environment - visual, audio, olfactory, and haptic are described and discussed in the paper. In particular 3D displays, head mounted devices, 3D sound systems, olfactory displays, haptic devices, and interaction devices are presented.

IEEE Computer | 2014

Smart AppStore: Expanding the Frontiers of Smartphone Ecosystems

Félix Gómez Mármol; Gregor Rozinaj; Sebastian Schumann; Ondrej Labaj; Juraj Kacur

Smart AppStore offers five important features for todays smartphone users: biometric authentication, multilevel authorization, gesture recognition and navigation, user-tailored reputation scores, and identity management. The Web extra at http://youtu.be/15KPOUp5H_A is a video demonstration of Smart AppStore, which offers five important features for todays smartphone users: biometric authentication, multilevel authorization, gesture recognition and navigation, user-tailored reputation scores, and identity management.

Telecommunication Systems | 2013

Building accurate and robust HMM models for practical ASR systems

Juraj Kacur; Gregor Rozinaj

In this article the relevant training aspects for building robust and accurate HMM models for large vocabulary recognition system are discussed and adjusted, namely: speech features, training steps, and the tying options for context dependent (CD) phonemes. As the basis for building HMM models the well known MASPER training scheme is assumed. First the incorporation of the voicing information and its effect on the classical extraction methods like MFCC and PLP will be shown together with the derivative features, where the relative error reductions are up to 50%. Next the suggested enhancement of the standard training procedure by introducing garbled speech models will be presented and tested on real data. As it will be shown it brings more than a 5% drop in the error rate. Finally, the options for tying states of CD phonemes using decision trees and phoneme classification will be adjusted, tested, and explained.

ELMAR 2007 | 2007

A hybrid pitch period estimation method based on HNM model

Martin Turi Nagy; Gregor Rozinaj; Andrej Palenik

Pitch period estimation (also called fundamental frequency estimation) is widely needed in speech processing for many purposes. In our system for prosodic modification of speech, the pitch period estimation is used as a basis for frame length detection. The pitch period estimation method used in the system is a hybrid method that is based on YIN fundamental frequency estimation algorithm and a method for fundamental frequency detection on magnitude of the speech signal. The experiments show, that the method is useful in sinusoidal modeling domain, as in other domains, too.

eurasip conference focused on video image processing and multimedia communications | 2003

Forward masking phenomenon in concatenative speech synthesis

M. Cernak; Gregor Rozinaj

The approach described in the paper tries to get more knowledge to the concatenative text-to-speech system design. The knowledge is based on masking phenomenon of the inner ear, particularly of its temporal (forward) masking properties. Designing such knowledge-based system is suggested to use in the unit selection-based speech synthesis, as contemporary a prominent technique in concatenative synthesis, which utilizes a big speech corpus. The more prosodic variability the corpus captures, the more natural a synthetic voice sounds and there are more possibilities to occur a forward masking events during concatenation of selected candidate units from the corpus.

eurasip conference focused on video image processing and multimedia communications | 2003

Speech detection in the noisy environment using wavelet transform

Juraj Kacur; Juraj Frank; Gregor Rozinaj

In this article we present speech detection systems based on Daubechie, Coiflet and Symlet wavelet transforms respectively. For each a selection of the most eligible levels of signal decomposition for the corrupted speech detection problem was made. Using those levels the distinction between noise and corrupted signal can be amplified as far as 100 times. Tests were accomplished using a set of Slovak words artificially noised to several SNR by white WSS noise.

international symposium on telecommunications | 2012

ZCPA features for speech recognition

Juraj Kacur; Mario Varga; Gregor Rozinaj

In this article we present implementation, modifications and optimization of zero-crossing peak amplitude (ZCPA) speech feature extraction method into Slovak speech recognition system. ZCPA features are closely mimicking the human auditory system in the time domain, and thus they should be more robust against common noises. Except the basic configuration several modifications have been suggested, implemented and evaluated. Furthermore, optimization of settings on a real system using professional database and MASPER training procedure have been found and compared to classical features presented by MFCC and PLP in different scenarios and noise conditions.

international conference on systems signals and image processing | 2007

MABox - Multimodal Microphone Array Algorithm Development System

Jan Vrabec; Gregor Rozinaj; Juraj Vojtko

In this work a design and a realization of multimodal microphone array algorithm development system which is proposed to develop new microphone array algorithm named MABox is presented. This device incorporates microphone array with four microphones, ADC cards and development software. Microphones are integrated in a separate directional box pointed to the speaker, the box is connected via USB and analog line to the computer. The development software environment allows us to test new beamforming algorithm. This system runs in real-time what allows us to change the structure of algorithm and its parameters.

international conference on industrial technology | 2003

Facial features detection for 3D face modeling

Gregor Rozinaj; F.-L. Mistral

This paper presents the detection algorithm developed for the specific application of 3D face modeling. The main goal of this algorithm developed under VC++ is to localize accurately some specific and needed points of facial features, to allow a 3D face creation. Instead of using some predefined 3D face model, the program process on two photos of a real subject, one from front view and the other from profile. It permits us to obtain the three coordinates for each localized point on the face. A data normalization allows face modeling using the openGL engine.

international conference on electronics circuits and systems | 1996

New approach of fast ICT and MICT algorithms development

A. Marcek; J. Kotuliakova; Gregor Rozinaj

New approach of fast ICT (Integer Cosine Transform) and MICT (Modified ICT) algorithms development is described in the paper. Both-ICT and MICT-transforms are the integer approximations of DCT. As the properties of ICT and MICT are close to those of DCT and their transform kernel is based on integers their computational complexity is less than in the case of DCT. Based on this DCT can be replaced by ICT and MICT, for example in transform coding of images (JPEG, MPEG, ITU-T H.261, etc.).

Explore More