Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Stanisław Kacprzak is active.

Publication


Featured researches published by Stanisław Kacprzak.


SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing | 2013

Speech/music discrimination via energy density analysis

Stanisław Kacprzak; Mariusz Ziółko

In this paper we suggest to apply a new feature, called Minimum Energy Density (MED), in discrimination of audio signals between speech and music. Our method is based on the analysis of local energy for 1 or 2.5 seconds audio signals. An elementary analysis of the probability for the power distribution is an effective tool supporting the decision making system. We compare our feature with Percentage of Low Energy Frames (LEF), Modified Low Energy Ratio (MLER) and examine their efficiency for two separate speech/music corpora.


international conference on systems signals and image processing | 2017

Spoken language clustering in the i-vectors space

Stanisław Kacprzak

This paper presents the results of language clustering in the i-vectors space, a method to determine in an unsupervised manner how many languages are in a data set and which recordings contain the same language. The most dense i-vectors clusters are found using the DBSCAN algorithm in a low dimensional space obtained by the t-SNE method. Quality of clustering for spherical k-means and the proposed method are tested with the data from NIST 2015 i-Vector Challenge. Usefulness of obtained clustering is tested in the challenge evaluation system. The results demonstrate that the proposed method allows to find 109 dense clusters with low impurity for 50 target languages.


international conference on systems signals and image processing | 2017

Speech/music discrimination for analysis of radio stations

Stanisław Kacprzak; Blazej Chwiecko; Bartosz Ziółko

A computationally efficient feature, called Minimum Energy Density (MED) was applied to discriminate audio signals between speech and music in the radio stations programs. The presented binary classifier is based on testing two features: energy distribution and differences between energy in channels. We analyzed 240 hours of signals, from 10 Polish radio stations. Our analysis enables us to provide information about content of particular radio stations.


ieee international conference on signal and image processing | 2017

On the extraction of early reflection signals for automatic speech recognition

Konrad Kowalczyk; Stanisław Kacprzak; Mariusz Ziółko

Room reverberation caused by multipath sound wave propagation in acoustic enclosures constitutes an unwanted distortion for automatic speech recognition systems. Multichannel speech enhancement methods often aim to enhance the signal impinging at the microphone array from the source direction while reducing late reverberation. In this paper, we investigate the applicability of spatial filters which constructively combine the direct-path signal with distinct early room reflection signals to increase the direct-to-reverberation ratio and to reduce the word error rate (WER) of automatic speech recognition systems. We present suitable filters and compare them with existing approaches. Results for the simulated acoustic environments indicate that an improvement in WER can indeed be achieved by the spatial filters which account for strong early reflections.


signal processing algorithms architectures arrangements and applications | 2016

Automatic extraction and clustering of phones

Stanisław Kacprzak; Mariusz Masior; Mariusz Ziółko

The automatic segmentation and parametrization based on the frequency analysis was used to compare with manually annotated phones. The phones boundaries were fixed in places of relatively large changes in the energy distribution between the frequency bands. Frequency parametrization and clustering enabled the division of phones into groups (clusters) according to their acoustic similarities. The results of performed experiments showed that analysis of the frequency properties only, results in correct segmentation but accuracy of recognition was about 20% only.


Studia Informatica | 2013

Database of speech recordings for comparative analysis of multi-language phonems

Mariusz Mąsior; Magdalena Igras; Mariusz Ziółko; Stanisław Kacprzak


conference of the international speech communication association | 2017

Audio Replay Attack Detection Using High-Frequency Features.

Marcin Witkowski; Stanisław Kacprzak; Piotr Zelasko; Konrad Kowalczyk; Jakub Gałka


Theoria et Historia Scientiarum | 2015

The acoustic diversity in the phoneme inventories of the world’s languages

Magdalena Igras; Stanisław Kacprzak; Mariusz Mąsior; Mariusz Ziółko


Prace Filologiczne | 2015

Automatyczna ekstrakcja i klasteryzacja głosek w sygnale mowy dla wielojęzykowej analizy porównawczej

Stanisław Kacprzak; Mariusz Mąsior; Mariusz Ziółko


Studia Informatica | 2014

Data analysis and management engine for signal processing

Mariusz Mąsior; Stanisław Kacprzak

Collaboration


Dive into the Stanisław Kacprzak's collaboration.

Top Co-Authors

Avatar

Mariusz Ziółko

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Mariusz Mąsior

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Magdalena Igras

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Konrad Kowalczyk

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar

Bartosz Ziółko

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Blazej Chwiecko

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Jakub Gałka

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Marcin Witkowski

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Mariusz Masior

AGH University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Piotr Zelasko

AGH University of Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge