Roman Meshcheryakov
Tomsk State University of Control Systems and Radio-electronics
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Roman Meshcheryakov.
international conference on speech and computer | 2013
Lidiya N. Balatskaya; Evgeny L. Choinzonov; Svetlana Yu. Chizevskaya; Eugeny U. Kostyuchenko; Roman Meshcheryakov
Restoration of speech functions after operations on the organs of speech production requires the development of procedures and support programs for rehabilitation. Available means to restore the original voice function for the patient and for speech therapy. Software tool feature of is the use of a combination of speech sounds, which are the most common in speech and which affect the naturalness and intelligibility of speech. It shows the effectiveness of procedures and programs.
international conference on speech and computer | 2017
Eugeny U. Kostyuchenko; Roman Meshcheryakov; Dariya Ignatieva; Alexander Pyatkov; Evgeny Choynzonov; L. N. Balatskaya
The paper considers the solution of aligning syllables in time problem. This kind of normalization allows to compare different implementations of the same syllable. This allows us to talk about a comparative evaluation of the syllables pronunciation quality in the event that one of the syllables is a reference implementation. If a patient’s record before the operative treatment of oral cancer is used as such a syllable, a comparative assessment of the quality of pronunciation of syllables in the process of speech rehabilitation can be made. In the process of normalization, an approach aimed at maximizing the correlation between individual fragments of the syllable is applied. Then, as a measure of similarity between the reference and the estimated syllable, the correlation coefficient is used. The work demonstrates the validity of such a decision based on the processing of records from healthy people and patients before and after surgical treatment. The results of this work allow us to approach the implementation of an automated software system for assessing the quality of pronunciation of syllables and proceed to implement its working prototype.
Journal of Physics: Conference Series | 2017
S Y Iskhakov; Alexander Shelupanov; Roman Meshcheryakov
In the article, the questions of modelling of complex security system networks are considered. The simulation model of operation of similar complexes and approbation of the offered approach to identification of the incidents are presented. The approach is based on detection of uncharacteristic alterations of the network operation mode. The results of the experiment allow one to draw a conclusion on possibility of the offered model application to analyse the current status of heterogeneous security systems. Also, it is confirmed that the application of short-term forecasting methods for the analysis of monitoring system data allows one to automate the process of formation the criteria to reveal the incidents.
international conference on speech and computer | 2015
Daniyar Volf; Roman Meshcheryakov; Sergey Kharchenko
A model of singular estimation process of speech fundamental pitch frequency is reviewed. Existing solutions for the known classes of mathematical problems (Singular spectrum analysis, fast Fourier transform, and convolution) are used to develop a numerical implementation of the model. The evaluation of the fundamental pitch frequency with existing algorithms.
international conference on speech and computer | 2014
Daria A. Suranova; Roman Meshcheryakov
Applying voice possibilities in modern speech billing systems can significantly reduce the operators’ time and costs and in some cases improve the accuracy of the initial data. For example, the software will evaluate the possibility of applying the technology of synthesis and speech recognition for billing systems. It considers individual user’s features. Experiments illustrate examples of voice technology effective implementation.
International Conference on Interactive Collaborative Robotics | 2017
Alexey Zalevsky; Oleg Osipov; Roman Meshcheryakov
The paper deals with the design robots platforms for warehouse. The experiment investigated the robot modes. Platform contains four omnidirectional wheels and standardized electronic control system. Proposed upgrade ways platform.
international conference on speech and computer | 2016
Ivan Rakhmanenko; Roman Meshcheryakov
This paper overviews the application sphere of speaker verification systems and illustrates the use of the Gaussian mixture model and the universal background model (GMM-UBM) in an automatic text-independent speaker verification task. The experimental evaluation of the GMM-UBM system using different speech features is conducted on a 50 speaker set and a result is presented. Equal error rate (EER) using 256 component Gaussian mixture model and feature vector containing 14 mel frequency cepstral coefficients (MFCC) and the voicing probability is 0,76 %. Comparing to standard 14 MFCC vector 23,7 % of EER improvement was acquired.
Archive | 2019
Evgeny Kostyuchenko; Roman Meshcheryakov; Dariya Ignatieva; Alexander Pyatkov; Evgeny Choynzonov; L. N. Balatskaya
Within this work, the application of the criterion based on correlation coefficient for comparative assessment of speech quality in speech rehabilitation for patients after surgical treatment of speech-producing tract oncological diseases is considered. The sequence of the actions intended for receiving comparative assessment of the speech quality by comparison of the sound recordings made before and after operation is considered. As a material for the assessment, a set of syllables from Standard GOST 50840-95 Speech transmission over varies communication channels. Techniques for measurements of speech quality, intelligibility, and voice identification are used. Also, a set of syllables, compiled on the basis of the analysis of most prone to postoperative change phonemes, is used. The previously proposed criteria based on the comparison of the intensity of time-normalized syllable records spectra have a fundamental drawback. They need an additional normalization of signal power. The proposed approach, based on the use of the linear correlation coefficient as a measure of similarity, does not have this drawback. The comparability of the values received using new criterion with the values received using previous version of the criteria is shown. Results of comparison confirm the possibility of the new criterion practical use.
international conference on speech and computer | 2018
Dariya Novokhrestova; Evgeny Kostyuchenko; Roman Meshcheryakov
The article describes an approach to assessing the intelligibility of speech in the process of speech rehabilitation by finding the measure of the similarity of the standard and distorted pronunciation of phonemes. The approach is based on the calculation of the correlation coefficient between the transformed signal envelopes. The envelope of the signal is constructed on the basis of the calculation of the short-term energy of signal. The selection of the short-term energy parameter (window size) is also described. The parameter selection is based on comparing the differences between the correlation coefficients for pairs with normal pronunciation and pairs with distorted pronunciation, calculated for different window sizes. The window sizes for each problem phoneme are selected.
Multimedia Tools and Applications | 2018
Oleg Evsutin; Anna Kokurina; Roman Meshcheryakov; Olga Shumskaya
Many effective methods of the data embedding into digital images are based on the frequency transformations. However use of similar transformations is connected to the following problem: the built-in message is distorted because of information losses in case of restoration of pixels’ integer values from the frequency domain. It represents a vital issue if the integrity of the transmitted data is critical. For example, an insignificant distortion of the ciphered message results in impossibility of deciphering and to loss of all ciphered information. In this paper is described the new algorithm of the information embedding into digital images on the basis of the discrete Fourier transformation allowing to provide unmistakable extraction of the built-in messages from the frequency domain. The faultlessness is reached through an iterative procedure of embedding and non-uniform distribution of the message parts for the image-container’s blocks. Our algorithm not only solves a problem of the built-in messages distortions, but also provides high visual quality of stego-image. Moreover, our approach to the unmistakable embedding of information into the digital images frequency domain is applicable not only for the discrete Fourier transformation, but also for other frequency transformations.
Collaboration
Dive into the Roman Meshcheryakov's collaboration.
Tomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputsTomsk State University of Control Systems and Radio-electronics
View shared research outputs