Jan Chorowski | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jan Chorowski is active.

Explore More

Publication

Featured researches published by Jan Chorowski.

international conference on acoustics, speech, and signal processing | 2016

End-to-end attention-based large vocabulary speech recognition

Dzmitry Bahdanau; Jan Chorowski; Dmitriy Serdyuk; Philemon Brakel; Yoshua Bengio

Many state-of-the-art Large Vocabulary Continuous Speech Recognition (LVCSR) Systems are hybrids of neural networks and Hidden Markov Models (HMMs). Recently, more direct end-to-end methods have been investigated, in which neural architectures were trained to model sequences of characters [1,2]. To our knowledge, all these approaches relied on Connectionist Temporal Classification [3] modules. We investigate an alternative method for sequence modelling based on an attention mechanism that allows a Recurrent Neural Network (RNN) to learn alignments between sequences of input frames and output labels. We show how this setup can be applied to LVCSR by integrating the decoding RNN with an n-gram language model and by speeding up its operation by constraining selections made by the attention mechanism and by reducing the source sequence lengths by pooling information over time. Recognition accuracies similar to other HMM-free RNN-based approaches are reported for the Wall Street Journal corpus.

intelligent data engineering and automated learning | 2014

Continuous Population-Based Incremental Learning with Mixture Probability Modeling for Dynamic Optimization Problems

Adrian Lancucki; Jan Chorowski; Krzysztof Michalak; Patryk Filipiak; Piotr Lipinski

This paper proposes a multimodal extension of PBIL C based on Gaussian mixture models for solving dynamic optimization problems. By tracking multiple optima, the algorithm is able to follow the changes in objective functions more efficiently than in the unimodal case. The approach was validated on a set of synthetic benchmarks including Moving Peaks, dynamization of the Rosenbrock function and compositions of functions from the IEEE CEC’2009 competition. The results obtained in the experiments proved the efficiency of the approach in solving dynamic problems with a number of competing peaks.

text speech and dialogue | 2017

On Multilingual Training of Neural Dependency Parsers

Michał Zapotoczny; Pawel Rychlikowski; Jan Chorowski

We show that a recently proposed neural dependency parser can be improved by joint training on multiple languages from the same family. The parser is implemented as a deep neural network whose only input is orthographic representations of words. In order to successfully parse, the network has to discover how linguistically relevant concepts can be inferred from word spellings. We analyze the representations of characters and words that are learned by the network to establish which properties of languages were accounted for. In particular we show that the parser has approximately learned to associate Latin characters with their Cyrillic counterparts and that it can group Polish and Russian words that have a similar grammatical function. Finally, we evaluate the parser on selected languages from the Universal Dependencies dataset and show that it is competitive with other recently proposed state-of-the art methods, while having a simple structure.

text speech and dialogue | 2017

Multipoint Neighbor Embedding

Adrian Lancucki; Jan Chorowski

Dimensionality reduction methods for visualization attempt to preserve in the embedding as much of the original information as possible. However, projection to 2-D or 3-D heavily distorts the data. Instead, we propose a multipoint extension to neighbor embedding methods, which allows to express datapoints from a high-dimensional space as sets of datapoints in a low-dimensional space. Cardinality of those sets is not assumed a priori. Using gradient of the cost function, we derive an expression, which for every datapoint indicates its remote area of attraction. We use it as a heuristic that guides selection and placement of additional datapoints. We demonstrate the approach with multipoint t-SNE, and adapt the \(\mathcal {O}(N\log N)\) approximation for computing the gradient of t-SNE to our setting. Experiments show that the approach brings qualitative and quantitative gains, i.e., it expresses more pairwise similarities and multi-group memberships of individual datapoints, better preserving the local structure of the data.

neural information processing systems | 2015

Attention-based models for speech recognition

Jan Chorowski; Dzmitry Bahdanau; Dmitriy Serdyuk; Kyunghyun Cho; Yoshua Bengio

arXiv: Learning | 2015

Blocks and Fuel: Frameworks for deep learning

Bart van Merriënboer; Dzmitry Bahdanau; Vincent Dumoulin; Dmitriy Serdyuk; David Warde-Farley; Jan Chorowski; Yoshua Bengio

arXiv: Neural and Evolutionary Computing | 2014

End-to-end continuous speech recognition using attention-based recurrent nn: First results

Jan Chorowski; Dzmitry Bahdanau; Kyunghyun Cho; Yoshua Bengio

IEEE Transactions on Neural Networks | 2015

Learning Understandable Neural Networks With Nonnegative Weight Constraints

Jan Chorowski; Jacek M. Zurada

international conference on acoustics, speech, and signal processing | 2018

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Chung-Cheng Chiu; Tara N. Sainath; Yonghui Wu; Rohit Prabhavalkar; Patrick Nguyen; Zhifeng Chen; Anjuli Kannan; Ron Weiss; Kanishka Rao; Katya Gonina; Navdeep Jaitly; Bo Li; Jan Chorowski; Michiel Bacchiani

conference of the international speech communication association | 2017