Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Tom Kocmi is active.

Publication


Featured researches published by Tom Kocmi.


text speech and dialogue | 2016

CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered

Ondřej Bojar; Ondřej Dušek; Tom Kocmi; Jindřich Libovický; Michal Novák; Martin Popel; Roman Sudarikov; Dusan Varis

We present a new release of the Czech-English parallel corpus CzEng. CzEng 1.6 consists of about 0.5 billion words (“gigaword”) in each language. The corpus is equipped with automatic annotation at a deep syntactic level of representation and alternatively in Universal Dependencies. Additionally, we release the complete annotation pipeline as a virtual machine in the Docker virtualization toolkit.


text speech and dialogue | 2016

SubGram: Extending Skip-Gram Word Representation with Substrings

Tom Kocmi; Ondřej Bojar

Skip-gram (word2vec) is a recent method for creating vector representations of words (“distributed word representations”) using a neural network. The representation gained popularity in various areas of natural language processing, because it seems to capture syntactic and semantic information about words without any explicit supervision in this respect.


Proceedings of the Second Conference on Machine Translation | 2017

Results of the WMT17 Neural MT Training Task.

Ondrej Bojar; Jindrich Helcl; Tom Kocmi; Jindrich Libovický; Tomás Musil


Archive | 2017

An Exploration of Word Embedding Initialization in Deep-Learning Tasks.

Tom Kocmi; Ondřej Bojar


conference of the association for machine translation in the americas | 2018

Neural Monkey: The Current State and Beyond.

Jindrich Helcl; Jindrich Libovický; Tom Kocmi; Tomás Musil; Ondrej Cífka; Dusan Varis; Ondrej Bojar


arXiv: Computation and Language | 2018

Trivial Transfer Learning for Low-Resource Neural Machine Translation

Tom Kocmi; Ondřej Bojar


recent advances in natural language processing | 2017

Curriculum Learning and Minibatch Bucketing in Neural Machine Translation.

Tom Kocmi; Ondrej Bojar


conference of the european chapter of the association for computational linguistics | 2017

LanideNN: Multilingual Language Identification on Text Stream.

Tom Kocmi; Ondřej Bojar


arXiv: Computation and Language | 2017

LanideNN: Multilingual Language Identification on Character Window.

Tom Kocmi; Ondřej Bojar


Proceedings of the Second Conference on Machine Translation | 2017

CUNI submission in WMT17: Chimera goes neural.

Roman Sudarikov; David Marecek; Tom Kocmi; Dusan Varis; Ondrej Bojar

Collaboration


Dive into the Tom Kocmi's collaboration.

Top Co-Authors

Avatar

Ondřej Bojar

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Ondrej Bojar

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Dusan Varis

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jindrich Helcl

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jindrich Libovický

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Roman Sudarikov

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jindřich Libovický

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Martin Popel

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Michal Novák

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Ondřej Dušek

Charles University in Prague

View shared research outputs
Researchain Logo
Decentralizing Knowledge