Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Miroslav Spousta is active.

Publication


Featured researches published by Miroslav Spousta.


meeting of the association for computational linguistics | 2009

Semi-Supervised Training for the Averaged Perceptron POS Tagger

Drahomíra "johanka" Spoustová; Jan Hajiċ; Jan Raab; Miroslav Spousta

This paper describes POS tagging experiments with semi-supervised training as an extension to the (supervised) averaged perceptron algorithm, first introduced for this task by (Collins, 2002). Experiments with an iterative training on standard-sized supervised (manually annotated) dataset (106 tokens) combined with a relatively modest (in the order of 108 tokens) unsupervised (plain) data in a bagging-like fashion showed significant improvement of the POS classification task on typologically different languages, yielding better than state-of-the-art results for English and Czech (4.12 % and 4.86 % relative error reduction, respectively; absolute accuracies being 97.44 % and 95.89 %).


meeting of the association for computational linguistics | 2007

Towards the Automatic Extraction of Definitions in Slavic

Adam Przepiórkowski; Lukasz Degórski; Miroslav Spousta; Kiril Simov; Petya Osenova; Lothar Lemnitzer; Vladislav Kuboň; Beata Wójtowicz

This paper presents the results of the preliminary experiments in the automatic extraction of definitions (for semi-automatic glossary construction) from usually unstructured or only weakly structured e-learning texts in Bulgarian, Czech and Polish. The extraction is performed by regular grammars over XML-encoded morphosyntactically-annotated documents. The results are less than satisfying and we claim that the reason for that is the intrinsic difficulty of the task, as measured by the low interannotator agreement, which calls for more sophisticated deeper linguistic processing, as well as for the use of machine learning classification techniques.


The Prague Bulletin of Mathematical Linguistics | 2010

Dependency Parsing as a Sequence Labeling Task

Drahomíra "johanka" Spoustová; Miroslav Spousta

Dependency Parsing as a Sequence Labeling Task The aim of this paper is to explore the feasibility of solving the dependency parsing problem using sequence labeling tools. We introduce an algorithm to transform a dependency tree into a tag sequence suitable for a sequence labeling algorithm and evaluate several parameter settings on the standard treebank data. We focus mainly on Czech, as a high-inflective free-word-order language, which is not so easy to parse using traditional techniques, but we also test our approach on English for comparison.


Archive | 2007

Web Page Cleaning with Conditional Random Fields

Michal Marek; Pavel Pecina; Miroslav Spousta


language resources and evaluation | 2010

Building a Web Corpus of Czech.

Drahomíra "johanka" Spoustová; Miroslav Spousta; Pavel Pecina


language resources and evaluation | 2012

A High-Quality Web Corpus of Czech

Johanka Spoustová; Miroslav Spousta


text, speech and dialogue | 2010

Integration of speech and text processing modules into a real-time dialogue system

Jan Ptáček; Pavel Ircing; Miroslav Spousta; Jan Romportl; Zdeněk Loose; Silvie Cinková; José Relaño Gil; Raúl Santos


Lecture Notes in Artificial Intelligence | 2010

Integration of Speech and Text Processing Modules into a Real-Time Dialogue System

Jan Ptáček; Pavel Ircing; Miroslav Spousta; Jan Romportl; Zdeněk Loose; Silvie Cinková; José Relaño Gil; Raúl Santos


the florida ai research society | 2008

Multilingual Approach to e-Learning from a Monolingual Perspective.

Vladislav Kubon; Miroslav Spousta


language resources and evaluation | 2008

Validating the Quality of Full Morphological Annotation.

Drahomíra "johanka" Spoustová; Pavel Pecina; Jan Hajic; Miroslav Spousta

Collaboration


Dive into the Miroslav Spousta's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jan Romportl

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar

Pavel Pecina

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jan Hajic

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jan Ptáček

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Johanka Spoustová

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Pavel Ircing

Johns Hopkins University

View shared research outputs
Top Co-Authors

Avatar

Silvie Cinková

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Zdeněk Loose

University of West Bohemia

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge