Jenna Kanerva | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jenna Kanerva is active.

Explore More

Publication

Featured researches published by Jenna Kanerva.

Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC) | 2014

Post-hoc Manipulations of Vector Space Models with Application to Semantic Role Labeling

Jenna Kanerva; Filip Ginter

In this paper, we introduce several vector space manipulation methods that are applied to trained vector space models in a post-hoc fashion, and present an application of these techniques in semantic role labeling for Finnish and English. Specifically, we show that the vectors can be circularly shifted to encode syntactic information and subsequently averaged to produce representations of predicate senses and arguments. Further, we show that it is possible to effectively learn a linear transformation between the vector representations of predicates and their arguments, within the same vector space.

Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw#N# Text to Universal Dependencies | 2017

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

Daniel Zeman; Martin Popel; Milan Straka; Jan Hajic; Joakim Nivre; Filip Ginter; Juhani Luotolahti; Sampo Pyysalo; Slav Petrov; Martin Potthast; Francis M. Tyers; Elena Badmaeva; Memduh Gokirmak; Anna Nedoluzhko; Silvie Cinková; Jaroslava Hlaváčová; Václava Kettnerová; Zdenka Uresová; Jenna Kanerva; Stina Ojala; Anna Missilä; Christopher D. Manning; Sebastian Schuster; Siva Reddy; Dima Taji; Nizar Habash; Herman Leung; Marie-Catherine de Marneffe; Manuela Sanguinetti; Maria Simi

The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets. In 2017, the task was devoted to learning dependency parsers for a large number of languages, in a real-world setting without any gold-standard annotation on input. All test sets followed a unified annotation scheme, namely that of Universal Dependencies. In this paper, we define the task and evaluation methodology, describe how the data sets were prepared, report and analyze the main results, and provide a brief categorization of the different approaches of the participating systems.

workshop on statistical machine translation | 2015

Morphological Segmentation and OPUS for Finnish-English Machine Translation

Jörg Tiedemann; Filip Ginter; Jenna Kanerva

This paper describes baseline systems for Finnish-English and English-Finnish machine translation using standard phrasebased and factored models including morphological features. We experiment with compound splitting and morphological segmentation and study the effect of adding noisy out-of-domain data to the parallel and the monolingual training data. Our results stress the importance of training data and demonstrate the effectiveness of morphological pre-processing of Finnish.

north american chapter of the association for computational linguistics | 2015

SETS: Scalable and Efficient Tree Search in Dependency Graphs

Juhani Luotolahti; Jenna Kanerva; Sampo Pyysalo; Filip Ginter

We present a syntactic analysis query toolkit geared specifically towards massive dependency parsebanks and morphologically rich languages. The query language allows arbitrary tree queries, including negated branches, and is suitable for querying analyses with rich morphological annotation. Treebanks of over a million words can be comfortably queried on a low-end netbook, and a parsebank with over 100M words on a single consumer-grade server. We also introduce a web-based interface for interactive querying. All contributions are available under open licenses.

language resources and evaluation | 2015

The Finnish Proposition Bank

Katri Haverinen; Jenna Kanerva; Samuel Kohonen; Anna Missilä; Stina Ojala; Timo Viljanen; Veronika Laippala; Filip Ginter

We present the Finnish PropBank, a resource for semantic role labeling (SRL) of Finnish based on the Turku Dependency Treebank whose syntax is annotated in the well-known Stanford Dependency (SD) scheme. The contribution of this paper consists of the lexicon of the verbs and their arguments present in the treebank, as well as the predicate-argument annotation of all verb occurrences in the treebank text. We demonstrate that the annotation is of high quality, that the SD scheme is highly compatible with PropBank annotation, and further that the additional dependencies present in the Turku Dependency Treebank are clearly beneficial for PropBank annotation. Further, we also use the PropBank to provide a strong baseline for automated Finnish SRL using a machine learning SRL system developed for the SemEval’14 shared task on broad-coverage semantic dependency parsing. The PropBank as well as the SRL system are available under a free license at http://bionlp.utu.fi/.

Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies : August 3-4, 2017 Vancouver, Canada, 2017, ISBN 978-1-945626-70-8, págs. 119-125 | 2017

TurkuNLP: Delexicalized Pre-training of Word Embeddings for Dependency Parsing

Jenna Kanerva; Juhani Luotolahti; Filip Ginter

We present the TurkuNLP entry in the CoNLL 2017 Shared Task on Multilingual Parsing from Raw Text to Universal Dependencies. The system is based on the UDPipe parser with our focus being in ex- ploring various techniques to pre-train the word embeddings used by the parser in order to improve its performance especially on languages with small training sets. The system ranked 11th among the 33 participants overall, being 8th on the small treebanks, 10th on the large treebanks, 12th on the parallel test sets, and 26th on the sur- prise languages.

Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers | 2016

Phrase-Based SMT for Finnish with More Data, Better Models and Alternative Alignment and Translation Tools

Jörg Tiedemann; Fabienne Cap; Jenna Kanerva; Filip Ginter; Sara Stymne; Robert Östling; Marion Weller-Di Marco

This paper summarises the contributions of the teams at the University of Helsinki, Uppsala University and the University of Turku to the news translation tasks for translating from and to Finnish. Our models address the problem of treating morphology and data coverage in various ways. We introduce a new efficient tool for word alignment and discuss factorisations, gappy language models and reinflection techniques for generating proper Finnish output. The results demonstrate once again that training data is the most effective way to increase translation performance.

Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers | 2016

Cross-Lingual Pronoun Prediction with Deep Recurrent Neural Networks.

Juhani Luotolahti; Jenna Kanerva; Filip Ginter

In this paper we present our winning system in the WMT16 Shared Task on CrossLingual Pronoun Prediction, where the objective is to predict a missing target language pronoun based on the target and source sentences. Our system is a deep recurrent neural network, which reads both the source language and target language context with a softmax layer making the final prediction. Our system achieves the best macro recall on all four language pairs. The margin to the next best system ranges between less than 1pp and almost 12pp depending on the language pair.

north american chapter of the association for computational linguistics | 2015

Turku: Semantic Dependency Parsing as a Sequence Classification

Jenna Kanerva; Juhani Luotolahti; Filip Ginter

This paper presents the University of Turku entry to the SemEval-2015 task on BroadCoverage Semantic Dependency Parsing. The system uses an existing transition-based parser as a sequence classifier to jointly predict all arguments of one candidate predicate at a time. Compared to our 2014 entry, the 2015 system gains about 3pp in terms of F-score for a fraction of the development time. Depending on the subtask, the difference between our entry and the winning system ranges between 1 and 5pp.

Archive | 2015

Universal Dependencies 1.0

Joakim Nivre; Željko Agić; Maria Jesus Aranzabe; Masayuki Asahara; Aitziber Atutxa; Miguel Ballesteros; John Bauer; Kepa Bengoetxea; Riyaz Ahmad Bhat; Cristina Bosco; Sam Bowman; Giuseppe G. A. Celano; Miriam Connor; Marie-Catherine de Marneffe; Arantza Diaz de Ilarraza; Kaja Dobrovoljc; Timothy Dozat; Tomaž Erjavec; Richárd Farkas; Jennifer Foster; Daniel Galbraith; Filip Ginter; Iakes Goenaga; Koldo Gojenola; Yoav Goldberg; Berta Gonzales; Bruno Guillaume; Jan Hajic; Dag Haug; Radu Ion

Explore More