Simon Šuster
University of Groningen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Simon Šuster.
north american chapter of the association for computational linguistics | 2016
Simon Šuster; Ivan Titov; Gerardus van Noord
We present an approach to learning multi-sense word embeddings relying both on monolingual and bilingual information. Our model consists of an encoder, which uses monolingual and bilingual context (i.e. a parallel sentence) to choose a sense for a given word, and a decoder which predicts context words based on the chosen sense. The two components are estimated jointly. We observe that the word representations induced from bilingual data outperform the monolingual counterparts across a range of evaluation tasks, even though crosslingual information is not available at test time.
meeting of the association for computational linguistics | 2017
Simon Šuster; Stéphan Tulkens; Walter Daelemans
Clinical NLP has an immense potential in contributing to how clinical practice will be revolutionized by the advent of large scale processing of clinical records. However, this potential has remained largely untapped due to slow progress primarily caused by strict data access policies for researchers. In this paper, we discuss the concern for privacy and the measures it entails. We also suggest sources of less sensitive data. Finally, we draw attention to biases that can compromise the validity of empirical research and lead to socially harmful applications.
BioNLP 2017 | 2017
Pieter Fivez; Simon Šuster; Walter Daelemans
We present an unsupervised contextsensitive spelling correction method for clinical free-text that uses word and character n-gram embeddings. Our method generates misspelling replacement candidates and ranks them according to their semantic fit, by calculating a weighted cosine similarity between the vectorized representation of a candidate and the misspelling context. We greatly outperform two baseline off-the-shelf spelling correction tools on a manually annotated MIMIC-III test set, and counter the frequency bias of an optimized noisy channel model, showing that neural embeddings can be successfully exploited to include context-awareness in a spelling correction model. Our source code, including a script to extract the annotated test data, can be found at https://github.com/ pieterfivez/bionlp2017.
meeting of the association for computational linguistics | 2016
Stéphan Tulkens; Simon Šuster; Walter Daelemans
arXiv: Computation and Language | 2015
Simon Šuster; Gerardus van Noord; Ivan Titov
arXiv: Computation and Language | 2017
Pieter Fivez; Simon Šuster; Walter Daelemans
north american chapter of the association for computational linguistics | 2018
Simon Šuster; Walter Daelemans
arXiv: Computation and Language | 2018
Madhumita Sushil; Simon Šuster; Walter Daelemans
arXiv: Computation and Language | 2017
Madhumita Sushil; Simon Šuster; Kim Luyckx; Walter Daelemans
AMIA | 2017
Simon Šuster; Walter Daelemans