Senja Pollak
University of Ljubljana
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Senja Pollak.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing | 2017
Ben Verhoeven; Iza Škrjanec; Senja Pollak
We present results of the first gender classification experiments on Slovene text to our knowledge. Inspired by the TwiSty corpus and experiments (Verhoeven et al., 2016), we employed the Janes corpus (Erjavec et al., 2016) and its gender annotations to perform gender classification experiments on Twitter text comparing a token-based and a lemma-based approach. We find that the token-based approach (92.6% accuracy), containing gender markings related to the author, outperforms the lemma-based approach by about 5%. Especially in the lemmatized version, we also observe stylistic and contentbased differences in writing between men (e.g., more profane language, numerals and beer mentions) and women (e.g., more pronouns, emoticons and character flooding). Many of our findings corroborate previous research on other languages.
european conference on information retrieval | 2018
Jan Štihec; Martin Žnidaršič; Senja Pollak
The aim of this work is to reproduce the approach to detecting semantic orientations in economic texts that was presented in the paper Good Debt or Bad Debt: Detecting Semantic Orientations in Economic Texts by Malo et al. The approach employs the Linearized Phrase Structure model for sentence level classification of short economic texts into a positive, negative or neutral category from investor’s perspective and yields state-of-the-art results. The proposed method employs both rule based linguistic models and machine learning. Where possible we follow the same approach as described in the original paper, with some documented modifications. Our solution is simplified in at least two aspects, but its performance is comparable to the original and overall remains better than the reported results of other benchmark algorithms mentioned in the original paper. The differences between the two models and results are described in detail and lead to conclusion that the original approach is to a large extent repeatable and that our simplified version does not overly sacrifice performance for generalizability.
ieee symposium series on computational intelligence | 2015
Senja Pollak; Borut Lesjak; Janez Kranjc; Vid Podpečan; Martin nidaric; Nada Lavrač
Computational Creativity is a sub-field of Artificial Intelligence research, studying how to engineer software that exhibits behaviours which would reasonably be deemed creative. This paper addresses a creative task of question generation from scientific papers, using a pattern-based approach to finding relevant sentences from which questions should be generated, a natural language processing question construction mechanism, a crowd sourcing mechanism for question rating, and a robot interface for posing questions during a conference session, integrated in a creative Robo CHAIR solution. The system was trained on a set of 200 articles from past computer science conferences and evaluated on a set of articles of members of the local lab.
meeting of the association for computational linguistics | 2011
Darja Fišer; Nikola Ljubešić; Špela Vintar; Senja Pollak
recent advances in natural language processing | 2015
Nikola Ljubešić; Darja Fišer; Tomaž Erjavec; Jaka Čibej; Dafne Marko; Senja Pollak; Iza Škrjanec
ICCC | 2016
Pedro Martins; Senja Pollak; Tanja Urbančič; Amílcar Cardoso
ICCC | 2015
Pedro Martins; Tanja Urbančič; Senja Pollak; Nada Lavrač; Amílcar Cardoso
CLEF (Working Notes) | 2017
Matej Martinc; Iza Škrjanec; Katja Zupan; Senja Pollak
ICCC | 2016
Senja Pollak; Biljana Mileva-Boshkoska; Dragana Miljkovic; Geraint A. Wiggins; Nada Lavrač
Proceedings of KONVENS 2012 | 2012
Senja Pollak; Anze Vavpetic; Janez Kranjc; Nada Lavrač; Špela Vintar