Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Maria-Hendrike Peetz is active.

Publication


Featured researches published by Maria-Hendrike Peetz.


Information Retrieval | 2014

Using temporal bursts for query modeling

Maria-Hendrike Peetz; Edgar Meij; Maarten de Rijke

We present an approach to query modeling that leverages the temporal distribution of documents in an initially retrieved set of documents. In news-related document collections such distributions tend to exhibit bursts. Here, we define a burst to be a time period where unusually many documents are published. In our approach we detect bursts in result lists returned for a query. We then model the term distributions of the bursts using a reduced result list and select its most descriptive terms. Finally, we merge the sets of terms obtained in this manner so as to arrive at a reformulation of the original query. For query sets that consist of both temporal and non-temporal queries, our query modeling approach incorporates an effective selection method of terms. We consistently and significantly improve over various baselines, such as relevance models, on both news collections and a collection of blog posts.


european conference on information retrieval | 2012

Adaptive temporal query modeling

Maria-Hendrike Peetz; Edgar Meij; Maarten de Rijke; Wouter Weerkamp

We present an approach to query modeling that uses the temporal distribution of documents in an initially retrieved set of documents. Such distributions tend to exhibit bursts, especially in news-related document collections. We hypothesize that documents in those bursts are more likely to be relevant and update the query model with the most distinguishing terms in high-quality documents sampled from bursts. We evaluate the effectiveness of our models on a test collection of blog posts.


european conference on information retrieval | 2013

Cognitive temporal document priors

Maria-Hendrike Peetz; Maarten de Rijke

Temporal information retrieval exploits temporal features of document collections and queries. Temporal document priors are used to adjust the score of a document based on its publication time. We consider a class of temporal document priors that is inspired by retention functions considered in cognitive psychology that are used to model the decay of memory. Many such functions used as a temporal document prior have a positive effect on overall retrieval performance. We examine the stability of this effect across news and microblog collections and discover interesting differences between retention functions. We also study the problem of optimizing parameters of the retention functions as temporal document priors; some retention functions display consistent good performance across large regions of the parameter space. A retention function based on a Weibull distribution is the preferred choice for a temporal document prior.


theory and practice of digital libraries | 2012

Semantic document selection: historical research on collections that span multiple centuries

Daan Odijk; Ork de Rooij; Maria-Hendrike Peetz; Toine Pieters; Maarten de Rijke; Stephen Snelders

The availability of digitized collections of historical data, such as newspapers, increases every day. With that, so does the wish for historians to explore these collections. Methods that are traditionally used to examine a collection do not scale up to todays collection sizes. We propose a method that combines text mining with exploratory search to provide historians with a means of interactively selecting and inspecting relevant documents from very large collections. We assess our proposal with a case study on a prototype system.


Information Processing and Management | 2016

Estimating Reputation Polarity on Microblog Posts

Maria-Hendrike Peetz; Maarten de Rijke; Rianne Kaptein

We find that reputation polarity of a post is different from sentiment.We model reputation polarity using feature classes from communication theory.We introduce new features based on the replies to a post.We propose different ways to operationalise the RepLab 2012 and 2013 tasks. In reputation management, knowing what impact a tweet has on the reputation of a brand or company is crucial. The reputation polarity of a tweet is a measure of how the tweet influences the reputation of a brand or company. We consider the task of automatically determining the reputation polarity of a tweet. For this classification task, we propose a feature-based model based on three dimensions: the source of the tweet, the contents of the tweet and the reception of the tweet, i.e., how the tweet is being perceived. For evaluation purposes, we make use of the RepLab 2012 and 2013 datasets. We study and contrast three training scenarios. The first is independent of the entity whose reputation is being managed, the second depends on the entity at stake, but has over 90% fewer training samples per model, on average. The third is dependent on the domain of the entities. We find that reputation polarity is different from sentiment and that having less but entity-dependent training data is significantly more effective for predicting the reputation polarity of a tweet than an entity-independent training scenario. Features related to the reception of a tweet perform significantly better than most other features.


international acm sigir conference on research and development in information retrieval | 2014

Hierarchical multi-label classification of social text streams

Zhaochun Ren; Maria-Hendrike Peetz; Shangsong Liang; Willemijn van Dolen; Maarten de Rijke


CEUR Workshop Proceedings | 2013

Towards an active learning system for company name disambiguation in microblog streams

Maria-Hendrike Peetz; Damiano Spina; Julio Gonzalo; M. de Rijke


international acm sigir conference on research and development in information retrieval | 2015

Active Learning for Entity Filtering in Microblog Streams

Damiano Spina; Maria-Hendrike Peetz; Maarten de Rijke


CEUR Workshop Proceedings | 2014

yourHistory - Semantic linking for a personalized timeline of historic events

D. Graus; Maria-Hendrike Peetz; Daan Odijk; O. de Rooij; M. de Rijke


CLEF (Working Notes) | 2013

Towards an Active Learning System for Company Name Disambiguation in Microblog Streams.

Maria-Hendrike Peetz; Damiano Spina; Julio Gonzalo; Maarten de Rijke

Collaboration


Dive into the Maria-Hendrike Peetz's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Daan Odijk

University of Amsterdam

View shared research outputs
Top Co-Authors

Avatar

M. de Rijke

University of Amsterdam

View shared research outputs
Top Co-Authors

Avatar

Stephen Snelders

VU University Medical Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anne Schuth

University of Amsterdam

View shared research outputs
Top Co-Authors

Avatar

O. de Rooij

University of Amsterdam

View shared research outputs
Top Co-Authors

Avatar

Ork de Rooij

University of Amsterdam

View shared research outputs
Researchain Logo
Decentralizing Knowledge