Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ville A. Satopää is active.

Publication


Featured researches published by Ville A. Satopää.


Journal of the American Statistical Association | 2016

Modeling Probability Forecasts via Information Diversity

Ville A. Satopää; Robin Pemantle; Lyle H. Ungar

ABSTRACT Randomness in scientific estimation is generally assumed to arise from unmeasured or uncontrolled factors. However, when combining subjective probability estimates, heterogeneity stemming from people’s cognitive or information diversity is often more important than measurement noise. This article presents a novel framework that uses partially overlapping information sources. A specific model is proposed within that framework and applied to the task of aggregating the probabilities given by a group of forecasters who predict whether an event will occur or not. Our model describes the distribution of information across forecasters in terms of easily interpretable parameters and shows how the optimal amount of extremizing of the average probability forecast (shifting it closer to its nearest extreme) varies as a function of the forecasters’ information overlap. Our model thus gives a more principled understanding of the historically ad hoc practice of extremizing average forecasts. Supplementary material for this article is available online.


The Annals of Applied Statistics | 2014

Probability aggregation in time-series: Dynamic hierarchical modeling of sparse expert beliefs

Ville A. Satopää; Shane T. Jensen; Barbara A. Mellers; Philip E. Tetlock; Lyle H. Ungar

Most subjective probability aggregation procedures use a single probability judgment from each expert, even though it is common for experts studying real problems to update their probability estimates over time. This paper advances into unexplored areas of probability aggregation by considering a dynamic context in which experts can update their beliefs at random intervals. The updates occur very infrequently, resulting in a sparse data set that cannot be modeled by standard time-series procedures. In response to the lack of appropriate methodology, this paper presents a hierarchical model that takes into account the experts level of self-reported expertise and produces aggregate probabilities that are sharp and well calibrated both in- and out-of-sample. The model is demonstrated on a real-world data set that includes over 2300 experts making multiple probability forecasts over two years on different subsets of 166 international political events.


Journal of the American Statistical Association | 2017

Mortality Rate Estimation and Standardization for Public Reporting: Medicare’s Hospital Compare

Edward I. George; Veronika Rockova; Paul R. Rosenbaum; Ville A. Satopää; Jeffrey H. Silber

ABSTRACT Bayesian models are increasingly fit to large administrative datasets and then used to make individualized recommendations. In particular, Medicare’s Hospital Compare webpage provides information to patients about specific hospital mortality rates for a heart attack or acute myocardial infarction (AMI). Hospital Compare’s current recommendations are based on a random-effects logit model with a random hospital indicator and patient risk factors. Except for the largest hospitals, these individual recommendations or predictions are not checkable against data, because data from smaller hospitals are too limited to provide a meaningful check. Before individualized Bayesian recommendations, people derived general advice from empirical studies of many hospitals, for example, prefer hospitals of Type 1 to Type 2 because the risk is lower at Type 1 hospitals. Here, we calibrate these Bayesian recommendation systems by checking, out of sample, whether their predictions aggregate to give correct general advice derived from another sample. This process of calibrating individualized predictions against general empirical advice leads to substantial revisions in the Hospital Compare model for AMI mortality. To make appropriately calibrated predictions, our revised models incorporate information about hospital volume, nursing staff, medical residents, and the hospital’s ability to perform cardiovascular procedures. For the ultimate purpose of comparisons, hospital mortality rates must be standardized to adjust for patient mix variation across hospitals. We find that indirect standardization, as currently used by Hospital Compare, fails to adequately control for differences in patient risk factors and systematically underestimates mortality rates at the low volume hospitals. To provide good control and correctly calibrated rates, we propose direct standardization instead. Supplementary materials for this article are available online.


Computational Statistics & Data Analysis | 2013

Simultaneous confidence intervals for comparing margins of multivariate binary data

Bernhard Klingenberg; Ville A. Satopää

In many applications two groups are compared simultaneously on several correlated binary variables for a more comprehensive assessment of group differences. Although the response is multivariate, the main interest is in comparing the marginal probabilities between the groups. Estimating the size of these differences under strong error control allows for a better evaluation of effects than can be provided by multiplicity adjusted P-values. Simultaneous confidence intervals for the differences in marginal probabilities are developed through inverting the maximum of correlated Wald, score or quasi-score statistics. Taking advantage of the available correlation information leads to improvements in the joint coverage probability and power compared to straightforward Bonferroni adjustments. Estimating the correlation under the null is also explored. While computationally complex even in small dimensions, it does not result in marked improvements. Based on extensive simulation results, a simple approach that uses univariate score statistics together with their estimated correlation is proposed and recommended. All methods are illustrated using data from a vaccine trial that investigated the incidence of four pre-specified adverse events between two groups and with data from the General Social Survey.


international conference on distributed computing systems workshops | 2011

Finding a "Kneedle" in a Haystack: Detecting Knee Points in System Behavior

Ville A. Satopää; Jeannie R. Albrecht; David E. Irwin; Barath Raghavan


International Journal of Forecasting | 2014

Combining multiple probability predictions using a simple logit model

Ville A. Satopää; Jonathan Baron; Dean P. Foster; Barbara A. Mellers; Philip E. Tetlock; Lyle H. Ungar


national conference on artificial intelligence | 2012

The Good Judgment Project: A Large Scale Test of Different Methods of Combining Expert Predictions

Lyle H. Ungar; Barbara A. Mellers; Ville A. Satopää; Philip E. Tetlock; Jon Baron


Archive | 2012

The Good Judgment Project: A Large Scale Test

Lyle H. Ungar; Barb Mellors; Ville A. Satopää; Jon Baron; Phil Tetlock; Jaime Ramos; Sam Swift


Electronic Journal of Statistics | 2017

Partial information framework: Model-based aggregation of estimates from diverse information sources

Ville A. Satopää; Shane T. Jensen; Robin Pemantle; Lyle H. Ungar


arXiv: Methodology | 2015

Combining and Extremizing Real-Valued Forecasts

Ville A. Satopää; Lyle H. Ungar

Collaboration


Dive into the Ville A. Satopää's collaboration.

Top Co-Authors

Avatar

Lyle H. Ungar

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Robin Pemantle

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Philip E. Tetlock

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Shane T. Jensen

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Edward I. George

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Jeffrey H. Silber

Children's Hospital of Philadelphia

View shared research outputs
Top Co-Authors

Avatar

Jon Baron

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Paul R. Rosenbaum

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Veronika Rockova

University of Pennsylvania

View shared research outputs
Researchain Logo
Decentralizing Knowledge