Sander Scholtus
Statistics Netherlands
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sander Scholtus.
Journal of Official Statistics | 2013
Jeroen Pannekoek; Sander Scholtus; Mark Van der Loo
Abstract Data editing is arguably one of the most resource-intensive processes at NSIs. Forced by everincreasing budget pressure, NSIs keep searching for more efficient forms of data editing. Efficiency gains can be obtained by selective editing, that is, limiting the manual editing to influential errors, and by automating the editing process as much as possible. In our view, an optimal mix of these two strategies should be aimed for. In this article we present a decomposition of the overall editing process into a number of different tasks and give an upto- date overview of all the possibilities of automatic editing in terms of these tasks. During the design of an editing process, this decomposition may be helpful in deciding which tasks can be done automatically and for which tasks (additional) manual editing is required. Such decisions can be made a priori, based on the specific nature of the task, or by empirical evaluation, which is illustrated by examples. The decomposition in tasks, or statistical functions, also naturally leads to reuseable components, resulting in efficiency gains in process design.
Journal of Official Statistics | 2015
Joep Burger; Arnout van Delden; Sander Scholtus
Abstract For policymakers and other users of official statistics, it is crucial to distinguish real differences underlying statistical outcomes from noise caused by various error sources in the statistical process. This has become more difficult as official statistics are increasingly based upon a mix of sources that typically do not involve probability sampling. In this article, we apply a resampling method to assess the sensitivity of mixed-source statistics to sourcespecific classification errors. Classification errors can be seen as coverage errors within a stratum. The method can be used to compare relative accuracies between strata and releases, it can assist in deciding how to optimally allocate resources in the statistical process, and it can be applied in evaluating potential estimators. A case study on short-term business statistics shows that bias occurs especially for those strata that deviate strongly from the mean value in other strata. It also suggests that shifting classification resources from small and mediumsized enterprises to large enterprises has virtually no net effect on accuracy, because the gain in precision is offset by the creation of bias. The resampling method can be extended to include other types of nonsampling error.
Journal of Official Statistics | 2016
Arnout van Delden; Sander Scholtus; Joep Burger
Abstract Publications in official statistics are increasingly based on a combination of sources. Although combining data sources may result in nearly complete coverage of the target population, the outcomes are not error free. Estimating the effect of nonsampling errors on the accuracy of mixed-source statistics is crucial for decision making, but it is not straightforward. Here we simulate the effect of classification errors on the accuracy of turnover-level estimates in car-trade industries. We combine an audit sample, the dynamics in the business register, and expert knowledge to estimate a transition matrix of classification-error probabilities. Bias and variance of the turnover estimates caused by classification errors are estimated by a bootstrap resampling approach. In addition, we study the extent to which manual selective editing at micro level can improve the accuracy. Our analyses reveal which industries do not meet preset quality criteria. Surprisingly, more selective editing can result in less accurate estimates for specific industries, and a fixed allocation of editing effort over industries is more effective than an allocation in proportion with the accuracy and population size of each industry. We discuss how to develop a practical method that can be implemented in production to estimate the accuracy of register-based estimates.
Archive | 2011
Ton de Waal; Jeroen Pannekoek; Sander Scholtus
Archive | 2011
Ton de Waal; Jeroen Pannekoek; Sander Scholtus
Wiley Interdisciplinary Reviews: Computational Statistics | 2012
Ton de Waal; Jeroen Pannekoek; Sander Scholtus
Archive | 2013
Sander Scholtus
New Techniques and Technologies for Statistics | 2015
Sander Scholtus; A. van Delden; B.F.M. Bakker
UN/ECE Work Session on Statistical Data Editing | 2017
Sander Scholtus; B.F.M. Bakker; Sam Robinson
Archive | 2017
Hanneke Posthumus; B.F.M. Bakker; Jamie Graham; Karolijne van der Houwen; Mersiha Tepic; Jeroen van den Tillaart; Sander Scholtus; Désirée Verhallen-Schumacher; Nander de Vette