Sander Scholtus | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sander Scholtus is active.

Explore More

Publication

Featured researches published by Sander Scholtus.

Journal of Official Statistics | 2013

Automated and Manual Data Editing: A View on Process Design and Methodology

Jeroen Pannekoek; Sander Scholtus; Mark Van der Loo

Abstract Data editing is arguably one of the most resource-intensive processes at NSIs. Forced by everincreasing budget pressure, NSIs keep searching for more efficient forms of data editing. Efficiency gains can be obtained by selective editing, that is, limiting the manual editing to influential errors, and by automating the editing process as much as possible. In our view, an optimal mix of these two strategies should be aimed for. In this article we present a decomposition of the overall editing process into a number of different tasks and give an upto- date overview of all the possibilities of automatic editing in terms of these tasks. During the design of an editing process, this decomposition may be helpful in deciding which tasks can be done automatically and for which tasks (additional) manual editing is required. Such decisions can be made a priori, based on the specific nature of the task, or by empirical evaluation, which is illustrated by examples. The decomposition in tasks, or statistical functions, also naturally leads to reuseable components, resulting in efficiency gains in process design.

Journal of Official Statistics | 2015

Sensitivity of Mixed-Source Statistics to Classification Errors

Joep Burger; Arnout van Delden; Sander Scholtus

Abstract For policymakers and other users of official statistics, it is crucial to distinguish real differences underlying statistical outcomes from noise caused by various error sources in the statistical process. This has become more difficult as official statistics are increasingly based upon a mix of sources that typically do not involve probability sampling. In this article, we apply a resampling method to assess the sensitivity of mixed-source statistics to sourcespecific classification errors. Classification errors can be seen as coverage errors within a stratum. The method can be used to compare relative accuracies between strata and releases, it can assist in deciding how to optimally allocate resources in the statistical process, and it can be applied in evaluating potential estimators. A case study on short-term business statistics shows that bias occurs especially for those strata that deviate strongly from the mean value in other strata. It also suggests that shifting classification resources from small and mediumsized enterprises to large enterprises has virtually no net effect on accuracy, because the gain in precision is offset by the creation of bias. The resampling method can be extended to include other types of nonsampling error.

Journal of Official Statistics | 2016

Accuracy of Mixed-Source Statistics as Affected by Classification Errors

Arnout van Delden; Sander Scholtus; Joep Burger

Abstract Publications in official statistics are increasingly based on a combination of sources. Although combining data sources may result in nearly complete coverage of the target population, the outcomes are not error free. Estimating the effect of nonsampling errors on the accuracy of mixed-source statistics is crucial for decision making, but it is not straightforward. Here we simulate the effect of classification errors on the accuracy of turnover-level estimates in car-trade industries. We combine an audit sample, the dynamics in the business register, and expert knowledge to estimate a transition matrix of classification-error probabilities. Bias and variance of the turnover estimates caused by classification errors are estimated by a bootstrap resampling approach. In addition, we study the extent to which manual selective editing at micro level can improve the accuracy. Our analyses reveal which industries do not meet preset quality criteria. Surprisingly, more selective editing can result in less accurate estimates for specific industries, and a fixed allocation of editing effort over industries is more effective than an allocation in proportion with the accuracy and population size of each industry. We discuss how to develop a practical method that can be implemented in production to estimate the accuracy of register-based estimates.

Archive | 2011

Handbook of statistical data editing and imputation

Ton de Waal; Jeroen Pannekoek; Sander Scholtus

Archive | 2011

Handbook of Statistical Data Editing and Imputation: de Waal/Handbook Data Editing

Ton de Waal; Jeroen Pannekoek; Sander Scholtus

Wiley Interdisciplinary Reviews: Computational Statistics | 2012

The editing of statistical data: methods and techniques for the efficient detection and correction of errors and missing values

Ton de Waal; Jeroen Pannekoek; Sander Scholtus

Archive | 2013

Automatic editing with hard and soft edits

Sander Scholtus

New Techniques and Technologies for Statistics | 2015

Modelling measurement error to estimate bias in administrative and survey variables

Sander Scholtus; A. van Delden; B.F.M. Bakker

UN/ECE Work Session on Statistical Data Editing | 2017

Evaluating the Quality of Business Survey Data before and after Automatic Editing. Working Paper No. 6

Sander Scholtus; B.F.M. Bakker; Sam Robinson

Archive | 2017

Herziening gewichtenregeling primair onderwijs. Hoofdlijnenrapport

Hanneke Posthumus; B.F.M. Bakker; Jamie Graham; Karolijne van der Houwen; Mersiha Tepic; Jeroen van den Tillaart; Sander Scholtus; Désirée Verhallen-Schumacher; Nander de Vette

Explore More