Tobias Bachteler
University of Duisburg-Essen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Tobias Bachteler.
BMC Medical Informatics and Decision Making | 2009
Rainer Schnell; Tobias Bachteler; Jörg Reiher
BackgroundCombining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns.MethodsA new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers.ResultsTests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings.ConclusionWe proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.
Evaluation Review | 2010
Rainer Schnell; Tobias Bachteler; Jörg Reiher
In panel studies on sensitive topics, respondent-generated identification codes are often used to link records across surveys. However, usually a substantial number of cases are lost due to the codes. These losses may cause biased estimates. Using more components and linking the codes by the Levenshtein string distance function will reduce the losses. In a simulation study and two field experiments, the proposed procedure outperforms the methods previously applied.
Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010
Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell
The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.
AStA Wirtschafts- und Sozialstatistisches Archiv | 2010
Tobias Gramlich; Tobias Bachteler; Bernhard Schimpl-Neimanns; Rainer Schnell
ZusammenfassungPaneldaten haben gegenüber Querschnittsdaten zahlreiche Vorteile. Amtliche Daten sind zudem eine wichtige Quelle für die Sozial- und Wirtschaftswissenschaften. Viele amtliche Datenerhebungen sind als Panel konzipiert und durchgeführt oder können zu Panels zusammengefügt werden. Diese Arbeit gibt eine Übersicht über die Panelerhebungen oder zu Panels aufbereiteten Einzeldatensätze von Haushalten oder Personen der deutschen amtlichen Statistik und beschreibt Erhebungsinhalte, Stichprobe sowie Zugangsmöglichkeiten.AbstractPanel data have numerous advantages to cross sectional data. Data from official statistical offices (and other public authorities) are a valuable data source for the social and economic sciences. Many of these data originally are panel data (or can be combined to form panel data). This article gives an overview on panel data (households or persons) and panel surveys conducted by German public authorities, describing topic and contents, sampling and access to these data.
Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010
Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell
The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.
Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010
Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell
The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.
Austrian Journal of Statistics | 2016
Rainer Schnell; Tobias Bachteler; Stefan Bender
Archive | 2011
Rainer Schnell; Tobias Bachteler; Jörg Reiher
methods, data, analyses | 2017
Rainer Schnell; Tobias Gramlich; Tobias Bachteler; Jörg Reiher; Mark Trappmann; Menno Smid; Inna Becher
FDZ Methodenreport | 2012
Theresa Scholz; Cerstin Rauscher; Jörg Reiher; Tobias Bachteler