Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Tobias Bachteler is active.

Publication


Featured researches published by Tobias Bachteler.


BMC Medical Informatics and Decision Making | 2009

Privacy-preserving record linkage using Bloom filters

Rainer Schnell; Tobias Bachteler; Jörg Reiher

BackgroundCombining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns.MethodsA new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers.ResultsTests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings.ConclusionWe proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.


Evaluation Review | 2010

Improving the Use of Self-Generated Identification Codes

Rainer Schnell; Tobias Bachteler; Jörg Reiher

In panel studies on sensitive topics, respondent-generated identification codes are often used to link records across surveys. However, usually a substantial number of cases are lost due to the codes. These losses may cause biased estimates. Using more components and linking the codes by the Levenshtein string distance function will reduce the losses. In a simulation study and two field experiments, the proposed procedure outperforms the methods previously applied.


Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010

Performance of record linkage for cancer registry data linked with mammography screening data

Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell

The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.


AStA Wirtschafts- und Sozialstatistisches Archiv | 2010

Panelerhebungen der amtlichen Statistik als Datenquellen für die Wirtschafts- und Sozialwissenschaften

Tobias Gramlich; Tobias Bachteler; Bernhard Schimpl-Neimanns; Rainer Schnell

ZusammenfassungPaneldaten haben gegenüber Querschnittsdaten zahlreiche Vorteile. Amtliche Daten sind zudem eine wichtige Quelle für die Sozial- und Wirtschaftswissenschaften. Viele amtliche Datenerhebungen sind als Panel konzipiert und durchgeführt oder können zu Panels zusammengefügt werden. Diese Arbeit gibt eine Übersicht über die Panelerhebungen oder zu Panels aufbereiteten Einzeldatensätze von Haushalten oder Personen der deutschen amtlichen Statistik und beschreibt Erhebungsinhalte, Stichprobe sowie Zugangsmöglichkeiten.AbstractPanel data have numerous advantages to cross sectional data. Data from official statistical offices (and other public authorities) are a valuable data source for the social and economic sciences. Many of these data originally are panel data (or can be combined to form panel data). This article gives an overview on panel data (households or persons) and panel surveys conducted by German public authorities, describing topic and contents, sampling and access to these data.


Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010

Zur Leistungsfähigkeit des Record-Linkage zwischen epidemiologischen Krebsregistern und dem Mammographie-Screening

Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell

The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.


Bundesgesundheitsblatt-gesundheitsforschung-gesundheitsschutz | 2010

Zur Leistungsfähigkeit des Record-Linkage zwischen epidemiologischen Krebsregistern und dem Mammographie-Screening@@@Performance of record linkage for cancer registry data linked with mammography screening data

Klaus Giersiepen; Tobias Bachteler; Tobias Gramlich; Jörg Reiher; B. Schubert; I. Novopashenny; Rainer Schnell

The evaluation of the German Mammography Screening Program requires record linkage with data from cancer registries in order to measure the number of false-negative mammograms and interval cancers. This study aims at evaluating the performance of the established linkage method based on identifiers encrypted by the standard procedure of the German cancer registries. In addition, the results are compared with an alternative method based on plain text identifiers. A total of 16,572 records from the Bremen Mammography Screening Pilot Study were linked with data from the Bremen Cancer Registry. Based on a gold standard set of matching record pairs, homonym and synonym errors were determined. Given the customary threshold value in cancer registries, the plain text method showed a lower rate of synonym errors (2.1-5.1%) and a lower rate of homonym errors (0.01-0.15%). As 10.4 million women are invited to take part biennially in screening, the corresponding figures would be 3,237 homonym errors for the standard procedure and 294 using the plain text method provided equivalent conditions. The 11-fold increase in the homonym error rate documents the trade-off for better data protection using encrypted data.


Austrian Journal of Statistics | 2016

A Toolbox for Record Linkage

Rainer Schnell; Tobias Bachteler; Stefan Bender


Archive | 2011

A Novel Error-Tolerant Anonymous Linking Code

Rainer Schnell; Tobias Bachteler; Jörg Reiher


methods, data, analyses | 2017

A New Name-Based Sampling Method for Migrants

Rainer Schnell; Tobias Gramlich; Tobias Bachteler; Jörg Reiher; Mark Trappmann; Menno Smid; Inna Becher


FDZ Methodenreport | 2012

Geocoding of German administrative data: the case of the Institute for Employment Research

Theresa Scholz; Cerstin Rauscher; Jörg Reiher; Tobias Bachteler

Collaboration


Dive into the Tobias Bachteler's collaboration.

Top Co-Authors

Avatar

Rainer Schnell

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar

Jörg Reiher

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar

Tobias Gramlich

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jörg Reiher

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar

Mark Trappmann

Institut für Arbeitsmarkt- und Berufsforschung

View shared research outputs
Researchain Logo
Decentralizing Knowledge