Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Christian Borgs is active.

Publication


Featured researches published by Christian Borgs.


Journal of Statistical Physics | 1997

Dobrushin states for classical spin systems with complex interactions

Christian Borgs; J. T. Chayes; Jürg Fröhlich

AbstractWe consider a classical spin system on the hypercubic lattice with a general interaction of the form


international conference on data mining | 2016

Randomized Response and Balanced Bloom Filters for Privacy Preserving Record Linkage

Rainer Schnell; Christian Borgs


international conference on data mining | 2015

Building a National Perinatal Data Base without the Use of Unique Personal Identifiers

Rainer Schnell; Christian Borgs

H = \frac{\beta } {4}\sum\limits_{\begin{array}{*{20}c} {x,y:} \\ {|x - y| = 1} \\ \end{array} } {|s_x - s_y | - h} \sum\limits_x {x{}_x + } \sum\limits_A {\lambda _A \prod\limits_{y \in A} {S_y } }


BMC Medical Informatics and Decision Making | 2017

Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets

Adrian Brown; Christian Borgs; Sean M. Randall; Rainer Schnell


international conference on health informatics | 2017

A Comparison of Statistical Linkage Keys with Bloom Filter-based Encryptions for Privacy-preserving Record Linkage using Real-world Mammography Data.

Rainer Schnell; Anke Richter; Christian Borgs

are the spin variables, Β is the inverse temperature,h is the magnetic field, andλA are translation-invariant coupling constants satisfyingλA = 0 if diamA > l. No symmetry relating the configurationss ={sinx} and-s=-sx is assumed. In dimension d-3, we construct low-temperature States which break the translation invariance of the system by introducing so-called Dobrushin boundary conditions which force a horizontal interface into the system. In contrast to previous constructions, our methods work equally well for complex interactions, and should therefore be generalizable to quantum spin systems.


BMC Health Services Research | 2018

Sociodemographic differences in linkage error: an examination of four large-scale datasets

Sean M. Randall; Adrian Brown; James H. Boyd; Rainer Schnell; Christian Borgs; Anna Ferrante

In most European settings, record linkage across different institutions is based on encrypted personal identifiers - such as names, birthdays, or places of birth - to protect privacy. However, in practice up to 20% of the records may contain errors in identifiers. Thus, exact record linkage on encrypted identifiers usually results in the loss of large subsets of the data. Such losses usually imply biased statistical estimates since the causes of errors might be correlated with the variables of interest in many applications. Over the past 10 years, the field of Privacy Preserving Record Linkage (PPRL) has developed different techniques to link data without revealing the identity of the described entity. However, only few techniques are suitable for applied research with large data bases that include millions of records, which is typical for administrative or medical data bases. Bloom filters were found to be one successful technique for PPRL when large scale applications are concerned. Yet, Bloom filters have been subject to cryptographic attacks. Previous research has shown that the straight application of Bloom filters has a non-zero re-identification risk. We present new results on recently developed techniques defying all known attacks on PPRL Bloom filters. The computationally inexpensive algorithms modify personal identifiers by combining different cryptographic techniques. The paper demonstrates these new algorithms and demonstrates their performance concerning precision, recall, and re-identification risk on large data bases.


Physical Review Letters | 1995

DOES THE ROUGHNESS OF THE SUBSTRATE ENHANCE WETTING

Christian Borgs; De Coninck J; Roman Kotecký; Zinque M

To assess the quality of hospital care, national databases of standard medical procedures are common. A widely known example are national databases of births. If unique personal identification numbers are available (as in Scandinavian countries), the construction of such databases is trivial from a computational point of view. However, due to privacy legislation, such identifiers are not available in all countries. Given such constraints, the construction of a national perinatal database has to rely on other patient identifiers, such as names and dates of birth. These kind of identifiers are prone to errors. Furthermore, some jurisdictions require the encryption of personal identifiers. The resulting problem is therefore an example of Privacy Preserving Record Linkage (PPRL). This contribution describes the design considerations for a national perinatal database using data of about 600,000 births in about 1,000 hospitals. Based on simulations, recommendations for parameter settings of Bloom filter based PPRL are given for this real world application.


Communications in Mathematical Physics | 1997

Dobrushin States in Quantum Lattice Systems

Christian Borgs; J. T. Chayes; Jürg Fröhlich

BackgroundIntegrating medical data using databases from different sources by record linkage is a powerful technique increasingly used in medical research. Under many jurisdictions, unique personal identifiers needed for linking the records are unavailable. Since sensitive attributes, such as names, have to be used instead, privacy regulations usually demand encrypting these identifiers. The corresponding set of techniques for privacy-preserving record linkage (PPRL) has received widespread attention. One recent method is based on Bloom filters. Due to superior resilience against cryptographic attacks, composite Bloom filters (cryptographic long-term keys, CLKs) are considered best practice for privacy in PPRL. Real-world performance of these techniques using large-scale data is unknown up to now.MethodsUsing a large subset of Australian hospital admission data, we tested the performance of an innovative PPRL technique (CLKs using multibit trees) against a gold-standard derived from clear-text probabilistic record linkage. Linkage time and linkage quality (recall, precision and F-measure) were evaluated.ResultsClear text probabilistic linkage resulted in marginally higher precision and recall than CLKs. PPRL required more computing time but 5 million records could still be de-duplicated within one day. However, the PPRL approach required fine tuning of parameters.ConclusionsWe argue that increased privacy of PPRL comes with the price of small losses in precision and recall and a large increase in computational burden and setup time. These costs seem to be acceptable in most applied settings, but they have to be considered in the decision to apply PPRL. Further research on the optimal automatic choice of parameters is needed.


BTW | 2015

Privacy Preserving Record Linkage with PPJoin.

Ziad Sehili; Lars Kolb; Christian Borgs; Rainer Schnell; Erhard Rahm

New EU regulations on the need to encrypt personal identifiers for linking data will increase the importance of Privacy-Preserving Record Linkage (PPRL) techniques over the course of the next years. Currently, the use of Anonymous Linkage Codes (ALCs) is the standard procedure for PPRL of medical databases. Recently, Bloom filter-based encodings of pseudo-identifiers such as names have received increasing attention for PPRL tasks. In contrast to most previous research in PPRL, which is based on simulated data, we compare the performance of ALCs and Bloom filter-based linkage keys using real data from a large regional breast cancer screening program. This large regional mammography data base contains nearly 200.000 records. We compare precision and recall for linking the data set existing at point t0 with new incident cases occuring after t0 using different encoding and matching strategies for the personal identifiers. Enhancing ALCs with an additional identifier (place of birth) yields better recall than standard ALCs. Using the same information for Bloom filters with recommended parameter settings exceeds ALCs in recall, while preserving precision.


International Journal for Population Data Science | 2017

High quality linkage using Multibit Trees for privacy-preserving blocking

Adrian Brown; Christian Borgs; Sean M. Randall; Rainer Schnell

BackgroundRecord linkage is an important tool for epidemiologists and health planners. Record linkage studies will generally contain some level of residual record linkage error, where individual records are either incorrectly marked as belonging to the same individual, or incorrectly marked as belonging to separate individuals. A key question is whether errors in linkage quality are distributed evenly throughout the population, or whether certain subgroups will exhibit higher rates of error. Previous investigations of this issue have typically compared linked and un-linked records, which can conflate bias caused by record linkage error, with bias caused by missing records (data capture errors).MethodsFour large administrative datasets were individually de-duplicated, with results compared to an available ‘gold-standard’ benchmark, allowing us to avoid methodological issues with comparing linked and un-linked records. Results were compared by gender, age, geographic remoteness (major cities, regional or remote) and socioeconomic status.ResultsResults varied between datasets, and by sociodemographic characteristic. The most consistent findings were worse linkage quality for younger individuals (seen in all four datasets) and worse linkage quality for those living in remote areas (seen in three of four datasets). The linkage quality within sociodemographic categories varied between datasets, with the associations with linkage error reversed across different datasets due to quirks of the specific data collection mechanisms and data sharing practices.ConclusionsThese results suggest caution should be taken both when linking younger individuals and those in remote areas, and when analysing linked data from these subgroups. Further research is required to determine the ramifications of worse linkage quality in these subpopulations on research outcomes.

Collaboration


Dive into the Christian Borgs's collaboration.

Top Co-Authors

Avatar

Rainer Schnell

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Roman Kotecky

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

J. T. Chayes

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Roman Kotecký

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge