Brunilda Balliu
Stanford University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Brunilda Balliu.
Nature Genetics | 2017
Manuel A. Ferreira; Judith M. Vonk; Hansjörg Baurecht; Ingo Marenholz; Chao Tian; Joshua Hoffman; Quinta Helmer; Annika Tillander; Vilhelmina Ullemar; Jenny van Dongen; Yi Lu; Franz Rüschendorf; Chris W Medway; Edward Mountjoy; Kimberley Burrows; Oliver Hummel; Sarah Grosche; Ben Michael Brumpton; John S. Witte; Jouke-Jan Hottenga; Gonneke Willemsen; Jie Zheng; Elke Rodriguez; Melanie Hotze; Andre Franke; Joana A. Revez; Jonathan Beesley; Melanie C. Matheson; Shyamali C. Dharmage; Lisa Bain
Asthma, hay fever (or allergic rhinitis) and eczema (or atopic dermatitis) often coexist in the same individuals, partly because of a shared genetic origin. To identify shared risk variants, we performed a genome-wide association study (GWAS; n = 360,838) of a broad allergic disease phenotype that considers the presence of any one of these three diseases. We identified 136 independent risk variants (P < 3 × 10−8), including 73 not previously reported, which implicate 132 nearby genes in allergic disease pathophysiology. Disease-specific effects were detected for only six variants, confirming that most represent shared risk factors. Tissue-specific heritability and biological process enrichment analyses suggest that shared risk variants influence lymphocyte-mediated immunity. Six target genes provide an opportunity for drug repositioning, while for 36 genes CpG methylation was found to influence transcription independently of genetic effects. Asthma, hay fever and eczema partly coexist because they share many genetic risk variants that dysregulate the expression of immune-related genes.
Genome Research | 2016
Kimberly R. Kukurba; Princy Parsana; Brunilda Balliu; Kevin S. Smith; Zachary Zappala; David A. Knowles; Marie Julie Favé; Joe R. Davis; Xin Li; Xiaowei Zhu; James B. Potash; Myrna M. Weissman; Jianxin Shi; Anshul Kundaje; Douglas F. Levinson; Alexis Battle; Stephen B. Montgomery
The X Chromosome, with its unique mode of inheritance, contributes to differences between the sexes at a molecular level, including sex-specific gene expression and sex-specific impact of genetic variation. Improving our understanding of these differences offers to elucidate the molecular mechanisms underlying sex-specific traits and diseases. However, to date, most studies have either ignored the X Chromosome or had insufficient power to test for the sex-specific impact of genetic variation. By analyzing whole blood transcriptomes of 922 individuals, we have conducted the first large-scale, genome-wide analysis of the impact of both sex and genetic variation on patterns of gene expression, including comparison between the X Chromosome and autosomes. We identified a depletion of expression quantitative trait loci (eQTL) on the X Chromosome, especially among genes under high selective constraint. In contrast, we discovered an enrichment of sex-specific regulatory variants on the X Chromosome. To resolve the molecular mechanisms underlying such effects, we generated chromatin accessibility data through ATAC-sequencing to connect sex-specific chromatin accessibility to sex-specific patterns of expression and regulatory variation. As sex-specific regulatory variants discovered in our study can inform sex differences in heritable disease prevalence, we integrated our data with genome-wide association study data for multiple immune traits identifying several traits with significant sex biases in genetic susceptibilities. Together, our study provides genome-wide insight into how genetic variation, the X Chromosome, and sex shape human gene regulation and disease.
European Journal of Human Genetics | 2014
Petra E A Huijts; Antoinette Hollestelle; Brunilda Balliu; Jeanine J. Houwing-Duistermaat; Caro M. Meijers; Jannet Blom; Bahar Ozturk; Elly M. M. Krol-Warmerdam; Juul T. Wijnen; Els M. J. J. Berns; John W.M. Martens; Caroline Seynaeve; Lambertus A. Kiemeney; Henricus F. M. van der Heijden; Rob A. E. M. Tollenaar; Peter Devilee; Christi J. van Asperen
The 1100delC mutation in the CHEK2 gene has a carrier frequency of up to 1.5% in individuals from North-West Europe. Women heterozygous for 1100delC have an increased breast cancer risk (odds ratio 2.7). To explore the prevalence and clinical consequences of 1100delC homozygosity in the Netherlands, we genotyped a sporadic breast cancer hospital-based cohort, a group of non-BRCA1/2 breast cancer families, and breast tumors from a tumor tissue bank. Three 1100delC homozygous patients were found in the cohort of 1434 sporadic breast cancer patients, suggesting an increased breast cancer risk for 1100delC homozygotes (odds ratio 3.4, 95% confidence interval 0.4–32.6, P=0.3). Another 1100delC homozygote was found in 592 individuals from 108 non-BRCA1/2 breast cancer families, and two more were found after testing 1706 breast tumors and confirming homozygosity on their wild-type DNA. Follow-up data was available for five homozygous patients, and remarkably, three of them had developed contralateral breast cancer. A possible relationship between 1100delC and lung cancer risk was investigated in 457 unrelated lung cancer patients but could not be confirmed. Due to the small number of 1100delC homozygotes identified, the breast cancer risk estimate associated with this genotype had limited accuracy but is probably higher than the risk in heterozygous females. Screening for CHEK2 1100delC could be beneficial in countries with a relatively high allele frequency.
Genetic Epidemiology | 2015
Brunilda Balliu; Roula Tsonaka; Stefan Boehringer; Jeanine J. Houwing-Duistermaat
Integrative omics, the joint analysis of outcome and multiple types of omics data, such as genomics, epigenomics, and transcriptomics data, constitute a promising approach for powerful and biologically relevant association studies. These studies often employ a case‐control design, and often include nonomics covariates, such as age and gender, that may modify the underlying omics risk factors. An open question is how to best integrate multiple omics and nonomics information to maximize statistical power in case‐control studies that ascertain individuals based on the phenotype. Recent work on integrative omics have used prospective approaches, modeling case‐control status conditional on omics, and nonomics risk factors. Compared to univariate approaches, jointly analyzing multiple risk factors with a prospective approach increases power in nonascertained cohorts. However, these prospective approaches often lose power in case‐control studies. In this article, we propose a novel statistical method for integrating multiple omics and nonomics factors in case‐control association studies. Our method is based on a retrospective likelihood function that models the joint distribution of omics and nonomics factors conditional on case‐control status. The new method provides accurate control of Type I error rate and has increased efficiency over prospective approaches in both simulated and real data.
PLOS ONE | 2014
Brunilda Balliu; Rolf P. Würtz; Bernhard Horsthemke; Dagmar Wieczorek; Stefan Böhringer
Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a principal component analysis (PCA) pre-processing step. In our data example classification accuracy improves from 47% to 62% when switching from PCA to penalized regression. A second goal is to visualize the resulting classifiers. We develop importance plots highlighting the influence of coordinates in the original 2D space. Features used for classification are mapped to coordinates in the original images and combined into an importance measure for each pixel. These plots assist in assessing plausibility of classifiers, interpretation of classifiers, and determination of the relative importance of different features.
Genetic Epidemiology | 2014
Han Chen; Dörthe Malzahn; Brunilda Balliu; Cong Li; Julia N. Bailey
With the advance of next‐generation sequencing technologies in recent years, rare genetic variant data have now become available for genetic epidemiology studies. For family samples, however, only a few statistical methods for association analysis of rare genetic variants have been developed. Rare variant approaches are of great interest, particularly for family data, because samples enriched for trait‐relevant variants can be ascertained and rare variants are putatively enriched through segregation. To facilitate the evaluation of existing and new rare variant testing approaches for analyzing family data, Genetic Analysis Workshop 18 (GAW18) provided genotype and next‐generation sequencing data and longitudinal blood pressure traits from extended pedigrees of Mexican American families from the San Antonio Family Study. Our GAW18 group members analyzed real and simulated phenotype data from GAW18 by using generalized linear mixed‐effects models or principal components to adjust for familial correlation or by testing binary traits using a correction factor for familial effects. With one exception, approaches dealt with the extended pedigrees in their original state using information based on the kinship matrix or alternative genetic similarity measures. For simulated data our group demonstrated that the family‐based kernel machine score test is superior in power to family‐based single‐marker or burden tests, except in a few specific scenarios. For real data three contributions identified significant associations. They substantially reduced the number of tests before performing the association analysis. We conclude from our real data analyses that further development of strategies for targeted testing or more focused screening of genetic variants is strongly desirable.
BMC Proceedings | 2014
Brunilda Balliu; Hae-Won Uh; Roula Tsonaka; Stefan Boehringer; Quinta Helmer; Jeanine J. Houwing-Duistermaat
In this analysis, we investigate the contributions that linkage-based methods, such as identical-by-descent mapping, can make to association mapping to identify rare variants in next-generation sequencing data. First, we identify regions in which cases share more segments identical-by-descent around a putative causal variant than do controls. Second, we use a two-stage mixed-effect model approach to summarize the single-nucleotide polymorphism data within each region and include them as covariates in the model for the phenotype. We assess the impact of linkage disequilibrium in determining identical-by-descent states between individuals by using markers with and without linkage disequilibrium for the first part and the impact of imputation in testing for association by using imputed genome-wide association studies or raw sequence markers for the second part. We apply the method to next-generation sequencing longitudinal family data from Genetic Association Workshop 18 and identify a significant region at chromosome 3: 40249244-41025167 (p-value = 2.3 × 10−3).
Genetic Epidemiology | 2012
Brunilda Balliu; Roula Tsonaka; Diane van der Woude; Stefan Boehringer; Jeanine J. Houwing-Duistermaat
It is hypothesized that certain alleles can have a protective effect not only when inherited by the offspring but also as noninherited maternal antigens (NIMA). To estimate the NIMA effect, large samples of families are needed. When large samples are not available, we propose a combined approach to estimate the NIMA effect from ascertained nuclear families and twin pairs. We develop a likelihood‐based approach allowing for several ascertainment schemes, to accommodate for the outcome‐dependent sampling scheme, and a family‐specific random term, to take into account the correlation between family members. We estimate the parameters using maximum likelihood based on the combined joint likelihood ( CJL ) approach. Simulations show that the CJL is more efficient for estimating the NIMA odds ratios as compared to a families‐only approach. To illustrate our approach, we used data from a family and a twin study from the United Kingdom on rheumatoid arthritis, and confirmed the protective NIMA effect, with an odds ratio of 0.477 (95% CI 0.264–0.864).
Genetics | 2016
Brunilda Balliu; Noah Zaitlen
Epistasis plays a significant role in the genetic architecture of many complex phenotypes in model organisms. To date, there have been very few interactions replicated in human studies due in part to the multiple-hypothesis burden implicit in genome-wide tests of epistasis. Therefore, it is of paramount importance to develop the most powerful tests possible for detecting interactions. In this work we develop a new SNP–SNP interaction test for use in case-only trio studies called the trio correlation (TC) test. The TC test computes the expected joint distribution of marker pairs in offspring conditional on parental genotypes. This distribution is then incorporated into a standard 1 d.f. correlation test of interaction. We show via extensive simulations under a variety of disease models that our test substantially outperforms existing tests of interaction in case-only trio studies. We also demonstrate a bias in a previous case-only trio interaction test and identify its origin. Finally, we show that a previously proposed permutation scheme in trio studies mitigates the known biases of case-only tests in the presence of population stratification. We conclude that the TC test shows improved power to identify interactions in existing, as well as emerging, trio association studies. The method is publicly available at www.github.com/BrunildaBalliu/TrioEpi.
bioRxiv | 2015
Brunilda Balliu; Noah Zaitlen
Epistasis plays a significant role in the genetic architecture of many complex phenotypes in model organisms. To date, there have been very few interactions replicated in human studies due in part to the multiple hypothesis burden implicit in genome-wide tests of epistasis. Therefore, it is of paramount importance to develop the most powerful tests possible for detecting interactions. In this work we develop a new gene-gene interaction test for use in trio studies called the trio correlation (TC) test. The TC test computes the expected joint distribution of marker pairs in offspring conditional on parental genotypes. This distribution is then incorporated into a standard one degree of freedom correlation test of interaction. We show via extensive simulations that our test substantially outperforms existing tests of interaction in trio studies. The gain in power under standard models of phenotype is large, with previous tests requiring more than twice the number of trios to obtain the power of our test. We also demonstrate a bias in a previous trio interaction test and identify its origin. We conclude that the TC test shows improved power to identify interactions in existing, as well as emerging, trio association studies. The method is publicly available at www.github.com/BrunildaBalliu/TrioEpi.