Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Andrew B. Nobel is active.

Publication


Featured researches published by Andrew B. Nobel.


Proceedings of the National Academy of Sciences of the United States of America | 2003

Repeated observation of breast tumor subtypes in independent gene expression data sets

Therese Sørlie; Robert Tibshirani; Joel S. Parker; Trevor Hastie; J. S. Marron; Andrew B. Nobel; Shibing Deng; Hilde Johnsen; Robert Pesich; Stephanie Geisler; Janos Demeter; Charles M. Perou; Per Eystein Lønning; Patrick O. Brown; Anne Lise Børresen-Dale; David Botstein

Characteristic patterns of gene expression measured by DNA microarrays have been used to classify tumors into clinically relevant subgroups. In this study, we have refined the previously defined subtypes of breast tumors that could be distinguished by their distinct patterns of gene expression. A total of 115 malignant breast tumors were analyzed by hierarchical clustering based on patterns of expression of 534 “intrinsic” genes and shown to subdivide into one basal-like, one ERBB2-overexpressing, two luminal-like, and one normal breast tissue-like subgroup. The genes used for classification were selected based on their similar expression levels between pairs of consecutive samples taken from the same tumor separated by 15 weeks of neoadjuvant treatment. Similar cluster analyses of two published, independent data sets representing different patient cohorts from different laboratories, uncovered some of the same breast cancer subtypes. In the one data set that included information on time to development of distant metastasis, subtypes were associated with significant differences in this clinical feature. By including a group of tumors from BRCA1 carriers in the analysis, we found that this genotype predisposes to the basal tumor subtype. Our results strongly support the idea that many of these breast tumor subtypes represent biologically distinct disease entities.


Bioinformatics | 2005

Significance analysis of functional categories in gene expression studies: a structured permutation approach

William T. Barry; Andrew B. Nobel; Fred A. Wright

MOTIVATION In high-throughput genomic and proteomic experiments, investigators monitor expression across a set of experimental conditions. To gain an understanding of broader biological phenomena, researchers have until recently been limited to post hoc analyses of significant gene lists. METHOD We describe a general framework, significance analysis of function and expression (SAFE), for conducting valid tests of gene categories ab initio. SAFE is a two-stage, permutation-based method that can be applied to various experimental designs, accounts for the unknown correlation among genes and enables permutation-based estimation of error rates. RESULTS The utility and flexibility of SAFE is illustrated with a microarray dataset of human lung carcinomas and gene categories based on Gene Ontology and the Protein Family database. Significant gene categories were observed in comparisons of (1) tumor versus normal tissue, (2) multiple tumor subtypes and (3) survival times. AVAILABILITY Code to implement SAFE in the statistical package R is available from the authors. SUPPLEMENTARY INFORMATION http://www.bios.unc.edu/~fwright/SAFE.


Genome Biology | 2005

ChIPOTle: a user-friendly tool for the analysis of ChIP-chip data

Michael J. Buck; Andrew B. Nobel; Jason D. Lieb

ChIPOTle (Chromatin ImmunoPrecipitation On Tiled arrays) takes advantage of two unique properties of ChIP-chip data: the single-tailed nature of the data, caused by specific enrichment but not specific depletion of genomic fragments; and the predictable enrichment of DNA fragments adjacent to sites of direct protein-DNA interaction. Implemented as a Microsoft Excel macro written in Visual Basic, ChIPOTle uses a sliding window approach that yields improvements in the identification of bona fide sites of protein-DNA interaction.ChIPOTle (Chromatin ImmunoPrecipitation On Tiled arrays) takes advantage of two unique properties of ChIP-chip data: the single-tailed nature of the data, caused by specific enrichment but not specific depletion of genomic fragments; and the predictable enrichment of DNA fragments adjacent to sites of direct protein-DNA interaction. Implemented as a Microsoft Excel macro written in Visual Basic, ChIPOTle uses a sliding window approach that yields improvements in the identification of bona fide sites of protein-DNA interaction.


Bioinformatics | 2008

Merging two gene-expression studies via cross-platform normalization

Andrey A. Shabalin; Håkon Tjelmeland; Cheng Fan; Charles M. Perou; Andrew B. Nobel

MOTIVATION Gene-expression microarrays are currently being applied in a variety of biomedical applications. This article considers the problem of how to merge datasets arising from different gene-expression studies of a common organism and phenotype. Of particular interest is how to merge data from different technological platforms. RESULTS The article makes two contributions to the problem. The first is a simple cross-study normalization method, which is based on linked gene/sample clustering of the given datasets. The second is the introduction and description of several general validation measures that can be used to assess and compare cross-study normalization methods. The proposed normalization method is applied to three existing breast cancer datasets, and is compared to several competing normalization methods using the proposed validation measures. AVAILABILITY The supplementary materials and XPN Matlab code are publicly available at website: https://genome.unc.edu/xpn


Journal of the American Statistical Association | 2008

Statistical Significance of Clustering for High-Dimension, Low–Sample Size Data

Yufeng Liu; David Neil Hayes; Andrew B. Nobel; J. S. Marron

Clustering methods provide a powerful tool for the exploratory analysis of high-dimension, low–sample size (HDLSS) data sets, such as gene expression microarray data. A fundamental statistical issue in clustering is which clusters are “really there,” as opposed to being artifacts of the natural sampling variation. We propose SigClust as a simple and natural approach to this fundamental statistical problem. In particular, we define a cluster as data coming from a single Gaussian distribution and formulate the problem of assessing statistical significance of clustering as a testing procedure. This Gaussian null assumption allows direct formulation of p values that effectively quantify the significance of a given clustering. HDLSS covariance estimation for SigClust is achieved by a combination of invariance principles, together with a factor analysis model. The properties of SigClust are studied. Simulated examples, as well as an application to a real cancer microarray data set, show that the proposed method works remarkably well for assessing significance of clustering. Some theoretical results also are obtained.


Molecular Cancer Therapeutics | 2006

Gene expression profiles do not consistently predict the clinical treatment response in locally advanced breast cancer

Therese Sørlie; Charles M. Perou; Cheng Fan; Stephanie Geisler; Turid Aas; Andrew B. Nobel; Gun Anker; Lars A. Akslen; David Botstein; Anne Lise Børresen-Dale; Per Eystein Lønning

Neoadjuvant treatment offers an opportunity to correlate molecular variables to treatment response and to explore mechanisms of drug resistance in vivo. Here, we present a statistical analysis of large-scale gene expression patterns and their relationship to response following neoadjuvant chemotherapy in locally advanced breast cancers. We analyzed cDNA expression data from 81 tumors from two patient series, one treated with doxorubicin alone (51) and the other treated with 5-fluorouracil and mitomycin (30), and both were previously studied for correlations between TP53 status and response to therapy. We observed a low frequency of progressive disease within the luminal A subtype from both series (2 of 36 versus 13 of 45 patients; P = 0.0089) and a high frequency of progressive disease among patients with luminal B type tumors treated with doxorubicin (5 of 8 patients; P = 0.0078); however, aside from these two observations, no other consistent associations between response to chemotherapy and tumor subtype were observed. These specific associations could possibly be explained by covariance with TP53 mutation status, which also correlated with tumor subtype. Using supervised analysis, we could not uncover a gene profile that could reliably (>70% accuracy and specificity) predict response to either treatment regimen. [Mol Cancer Ther 2006;5(11):2914–8]


The Annals of Applied Statistics | 2009

FINDING LARGE AVERAGE SUBMATRICES IN HIGH DIMENSIONAL DATA

Andrey A. Shabalin; Victor J. Weigman; Charles M. Perou; Andrew B. Nobel

The search for sample-variable associations is an important problem in the exploratory analysis of high dimensional data. Biclustering methods search for sample-variable associations in the form of distinguished submatrices of the data matrix. (The rows and columns of a submatrix need not be contiguous.) In this paper we propose and evaluate a statistically motivated biclustering procedure (LAS) that finds large average submatrices within a given real-valued data matrix. The procedure operates in an iterative-residual fashion, and is driven by a Bonferroni-based significance score that effectively trades off between submatrix size and average value. We examine the performance and potential utility of LAS, and compare it with a number of existing methods, through an extensive three-part validation study using two gene expression datasets. The validation study examines quantitative properties of biclusters, biological and clinical assessments using auxiliary information, and classification of disease subtypes using bicluster membership. In addition, we carry out a simulation study to assess the effectiveness and noise sensitivity of the LAS search procedure. These results suggest that LAS is an effective exploratory tool for the discovery of biologically relevant structures in high dimensional data. Software is available at https://genome.unc.edu/las/.


The Annals of Applied Statistics | 2013

Joint and individual variation explained (JIVE) for integrated analysis of multiple data types

Eric F. Lock; Katherine A. Hoadley; J. S. Marron; Andrew B. Nobel

Research in several fields now requires the analysis of datasets in which multiple high-dimensional types of data are available for a common set of objects. In particular, The Cancer Genome Atlas (TCGA) includes data from several diverse genomic technologies on the same cancerous tumor samples. In this paper we introduce Joint and Individual Variation Explained (JIVE), a general decomposition of variation for the integrated analysis of such datasets. The decomposition consists of three terms: a low-rank approximation capturing joint variation across data types, low-rank approximations for structured variation individual to each data type, and residual noise. JIVE quantifies the amount of joint variation between data types, reduces the dimensionality of the data, and provides new directions for the visual exploration of joint and individual structure. The proposed method represents an extension of Principal Component Analysis and has clear advantages over popular two-block methods such as Canonical Correlation Analysis and Partial Least Squares. A JIVE analysis of gene expression and miRNA data on Glioblastoma Multiforme tumor samples reveals gene-miRNA associations and provides better characterization of tumor types.


Breast Cancer Research and Treatment | 2012

Basal-like Breast cancer DNA copy number losses identify genes involved in genomic instability, response to therapy, and patient survival

Victor J. Weigman; Hann Hsiang Chao; Andrey A. Shabalin; Xiaping He; Joel S. Parker; Silje H. Nordgard; Tatyana A. Grushko; Dezheng Huo; Chika Nwachukwu; Andrew B. Nobel; Vessela N. Kristensen; Anne Lise Børresen-Dale; Olufunmilayo I. Olopade; Charles M. Perou

Breast cancer is a heterogeneous disease with known expression-defined tumor subtypes. DNA copy number studies have suggested that tumors within gene expression subtypes share similar DNA Copy number aberrations (CNA) and that CNA can be used to further sub-divide expression classes. To gain further insights into the etiologies of the intrinsic subtypes, we classified tumors according to gene expression subtype and next identified subtype-associated CNA using a novel method called SWITCHdna, using a training set of 180 tumors and a validation set of 359 tumors. Fisher’s exact tests, Chi-square approximations, and Wilcoxon rank-sum tests were performed to evaluate differences in CNA by subtype. To assess the functional significance of loss of a specific chromosomal region, individual genes were knocked down by shRNA and drug sensitivity, and DNA repair foci assays performed. Most tumor subtypes exhibited specific CNA. The Basal-like subtype was the most distinct with common losses of the regions containing RB1, BRCA1, INPP4B, and the greatest overall genomic instability. One Basal-like subtype-associated CNA was loss of 5q11–35, which contains at least three genes important for BRCA1-dependent DNA repair (RAD17, RAD50, and RAP80); these genes were predominantly lost as a pair, or all three simultaneously. Loss of two or three of these genes was associated with significantly increased genomic instability and poor patient survival. RNAi knockdown of RAD17, or RAD17/RAD50, in immortalized human mammary epithelial cell lines caused increased sensitivity to a PARP inhibitor and carboplatin, and inhibited BRCA1 foci formation in response to DNA damage. These data suggest a possible genetic cause for genomic instability in Basal-like breast cancers and a biological rationale for the use of DNA repair inhibitor related therapeutics in this breast cancer subtype.


Journal of Multivariate Analysis | 2013

Reconstruction of a low-rank matrix in the presence of Gaussian noise

Andrey A. Shabalin; Andrew B. Nobel

This paper addresses the problem of reconstructing a low-rank signal matrix observed with additive Gaussian noise. We first establish that, under mild assumptions, one can restrict attention to orthogonally equivariant reconstruction methods, which act only on the singular values of the observed matrix and do not affect its singular vectors. Using recent results in random matrix theory, we then propose a new reconstruction method that aims to reverse the effect of the noise on the singular value decomposition of the signal matrix. In conjunction with the proposed reconstruction method we also introduce a Kolmogorov–Smirnov based estimator of the noise variance.

Collaboration


Dive into the Andrew B. Nobel's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Fred A. Wright

North Carolina State University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

J. S. Marron

University of North Carolina at Chapel Hill

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Shankar Bhamidi

University of North Carolina at Chapel Hill

View shared research outputs
Top Co-Authors

Avatar

John Palowitch

University of North Carolina at Chapel Hill

View shared research outputs
Top Co-Authors

Avatar

Philip S. Bernard

University of North Carolina at Chapel Hill

View shared research outputs
Top Co-Authors

Avatar

Cheng Fan

University of North Carolina at Chapel Hill

View shared research outputs
Researchain Logo
Decentralizing Knowledge