Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Thomas W. Blackwell is active.

Publication


Featured researches published by Thomas W. Blackwell.


Proceedings of the National Academy of Sciences of the United States of America | 2008

Quantifying changes in the thiol redox proteome upon oxidative stress in vivo

Lars I. Leichert; Florian Gehrke; Harini V. Gudiseva; Thomas W. Blackwell; Marianne Ilbert; Angela K. Walker; John R. Strahler; Philip C. Andrews; Ursula Jakob

Antimicrobial levels of reactive oxygen species (ROS) are produced by the mammalian host defense to kill invading bacteria and limit bacterial colonization. One main in vivo target of ROS is the thiol group of proteins. We have developed a quantitative thiol trapping technique termed OxICAT to identify physiologically important target proteins of hydrogen peroxide (H2O2) and hypochlorite (NaOCl) stress in vivo. OxICAT allows the precise quantification of oxidative thiol modifications in hundreds of different proteins in a single experiment. It also identifies the affected proteins and defines their redox-sensitive cysteine(s). Using this technique, we identified a group of Escherichia coli proteins with significantly (30–90%) oxidatively modified thiol groups, which appear to be specifically sensitive to either H2O2 or NaOCl stress. These results indicate that individual oxidants target distinct proteins in vivo. Conditionally essential E. coli genes encode one-third of redox-sensitive proteins, a finding that might explain the bacteriostatic effect of oxidative stress treatment. We identified a select group of redox-regulated proteins, which protect E. coli against oxidative stress conditions. These experiments illustrate that OxICAT, which can be used in a variety of different cell types and organisms, is a powerful tool to identify, quantify, and monitor oxidative thiol modifications in vivo.


Nature Biotechnology | 2006

Challenges in deriving high-confidence protein identifications from data gathered by a HUPO plasma proteome collaborative study

David J. States; Gilbert S. Omenn; Thomas W. Blackwell; Damian Fermin; Jimmy K. Eng; David W. Speicher; Samir M. Hanash

The Human Proteome Organization (HUPO) recently completed the first large-scale collaborative study to characterize the human serum and plasma proteomes. The study was carried out in different locations and used diverse methods and instruments to compare and integrate tandem mass spectrometry (MS/MS) data on aliquots of pooled serum and plasma from healthy subjects. Liquid chromatography (LC)-MS/MS data sets from 18 laboratories were matched to the International Protein Index database, and an initial integration exercise resulted in 9,504 proteins identified with one or more peptides, and 3,020 proteins identified with two or more peptides. This article uses a rigorous statistical approach to take into account the length of coding regions in genes, and multiple hypothesis-testing techniques. On this basis, we now present a reduced set of 889 proteins identified with a confidence level of at least 95%. We also discuss the importance of such an integrated analysis in providing an accurate representation of a proteome as well as the value such data sets contain for the high-confidence identification of protein matches to novel exons, some of which may be localized in alternatively spliced forms of known plasma proteins and some in previously nonannotated gene sequences.


Genome Biology | 2011

The functional spectrum of low-frequency coding variation.

Gabor T. Marth; Fuli Yu; Amit Indap; Kiran Garimella; Simon Gravel; Wen Fung Leong; Chris Tyler-Smith; Matthew N. Bainbridge; Thomas W. Blackwell; Xiangqun Zheng-Bradley; Yuan Chen; Danny Challis; Laura Clarke; Edward V. Ball; Kristian Cibulskis; David Neil Cooper; Bob Fulton; Chris Hartl; Dan Koboldt; Donna M. Muzny; Richard Smith; Carrie Sougnez; Chip Stewart; Alistair Ward; Jin Yu; Yali Xue; David Altshuler; Carlos Bustamante; Andrew G. Clark; Mark J. Daly

BackgroundRare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency.ResultsThe 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants.ConclusionsThis study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.


Genome Biology | 2006

Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics

Damian Fermin; Baxter B. Allen; Thomas W. Blackwell; Rajasree Menon; Marcin Adamski; Yin Xu; Peter J. Ulintz; Gilbert S. Omenn; David J. States

BackgroundDefining the location of genes and the precise nature of gene products remains a fundamental challenge in genome annotation. Interrogating tandem mass spectrometry data using genomic sequence provides an unbiased method to identify novel translation products. A six-frame translation of the entire human genome was used as the query database to search for novel blood proteins in the data from the Human Proteome Organization Plasma Proteome Project. Because this target database is orders of magnitude larger than the databases traditionally employed in tandem mass spectra analysis, careful attention to significance testing is required. Confidence of identification is assessed using our previously described Poisson statistic, which estimates the significance of multi-peptide identifications incorporating the length of the matching sequence, number of spectra searched and size of the target sequence database.ResultsApplying a false discovery rate threshold of 0.05, we identified 282 significant open reading frames, each containing two or more peptide matches. There were 627 novel peptides associated with these open reading frames that mapped to a unique genomic coordinate placed within the start/stop points of previously annotated genes. These peptides matched 1,110 distinct tandem MS spectra. Peptides fell into four categories based upon where their genomic coordinates placed them relative to annotated exons within the parent gene.ConclusionThis work provides evidence for novel alternative splice variants in many previously annotated genes. These findings suggest that annotation of the genome is not yet complete and that proteomics has the potential to further add to our understanding of gene structures.


BMC Proceedings | 2014

Data for Genetic Analysis Workshop 18: human whole genome sequence, blood pressure, and simulated phenotypes in extended pedigrees

Laura Almasy; Thomas D. Dyer; Juan Manuel Peralta; Goo Jun; Andrew R. Wood; Christian Fuchsberger; Marcio Almeida; Jack W. Kent; Sharon P. Fowler; Thomas W. Blackwell; Sobha Puppala; Satish Kumar; Joanne E. Curran; Donna M. Lehman; Gonçalo R. Abecasis; Ravindranath Duggirala; John Blangero

Genetic Analysis Workshop 18 (GAW18) focused on identification of genes and functional variants that influence complex phenotypes in human sequence data. Data for the workshop were donated by the T2D-GENES Consortium and included whole genome sequences for odd-numbered autosomes in 464 key individuals selected from 20 Mexican American families, a dense set of single-nucleotide polymorphisms in 959 individuals in these families, and longitudinal data on systolic and diastolic blood pressure measured at 1-4 examinations over a period of 20 years. Simulated phenotypes were generated based on the real sequence data and pedigree structures. In the design of the simulation model, gene expression measures from the San Antonio Family Heart Study (not distributed as part of the GAW18 data) were used to identify genes whose mRNA levels were correlated with blood pressure. Observed variants within these genes were designated as functional in the GAW18 simulation if they were nonsynonymous and predicted to have deleterious effects on protein function or if they were noncoding and associated with mRNA levels. Two simulated longitudinal phenotypes were modeled to have the same trait distributions as the real systolic and diastolic blood pressure data, with effects of age, sex, and medication use, including a genotype-medication interaction. For each phenotype, more than 1000 sequence variants in more than 200 genes present on the odd-numbered autosomes individually explained less than 0.01-2.78% of phenotypic variance. Cumulatively, variants in the most influential gene explained 7.79% of trait variance. An additional simulated phenotype, Q1, was designed to be correlated among family members but to not be associated with any sequence variants. Two hundred replicates of the phenotypes were simulated, with each including data for 849 individuals.


Genetic Epidemiology | 2013

Recommended joint and meta-analysis strategies for case-control association testing of single low-count variants.

Clement Ma; Thomas W. Blackwell; Michael Boehnke; Laura J. Scott

In genome‐wide association studies of binary traits, investigators typically use logistic regression to test common variants for disease association within studies, and combine association results across studies using meta‐analysis. For common variants, logistic regression tests are well calibrated, and meta‐analysis of study‐specific association results is only slightly less powerful than joint analysis of the combined individual‐level data. In recent sequencing and dense chip based association studies, investigators increasingly test low‐frequency variants for disease association. In this paper, we seek to (1) identify the association test with maximal power among tests with well controlled type I error rate and (2) compare the relative power of joint and meta‐analysis tests. We use analytic calculation and simulation to compare the empirical type I error rate and power of four logistic regression based tests: Wald, score, likelihood ratio, and Firth bias‐corrected. We demonstrate for low‐count variants (roughly minor allele count [MAC] < 400) that: (1) for joint analysis, the Firth test has the best combination of type I error and power; (2) for meta‐analysis of balanced studies (equal numbers of cases and controls), the score test is best, but is less powerful than Firth test based joint analysis; and (3) for meta‐analysis of sufficiently unbalanced studies, all four tests can be anti‐conservative, particularly the score test. We also establish MAC as the key parameter determining test calibration for joint and meta‐analysis.


PLOS Computational Biology | 2007

Integration of Genome and Chromatin Structure with Gene Expression Profiles To Predict c-MYC Recognition Site Binding and Function

Yili Chen; Thomas W. Blackwell; Ji Chen; Jing Gao; Angel W. Lee; David J. States

The MYC genes encode nuclear sequence specific–binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches—even experimentally confirmed sites—are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy.


Archive | 2007

The Human Plasma and Serum Proteome

Gilbert S. Omenn; Rajasree Menon; Marcin Adamski; Thomas W. Blackwell; Brian B. Haab; Weimin Gao; David J. States

Human plasma and serum are the preferred specimens for noninvasive studies of normal and disease-associated proteins in the circulation and arising from cells throughout the body. The attributes of extreme complexity, very wide dynamic range, genetic and physiological variation, endogenous and ex vivo modifications, and incompleteness of sampling by mass spectrometry all represent major challenges to reproducible, high-resolution, high-throughput analyses of the plasma proteome. This chapter summarizes the major reports to date identifying proteins in normal individuals and identifies paths to increased use of proteomics methods with human specimens for biomarker discovery and application in various diseases.


Proceedings of the National Academy of Sciences of the United States of America | 2018

Evaluating the contribution of rare variants to type 2 diabetes and related traits using pedigrees

Goo Jun; Alisa K. Manning; Marcio Almeida; Matthew Zawistowski; Andrew R. Wood; Tanya M. Teslovich; Christian Fuchsberger; Shuang Feng; Pablo Cingolani; Kyle J. Gaulton; Thomas D. Dyer; Thomas W. Blackwell; Han Chen; Peter S. Chines; Sungkyoung Choi; Claire Churchhouse; Pierre Fontanillas; Ryan King; Sungyoung Lee; Stephen E. Lincoln; Vasily Trubetskoy; Mark A. DePristo; Tasha E. Fingerlin; Robert L. Grossman; Jason Grundstad; A. C. Heath; Jayoun Kim; Young-Jin Kim; Jason M. Laramie; Jae-Hoon Lee

Significance Contributions of rare variants to common and complex traits such as type 2 diabetes (T2D) are difficult to measure. This paper describes our results from deep whole-genome analysis of large Mexican-American pedigrees to understand the role of rare-sequence variations in T2D and related traits through enriched allele counts in pedigrees. Our study design was well-powered to detect association of rare variants if rare variants with large effects collectively accounted for large portions of risk variability, but our results did not identify such variants in this sample. We further quantified the contributions of common and rare variants in gene expression profiles and concluded that rare expression quantitative trait loci explain a substantive, but minor, portion of expression heritability. A major challenge in evaluating the contribution of rare variants to complex disease is identifying enough copies of the rare alleles to permit informative statistical analysis. To investigate the contribution of rare variants to the risk of type 2 diabetes (T2D) and related traits, we performed deep whole-genome analysis of 1,034 members of 20 large Mexican-American families with high prevalence of T2D. If rare variants of large effect accounted for much of the diabetes risk in these families, our experiment was powered to detect association. Using gene expression data on 21,677 transcripts for 643 pedigree members, we identified evidence for large-effect rare-variant cis-expression quantitative trait loci that could not be detected in population studies, validating our approach. However, we did not identify any rare variants of large effect associated with T2D, or the related traits of fasting glucose and insulin, suggesting that large-effect rare variants account for only a modest fraction of the genetic risk of these traits in this sample of families. Reliable identification of large-effect rare variants will require larger samples of extended pedigrees or different study designs that further enrich for such variants.


Proteomics | 2005

Overview of the HUPO Plasma Proteome Project: Results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database

Gilbert S. Omenn; David J. States; Marcin Adamski; Thomas W. Blackwell; Rajasree Menon; Henning Hermjakob; Rolf Apweiler; Brian B. Haab; Richard J. Simpson; James S. Eddes; Eugene A. Kapp; Robert L. Moritz; Daniel W. Chan; Alex J. Rai; Arie Admon; Ruedi Aebersold; Jimmy K. Eng; William S. Hancock; Stanley A. Hefta; Helmut E. Meyer; Young-Ki Paik; Jong Shin Yoo; Peipei Ping; Joel G. Pounds; Joshua N. Adkins; Xiaohong Qian; Rong Wang; Valerie C. Wasinger; Chi Yue Wu; Xiaohang Zhao

Collaboration


Dive into the Thomas W. Blackwell's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Goo Jun

University of Texas Health Science Center at Houston

View shared research outputs
Top Co-Authors

Avatar

Marcio Almeida

Texas Biomedical Research Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Thomas D. Dyer

University of Texas at Austin

View shared research outputs
Researchain Logo
Decentralizing Knowledge