Adam M. Gustafson
Boston University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Adam M. Gustafson.
PLOS Biology | 2004
Zhixin Xie; Lisa K. Johansen; Adam M. Gustafson; Kristin D. Kasschau; Andrew D Lellis; Daniel Zilberman; Steven E. Jacobsen; James C. Carrington
Multicellular eukaryotes produce small RNA molecules (approximately 21–24 nucleotides) of two general types, microRNA (miRNA) and short interfering RNA (siRNA). They collectively function as sequence-specific guides to silence or regulate genes, transposons, and viruses and to modify chromatin and genome structure. Formation or activity of small RNAs requires factors belonging to gene families that encode DICER (or DICER-LIKE [DCL]) and ARGONAUTE proteins and, in the case of some siRNAs, RNA-dependent RNA polymerase (RDR) proteins. Unlike many animals, plants encode multiple DCL and RDR proteins. Using a series of insertion mutants of Arabidopsis thaliana, unique functions for three DCL proteins in miRNA (DCL1), endogenous siRNA (DCL3), and viral siRNA (DCL2) biogenesis were identified. One RDR protein (RDR2) was required for all endogenous siRNAs analyzed. The loss of endogenous siRNA in dcl3 and rdr2 mutants was associated with loss of heterochromatic marks and increased transcript accumulation at some loci. Defects in siRNA-generation activity in response to turnip crinkle virus in dcl2 mutant plants correlated with increased virus susceptibility. We conclude that proliferation and diversification of DCL and RDR genes during evolution of plants contributed to specialization of small RNA-directed pathways for development, chromatin structure, and defense.
Nature Genetics | 2004
Edwards Allen; Zhixin Xie; Adam M. Gustafson; Gi-Ho Sung; Joseph W. Spatafora; James C. Carrington
MicroRNAs (miRNAs) in plants and animals function as post-transcriptional regulators of target genes, many of which are involved in multicellular development. miRNAs guide effector complexes to target mRNAs through base-pair complementarity, facilitating site-specific cleavage or translational repression. Biogenesis of miRNAs involves nucleolytic processing of a precursor transcript with extensive foldback structure. Here, we provide evidence that genes encoding miRNAs in plants originated by inverted duplication of target gene sequences. Several recently evolved genes encoding miRNAs in Arabidopsis thaliana and other small RNA–generating loci possess the hallmarks of inverted duplication events that formed the arms on each side of their respective foldback precursors. We propose a model for miRNA evolution that suggests a mechanism for de novo generation of new miRNA genes with unique target specificities.
Proceedings of the National Academy of Sciences of the United States of America | 2009
Frank Schembri; Sriram Sridhar; Catalina Perdomo; Adam M. Gustafson; Xiaoling Zhang; Ayla Ergun; Jining Lü; Gang Liu; Xiaohui Zhang; Jessica Bowers; Cyrus Vaziri; Kristen Ott; Kelly Sensinger; James J. Collins; Jerome S. Brody; Robert C. Getts; Marc E. Lenburg; Avrum Spira
We have shown that smoking impacts bronchial airway gene expression and that heterogeneity in this response associates with smoking-related disease risk. In this study, we sought to determine whether microRNAs (miRNAs) play a role in regulating the airway gene expression response to smoking. We examined whole-genome miRNA and mRNA expression in bronchial airway epithelium from current and never smokers (n = 20) and found 28 miRNAs to be differentially expressed (P < 0.05) with the majority being down-regulated in smokers. We further identified a number of mRNAs whose expression level is highly inversely correlated with miRNA expression in vivo. Many of these mRNAs contain potential binding sites for the differentially expressed miRNAs in their 3′-untranslated region (UTR) and are themselves affected by smoking. We found that either increasing or decreasing the levels of mir-218 (a miRNA that is strongly affected by smoking) in both primary bronchial epithelial cells and H1299 cells was sufficient to cause a corresponding decrease or increase in the expression of predicted mir-218 mRNA targets, respectively. Further, mir-218 expression is reduced in primary bronchial epithelium exposed to cigarette smoke condensate (CSC), and alteration of mir-218 levels in these cells diminishes the induction of the predicted mir-218 target MAFG in response to CSC. These data indicate that mir-218 levels modulate the airway epithelial gene expression response to cigarette smoke and support a role for miRNAs in regulating host response to environmental toxins.
Nucleic Acids Research | 2004
Adam M. Gustafson; Edwards Allen; Scott A. Givan; Daniel P. Smith; James C. Carrington; Kristin D. Kasschau
Eukaryotes produce functionally diverse classes of small RNAs (20–25 nt). These include microRNAs (miRNAs), which act as regulatory factors during growth and development, and short-interfering RNAs (siRNAs), which function in several epigenetic and post-transcriptional silencing systems. The Arabidopsis Small RNA Project (ASRP) seeks to characterize and functionally analyze the major classes of endogenous small RNAs in plants. The ASRP database provides a repository for sequences of small RNAs cloned from various Arabidopsis genotypes and tissues. Version 3.0 of the database contains 1920 unique sequences, with tools to assist in miRNA and siRNA identification and analysis. The comprehensive database is publicly available through a web interface at http://asrp.cgrb.oregonstate.edu.
Science Translational Medicine | 2010
Adam M. Gustafson; Raffaella Soldi; Christina Anderlind; Mary Beth Scholand; Xiaohui Zhang; Kendal G Cooper; Darren Walker; Annette McWilliams; Gang Liu; Eva Szabo; Jerome S. Brody; Pierre P. Massion; Marc E. Lenburg; Stephen Lam; Andrea Bild; Avrum Spira
A cancer-associated signaling pathway is reversibly activated in the normal airways of smokers before they develop lung cancer, presenting an opportunity for preventive therapy. An Ounce of Prevention for Lung Cancer Lung cancer takes a terrific toll on humankind. Despite our understanding of the contribution of tobacco smoke, this knowledge has not been able to reverse the global increase in lung cancer incidence. New approaches are needed. Is there a way to tell whether a smoker will develop cancer and, even more important, can we see when this process starts so we can stop it? Work from Gustafson and colleagues has defined a biochemical harbinger of cancer in seemingly normal respiratory tissue that can be reversed before cancer begins. Numerous cellular signaling pathways are deregulated in cancers, such as the Ras, p53, and phosphatidylinositol 3-kinase (PI3K) pathways. A molecular understanding of lung cancer may help to develop effective drugs for deterrence. To see whether they could find a predictor of impending cancer, the authors examined normal respiratory tract tissue from smokers with lung cancer or other abnormalities. By looking for previously determined gene expression signatures for various signaling pathways, they found that one of these pathways—PI3K—was clearly activated above normal values. Moreover, the PI3K pathway was already turned on in smokers with abnormal dysplastic lesions, precursors to lung cancer. Lung cancer cells themselves showed even higher expression of the genes in the PI3K pathway. Concluding that elevated PI3K pathway activity precedes the development of lung cancer, the authors assessed gene expression in tissue from patients with dysplasias who had been successfully treated with myo-inositol, an inhibitor of PI3K, finding effective down-regulation of the PI3K pathway. Treatment of cancers with surgery, radiation, and chemotherapy—or, in some cases, targeted molecular therapies—may be the standard of care at present. But prevention should surely be the ultimate goal. The new tool reported in this article—measurement of PI3K pathway activation—and the demonstration that this is an early and reversible step in lung tumorigenesis are hopeful signs. Although only a subset of smokers develop lung cancer, we cannot determine which smokers are at highest risk for cancer development, nor do we know the signaling pathways altered early in the process of tumorigenesis in these individuals. On the basis of the concept that cigarette smoke creates a molecular field of injury throughout the respiratory tract, this study explores oncogenic pathway deregulation in cytologically normal proximal airway epithelial cells of smokers at risk for lung cancer. We observed a significant increase in a genomic signature of phosphatidylinositol 3-kinase (PI3K) pathway activation in the cytologically normal bronchial airway of smokers with lung cancer and smokers with dysplastic lesions, suggesting that PI3K is activated in the proximal airway before tumorigenesis. Further, PI3K activity is decreased in the airway of high-risk smokers who had significant regression of dysplasia after treatment with the chemopreventive agent myo-inositol, and myo-inositol inhibits the PI3K pathway in vitro. These results suggest that deregulation of the PI3K pathway in the bronchial airway epithelium of smokers is an early, measurable, and reversible event in the development of lung cancer and that genomic profiling of these relatively accessible airway cells may enable personalized approaches to chemoprevention and therapy. Our work further suggests that additional lung cancer chemoprevention trials either targeting the PI3K pathway or measuring airway PI3K activation as an intermediate endpoint are warranted.
BMC Genomics | 2006
Adam M. Gustafson; Evan S. Snitkin; Stephen C.J. Parker; Charles DeLisi; Simon Kasif
BackgroundThe identification of genes essential for survival is of theoretical importance in the understanding of the minimal requirements for cellular life, and of practical importance in the identification of potential drug targets in novel pathogens. With the great time and expense required for experimental studies aimed at constructing a catalog of essential genes in a given organism, a computational approach which could identify essential genes with high accuracy would be of great value.ResultsWe gathered numerous features which could be generated automatically from genome sequence data and assessed their relationship to essentiality, and subsequently utilized machine learning to construct an integrated classifier of essential genes in both S. cerevisiae and E. coli. When looking at single features, phyletic retention, a measure of the number of organisms an ortholog is present in, was the most predictive of essentiality. Furthermore, during construction of our phyletic retention feature we for the first time explored the evolutionary relationship among the set of organisms in which the presence of a gene is most predictive of essentiality. We found that in both E. coli and S. cerevisiae the optimal sets always contain host-associated organisms with small genomes which are closely related to the reference. Using five optimally selected organisms, we were able to improve predictive accuracy as compared to using all available sequenced organisms. We hypothesize the predictive power of these genomes is a consequence of the process of reductive evolution, by which many parasites and symbionts evolved their gene content. In addition, essentiality is measured in rich media, a condition which resembles the environments of these organisms in their hosts where many nutrients are provided. Finally, we demonstrate that integration of our most highly predictive features using a probabilistic classifier resulted in accuracies surpassing any individual feature.ConclusionUsing features obtainable directly from sequence data, we were able to construct a classifier which can predict essential genes with high accuracy. Furthermore, our analysis of the set of genomes in which the presence of a gene is most predictive of essentiality may suggest ways in which targeted sequencing can be used in the identification of essential genes. In summary, the methods presented here can aid in the reduction of time and money invested in essential gene identification by targeting those genes for experimentation which are predicted as being essential with a high probability.
Breast Cancer Research | 2009
Andrea Bild; Joel S. Parker; Adam M. Gustafson; Chaitanya R. Acharya; Katherine A. Hoadley; Carey K. Anders; P. Kelly Marcom; Lisa A. Carey; Anil Potti; Joseph R. Nevins; Charles M. Perou
IntroductionPerhaps the major challenge in developing more effective therapeutic strategies for the treatment of breast cancer patients is confronting the heterogeneity of the disease, recognizing that breast cancer is not one disease but multiple disorders with distinct underlying mechanisms. Gene-expression profiling studies have been used to dissect this complexity, and our previous studies identified a series of intrinsic subtypes of breast cancer that define distinct populations of patients with respect to survival. Additional work has also used signatures of oncogenic pathway deregulation to dissect breast cancer heterogeneity as well as to suggest therapeutic opportunities linked to pathway activation.MethodsWe used genomic analyses to identify relations between breast cancer subtypes, pathway deregulation, and drug sensitivity. For these studies, we use three independent breast cancer gene-expression data sets to measure an individual tumor phenotype. Correlation between pathway status and subtype are examined and linked to predictions for response to conventional chemotherapies.ResultsWe reveal patterns of pathway activation characteristic of each molecular breast cancer subtype, including within the more aggressive subtypes in which novel therapeutic opportunities are critically needed. Whereas some oncogenic pathways have high correlations to breast cancer subtype (RAS, CTNNB1, p53, HER1), others have high variability of activity within a specific subtype (MYC, E2F3, SRC), reflecting biology independent of common clinical factors. Additionally, we combined these analyses with predictions of sensitivity to commonly used cytotoxic chemotherapies to provide additional opportunities for therapeutics specific to the intrinsic subtype that might be better aligned with the characteristics of the individual patient.ConclusionsGenomic analyses can be used to dissect the heterogeneity of breast cancer. We use an integrated analysis of breast cancer that combines independent methods of genomic analyses to highlight the complexity of signaling pathways underlying different breast cancer phenotypes and to identify optimal therapeutic opportunities.
BMC Bioinformatics | 2006
Evan S. Snitkin; Adam M. Gustafson; Joseph C. Mellor; Jie Wu; Charles DeLisi
BackgroundThe rapidly increasing speed with which genome sequence data can be generated will be accompanied by an exponential increase in the number of sequenced eukaryotes. With the increasing number of sequenced eukaryotic genomes comes a need for bioinformatic techniques to aid in functional annotation. Ideally, genome context based techniques such as proximity, fusion, and phylogenetic profiling, which have been so successful in prokaryotes, could be utilized in eukaryotes. Here we explore the application of phylogenetic profiling, a method that exploits the evolutionary co-occurrence of genes in the assignment of functional linkages, to eukaryotic genomes.ResultsIn order to evaluate the performance of phylogenetic profiling in eukaryotes, we assessed the relative performance of commonly used profile construction techniques and genome compositions in predicting functional linkages in both prokaryotic and eukaryotic organisms. When predicting linkages in E. coli with a prokaryotic profile, the use of continuous values constructed from transformed BLAST bit-scores performed better than profiles composed of discretized E-values; the use of discretized E-values resulted in more accurate linkages when using S. cerevisiae as the query organism. Extending this analysis by incorporating several eukaryotic genomes in profiles containing a majority of prokaryotes resulted in similar overall accuracy, but with a surprising reduction in pathway diversity among the most significant linkages. Furthermore, the application of phylogenetic profiling using profiles composed of only eukaryotes resulted in the loss of the strong correlation between common KEGG pathway membership and profile similarity score. Profile construction methods, orthology definitions, ontology and domain complexity were explored as possible sources of the poor performance of eukaryotic profiles, but with no improvement in results.ConclusionGiven the current set of completely sequenced eukaryotic organisms, phylogenetic profiling using profiles generated from any of the commonly used techniques was found to yield extremely poor results. These findings imply genome-specific requirements for constructing functionally relevant phylogenetic profiles, and suggest that differences in the evolutionary history between different kingdoms might generally limit the usefulness of phylogenetic profiling in eukaryotes.
Molecular Systems Biology | 2014
Adam L. Cohen; Raffaella Soldi; Haiyu Zhang; Adam M. Gustafson; Ryan Wilcox; Bryan E. Welm; Jeffrey T. Chang; Evan Johnson; Avrum Spira; Stefanie S. Jeffrey; Andrea Bild
Identifying the best drug for each cancer patient requires an efficient individualized strategy. We present MATCH (Merging genomic and pharmacologic Analyses for Therapy CHoice), an approach using public genomic resources and drug testing of fresh tumor samples to link drugs to patients. Valproic acid (VPA) is highlighted as a proof‐of‐principle. In order to predict specific tumor types with high probability of drug sensitivity, we create drug response signatures using publically available gene expression data and assess sensitivity in a data set of >40 cancer types. Next, we evaluate drug sensitivity in matched tumor and normal tissue and exclude cancer types that are no more sensitive than normal tissue. From these analyses, breast tumors are predicted to be sensitive to VPA. A meta‐analysis across breast cancer data sets shows that aggressive subtypes are most likely to be sensitive to VPA, but all subtypes have sensitive tumors. MATCH predictions correlate significantly with growth inhibition in cancer cell lines and three‐dimensional cultures of fresh tumor samples. MATCH accurately predicts reduction in tumor growth rate following VPA treatment in patient tumor xenografts. MATCH uses genomic analysis with in vitro testing of patient tumors to select optimal drug regimens before clinical trial initiation.
BMC Bioinformatics | 2008
Bolan Linghu; Evan S. Snitkin; Dustin T. Holloway; Adam M. Gustafson; Yu Xia; Charles DeLisi
BackgroundInformation obtained from diverse data sources can be combined in a principled manner using various machine learning methods to increase the reliability and range of knowledge about protein function. The result is a weighted functional linkage network (FLN) in which linked neighbors share at least one function with high probability. Precision is, however, low. Aiming to provide precise functional annotation for as many proteins as possible, we explore and propose a two-step framework for functional annotation (1) construction of a high-coverage and reliable FLN via machine learning techniques (2) development of a decision rule for the constructed FLN to optimize functional annotation.ResultsWe first apply this framework to Saccharomyces cerevisiae. In the first step, we demonstrate that four commonly used machine learning methods, Linear SVM, Linear Discriminant Analysis, Naïve Bayes, and Neural Network, all combine heterogeneous data to produce reliable and high-coverage FLNs, in which the linkage weight more accurately estimates functional coupling of linked proteins than use individual data sources alone. In the second step, empirical tuning of an adjustable decision rule on the constructed FLN reveals that basing annotation on maximum edge weight results in the most precise annotation at high coverages. In particular at low coverage all rules evaluated perform comparably. At coverage above approximately 50%, however, they diverge rapidly. At full coverage, the maximum weight decision rule still has a precision of approximately 70%, whereas for other methods, precision ranges from a high of slightly more than 30%, down to 3%. In addition, a scoring scheme to estimate the precisions of individual predictions is also provided. Finally, tests of the robustness of the framework indicate that our framework can be successfully applied to less studied organisms.ConclusionWe provide a general two-step function-annotation framework, and show that high coverage, high precision annotations can be achieved by constructing a high-coverage and reliable FLN via data integration followed by applying a maximum weight decision rule.