Rachel Karchin
Johns Hopkins University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Rachel Karchin.
Science | 2008
D. Williams Parsons; Siân Jones; Xiaosong Zhang; Jimmy Lin; Rebecca J. Leary; Philipp Angenendt; Parminder Mankoo; Hannah Carter; I-Mei Siu; Gary L. Gallia; Alessandro Olivi; Roger E. McLendon; B. Ahmed Rasheed; Stephen T. Keir; Tatiana Nikolskaya; Yuri Nikolsky; Dana Busam; Hanna Tekleab; Luis A. Diaz; James Hartigan; Doug Smith; Robert L. Strausberg; Suely Kazue Nagahashi Marie; Sueli Mieko Oba Shinjo; Hai Yan; Gregory J. Riggins; Darell D. Bigner; Rachel Karchin; Nick Papadopoulos; Giovanni Parmigiani
Glioblastoma multiforme (GBM) is the most common and lethal type of brain cancer. To identify the genetic alterations in GBMs, we sequenced 20,661 protein coding genes, determined the presence of amplifications and deletions using high-density oligonucleotide arrays, and performed gene expression analyses using next-generation sequencing technologies in 22 human tumor samples. This comprehensive analysis led to the discovery of a variety of genes that were not known to be altered in GBMs. Most notably, we found recurrent mutations in the active site of isocitrate dehydrogenase 1 (IDH1) in 12% of GBM patients. Mutations in IDH1 occurred in a large fraction of young patients and in most patients with secondary GBMs and were associated with an increase in overall survival. These studies demonstrate the value of unbiased genomic analyses in the characterization of human brain cancer and identify a potentially useful genetic alteration for the classification and targeted therapy of GBMs.
Science | 2007
Laura D. Wood; D. Williams Parsons; Siân Jones; Jimmy Lin; Tobias Sjöblom; Rebecca J. Leary; Dong Shen; Simina M. Boca; Thomas D. Barber; Janine Ptak; Natalie Silliman; Steve Szabo; Zoltan Dezso; Vadim Ustyanksky; Tatiana Nikolskaya; Yuri Nikolsky; Rachel Karchin; Paul Wilson; Joshua S. Kaminker; Zemin Zhang; Randal Croshaw; Joseph Willis; Dawn Dawson; Michail Shipitsin; James K V Willson; Saraswati Sukumar; Kornelia Polyak; Ben Ho Park; Charit L. Pethiyagoda; P.V. Krishna Pant
Human cancer is caused by the accumulation of mutations in oncogenes and tumor suppressor genes. To catalog the genetic changes that occur during tumorigenesis, we isolated DNA from 11 breast and 11 colorectal tumors and determined the sequences of the genes in the Reference Sequence database in these samples. Based on analysis of exons representing 20,857 transcripts from 18,191 genes, we conclude that the genomic landscapes of breast and colorectal cancers are composed of a handful of commonly mutated gene “mountains” and a much larger number of gene “hills” that are mutated at low frequency. We describe statistical and bioinformatic tools that may help identify mutations with a role in tumorigenesis. These results have implications for understanding the nature and heterogeneity of human cancers and for using personal genomics for tumor diagnosis and therapy.
Nucleic Acids Research | 2004
Ursula Pieper; Narayanan Eswar; Ben Webb; David Eramian; Libusha Kelly; David T. Barkan; Hannah Carter; Parminder Mankoo; Rachel Karchin; Marc A. Marti-Renom; Fred P. Davis; Andrej Sali
ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains almost 30 million reliable models for domains in 4.7 million unique protein sequences. ModBase allows users to compute or update comparative models on demand, through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the AllosMod server for modeling ligand-induced protein dynamics (http://salilab.org/allosmod), the AllosMod-FoXS server for predicting a structural ensemble that fits an SAXS profile (http://salilab.org/allosmod-foxs), the FoXSDock server for protein–protein docking filtered by an SAXS profile (http://salilab.org/foxsdock), the SAXS Merge server for automatic merging of SAXS profiles (http://salilab.org/saxsmerge) and the Pose & Rank server for scoring protein–ligand complexes (http://salilab.org/poseandrank). In this update, we also highlight two applications of ModBase: a PSI:Biology initiative to maximize the structural coverage of the human alpha-helical transmembrane proteome and a determination of structural determinants of human immunodeficiency virus-1 protease specificity.
Science | 2011
D. Williams Parsons; Meng Li; Xiaosong Zhang; Siân Jones; Rebecca J. Leary; Jimmy Lin; Simina M. Boca; Hannah Carter; Josue Samayoa; Chetan Bettegowda; Gary L. Gallia; George I. Jallo; Zev A. Binder; Yuri Nikolsky; James Hartigan; Doug Smith; Daniela S. Gerhard; Daniel W. Fults; Scott R. VandenBerg; Mitchel S. Berger; Suely Kazue Nagahashi Marie; Sueli Mieko Oba Shinjo; Carlos Clara; Peter C. Phillips; Jane E. Minturn; Jaclyn A. Biegel; Alexander R. Judkins; Adam C. Resnick; Phillip B. Storm; Tom Curran
Genomic analysis of a childhood cancer reveals markedly fewer mutations than what is typically seen in adult cancers. Medulloblastoma (MB) is the most common malignant brain tumor of children. To identify the genetic alterations in this tumor type, we searched for copy number alterations using high-density microarrays and sequenced all known protein-coding genes and microRNA genes using Sanger sequencing in a set of 22 MBs. We found that, on average, each tumor had 11 gene alterations, fewer by a factor of 5 to 10 than in the adult solid tumors that have been sequenced to date. In addition to alterations in the Hedgehog and Wnt pathways, our analysis led to the discovery of genes not previously known to be altered in MBs. Most notably, inactivating mutations of the histone-lysine N-methyltransferase genes MLL2 or MLL3 were identified in 16% of MB patients. These results demonstrate key differences between the genetic landscapes of adult and childhood cancers, highlight dysregulation of developmental pathways as an important mechanism underlying MBs, and identify a role for a specific type of histone methylation in human tumorigenesis.
Proceedings of the National Academy of Sciences of the United States of America | 2010
Ivana Bozic; Tibor Antal; Hisashi Ohtsuki; Hannah Carter; Dewey Kim; Sining Chen; Rachel Karchin; Kenneth W. Kinzler; Bert Vogelstein; Martin A. Nowak
Major efforts to sequence cancer genomes are now occurring throughout the world. Though the emerging data from these studies are illuminating, their reconciliation with epidemiologic and clinical observations poses a major challenge. In the current study, we provide a mathematical model that begins to address this challenge. We model tumors as a discrete time branching process that starts with a single driver mutation and proceeds as each new driver mutation leads to a slightly increased rate of clonal expansion. Using the model, we observe tremendous variation in the rate of tumor development—providing an understanding of the heterogeneity in tumor sizes and development times that have been observed by epidemiologists and clinicians. Furthermore, the model provides a simple formula for the number of driver mutations as a function of the total number of mutations in the tumor. Finally, when applied to recent experimental data, the model allows us to calculate the actual selective advantage provided by typical somatic mutations in human tumors in situ. This selective advantage is surprisingly small—0.004 ± 0.0004—and has major implications for experimental cancer research.
Proceedings of the National Academy of Sciences of the United States of America | 2011
Jian Wu; Yuchen Jiao; Marco Dal Molin; Anirban Maitra; Roeland F. De Wilde; Laura D. Wood; James R. Eshleman; Michael Goggins; Christopher L. Wolfgang; Marcia I. Canto; Richard D. Schulick; Barish H. Edil; Michael A. Choti; Volkan Adsay; David S. Klimstra; G. Johan A. Offerhaus; Alison P. Klein; Levy Kopelovich; Hannah Carter; Rachel Karchin; Peter J. Allen; C. Max Schmidt; Yoshiki Naito; Luis A. Diaz; Kenneth W. Kinzler; Nickolas Papadopoulos; Ralph H. Hruban; Bert Vogelstein
More than 2% of adults harbor a pancreatic cyst, a subset of which progresses to invasive lesions with lethal consequences. To assess the genomic landscapes of neoplastic cysts of the pancreas, we determined the exomic sequences of DNA from the neoplastic epithelium of eight surgically resected cysts of each of the major neoplastic cyst types: serous cystadenomas (SCAs), intraductal papillary mucinous neoplasms (IPMNs), mucinous cystic neoplasms (MCNs), and solid pseudopapillary neoplasms (SPNs). SPNs are low-grade malignancies, and IPMNs and MCNs, but not SCAs, have the capacity to progress to cancer. We found that SCAs, IPMNs, MCNs, and SPNs contained 10 ± 4.6, 27 ± 12, 16 ± 7.6, and 2.9 ± 2.1 somatic mutations per tumor, respectively. Among the mutations identified, E3 ubiquitin ligase components were of particular note. Four of the eight SCAs contained mutations of the von Hippel–Lindau gene (VHL), a key component of the VHL ubiquitin ligase complex that has previously been associated with renal cell carcinomas, SCAs, and other neoplasms. Six of the eight IPMNs and three of the eight MCNs harbored mutations of RNF43, a gene coding for a protein with intrinsic E3 ubiquitin ligase activity that has not previously been found to be genetically altered in any human cancer. The preponderance of inactivating mutations in RNF43 unequivocally establish it as a suppressor of both IPMNs and MCNs. SPNs contained remarkably few genetic alterations but always contained mutations of CTNNB1, previously demonstrated to inhibit degradation of the encoded protein (β-catenin) by E3 ubiquitin ligases. These results highlight the essential role of ubiquitin ligases in these neoplasms and have important implications for the diagnosis and treatment of patients with cystic tumors.
Proteins | 2003
Kevin Karplus; Rachel Karchin; Jenny Draper; Jonathan Casper; Yael Mandel-Gutfreund; Mark Diekhans; Richard Hughey
This article presents an overview of the SAM‐T02 method for protein fold recognition and the UNDERTAKER program for ab initio predictions. The SAM‐T02 server is an automatic method that uses two‐track hidden Markov models (HMMS) to find and align template proteins from PDB to the target protein. The two‐track HMMs use an amino acid alphabet and one of several different local structure alphabets. The UNDERTAKER program is a new fragment‐packing program that can use short or long fragments and alignments to create protein conformations. The HMMs and fold‐recognition alignments from the SAM‐T02 method were used to generate the fragment and alignment libraries used by UNDERTAKER. We present results on a few selected targets for which this combined method worked particularly well: T0129, T0181, T0135, T0130, and T0139. Proteins 2003;53:491–496.
Bioinformatics | 2002
Rachel Karchin; Kevin Karplus; David Haussler
MOTIVATION The enormous amount of protein sequence data uncovered by genome research has increased the demand for computer software that can automate the recognition of new proteins. We discuss the relative merits of various automated methods for recognizing G-Protein Coupled Receptors (GPCRs), a superfamily of cell membrane proteins. GPCRs are found in a wide range of organisms and are central to a cellular signalling network that regulates many basic physiological processes. They are the focus of a significant amount of current pharmaceutical research because they play a key role in many diseases. However, their tertiary structures remain largely unsolved. The methods described in this paper use only primary sequence information to make their predictions. We compare a simple nearest neighbor approach (BLAST), methods based on multiple alignments generated by a statistical profile Hidden Markov Model (HMM), and methods, including Support Vector Machines (SVMs), that transform protein sequences into fixed-length feature vectors. RESULTS The last is the most computationally expensive method, but our experiments show that, for those interested in annotation-quality classification, the results are worth the effort. In two-fold cross-validation experiments testing recognition of GPCR subfamilies that bind a specific ligand (such as a histamine molecule), the errors per sequence at the Minimum Error Point (MEP) were 13.7% for multi-class SVMs, 17.1% for our SVMtree method of hierarchical multi-class SVM classification, 25.5% for BLAST, 30% for profile HMMs, and 49% for classification based on nearest neighbor feature vector Kernel Nearest Neighbor (kernNN). The percentage of true positives recognized before the first false positive was 65% for both SVM methods, 13% for BLAST, 5% for profile HMMs and 4% for kernNN.
Cancer Research | 2009
Hannah Carter; Sining Chen; Leyla Isik; Svitlana Tyekucheva; Victor E. Velculescu; Kenneth W. Kinzler; Bert Vogelstein; Rachel Karchin
Large-scale sequencing of cancer genomes has uncovered thousands of DNA alterations, but the functional relevance of the majority of these mutations to tumorigenesis is unknown. We have developed a computational method, called Cancer-specific High-throughput Annotation of Somatic Mutations (CHASM), to identify and prioritize those missense mutations most likely to generate functional changes that enhance tumor cell proliferation. The method has high sensitivity and specificity when discriminating between known driver missense mutations and randomly generated missense mutations (area under receiver operating characteristic curve, >0.91; area under Precision-Recall curve, >0.79). CHASM substantially outperformed previously described missense mutation function prediction methods at discriminating known oncogenic mutations in P53 and the tyrosine kinase epidermal growth factor receptor. We applied the method to 607 missense mutations found in a recent glioblastoma multiforme sequencing study. Based on a model that assumed the glioblastoma multiforme mutations are a mixture of drivers and passengers, we estimate that 8% of these mutations are drivers, causally contributing to tumorigenesis.
Nature Genetics | 2013
Patrick R. Sosnay; Karen R Siklosi; Fredrick Van Goor; Kyle Kaniecki; H. Yu; Neeraj Sharma; Anabela S. Ramalho; Margarida D. Amaral; Ruslan Dorfman; Julian Zielenski; David L. Masica; Rachel Karchin; Linda Millen; Philip J. Thomas; George P. Patrinos; Mary Corey; Michelle Huckaby Lewis; Johanna M. Rommens; Carlo Castellani; Christopher M. Penland; Garry R. Cutting
Allelic heterogeneity in disease-causing genes presents a substantial challenge to the translation of genomic variation into clinical practice. Few of the almost 2,000 variants in the cystic fibrosis transmembrane conductance regulator gene CFTR have empirical evidence that they cause cystic fibrosis. To address this gap, we collected both genotype and phenotype data for 39,696 individuals with cystic fibrosis in registries and clinics in North America and Europe. In these individuals, 159 CFTR variants had an allele frequency of ł0.01%. These variants were evaluated for both clinical severity and functional consequence, with 127 (80%) meeting both clinical and functional criteria consistent with disease. Assessment of disease penetrance in 2,188 fathers of individuals with cystic fibrosis enabled assignment of 12 of the remaining 32 variants as neutral, whereas the other 20 variants remained of indeterminate effect. This study illustrates that sourcing data directly from well-phenotyped subjects can address the gap in our ability to interpret clinically relevant genomic variation.