Harish Nagarajan
University of California, San Diego
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Harish Nagarajan.
Nature Biotechnology | 2011
Xun Xu; Harish Nagarajan; Nathan E. Lewis; Shengkai Pan; Zhiming Cai; Xin Liu; Wenbin Chen; Min Xie; Wenliang Wang; Stephanie Hammond; Mikael Rørdam Andersen; Norma F. Neff; Benedetto Passarelli; Winston Koh; H. Christina Fan; Jianbin Wang; Yaoting Gui; Kelvin H. Lee; Michael J. Betenbaugh; Stephen R. Quake; Iman Famili; Bernhard O. Palsson; Jun Wang
Chinese hamster ovary (CHO)–derived cell lines are the preferred host cells for the production of therapeutic proteins. Here we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45 Gb of genomic sequence, with 24,383 predicted genes. We associate most of the assembled scaffolds with 21 chromosomes isolated by microfluidics to identify chromosomal locations of genes. Furthermore, we investigate genes involved in glycosylation, which affect therapeutic protein quality, and viral susceptibility genes, which are relevant to cell engineering and regulatory concerns. Homologs of most human glycosylation-associated genes are present in the CHO-K1 genome, although 141 of these homologs are not expressed under exponential growth conditions. Many important viral entry genes are also present in the genome but not expressed, which may explain the unusual viral resistance property of CHO cell lines. We discuss how the availability of this genome sequence may facilitate genome-scale science for the optimization of biopharmaceutical protein production.
Nature Reviews Microbiology | 2012
Nathan E. Lewis; Harish Nagarajan; Bernhard O. Palsson
Reconstructed microbial metabolic networks facilitate a mechanistic description of the genotype–phenotype relationship through the deployment of constraint-based reconstruction and analysis (COBRA) methods. As reconstructed networks leverage genomic data for insight and phenotype prediction, the development of COBRA methods has accelerated following the advent of whole-genome sequencing. Here, we describe a phylogeny of COBRA methods that has rapidly evolved from the few early methods, such as flux balance analysis and elementary flux mode analysis, into a repertoire of more than 100 methods. These methods have enabled genome-scale analysis of microbial metabolism for numerous basic and applied uses, including antibiotic discovery, metabolic engineering and modelling of microbial community behaviour.
Nature Biotechnology | 2013
Nathan E. Lewis; Xin Liu; Yuxiang Li; Harish Nagarajan; George Yerganian; Edward J. O'Brien; Aarash Bordbar; Anne M Roth; Jeffrey Rosenbloom; Chao Bian; Min Xie; Wenbin Chen; Ning Li; Deniz Baycin-Hizal; Haythem Latif; Jochen Förster; Michael J. Betenbaugh; Iman Famili; Xun Xu; Jun Wang; Bernhard O. Palsson
Chinese hamster ovary (CHO) cells, first isolated in 1957, are the preferred production host for many therapeutic proteins. Although genetic heterogeneity among CHO cell lines has been well documented, a systematic, nucleotide-resolution characterization of their genotypic differences has been stymied by the lack of a unifying genomic resource for CHO cells. Here we report a 2.4-Gb draft genome sequence of a female Chinese hamster, Cricetulus griseus, harboring 24,044 genes. We also resequenced and analyzed the genomes of six CHO cell lines from the CHO-K1, DG44 and CHO-S lineages. This analysis identified hamster genes missing in different CHO cell lines, and detected >3.7 million single-nucleotide polymorphisms (SNPs), 551,240 indels and 7,063 copy number variations. Many mutations are located in genes with functions relevant to bioprocessing, such as apoptosis. The details of this genetic diversity highlight the value of the hamster genome as the reference upon which CHO cells can be studied and engineered for protein production.
Nature Communications | 2013
Teruaki Nakatsuji; Hsin-I Chiang; Shangi B. Jiang; Harish Nagarajan; Karsten Zengler; Richard L. Gallo
Commensal microbes on the skin surface influence the behavior of cells below the epidermis. We hypothesized that bacteria or their products exist below the surface epithelium and thus permit physical interaction between microbes and dermal cells. Here, to test this hypothesis, we employed multiple independent detection techniques for bacteria including qPCR, Gram-staining, immunofluorescence, and in situ hybridization. Bacteria were consistently detectable within the dermis and dermal adipose of normal human skin. Sequencing of DNA from dermis and dermal adipose tissue identified bacterial 16S rRNA reflective of a diverse and partially distinct microbial community in each skin compartment. These results show the microbiota extends within the dermis, therefore enabling physical contact between bacteria and various cells below the basement membrane. These observations show that normal commensal bacterial communities directly communicate with the host in a tissue previously thought to be sterile.
Proceedings of the National Academy of Sciences of the United States of America | 2013
Don D. Nguyen; Cheng-Hsuan Wu; Wilna J. Moree; Anne Lamsa; Marnix H. Medema; X. Zhao; Ronnie G. Gavilán; Marystella Aparicio; Librada Atencio; Chanaye Jackson; Javier Ballesteros; Joel Sanchez; Jeramie D. Watrous; Vanessa V. Phelan; Corine van de Wiel; Roland D. Kersten; Samina Mehnaz; René De Mot; Elizabeth A. Shank; Pep Charusanti; Harish Nagarajan; Brendan M. Duggan; Bradley S. Moore; Nuno Bandeira; Bernhard O. Palsson; Kit Pogliano; Marcelino Gutiérrez; Pieter C. Dorrestein
Significance The paper introduces the concepts of molecular families (MFs) and gene cluster families (GCFs). We define MFs as structurally related molecules based on their mass spectral fragmentation patterns, whereas GCFs are biosynthetic gene clusters that show similar gene cluster organization with a high degree of sequence similarity. We use MS/MS networking as a tool to map the molecular network of more than 60 organisms, most of which are unsequenced, and locate their nonribosomal peptide MFs. These MFs from unsequenced organisms are then connected to GCFs of publicly available genome sequences of closely related organisms. The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779T. The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family–gene cluster families of hundreds or more diverse organisms in one single MS/MS network.
Journal of Proteome Research | 2012
Deniz Baycin-Hizal; David L. Tabb; Raghothama Chaerkady; Lily Chen; Nathan E. Lewis; Harish Nagarajan; Vishaldeep Sarkaria; Amit Kumar; Daniel Wolozny; Joe Colao; Elena Jacobson; Yuan Tian; Robert N. O’Meally; Sharon S. Krag; Robert N. Cole; Bernhard O. Palsson; Hui Zhang; Michael J. Betenbaugh
To complement the recent genomic sequencing of Chinese hamster ovary (CHO) cells, proteomic analysis was performed on CHO cells including the cellular proteome, secretome, and glycoproteome using tandem mass spectrometry (MS/MS) of multiple fractions obtained from gel electrophoresis, multidimensional liquid chromatography, and solid phase extraction of glycopeptides (SPEG). From the 120 different mass spectrometry analyses generating 682,097 MS/MS spectra, 93,548 unique peptide sequences were identified with at most 0.02 false discovery rate (FDR). A total of 6164 grouped proteins were identified from both glycoproteome and proteome analysis, representing an 8-fold increase in the number of proteins currently identified in the CHO proteome. Furthermore, this is the first proteomic study done using the CHO genome exclusively, which provides for more accurate identification of proteins. From this analysis, the CHO codon frequency was determined and found to be distinct from humans, which will facilitate expression of human proteins in CHO cells. Analysis of the combined proteomic and mRNA data sets indicated the enrichment of a number of pathways including protein processing and apoptosis but depletion of proteins involved in steroid hormone and glycosphingolipid metabolism. Five-hundred four of the detected proteins included N-acetylation modifications, and 1292 different proteins were observed to be N-glycosylated. This first large-scale proteomic analysis will enhance the knowledge base about CHO capabilities for recombinant expression and provide information useful in cell engineering efforts aimed at modifying CHO cellular functions.
PLOS Genetics | 2012
Donghyuk Kim; Jay Sung-Joong Hong; Yu Qiu; Harish Nagarajan; Joo-Hyun Seo; Byung Kwan Cho; Shih-Feng Tsai; Bernhard O. Palsson
Genome-wide transcription start site (TSS) profiles of the enterobacteria Escherichia coli and Klebsiella pneumoniae were experimentally determined through modified 5′ RACE followed by deep sequencing of intact primary mRNA. This identified 3,746 and 3,143 TSSs for E. coli and K. pneumoniae, respectively. Experimentally determined TSSs were then used to define promoter regions and 5′ UTRs upstream of coding genes. Comparative analysis of these regulatory elements revealed the use of multiple TSSs, identical sequence motifs of promoter and Shine-Dalgarno sequence, reflecting conserved gene expression apparatuses between the two species. In both species, over 70% of primary transcripts were expressed from operons having orthologous genes during exponential growth. However, expressed orthologous genes in E. coli and K. pneumoniae showed a strikingly different organization of upstream regulatory regions with only 20% identical promoters with TSSs in both species. Over 40% of promoters had TSSs identified in only one species, despite conserved promoter sequences existing in the other species. 662 conserved promoters having TSSs in both species resulted in the same number of comparable 5′ UTR pairs, and that regulatory element was found to be the most variant region in sequence among promoter, 5′ UTR, and ORF. In K. pneumoniae, 48 sRNAs were predicted and 36 of them were expressed during exponential growth. Among them, 34 orthologous sRNAs between two species were analyzed in depth, and the analysis showed that many sRNAs of K. pneumoniae, including pleiotropic sRNAs such as rprA, arcZ, and sgrS, may work in the same way as in E. coli. These results reveal a new dimension of comparative genomics such that a comparison of two genomes needs to be comprehensive over all levels of genome organization.
Microbial Cell Factories | 2013
Harish Nagarajan; Merve Sahin; Juan Nogales; Haythem Latif; Derek R. Lovley; Ali Ebrahim; Karsten Zengler
BackgroundThe metabolic capabilities of acetogens to ferment a wide range of sugars, to grow autotrophically on H2/CO2, and more importantly on synthesis gas (H2/CO/CO2) make them very attractive candidates as production hosts for biofuels and biocommodities. Acetogenic metabolism is considered one of the earliest modes of bacterial metabolism. A thorough understanding of various factors governing the metabolism, in particular energy conservation mechanisms, is critical for metabolic engineering of acetogens for targeted production of desired chemicals.ResultsHere, we present the genome-scale metabolic network of Clostridium ljungdahlii, the first such model for an acetogen. This genome-scale model (iHN637) consisting of 637 genes, 785 reactions, and 698 metabolites captures all the major central metabolic and biosynthetic pathways, in particular pathways involved in carbon fixation and energy conservation. A combination of metabolic modeling, with physiological and transcriptomic data provided insights into autotrophic metabolism as well as aided the characterization of a nitrate reduction pathway in C. ljungdahlii. Analysis of the iHN637 metabolic model revealed that flavin based electron bifurcation played a key role in energy conservation during autotrophic growth and helped identify genes for some of the critical steps in this mechanism.ConclusionsiHN637 represents a predictive model that recapitulates experimental data, and provides valuable insights into the metabolic response of C. ljungdahlii to genetic perturbations under various growth conditions. Thus, the model will be instrumental in guiding metabolic engineering of C. ljungdahlii for the industrial production of biocommodities and biofuels.
The ISME Journal | 2014
Mallory Embree; Harish Nagarajan; Narjes S. Movahedi; Hamidreza Chitsaz; Karsten Zengler
Microbial interactions have a key role in global geochemical cycles. Although we possess significant knowledge about the general biochemical processes occurring in microbial communities, we are often unable to decipher key functions of individual microorganisms within the environment in part owing to the inability to cultivate or study them in isolation. Here, we circumvent this shortcoming through the use of single-cell genome sequencing and a novel low-input metatranscriptomics protocol to reveal the intricate metabolic capabilities and microbial interactions of an alkane-degrading methanogenic community. This methanogenic consortium oxidizes saturated hydrocarbons under anoxic conditions through a thus-far-uncharacterized biochemical process. The genome sequence of a dominant bacterial member of this community, belonging to the genus Smithella, was sequenced and served as the basis for subsequent analysis through metabolic reconstruction. Metatranscriptomic data generated from less than 500 pg of mRNA highlighted metabolically active genes during anaerobic alkane oxidation in comparison with growth on fatty acids. These data sets suggest that Smithella is not activating hexadecane by fumarate addition. Differential expression assisted in the identification of hypothetical proteins with no known homology that may be involved in hexadecane activation. Additionally, the combination of 16S rDNA sequence and metatranscriptomic data enabled the study of other prevalent organisms within the consortium and their interactions with Smithella, thus yielding a comprehensive characterization of individual constituents at the genome scale during methanogenic alkane oxidation.
Nature Communications | 2013
Harish Nagarajan; Mallory Embree; Amelia-Elena Rotaru; Pravin Malla Shrestha; Adam M. Feist; Bernhard O. Palsson; Derek R. Lovley; Karsten Zengler
Syntrophic associations are central to microbial communities and thus have a fundamental role in the global carbon cycle. Despite biochemical approaches describing the physiological activity of these communities, there has been a lack of a mechanistic understanding of the relationship between complex nutritional and energetic dependencies and their functioning. Here we apply a multi-omic modelling workflow that combines genomic, transcriptomic and physiological data with genome-scale models to investigate dynamics and electron flow mechanisms in the syntrophic association of Geobacter metallireducens and Geobacter sulfurreducens. Genome-scale modelling of direct interspecies electron transfer reveals insights into the energetics of electron transfer mechanisms. While G. sulfurreducens adapts to rapid syntrophic growth by changes at the genomic and transcriptomic level, G. metallireducens responds only at the transcriptomic level. This multi-omic approach enhances our understanding of adaptive responses and factors that shape the evolution of syntrophic communities.