Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Guang Lan Zhang is active.

Publication


Featured researches published by Guang Lan Zhang.


BMC Bioinformatics | 2008

Evaluation of MHC-II peptide binding prediction servers: applications for vaccine research.

Honghuang Lin; Guang Lan Zhang; Songsak Tongchusak; Ellis L. Reinherz; Vladimir Brusic

BackgroundInitiation and regulation of immune responses in humans involves recognition of peptides presented by human leukocyte antigen class II (HLA-II) molecules. These peptides (HLA-II T-cell epitopes) are increasingly important as research targets for the development of vaccines and immunotherapies. HLA-II peptide binding studies involve multiple overlapping peptides spanning individual antigens, as well as complete viral proteomes. Antigen variation in pathogens and tumor antigens, and extensive polymorphism of HLA molecules increase the number of targets for screening studies. Experimental screening methods are expensive and time consuming and reagents are not readily available for many of the HLA class II molecules. Computational prediction methods complement experimental studies, minimize the number of validation experiments, and significantly speed up the epitope mapping process. We collected test data from four independent studies that involved 721 peptide binding assays. Full overlapping studies of four antigens identified binding affinity of 103 peptides to seven common HLA-DR molecules (DRB1*0101, 0301, 0401, 0701, 1101, 1301, and 1501). We used these data to analyze performance of 21 HLA-II binding prediction servers accessible through the WWW.ResultsBecause not all servers have predictors for all tested HLA-II molecules, we assessed a total of 113 predictors. The length of test peptides ranged from 15 to 19 amino acids. We tried three prediction strategies – the best 9-mer within the longer peptide, the average of best three 9-mer predictions, and the average of all 9-mer predictions within the longer peptide. The best strategy was the identification of a single best 9-mer within the longer peptide. Overall, measured by the receiver operating characteristic method (AROC), 17 predictors showed good (AROC > 0.8), 41 showed marginal (AROC > 0.7), and 55 showed poor performance (AROC < 0.7). Good performance predictors included HLA-DRB1*0101 (seven), 1101 (six), 0401 (three), and 0701 (one). The best individual predictor was NETMHCIIPAN, closely followed by PROPRED, IEDB (Consensus), and MULTIPRED (SVM). None of the individual predictors was shown to be suitable for prediction of promiscuous peptides. Current predictive capabilities allow prediction of only 50% of actual T-cell epitopes using practical thresholds.ConclusionThe available HLA-II servers do not match prediction capabilities of HLA-I predictors. Currently available HLA-II prediction servers offer only a limited prediction accuracy and the development of improved predictors is needed for large-scale studies, such as proteome-wide epitope mapping. The requirements for accuracy of HLA-II binding predictions are stringent because of the substantial effect of false positives.


PLOS ONE | 2007

Evolutionarily Conserved Protein Sequences of Influenza A Viruses, Avian and Human, as Vaccine Targets

A. T. Heiny; Olivo Miotto; Kellathur N. Srinivasan; Asif M. Khan; Guang Lan Zhang; Vladimir Brusic; Tin Wee Tan; J. Thomas August

Background Influenza A viruses generate an extreme genetic diversity through point mutation and gene segment exchange, resulting in many new strains that emerge from the animal reservoirs, among which was the recent highly pathogenic H5N1 virus. This genetic diversity also endows these viruses with a dynamic adaptability to their habitats, one result being the rapid selection of genomic variants that resist the immune responses of infected hosts. With the possibility of an influenza A pandemic, a critical need is a vaccine that will recognize and protect against any influenza A pathogen. One feasible approach is a vaccine containing conserved immunogenic protein sequences that represent the genotypic diversity of all current and future avian and human influenza viruses as an alternative to current vaccines that address only the known circulating virus strains. Methodology/Principal Findings Methodologies for large-scale analysis of the evolutionary variability of the influenza A virus proteins recorded in public databases were developed and used to elucidate the amino acid sequence diversity and conservation of 36,343 sequences of the 11 viral proteins of the recorded virus isolates of the past 30 years. Technologies were also applied to identify the conserved amino acid sequences from isolates of the past decade, and to evaluate the predicted human lymphocyte antigen (HLA) supertype-restricted class I and II T-cell epitopes of the conserved sequences. Fifty-five (55) sequences of 9 or more amino acids of the polymerases (PB2, PB1, and PA), nucleoprotein (NP), and matrix 1 (M1) proteins were completely conserved in at least 80%, many in 95 to 100%, of the avian and human influenza A virus isolates despite the marked evolutionary variability of the viruses. Almost all (50) of these conserved sequences contained putative supertype HLA class I or class II epitopes as predicted by 4 peptide-HLA binding algorithms. Additionally, data of the Immune Epitope Database (IEDB) include 29 experimentally identified HLA class I and II T-cell epitopes present in 14 of the conserved sequences. Conclusions/Significance This study of all reported influenza A virus protein sequences, avian and human, has identified 55 highly conserved sequences, most of which are predicted to have immune relevance as T-cell epitopes. This is a necessary first step in the design and analysis of a polyepitope, pan-influenza A vaccine. In addition to the application described herein, these technologies can be applied to other pathogens and to other therapeutic modalities designed to attack DNA, RNA, or protein sequences critical to pathogen function.


Nucleic Acids Research | 2005

MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides

Guang Lan Zhang; Asif M. Khan; Kellathur N. Srinivasan; J. Thomas August; Vladimir Brusic

MULTIPRED is a web-based computational system for the prediction of peptide binding to multiple molecules (proteins) belonging to human leukocyte antigens (HLA) class I A2, A3 and class II DR supertypes. It uses hidden Markov models and artificial neural network methods as predictive engines. A novel data representation method enables MULTIPRED to predict peptides that promiscuously bind multiple HLA alleles within one HLA supertype. Extensive testing was performed for validation of the prediction models. Testing results show that MULTIPRED is both sensitive and specific and it has good predictive ability (area under the receiver operating characteristic curve AROC > 0.80). MULTIPRED can be used for the mapping of promiscuous T-cell epitopes as well as the regions of high concentration of these targets—termed T-cell epitope hotspots. MULTIPRED is available at .


Immunology and Cell Biology | 2002

Prediction of promiscuous peptides that bind HLA class I molecules

Vladimir Brusic; Nikolai Petrovsky; Guang Lan Zhang; Vladimir B. Bajic

Promiscuous T‐cell epitopes make ideal targets for vaccine development. We report here a computational system, multipred, for the prediction of peptide binding to the HLA‐A2 supertype. It combines a novel representation of peptide/MHC interactions with a hidden Markov model as the prediction algorithm. multipred is both sensitive and specific, and demonstrates high accuracy of peptide‐binding predictions for HLA‐A∗0201, ∗0204, and ∗0205 alleles, good accuracy for ∗0206 allele, and marginal accuracy for ∗0203 allele. multipred replaces earlier requirements for individual prediction models for each HLA allelic variant and simplifies computational aspects of peptide‐binding prediction. Preliminary testing indicates that multipred can predict peptide binding to HLA‐A2 supertype molecules with high accuracy, including those allelic variants for which no experimental binding data are currently available.


Bioinformatics | 2002

Dragon Promoter Finder: recognition of vertebrate RNA polymerase II promoters

Vladimir B. Bajic; Seng Hong Seah; Allen Chong; Guang Lan Zhang; Judice L. Y. Koh; Vladimir Brusic

Dragon Promoter Finder (DPF) locates RNA polymerase II promoters in DNA sequences of vertebrates by predicting Transcription Start Site (TSS) positions. DPFs algorithm uses sensors for three functional regions (promoters, exons and introns) and an Artificial Neural Network (ANN). Results on a large and diverse evaluation set indicate that DPF exhibits a superior predicting ability for TSS location compared to three other promoter-finding programs.


Immunity | 2017

Mass Spectrometry Profiling of HLA-Associated Peptidomes in Mono-allelic Cells Enables More Accurate Epitope Prediction

Jennifer G. Abelin; Derin B. Keskin; Siranush Sarkizova; Christina R. Hartigan; Wandi Zhang; John Sidney; Jonathan Stevens; William S. Lane; Guang Lan Zhang; Thomas Eisenhaure; Karl R. Clauser; Nir Hacohen; Michael S. Rooney; Steven A. Carr; Catherine J. Wu

SUMMARY Identification of human leukocyte antigen (HLA)‐bound peptides by liquid chromatography‐tandem mass spectrometry (LC‐MS/MS) is poised to provide a deep understanding of rules underlying antigen presentation. However, a key obstacle is the ambiguity that arises from the co‐expression of multiple HLA alleles. Here, we have implemented a scalable mono‐allelic strategy for profiling the HLA peptidome. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing an application‐specific spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of subdominant binding motifs and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural‐network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a strategy for systematically learning the rules of endogenous antigen presentation. Graphical Abstract Figure. No Caption available. Highlights24,000 HLA class I peptides were identified through a scalable MS‐based pipeline.Mono‐allelic data revealed binding motifs that were validated biochemically.Comprehensive analyses provide an updated portrait of antigen processing rules.Neural networks were trained for 16 alleles and outperform standard by 2‐fold. &NA; HLA class I binding prediction has traditionally been based on biochemical binding experiments. Abelin and colleagues present an LC‐MS/MS‐based workflow and analytical framework that greatly accelerates gains in prediction performance. Key advances include the discovery of sequence motifs and improved quantification of the roles of gene expression and proteasomal processing.


PLOS Neglected Tropical Diseases | 2008

Conservation and variability of dengue virus proteins: Implications for vaccine design

Asif M. Khan; Olivo Miotto; Eduardo J. M. Nascimento; Kellathur N. Srinivasan; A. T. Heiny; Guang Lan Zhang; Ernesto E. T. Marques; Tin Wee Tan; Vladimir Brusic; Jerome Salmon; J. Thomas August

Background Genetic variation and rapid evolution are hallmarks of RNA viruses, the result of high mutation rates in RNA replication and selection of mutants that enhance viral adaptation, including the escape from host immune responses. Variability is uneven across the genome because mutations resulting in a deleterious effect on viral fitness are restricted. RNA viruses are thus marked by protein sites permissive to multiple mutations and sites critical to viral structure-function that are evolutionarily robust and highly conserved. Identification and characterization of the historical dynamics of the conserved sites have relevance to multiple applications, including potential targets for diagnosis, and prophylactic and therapeutic purposes. Methodology/Principal Findings We describe a large-scale identification and analysis of evolutionarily highly conserved amino acid sequences of the entire dengue virus (DENV) proteome, with a focus on sequences of 9 amino acids or more, and thus immune-relevant as potential T-cell determinants. DENV protein sequence data were collected from the NCBI Entrez protein database in 2005 (9,512 sequences) and again in 2007 (12,404 sequences). Forty-four (44) sequences (pan-DENV sequences), mainly those of nonstructural proteins and representing ∼15% of the DENV polyprotein length, were identical in 80% or more of all recorded DENV sequences. Of these 44 sequences, 34 (∼77%) were present in ≥95% of sequences of each DENV type, and 27 (∼61%) were conserved in other Flaviviruses. The frequencies of variants of the pan-DENV sequences were low (0 to ∼5%), as compared to variant frequencies of ∼60 to ∼85% in the non pan-DENV sequence regions. We further showed that the majority of the conserved sequences were immunologically relevant: 34 contained numerous predicted human leukocyte antigen (HLA) supertype-restricted peptide sequences, and 26 contained T-cell determinants identified by studies with HLA-transgenic mice and/or reported to be immunogenic in humans. Conclusions/Significance Forty-four (44) pan-DENV sequences of at least 9 amino acids were highly conserved and identical in 80% or more of all recorded DENV sequences, and the majority were found to be immune-relevant by their correspondence to known or putative HLA-restricted T-cell determinants. The conservation of these sequences through the entire recorded DENV genetic history supports their possible value for diagnosis, prophylactic and/or therapeutic applications. The combination of bioinformatics and experimental approaches applied herein provides a framework for large-scale and systematic analysis of conserved and variable sequences of other pathogens, in particular, for rapidly mutating viruses, such as influenza A virus and HIV.


Journal of Immunological Methods | 2011

Machine learning competition in immunology – Prediction of HLA class I binding peptides

Guang Lan Zhang; Hifzur Rahman Ansari; Phil Bradley; Gavin C. Cawley; Tomer Hertz; Xihao Hu; Nebojsa Jojic; Yohan Kim; Oliver Kohlbacher; Ole Lund; Claus Lundegaard; Craig A. Magaret; Morten Nielsen; Harris Papadopoulos; Gajendra P. S. Raghava; Vider-Shalit Tal; Li C. Xue; Chen Yanover; Shanfeng Zhu; Michael T. Rock; James E. Crowe; Christos G. Panayiotou; Marios M. Polycarpou; Włodzisław Duch; Vladimir Brusic

Experimental studies of immune system and related applications such as characterization of immune responses against pathogens, vaccine design, or optimization of therapies are combinatorially complex, time-consuming and expensive. The main methods for large-scale identification of T-cell epitopes from pathogens or cancer proteomes involve either reverse immunology or high-throughput mass spectrometry (HTMS). Reverse immunology approaches involve pre-screening of proteomes by computational algorithms, followed by experimental validation of selected targets (Mora et al., 2006; De Groot et al., 2008; Larsen et al., 2010). HTMS involves HLA typing, immunoaffinity chromatography of HLA molecules, HLA extraction, and chromatography combined with tandem mass spectrometry, followed by the application of computational algorithms for peptide characterization (Bassani-Sternberg et al., 2010). Hundreds of naturally processed HLA class I associated peptides have been identified in individual studies using HTMS in normal (Escobar et al., 2008), cancer (Antwi et al., 2009; Bassani-Sternberg et al., 2010), autoimmunity-related (Ben Dror et al., 2010), and infected samples (Wahl et al, 2010). Computational algorithms are essential steps in highthroughput identification of T-cell epitope candidates using both reverse immunology and HTMS approaches. Peptide binding to MHC molecules is the single most selective step in defining T cell epitope and the accuracy of computational algorithms for prediction of peptide binding, therefore, determines the accuracy of the overall method. Computational predictions of peptide binding to HLA, both class I and class II, use a variety of algorithms ranging from binding motifs to advanced machine learning techniques (Brusic et al., 2004; Lafuente and Reche, 2009) and standards for their


Immunome Research | 2006

PREDTAP: a system for prediction of peptide binding to the human transporter associated with antigen processing

Guang Lan Zhang; Nikolai Petrovsky; Chee Keong Kwoh; J. Thomas August; Vladimir Brusic

BackgroundThe transporter associated with antigen processing (TAP) is a critical component of the major histocompatibility complex (MHC) class I antigen processing and presentation pathway. TAP transports antigenic peptides into the endoplasmic reticulum where it loads them into the binding groove of MHC class I molecules. Because peptides must first be transported by TAP in order to be presented on MHC class I, TAP binding preferences should impact significantly on T-cell epitope selection.DescriptionPREDTAP is a computational system that predicts peptide binding to human TAP. It uses artificial neural networks and hidden Markov models as predictive engines. Extensive testing was performed to valid the prediction models. The results showed that PREDTAP was both sensitive and specific and had good predictive ability (area under the receiver operating characteristic curve Aroc>0.85).ConclusionPREDTAP can be integrated with prediction systems for MHC class I binding peptides for improved performance of in silico prediction of T-cell epitopes. PREDTAP is available for public use at [1].


Nucleic Acids Research | 2005

PREDBALB/c: a system for the prediction of peptide binding to H2d molecules, a haplotype of the BALB/c mouse

Guang Lan Zhang; Kellathur N. Srinivasan; Anitha Veeramani; J. Thomas August; Vladimir Brusic

PREDBALB/c is a computational system that predicts peptides binding to the major histocompatibility complex-2 (H2d) of the BALB/c mouse, an important laboratory model organism. The predictions include the complete set of H2d class I (H2-Kd, H2-Ld and H2-Dd) and class II (I-Ed and I-Ad) molecules. The prediction system utilizes quantitative matrices, which were rigorously validated using experimentally determined binders and non-binders and also by in vivo studies using viral proteins. The prediction performance of PREDBALB/c is of very high accuracy. To our knowledge, this is the first online server for the prediction of peptides binding to a complete set of major histocompatibility complex molecules in a model organism (H2d haplotype). PREDBALB/c is available at .

Collaboration


Dive into the Guang Lan Zhang's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

J. Thomas August

Johns Hopkins University School of Medicine

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lars Olsen

University of Copenhagen

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Asif M. Khan

National University of Singapore

View shared research outputs
Researchain Logo
Decentralizing Knowledge