Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where James M. Heather is active.

Publication


Featured researches published by James M. Heather.


Genomics | 2016

The sequence of sequencers: The history of sequencing DNA

James M. Heather; Benjamin M. Chain

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way.


Bioinformatics | 2013

Decombinator: a tool for fast, efficient gene assignment in T-cell receptor sequences using a finite state machine

Niclas Thomas; James M. Heather; Wilfred Ndifon; John Shawe-Taylor; Benjamin M. Chain

SUMMARY High-throughput sequencing provides an opportunity to analyse the repertoire of antigen-specific receptors with an unprecedented breadth and depth. However, the quantity of raw data produced by this technology requires efficient ways to categorize and store the output for subsequent analysis. To this end, we have defined a simple five-item identifier that uniquely and unambiguously defines each TcR sequence. We then describe a novel application of finite-state automaton to map Illumina short-read sequence data for individual TcRs to their respective identifier. An extension of the standard algorithm is also described, which allows for the presence of single-base pair mismatches arising from sequencing error. The software package, named Decombinator, is tested first on a set of artificial in silico sequences and then on a set of published human TcR-β sequences. Decombinator assigned sequences at a rate more than two orders of magnitude faster than that achieved by classical pairwise alignment algorithms, and with a high degree of accuracy (>88%), even after introducing up to 1% error rates in the in silico sequences. Analysis of the published sequence dataset highlighted the strong V and J usage bias observed in the human peripheral blood repertoire, which seems to be unconnected to antigen exposure. The analysis also highlighted the enormous size of the available repertoire and the challenge of obtaining a comprehensive description for it. The Decombinator package will be a valuable tool for further in-depth analysis of the T-cell repertoire. AVAILABILITY AND IMPLEMENTATION The Decombinator package is implemented in Python (v2.6) and is freely available at https://github.com/uclinfectionimmunity/Decombinator along with full documentation and examples of typical usage.


PLOS ONE | 2011

Epigenetic Control of Viral Life-Cycle by a DNA-Methylation Dependent Transcription Factor

Kirsty Flower; David Thomas; James M. Heather; Sharada Ramasubramanyan; Susan Jones; Alison J. Sinclair

Epstein-Barr virus (EBV) encoded transcription factor Zta (BZLF1, ZEBRA, EB1) is the prototype of a class of transcription factor (including C/EBPalpha) that interact with CpG-containing DNA response elements in a methylation-dependent manner. The EBV genome undergoes a biphasic methylation cycle; it is extensively methylated during viral latency but is reset to an unmethylated state following viral lytic replication. Zta is expressed transiently following infection and again during the switch between latency and lytic replication. The requirement for CpG-methylation at critical Zta response elements (ZREs) has been proposed to regulate EBV replication, specifically it could aid the activation of viral lytic gene expression from silenced promoters on the methylated genome during latency in addition to preventing full lytic reactivation from the non-methylated EBV genome immediately following infection. We developed a computational approach to predict the location of ZREs which we experimentally assessed using in vitro and in vivo DNA association assays. A remarkably different binding motif is apparent for the CpG and non-CpG ZREs. Computational prediction of the location of these binding motifs in EBV revealed that the majority of lytic cycle genes have at least one and many have multiple copies of methylation-dependent CpG ZREs within their promoters. This suggests that the abundance of Zta protein coupled with the methylation status of the EBV genome act together to co-ordinate the expression of lytic cycle genes at the majority of EBV promoters.


Journal of General Virology | 2009

The Epstein-Barr virus lytic cycle activator Zta interacts with methylated ZRE in the promoter of host target gene egr1.

James M. Heather; Kirsty Flower; Samine Isaac; Alison J. Sinclair

Activation of the host gene egr1 is essential for the lytic replication of Epstein–Barr virus (EBV). egr1 is activated by Zta (BZLF1, ZEBRA). Zta interacts directly with DNA through a series of closely related Zta-response elements (ZREs). Here we dissect the mechanism used by Zta to interact with the egr1 promoter and identify a weak interaction with egr1ZRE that is dependent on the distal part of egr1ZRE. Furthermore, we demonstrate that the ability of Zta to interact with egr1ZRE is enhanced at least tenfold by methylation. The ability of Zta to transactivate a reporter construct driven by the egr1 promoter can be enhanced by methylation. As the ability of Zta to interact with a methylated ZRE in the EBV genome correlates with its ability to activate the expression of the endogenous viral gene BRLF1, this suggests that Zta may also have the capability to overturn epigenetic control of egr1.


Scientific Reports | 2015

Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding

Katharine Best; Theres Oakes; James M. Heather; John Shawe-Taylor; Benny Chain

The polymerase chain reaction (PCR) is one of the most widely used techniques in molecular biology. In combination with High Throughput Sequencing (HTS), PCR is widely used to quantify transcript abundance for RNA-seq, and in the context of analysis of T and B cell receptor repertoires. In this study, we combine DNA barcoding with HTS to quantify PCR output from individual target molecules. We develop computational tools that simulate both the PCR branching process itself, and the subsequent subsampling which typically occurs during HTS sequencing. We explore the influence of different types of heterogeneity on sequencing output, and compare them to experimental results where the efficiency of amplification is measured by barcodes uniquely identifying each molecule of starting template. Our results demonstrate that the PCR process introduces substantial amplification heterogeneity, independent of primer sequence and bulk experimental conditions. This heterogeneity can be attributed both to inherited differences between different template DNA molecules, and the inherent stochasticity of the PCR process. The results demonstrate that PCR heterogeneity arises even when reaction and substrate conditions are kept as constant as possible, and therefore single molecule barcoding is essential in order to derive reproducible quantitative results from any protocol combining PCR with HTS.


Frontiers in Immunology | 2015

Dynamic Perturbations of the T-Cell Receptor Repertoire in Chronic HIV Infection and following Antiretroviral Therapy.

James M. Heather; Katharine Best; Theres Oakes; Eleanor R. Gray; Jennifer Roe; Niclas Thomas; Nir Friedman; Mahdad Noursadeghi; Benjamin M. Chain

HIV infection profoundly affects many parameters of the immune system and ultimately leads to AIDS, yet which factors are most important for determining resistance, pathology, and response to antiretroviral treatment – and how best to monitor them – remain unclear. We develop a quantitative high-throughput sequencing pipeline to characterize the TCR repertoires of HIV-infected individuals before and after antiretroviral therapy, working from small, unfractionated samples of peripheral blood. This reveals the TCR repertoires of HIV+ individuals to be highly perturbed, with considerably reduced diversity as a small proportion of sequences are highly overrepresented. HIV also causes specific qualitative changes to the repertoire including an altered distribution of V gene usage, depletion of public TCR sequences, and disruption of TCR networks. Short-term antiretroviral therapy has little impact on most of the global damage to repertoire structure, but is accompanied by rapid changes in the abundance of many individual TCR sequences, decreases in abundance of the most common sequences, and decreases in the majority of HIV-associated CDR3 sequences. Thus, high-throughput repertoire sequencing of small blood samples that are easy to take, store, and process can shed light on various aspects of the T-cell immune compartment and stands to offer insights into patient stratification and immune reconstitution.


Briefings in Bioinformatics | 2017

High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities

James M. Heather; Mazlina Ismail; Theres Oakes; Benny Chain

&NA; T‐cell specificity is determined by the T‐cell receptor, a heterodimeric protein coded for by an extremely diverse set of genes produced by imprecise somatic gene recombination. Massively parallel high‐throughput sequencing allows millions of different T‐cell receptor genes to be characterized from a single sample of blood or tissue. However, the extraordinary heterogeneity of the immune repertoire poses significant challenges for subsequent analysis of the data. We outline the major steps in processing of repertoire data, considering low‐level processing of raw sequence files and high‐level algorithms, which seek to extract biological or pathological information. The latest generation of bioinformatics tools allows millions of DNA sequences to be accurately and rapidly assigned to their respective variable V and J gene segments, and to reconstruct an almost error‐free representation of the non‐templated additions and deletions that occur. High‐level processing can measure the diversity of the repertoire in different samples, quantify V and J usage and identify private and public T‐cell receptors. Finally, we discuss the major challenge of linking T‐cell receptor sequence to function, and specifically to antigen recognition. Sophisticated machine learning algorithms are being developed that can combine the paradoxical degeneracy and cross‐reactivity of individual T‐cell receptors with the specificity of the overall T‐cell immune response. Computational analysis will provide the key to unlock the potential of the T‐cell receptor repertoire to give insight into the fundamental biology of the adaptive immune system and to provide powerful biomarkers of disease.


Bioinformatics | 2017

Feature selection using a one dimensional naïve Bayes’ classifier increases the accuracy of support vector machine classification of CDR3 repertoires

Mattia Cinelli; Yuxin Sun; Katharine Best; James M. Heather; Shlomit Reich-Zeliger; Eric Shifrut; Nir Friedman; John Shawe-Taylor; Benny Chain

Motivation: Somatic DNA recombination, the hallmark of vertebrate adaptive immunity, has the potential to generate a vast diversity of antigen receptor sequences. How this diversity captures antigen specificity remains incompletely understood. In this study we use high throughput sequencing to compare the global changes in T cell receptor &bgr; chain complementarity determining region 3 (CDR3&bgr;) sequences following immunization with ovalbumin administered with complete Freunds adjuvant (CFA) or CFA alone. Results: The CDR3&bgr; sequences were deconstructed into short stretches of overlapping contiguous amino acids. The motifs were ranked according to a one‐dimensional Bayesian classifier score comparing their frequency in the repertoires of the two immunization classes. The top ranking motifs were selected and used to create feature vectors which were used to train a support vector machine. The support vector machine achieved high classification scores in a leave‐one‐out validation test reaching >90% in some cases. Summary: The study describes a novel two‐stage classification strategy combining a one‐dimensional Bayesian classifier with a support vector machine. Using this approach we demonstrate that the frequency of a small number of linear motifs three amino acids in length can accurately identify a CD4 T cell response to ovalbumin against a background response to the complex mixture of antigens which characterize Complete Freunds Adjuvant. Availability and implementation: The sequence data is available at www.ncbi.nlm.nih.gov/sra/?term¼SRP075893. The Decombinator package is available at github.com/innate2adaptive/Decombinator. The R package e1071 is available at the CRAN repository https://cran.r‐project.org/web/packages/e1071/index.html. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.


Frontiers in Immunology | 2017

Quantitative Characterization of the T Cell Receptor Repertoire of Naïve and Memory Subsets Using an Integrated Experimental and Computational Pipeline Which Is Robust, Economical, and Versatile

Theres Oakes; James M. Heather; Katharine Best; Rachel Byng-Maddick; Connor Husovsky; Mazlina Ismail; Kroopa Joshi; Gavin Maxwell; Mahdad Noursadeghi; Natalie E. Riddell; Tabea Ruehl; Carolin T. Turner; Imran Uddin; Benny Chain

The T cell receptor (TCR) repertoire can provide a personalized biomarker for infectious and non-infectious diseases. We describe a protocol for amplifying, sequencing, and analyzing TCRs which is robust, sensitive, and versatile. The key experimental step is ligation of a single-stranded oligonucleotide to the 3′ end of the TCR cDNA. This allows amplification of all possible rearrangements using a single set of primers per locus. It also introduces a unique molecular identifier to label each starting cDNA molecule. This molecular identifier is used to correct for sequence errors and for effects of differential PCR amplification efficiency, thus producing more accurate measures of the true TCR frequency within the sample. This integrated experimental and computational pipeline is applied to the analysis of human memory and naive subpopulations, and results in consistent measures of diversity and inequality. After error correction, the distribution of TCR sequence abundance in all subpopulations followed a power law over a wide range of values. The power law exponent differed between naïve and memory populations, but was consistent between individuals. The integrated experimental and analysis pipeline we describe is appropriate to studies of T cell responses in a broad range of physiological and pathological contexts.


Journal of Immunology | 2016

Accelerated Loss of TCR Repertoire Diversity in Common Variable Immunodeficiency

Gabriel Wong; David Millar; Sarah Penny; James M. Heather; Punam Mistry; Nico Buettner; Jane Bryon; Aarnoud Huissoon; Mark Cobbold

Although common variable immunodeficiency (CVID) has long been considered as a group of primary Ab deficiencies, growing experimental data now suggest a global disruption of the entire adaptive immune response in a segment of patients. Oligoclonality of the TCR repertoire was previously demonstrated; however, the manner in which it relates to other B cell and T cell findings reported in CVID remains unclear. Using a combination approach of high-throughput TCRβ sequencing and multiparametric flow cytometry, we compared the TCR repertoire diversity between various subgroups of CVID patients according to their B cell immunophenotypes. Our data suggest that the reduction in repertoire diversity is predominantly restricted to those patients with severely reduced class-switched memory B cells and an elevated level of CD21lo B cells (Freiburg 1a), and may be driven by a reduced number of naive T cells unmasking underlying memory clonality. Moreover, our data indicate that this loss in repertoire diversity progresses with advancing age far exceeding the expected physiological rate. Radiological evidence supports the loss in thymic volume, correlating with the decrease in repertoire diversity. Evidence now suggests that primary thymic failure along with other well-described B cell abnormalities play an important role in the pathophysiology in Freiburg group 1a patients. Clinically, our findings emphasize the integration of combined B and T cell testing to identify those patients at the greatest risk for infection. Future work should focus on investigating the link between thymic failure and the severe reduction in class-switched memory B cells, while gathering longitudinal laboratory data to examine the progressive nature of the disease.

Collaboration


Dive into the James M. Heather's collaboration.

Top Co-Authors

Avatar

Katharine Best

University College London

View shared research outputs
Top Co-Authors

Avatar

Theres Oakes

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Benny Chain

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mazlina Ismail

University College London

View shared research outputs
Top Co-Authors

Avatar

Niclas Thomas

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge