Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Michael A. Quail is active.

Publication


Featured researches published by Michael A. Quail.


BMC Genomics | 2012

A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers

Michael A. Quail; Miriam Smith; Paul Coupland; Thomas D. Otto; Simon R. Harris; Thomas Richard Connor; Anna Bertoni; Harold Swerdlow; Yong Gu

BackgroundNext generation sequencing (NGS) technology has revolutionized genomic and genetic research. The pace of change in this area is rapid with three major new sequencing platforms having been released in 2011: Ion Torrent’s PGM, Pacific Biosciences’ RS and the Illumina MiSeq. Here we compare the results obtained with those platforms to the performance of the Illumina HiSeq, the current market leader. In order to compare these platforms, and get sufficient coverage depth to allow meaningful analysis, we have sequenced a set of 4 microbial genomes with mean GC content ranging from 19.3 to 67.7%. Together, these represent a comprehensive range of genome content. Here we report our analysis of that sequence data in terms of coverage distribution, bias, GC distribution, variant detection and accuracy.ResultsSequence generated by Ion Torrent, MiSeq and Pacific Biosciences technologies displays near perfect coverage behaviour on GC-rich, neutral and moderately AT-rich genomes, but a profound bias was observed upon sequencing the extremely AT-rich genome of Plasmodium falciparum on the PGM, resulting in no coverage for approximately 30% of the genome. We analysed the ability to call variants from each platform and found that we could call slightly more variants from Ion Torrent data compared to MiSeq data, but at the expense of a higher false positive rate. Variant calling from Pacific Biosciences data was possible but higher coverage depth was required. Context specific errors were observed in both PGM and MiSeq data, but not in that from the Pacific Biosciences platform.ConclusionsAll three fast turnaround sequencers evaluated here were able to generate usable sequence. However there are key differences between the quality of that data and the applications it will support.


Nature Methods | 2008

A large genome center's improvements to the Illumina sequencing system

Michael A. Quail; Iwanka Kozarewa; Frances Smith; Aylwyn Scally; Philip Stephens; Richard Durbin; Harold Swerdlow; Daniel J. Turner

The Wellcome Trust Sanger Institute is one of the worlds largest genome centers, and a substantial amount of our sequencing is performed with next-generation massively parallel sequencing technologies: in June 2008 the quantity of purity-filtered sequence data generated by our Genome Analyzer (Illumina) platforms reached 1 terabase, and our average weekly Illumina production output is currently 64 gigabases. Here we describe a set of improvements we have made to the standard Illumina protocols to make the library preparation more reliable in a high-throughput environment, to reduce bias, tighten insert size distribution and reliably obtain high yields of data.


Nature Methods | 2009

Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes

Iwanka Kozarewa; Zemin Ning; Michael A. Quail; Mandy Sanders; Matthew Berriman; Daniel J. Turner

Amplification artifacts introduced during library preparation for the Illumina Genome Analyzer increase the likelihood that an appreciable proportion of these sequences will be duplicates and cause an uneven distribution of read coverage across the targeted sequencing regions. As a consequence, these unfavorable features result in difficulties in genome assembly and variation analysis from the short reads, particularly when the sequences are from genomes with base compositions at the extremes of high or low G+C content. Here we present an amplification-free method of library preparation, in which the cluster amplification step, rather than the PCR, enriches for fully ligated template strands, reducing the incidence of duplicate sequences, improving read mapping and single nucleotide polymorphism calling and aiding de novo assembly. We illustrate this by generating and analyzing DNA sequences from extremely (G+C)-poor (Plasmodium falciparum), (G+C)-neutral (Escherichia coli) and (G+C)-rich (Bordetella pertussis) genomes.


BMC Genomics | 2012

Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.

Samuel O. Oyola; Thomas D. Otto; Yong-ping Gu; Gareth Maslen; Magnus Manske; Susana Campino; Daniel J. Turner; Bronwyn MacInnis; Dominic P. Kwiatkowski; Harold Swerdlow; Michael A. Quail

BackgroundMassively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences.ResultsWe have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates.ConclusionWe have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.


Cell Reports | 2015

Leukemia-Associated Somatic Mutations Drive Distinct Patterns of Age-Related Clonal Hemopoiesis

Thomas McKerrell; Naomi Park; Thaidy Moreno; C Grove; Hannes Ponstingl; Jonathan Stephens; Charles Crawley; Jenny I.O. Craig; Mike A. Scott; Clare Hodkinson; Joanna Baxter; Roland Rad; Duncan R. Forsyth; Michael A. Quail; Eleftheria Zeggini; Willem H. Ouwehand; Ignacio Varela; George S. Vassiliou

Summary Clonal hemopoiesis driven by leukemia-associated gene mutations can occur without evidence of a blood disorder. To investigate this phenomenon, we interrogated 15 mutation hot spots in blood DNA from 4,219 individuals using ultra-deep sequencing. Using only the hot spots studied, we identified clonal hemopoiesis in 0.8% of individuals under 60, rising to 19.5% of those ≥90 years, thus predicting that clonal hemopoiesis is much more prevalent than previously realized. DNMT3A-R882 mutations were most common and, although their prevalence increased with age, were found in individuals as young as 25 years. By contrast, mutations affecting spliceosome genes SF3B1 and SRSF2, closely associated with the myelodysplastic syndromes, were identified only in those aged >70 years, with several individuals harboring more than one such mutation. This indicates that spliceosome gene mutations drive clonal expansion under selection pressures particular to the aging hemopoietic system and explains the high incidence of clonal disorders associated with these mutations in advanced old age.


Nature Methods | 2015

G&T-seq: parallel sequencing of single-cell genomes and transcriptomes

Iain C. Macaulay; Wilfried Haerty; Parveen Kumar; Yang I. Li; Tim Xiaoming Hu; Mabel J Teng; Mubeen Goolam; Nathalie Saurat; Paul Coupland; Lesley Shirley; Miriam Smith; Niels Van der Aa; Ruby Banerjee; Peter Ellis; Michael A. Quail; Harold Swerdlow; Magdalena Zernicka-Goetz; Frederick J. Livesey; Chris P. Ponting; Thierry Voet

The simultaneous sequencing of a single cells genome and transcriptome offers a powerful means to dissect genetic variation and its effect on gene expression. Here we describe G&T-seq, a method for separating and sequencing genomic DNA and full-length mRNA from single cells. By applying G&T-seq to over 220 single cells from mice and humans, we discovered cellular properties that could not be inferred from DNA or RNA sequencing alone.


Nature Methods | 2012

Optimal enzymes for amplifying sequencing libraries.

Michael A. Quail; Thomas D. Otto; Yong Qiang Gu; Simon R. Harris; Thomas Skelly; Jacqueline McQuillan; Harold Swerdlow; Samuel O. Oyola

PCR amplification introduces bias into Illumina sequencing libraries1. Although amplification-free library preparation solves this, micrograms of starting material are usually required. Most researchers follow standard protocols using Phusion polymerase, which has processivity and fidelity advantages over most polymerases. Yet for genomics applications, our demands on DNA amplification systems often surpass their specification. Thermostable DNA polymerases such as Phusion are used to amplify mixtures of fragments, albeit with variable efficiency. Typically, (G+C)-neutral fragments are amplified with higher efficiency than extremely (G+C)-rich or (A+T)-rich fragments. The accumulation of these slight differences in amplification over multiple cycles often results in profound bias. There have been reports of using alternative DNA polymerases for Illumina library construction2, 3, 4, but these are infrequent, and comprehensive analyses are lacking. To reduce bias, we investigated many thermostable DNA polymerases and alternate reaction conditions for amplification of adapter-ligated fragments for Illumina sequencing. We expect this comparison to be relevant to other applications that involve simultaneous amplification of complex fragment mixtures.


Current protocols in human genetics | 2009

Improved Protocols for the Illumina Genome Analyzer Sequencing System

Michael A. Quail; Harold Swerdlow; Daniel J. Turner

In this unit, we describe a set of improvements we have made to the standard Illumina Genome Analyzer protocols to make the sequencing process more reliable in a high‐throughput environment, reduce amplification bias, narrow the distribution of insert sizes, and reliably obtain high yields of data. Curr. Protoc. Hum. Genet. 62:18.2.1‐18.2.27.


BioTechniques | 2012

Direct sequencing of small genomes on the Pacific Biosciences RS without library preparation.

Paul Coupland; Tamir Chandra; Michael A. Quail; Wolf Reik; Harold Swerdlow

We have developed a sequencing method on the Pacific Biosciences RS sequencer (the PacBio) for small DNA molecules that avoids the need for a standard library preparation. To date this approach has been applied toward sequencing single-stranded and double-stranded viral genomes, bacterial plasmids, plasmid vector models for DNA-modification analysis, and linear DNA fragments covering an entire bacterial genome. Using direct sequencing it is possible to generate sequence data from as little as 1 ng of DNA, offering a significant advantage over current protocols which typically require 400-500 ng of sheared DNA for the library preparation.


BMC Genomics | 2014

SASI-Seq: sample assurance Spike-Ins, and highly differentiating 384 barcoding for Illumina sequencing.

Michael A. Quail; Miriam Smith; David K. Jackson; Steven Leonard; Thomas Skelly; Harold Swerdlow; Yong Gu; Peter Ellis

BackgroundA minor but significant fraction of samples subjected to next-generation sequencing methods are either mixed-up or cross-contaminated. These events can lead to false or inconclusive results. We have therefore developed SASI-Seq; a process whereby a set of uniquely barcoded DNA fragments are added to samples destined for sequencing. From the final sequencing data, one can verify that all the reads derive from the original sample(s) and not from contaminants or other samples.ResultsBy adding a mixture of three uniquely barcoded amplicons, of different sizes spanning the range of insert sizes one would normally use for Illumina sequencing, at a spike-in level of approximately 0.1%, we demonstrate that these fragments remain intimately associated with the sample. They can be detected following even the tightest size selection regimes or exome enrichment and can report the occurrence of sample mix-ups and cross-contamination.As a consequence of this work, we have designed a set of 384 eleven-base Illumina barcode sequences that are at least 5 changes apart from each other, allowing for single-error correction and very low levels of barcode misallocation due to sequencing error.ConclusionSASI-Seq is a simple, inexpensive and flexible tool that enables sample assurance, allows deconvolution of sample mix-ups and reports levels of cross-contamination between samples throughout NGS workflows.

Collaboration


Dive into the Michael A. Quail's collaboration.

Top Co-Authors

Avatar

Harold Swerdlow

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Daniel J. Turner

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Miriam Smith

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Paul Coupland

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Iwanka Kozarewa

Institute of Cancer Research

View shared research outputs
Top Co-Authors

Avatar

Peter Ellis

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Samuel O. Oyola

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Simon R. Harris

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Yong Gu

Wellcome Trust Sanger Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge