Patrick Marks
Pacific Biosciences
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Patrick Marks.
Science | 2009
John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald L. Cicero; Sonya Clark; Ravindra Dalal; Alex deWinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin
We present single-molecule, real-time sequencing data obtained from a DNA polymerase performing uninterrupted template-directed synthesis using four distinguishable fluorescently labeled deoxyribonucleoside triphosphates (dNTPs). We detected the temporal order of their enzymatic incorporation into a growing DNA strand with zero-mode waveguide nanostructure arrays, which provide optical observation volume confinement and enable parallel, simultaneous detection of thousands of single-molecule sequencing reactions. Conjugation of fluorophores to the terminal phosphate moiety of the dNTPs allows continuous observation of DNA synthesis over thousands of bases without steric hindrance. The data report directly on polymerase dynamics, revealing distinct polymerization states and pause sites corresponding to DNA secondary structure. Sequence data were aligned with the known reference sequence to assay biophysical parameters of polymerization for each template position. Consensus sequences were generated from the single-molecule reads at 15-fold coverage, showing a median accuracy of 99.3%, with no systematic error beyond fluorophore-dependent error rates.
Nature Methods | 2013
Chen Shan Chin; David H. Alexander; Patrick Marks; Aaron Klammer; James P Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E. Eichler; Stephen Turner; Jonas Korlach
We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph–based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.
Proceedings of the National Academy of Sciences of the United States of America | 2008
Jonas Korlach; Patrick Marks; Ronald L. Cicero; Jeremy Gray; Devon Murphy; Daniel Roitman; Thang Pham; Geoff Otto; Mathieu Foquet; Stephen Turner
Optical nanostructures have enabled the creation of subdiffraction detection volumes for single-molecule fluorescence microscopy. Their applicability is extended by the ability to place molecules in the confined observation volume without interfering with their biological function. Here, we demonstrate that processive DNA synthesis thousands of bases in length was carried out by individual DNA polymerase molecules immobilized in the observation volumes of zero-mode waveguides (ZMWs) in high-density arrays. Selective immobilization of polymerase to the fused silica floor of the ZMW was achieved by passivation of the metal cladding surface using polyphosphonate chemistry, producing enzyme density contrasts of glass over aluminum in excess of 400:1. Yields of single-molecule occupancies of ≈30% were obtained for a range of ZMW diameters (70–100 nm). Results presented here support the application of immobilized single DNA polymerases in ZMW arrays for long-read-length DNA sequencing.
Nature Methods | 2016
Yulia Mostovoy; Michal Levy-Sakin; Jessica Lam; Ernest T. Lam; Alex Hastie; Patrick Marks; Joyce Lee; Catherine Chu; Chin Lin; Željko Džakula; Stephen A. Schlebusch; Kristina Giorda; Michael Schnall-Levin; Jeffrey D. Wall; Pui-Yan Kwok
Despite tremendous progress in genome sequencing, the basic goal of producing a phased (haplotype-resolved) genome sequence with end-to-end contiguity for each chromosome at reasonable cost and effort is still unrealized. In this study, we describe an approach to performing de novo genome assembly and experimental phasing by integrating the data from Illumina short-read sequencing, 10X Genomics linked-read sequencing, and BioNano Genomics genome mapping to yield a high-quality, phased, de novo assembled human genome.
PLOS ONE | 2015
Np Mayor; James Robinson; A.J. McWhinnie; Swati Ranade; Kevin Eng; William Midwinter; Will P. Bultitude; Chen-Shan Chin; Brett Bowman; Patrick Marks; Henny Braund; J. Alejandro Madrigal; Katy Latham; Steven G.E. Marsh
Allele-level resolution data at primary HLA typing is the ideal for most histocompatibility testing laboratories. Many high-throughput molecular HLA typing approaches are unable to determine the phase of observed DNA sequence polymorphisms, leading to ambiguous results. The use of higher resolution methods is often restricted due to cost and time limitations. Here we report on the feasibility of using Pacific Biosciences’ Single Molecule Real-Time (SMRT) DNA sequencing technology for high-resolution and high-throughput HLA typing. Seven DNA samples were typed for HLA-A, -B and -C. The results showed that SMRT DNA sequencing technology was able to generate sequences that spanned entire HLA Class I genes that allowed for accurate allele calling. Eight novel genomic HLA class I sequences were identified, four were novel alleles, three were confirmed as genomic sequence extensions and one corrected an existing genomic reference sequence. This method has the potential to revolutionize the field of HLA typing. The clinical impact of achieving this level of resolution HLA typing data is likely to considerable, particularly in applications such as organ and blood stem cell transplantation where matching donors and recipients for their HLA is of utmost importance.
bioRxiv | 2017
Patrick Marks; Sarah Garcia; Alvaro Martinez Barrio; Kamila Belhocine; Jorge Bernate; Rajiv Bharadwaj; Keith Bjornson; Claudia Catalanotti; Josh Delaney; Adrian Fehr; Brendan Galvin; Jill Herschleb; Christopher M. Hindson; Esty Holt; Cassandra Jabara; Susanna Jett; Nikka Keivanfar; Sofia Kyriazopoulou-Panagiotopoulou; Monkol Lek; Bill Lin; Adam J. Lowe; Shazia Mahamdallie; Shamoni Maheshwari; Tony Makarewicz; Jamie Marshall; Francesca Meschi; Chris O'keefe; Heather Ordonez; Pranav Patel; A J Price
Large-scale population based analyses coupled with advances in technology have demonstrated that the human genome is more diverse than originally thought. To date, this diversity has largely been uncovered using short read whole genome sequencing. However, standard short-read approaches, used primarily due to accuracy, throughput and costs, fail to give a complete picture of a genome. They struggle to identify large, balanced structural events, cannot access repetitive regions of the genome and fail to resolve the human genome into its two haplotypes. Here we describe an approach that retains long range information while harnessing the advantages of short reads. Starting from only ∼1ng of DNA, we produce barcoded short read libraries. The use of novel informatic approaches allows for the barcoded short reads to be associated with the long molecules of origin producing a novel datatype known as ‘Linked-Reads’. This approach allows for simultaneous detection of small and large variants from a single Linked-Read library. We have previously demonstrated the utility of whole genome Linked-Reads (lrWGS) for performing diploid, de novo assembly of individual genomes (Weisenfeld et al. 2017). In this manuscript, we show the advantages of Linked-Reads over standard short read approaches for reference based analysis. We demonstrate the ability of Linked-Reads to reconstruct megabase scale haplotypes and to recover parts of the genome that are typically inaccessible to short reads, including phenotypically important genes such as STRC, SMN1 and SMN2. We demonstrate the ability of both lrWGS and Linked-Read Whole Exome Sequencing (lrWES) to identify complex structural variations, including balanced events, single exon deletions, and single exon duplications. The data presented here show that Linked-Reads provide a scalable approach for comprehensive genome analysis that is not possible using short reads alone.
bioRxiv | 2017
Sarah Garcia; Stephen Williams; Andrew Wei Xu; Jill Herschleb; Patrick Marks; David Stafford; Deanna M. Church
Large genomic structural variants (>50bp) are important contributors to disease, yet they remain one of the most difficult types of variation to accurately ascertain, in part because they tend to cluster in duplicated and repetitive regions, but also because the various signals for these events can be challenging to detect with short reads. Clinically, aCGH and karyotype remain the most commonly used assays for genome-wide structural variant (SV) detection, though there is clear potential benefit to an NGS-based assay that accurately detects both SVs and single nucleotide variants. Linked-Read sequencing is a relatively simple, fast, and cost-effective method that is applicable to both genome and targeted assays. Linked-Reads are generated by performing haplotype-level dilution of long input DNA molecules into >1 million barcoded partitions, generating barcoded short reads within those partitions, and then performing short read sequencing in bulk. We performed 30x Linked-Read genome sequencing on a set of 23 samples with known balanced or unbalanced SVs. Twenty-seven of the 29 known events were detected and another event was called as a candidate. Sequence downsampling was performed on a subset to determine the lowest sequence depth required to detect variations. Copy-number variants can be called with as little as 1-2x sequencing depth (5-10Gb) while balanced events require on the order of 10x coverage for variant calls to be made, although specific signal is clearly present at 1-2x sequencing depth. In addition to detecting a full spectrum of variant types with a single test, Linked-Read sequencing provides base-level resolution of breakpoints, enabling complete resolution of even the most complex chromosomal rearrangements.
Cancer Research | 2016
Sofia Kyriazopoulou-Panagiotopoulou; Patrick Marks; Heather Ordonez; Kristina Giorda; Cassandra Jabara; Billy Lau; John M. Bell; Michael Schnall-Levin; Hanlee P. Ji
Studies have shown that somatic structural variation (SV) plays a key role in the oncogenic process. Traditionally SVs in the cancer genome have been detected using low resolution cytogenetic approaches, such as FISH, or microarray-based techniques. More recently, next-generation sequencing (NGS)-based technologies have been employed to detect SVs, including indels and translocations. However, both short- and long-read NGS-based approaches are limited in their ability to accurately identify SV events and delineate their breakpoints due to the limitations inherent in assembly of billions of short-read sequences across a heterogeneous cancer sample, as well as the costly and burdensome laboratory infrastructure associated with long-read sequencers. We utilized a novel technology that combines microfluidics and molecular barcoding to generate libraries that are sequenced with an Illumina system. Open-source bioinformatics software produces linked-reads that maintain long-range information and single molecule sensitivity. Cell lines and cancer samples were obtained from commercial sources, and genomic DNA was extracted. DNA sample indexing and partitioning was performed using the 10X Genomicx GemCode instrument. One ng of sample DNA was used as input for each reaction, and DNA molecules were partitioned into droplets to fragment the DNA and introduce molecular barcodes. Following barcoding, droplets were fractured, and library DNA was purified and sequenced on Illumina sequencers. The GemCode Long Ranger software suite was used to map sequencing reads back to original long molecules of DNA, generating reads linked to partition barcodes. Thus we can generate phased sequences covering many 109s to 1009s of kilobases. We first benchmarked the ability to call multiple SV types using a well-characterized germline HapMap sample (NA12878) as well as two recently characterized haploid hydatidiform moles (CHM1 and CHM13) that have been studied with multiple orthogonal technologies. Regions with evidence for structural variation were reassembled into distinct haplotypes. The barcode information allowed us to both phase the structural variants we detected and disambiguate calls within highly repetitive regions, such as segmental duplications. We demonstrated high concordance with alternative approaches across all major classes of SVs, including long insertions and deletions as well as copy-neutral events. In cancer cell lines, we detected well-annotated gene fusions, such as the EML4/ALK and ALK/PTPN3 fusions in the lung cancer cell line NCI-H2228, and the SLC26A/PRKAR2A fusion in the triple negative breast cancer cell line HCC38. Citation Format: Sofia Kyriazopoulou-Panagiotopoulou, Patrick Marks, Haynes Heaton, Heather Ordonez, Kristina Giorda, Cassandra Jabara, Billy Lau, John M. Bell, Michael Schnall-Levin, Hanlee P. Ji. Linked-Reads enable detailed, phased resolution of structural variation in the cancer genome. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 3602.
Archive | 2008
Austin B. Tomaney; Kenneth Mark Maxham; David Holden; Kevin Hester; Devon Murphy; Patrick Marks
Archive | 2009
Benjamin Flusberg; Jonas Korlach; Jeffrey Wegener; Tyson A. Clark; Igor Vilfan; Andrey Kislyuk; Stephen Turner; Jon Sorenson; Kevin Travers; Cheryl Heiner; Austin B. Tomaney; Patrick Marks; Jessica Lee; Lei Jia; Dale Webster; John Lyle; Jeremiah Hanes; Joseph Puglisi