Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Reed A. George is active.

Publication


Featured researches published by Reed A. George.


Genome Biology | 2002

Finishing a whole-genome shotgun: Release 3 of the Drosophila melanogaster euchromatic genome sequence

Susan E. Celniker; David A. Wheeler; Brent Kronmiller; Joseph W. Carlson; Aaron L. Halpern; Sandeep Patel; Mark D. Adams; Mark Champe; Shannon Dugan; Erwin Frise; Ann Hodgson; Reed A. George; Roger A. Hoskins; Todd R. Laverty; Donna M. Muzny; Catherine R. Nelson; Joanne Pacleb; Soo Park; Barret D. Pfeiffer; Stephen Richards; Erica Sodergren; Robert Svirskas; Paul E. Tabor; Kenneth H. Wan; Mark Stapleton; Granger Sutton; Craig Venter; George M. Weinstock; Steven E. Scherer; Eugene W. Myers

BackgroundThe Drosophila melanogaster genome was the first metazoan genome to have been sequenced by the whole-genome shotgun (WGS) method. Two issues relating to this achievement were widely debated in the genomics community: how correct is the sequence with respect to base-pair (bp) accuracy and frequency of assembly errors? And, how difficult is it to bring a WGS sequence to the accepted standard for finished sequence? We are now in a position to answer these questions.ResultsOur finishing process was designed to close gaps, improve sequence quality and validate the assembly. Sequence traces derived from the WGS and draft sequencing of individual bacterial artificial chromosomes (BACs) were assembled into BAC-sized segments. These segments were brought to high quality, and then joined to constitute the sequence of each chromosome arm. Overall assembly was verified by comparison to a physical map of fingerprinted BAC clones. In the current version of the 116.9 Mb euchromatic genome, called Release 3, the six euchromatic chromosome arms are represented by 13 scaffolds with a total of 37 sequence gaps. We compared Release 3 to Release 2; in autosomal regions of unique sequence, the error rate of Release 2 was one in 20,000 bp.ConclusionsThe WGS strategy can efficiently produce a high-quality sequence of a metazoan genome while generating the reagents required for sequence finishing. However, the initial method of repeat assembly was flawed. The sequence we report here, Release 3, is a reliable resource for molecular genetic experimentation and computational analysis.


Genome Biology | 2002

A Drosophila full-length cDNA resource

Mark Stapleton; Joe Carlson; Peter Brokstein; Charles Yu; Mark Champe; Reed A. George; Hannibal Guarin; Brent Kronmiller; Joanne Pacleb; Soo Park; Ken Wan; Gerald M. Rubin; Susan E. Celniker

BackgroundA collection of sequenced full-length cDNAs is an important resource both for functional genomics studies and for the determination of the intron-exon structure of genes. Providing this resource to the Drosophila melanogaster research community has been a long-term goal of the Berkeley Drosophila Genome Project. We have previously described the Drosophila Gene Collection (DGC), a set of putative full-length cDNAs that was produced by generating and analyzing over 250,000 expressed sequence tags (ESTs) derived from a variety of tissues and developmental stages.ResultsWe have generated high-quality full-insert sequence for 8,921 clones in the DGC. We compared the sequence of these clones to the annotated Release 3 genomic sequence, and identified more than 5,300 cDNAs that contain a complete and accurate protein-coding sequence. This corresponds to at least one splice form for 40% of the predicted D. melanogaster genes. We also identified potential new cases of RNA editing.ConclusionsWe show that comparison of cDNA sequences to a high-quality annotated genomic sequence is an effective approach to identifying and eliminating defective clones from a cDNA collection and ensure its utility for experimentation. Clones were eliminated either because they carry single nucleotide discrepancies, which most probably result from reverse transcriptase errors, or because they are truncated and contain only part of the protein-coding sequence.


Genome Biology | 2002

Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome

Casey M. Bergman; Barret D. Pfeiffer; Diego E. Rincon-Limas; Roger A. Hoskins; Andreas Gnirke; Chris Mungall; Adrienne M. Wang; Brent Kronmiller; Joanne Pacleb; Soo Park; Mark Stapleton; Kenneth H. Wan; Reed A. George; Pieter J. de Jong; Juan Botas; Gerald M. Rubin; Susan E. Celniker

BackgroundIt is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined.ResultsWe analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences.ConclusionsOur results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone.


Genome Research | 2015

The Release 6 reference sequence of the Drosophila melanogaster genome

Roger A. Hoskins; Joseph W. Carlson; Kenneth H. Wan; Soo Park; Ivonne Mendez; Samuel E. Galle; Benjamin W. Booth; Barret D. Pfeiffer; Reed A. George; Robert Svirskas; Martin Krzywinski; Jacqueline E. Schein; Maria Carmela Accardo; Elisabetta Damia; Giovanni Messina; Maria Mendez-Lago; Beatriz de Pablos; Olga V. Demakova; Evgeniya N. Andreyeva; Lidiya V. Boldyreva; Marco A. Marra; A. Bernardo Carvalho; Patrizio Dimitri; Alfredo Villasante; Igor F. Zhimulev; Gerald M. Rubin; Gary H. Karpen; Susan E. Celniker

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.


Nucleic Acids Research | 2005

Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP)

Roger A. Hoskins; Mark Stapleton; Reed A. George; Charles Yu; Kenneth H. Wan; Joseph W. Carlson; Susan E. Celniker

cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5′- and 3′-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT–PCR approaches.


Nature Protocols | 2006

High-Throughput Plasmid cDNA Library Screening

Kenneth H. Wan; Charles Yu; Reed A. George; Joseph W. Carlson; Roger A. Hoskins; Robert Svirskas; Mark Stapleton; Susan E. Celniker

Libraries of cDNA clones are valuable resources for analyzing the expression, structure and regulation of genes, and for studying protein functions and interactions. Full-length cDNA clones provide information about intron and exon structures, splice junctions, and 5′ and 3′ untranslated regions (UTRs). Open reading frames (ORFs) derived from cDNA clones can be used to generate constructs allowing the expression of both wild-type proteins and proteins tagged at their amino or carboxy terminus. Thus, obtaining full-length cDNA clones and sequences for most or all genes in an organism is essential for understanding genome functions. EST sequencing samples cDNA libraries at random, an approach that is most useful at the beginning of large-scale screening projects. As projects progress towards completion, however, the probability of identifying unique cDNAs by EST sequencing diminishes, resulting in poor recovery of rare transcripts. Here we describe an adapted, high-throughput protocol intended for the recovery of specific, full-length clones from plasmid cDNA libraries in 5 d.


Science | 2000

Comparative genomics of the eukaryotes.

Gerald M. Rubin; Mark Yandell; Jennifer R. Wortman; George L. Gabor Miklos; Catherine R. Nelson; Iswar K. Hariharan; Mark E. Fortini; Peter Li; Rolf Apweiler; Wolfgang Fleischmann; J. Michael Cherry; Steven Henikoff; Marian Skupski; Sima Misra; Michael Ashburner; Ewan Birney; Mark S. Boguski; Thomas Brody; Peter Brokstein; Susan E. Celniker; Stephen A. Chervitz; David Coates; Anibal Cravchik; Andrei E. Gabrielian; Richard F. Galle; William M. Gelbart; Reed A. George; Lawrence S.B. Goldstein; Fangcheng Gong; Ping Guan


Genetics | 1999

An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region

Michael Ashburner; Sima Misra; John Roote; Suzanna E. Lewis; R. Blazej; Terence Davis; C. Doyle; Richard F. Galle; Reed A. George; Nomi L. Harris; G. Hartzell; D. Harvey; Ling Hong; K. Houston; Roger A. Hoskins; Glynnis Johnson; C. Martin; A. Moshrefi; M. Palazzolo; M. G. Reese; Allan C. Spradling; Garson Tsang; Kenneth H. Wan; K. Whitelaw; B. Kimmel; Susan E. Celniker; Gerald M. Rubin


Genome Research | 2002

The Drosophila gene collection: identification of putative full-length cDNAs for 70% of D. melanogaster genes.

Mark Stapleton; Guochun Liao; Peter Brokstein; Ling Hong; Piero Carninci; Toshiyuki Shiraki; Yoshihide Hayashizaki; Mark Champe; Joanne Pacleb; Ken Wan; Charles Yu; Joe Carlson; Reed A. George; Susan E. Celniker; Gerald M. Rubin


Science | 2000

A BAC-based physical map of the major autosomes of Drosophila melanogaster

Roger A. Hoskins; Catherine R. Nelson; Benjamin P. Berman; Todd R. Laverty; Reed A. George; Lisa Ciesiolka; Mohammed Naeemuddin; Andrew D. Arenson; James Durbin; Robert David; Paul E. Tabor; Michael R. Bailey; Denise R. DeShazo; Joseph J. Catanese; Aaron G. Mammoser; Kazutoyo Osoegawa; Pieter J. de; Jong; Susan E. Celniker; Richard A. Gibbs; Gerald M. Rubin; Steven E. Scherer

Collaboration


Dive into the Reed A. George's collaboration.

Top Co-Authors

Avatar

Susan E. Celniker

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Gerald M. Rubin

Howard Hughes Medical Institute

View shared research outputs
Top Co-Authors

Avatar

Roger A. Hoskins

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Kenneth H. Wan

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Mark Stapleton

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Charles Yu

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Joanne Pacleb

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Joseph W. Carlson

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Soo Park

Lawrence Berkeley National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Barret D. Pfeiffer

Howard Hughes Medical Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge