Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alan R. Gingle is active.

Publication


Featured researches published by Alan R. Gingle.


Nature | 2009

The Sorghum bicolor genome and the diversification of grasses

Andrew H. Paterson; John E. Bowers; Rémy Bruggmann; Inna Dubchak; Jane Grimwood; Heidrun Gundlach; Georg Haberer; Uffe Hellsten; Therese Mitros; Alexander Poliakov; Jeremy Schmutz; Manuel Spannagl; Haibao Tang; Xiyin Wang; Thomas Wicker; Arvind K. Bharti; Jarrod Chapman; F. Alex Feltus; Udo Gowik; Igor V. Grigoriev; Eric Lyons; Christopher A. Maher; Mihaela Martis; Apurva Narechania; Robert Otillar; Bryan W. Penning; Asaf Salamov; Yu Wang; Lifang Zhang; Nicholas C. Carpita

Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the ∼75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization ∼70 million years ago, most duplicated gene sets lost one member before the sorghum–rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum’s drought tolerance.


Nucleic Acids Research | 2009

PlasmoDB: a functional genomic database for malaria parasites

Cristina Aurrecoechea; John Brestelli; Brian P. Brunk; Jennifer Dommer; Steve Fischer; Bindu Gajria; Xin Gao; Alan R. Gingle; Gregory R. Grant; Omar S. Harb; Mark Heiges; Frank Innamorato; John Iodice; Jessica C. Kissinger; Eileen Kraemer; Wei Li; John A. Miller; Vishal Nayak; Cary Pennington; Deborah F. Pinney; David S. Roos; Chris Ross; Christian J. Stoeckert; Charles Treatman; Haiming Wang

PlasmoDB (http://PlasmoDB.org) is a functional genomic database for Plasmodium spp. that provides a resource for data analysis and visualization in a gene-by-gene or genome-wide scale. PlasmoDB belongs to a family of genomic resources that are housed under the EuPathDB (http://EuPathDB.org) Bioinformatics Resource Center (BRC) umbrella. The latest release, PlasmoDB 5.5, contains numerous new data types from several broad categories—annotated genomes, evidence of transcription, proteomics evidence, protein function evidence, population biology and evolution. Data in PlasmoDB can be queried by selecting the data of interest from a query grid or drop down menus. Various results can then be combined with each other on the query history page. Search results can be downloaded with associated functional data and registered users can store their query history for future retrieval or analysis.


Nature | 2012

Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres

Andrew H. Paterson; Jonathan F. Wendel; Heidrun Gundlach; Hui Guo; Jerry Jenkins; Dianchuan Jin; Danny J. Llewellyn; Kurtis C. Showmaker; Shengqiang Shu; Mi-jeong Yoo; Robert L. Byers; Wei Chen; Adi Doron-Faigenboim; Mary V. Duke; Lei Gong; Jane Grimwood; Corrinne E. Grover; Kara Grupp; Guanjing Hu; Tae-Ho Lee; Jingping Li; Lifeng Lin; Tao Liu; Barry S. Marler; Justin T. Page; Alison W. Roberts; Elisson Romanel; William S. Sanders; Emmanuel Szadkowski; Xu Tan

Polyploidy often confers emergent properties, such as the higher fibre productivity and quality of tetraploid cottons than diploid cottons bred for the same environments. Here we show that an abrupt five- to sixfold ploidy increase approximately 60 million years (Myr) ago, and allopolyploidy reuniting divergent Gossypium genomes approximately 1–2 Myr ago, conferred about 30–36-fold duplication of ancestral angiosperm (flowering plant) genes in elite cottons (Gossypium hirsutum and Gossypium barbadense), genetic complexity equalled only by Brassica among sequenced angiosperms. Nascent fibre evolution, before allopolyploidy, is elucidated by comparison of spinnable-fibred Gossypium herbaceum A and non-spinnable Gossypium longicalyx F genomes to one another and the outgroup D genome of non-spinnable Gossypium raimondii. The sequence of a G. hirsutum AtDt (in which ‘t’ indicates tetraploid) cultivar reveals many non-reciprocal DNA exchanges between subgenomes that may have contributed to phenotypic innovation and/or other emergent properties such as ecological adaptation by polyploids. Most DNA-level novelty in G. hirsutum recombines alleles from the D-genome progenitor native to its New World habitat and the Old World A-genome progenitor in which spinnable fibre evolved. Coordinated expression changes in proximal groups of functionally distinct genes, including a nuclear mitochondrial DNA block, may account for clusters of cotton-fibre quantitative trait loci affecting diverse traits. Opportunities abound for dissecting emergent properties of other polyploids, particularly angiosperms, by comparison to diploid progenitors and outgroups.


Nucleic Acids Research | 2010

TriTrypDB: a functional genomic resource for the Trypanosomatidae

Martin Aslett; Cristina Aurrecoechea; Matthew Berriman; John Brestelli; Brian P. Brunk; Mark Carrington; Daniel P. Depledge; Steve Fischer; Bindu Gajria; Xin Gao; Malcolm J. Gardner; Alan R. Gingle; Greg Grant; Omar S. Harb; Mark Heiges; Christiane Hertz-Fowler; Robin Houston; Frank Innamorato; John Iodice; Jessica C. Kissinger; Eileen Kraemer; Wei Li; Flora J. Logan; John A. Miller; Siddhartha Mitra; Peter J. Myler; Vishal Nayak; Cary Pennington; Isabelle Phan; Deborah F. Pinney

TriTrypDB (http://tritrypdb.org) is an integrated database providing access to genome-scale datasets for kinetoplastid parasites, and supporting a variety of complex queries driven by research and development needs. TriTrypDB is a collaborative project, utilizing the GUS/WDK computational infrastructure developed by the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) to integrate genome annotation and analyses from GeneDB and elsewhere with a wide variety of functional genomics datasets made available by members of the global research community, often pre-publication. Currently, TriTrypDB integrates datasets from Leishmania braziliensis, L. infantum, L. major, L. tarentolae, Trypanosoma brucei and T. cruzi. Users may examine individual genes or chromosomal spans in their genomic context, including syntenic alignments with other kinetoplastid organisms. Data within TriTrypDB can be interrogated utilizing a sophisticated search strategy system that enables a user to construct complex queries combining multiple data types. All search strategies are stored, allowing future access and integrated searches. ‘User Comments’ may be added to any gene page, enhancing available annotation; such comments become immediately searchable via the text search, and are forwarded to curators for incorporation into the reference annotation when appropriate.


Plant Physiology | 2007

Toward Sequencing Cotton (Gossypium) Genomes

Z. Jeffrey Chen; Brian E. Scheffler; Elizabeth S. Dennis; Barbara A. Triplett; Tianzhen Zhang; Wangzhen Guo; Xiao-Ya Chen; David M. Stelly; Pablo D. Rabinowicz; Christopher D. Town; Tony Arioli; Curt L. Brubaker; Roy G. Cantrell; Jean Marc Lacape; Mauricio Ulloa; Peng Chee; Alan R. Gingle; Candace H. Haigler; Richard G. Percy; Sukumar Saha; Thea A. Wilkins; Robert J. Wright; Allen Van Deynze; Yuxian Zhu; Shuxun Yu; Ibrokhim Y. Abdurakhmonov; Ishwarappa S. Katageri; P. Ananda Kumar; Mehboob-ur-Rahman; Yusuf Zafar

Despite rapidly decreasing costs and innovative technologies, sequencing of angiosperm genomes is not yet undertaken lightly. Generating larger amounts of sequence data more quickly does not address the difficulties of sequencing and assembling complex genomes de novo. The cotton ( Gossypium spp.)


Nucleic Acids Research | 2009

GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis

Cristina Aurrecoechea; John Brestelli; Brian P. Brunk; Jane M. Carlton; Jennifer Dommer; Steve Fischer; Bindu Gajria; Xin Gao; Alan R. Gingle; Gregory R. Grant; Omar S. Harb; Mark Heiges; Frank Innamorato; John Iodice; Jessica C. Kissinger; Eileen Kraemer; Wei Li; John A. Miller; Hilary G. Morrison; Vishal Nayak; Cary Pennington; Deborah F. Pinney; David S. Roos; Chris Ross; Christian J. Stoeckert; Steven A. Sullivan; Charles Treatman; Haiming Wang

GiardiaDB (http://GiardiaDB.org) and TrichDB (http://TrichDB.org) house the genome databases for Giardia lamblia and Trichomonas vaginalis, respectively, and represent the latest additions to the EuPathDB (http://EuPathDB.org) family of functional genomic databases. GiardiaDB and TrichDB employ the same framework as other EuPathDB sites (CryptoDB, PlasmoDB and ToxoDB), supporting fully integrated and searchable databases. Genomic-scale data available via these resources may be queried based on BLAST searches, annotation keywords and gene ID searches, GO terms, sequence motifs and other protein characteristics. Functional queries may also be formulated, based on transcript and protein expression data from a variety of platforms. Phylogenetic relationships may also be interrogated. The ability to combine the results from independent queries, and to store queries and query results for future use facilitates complex, genome-wide mining of functional genomic data.


Nucleic Acids Research | 2010

EuPathDB: a portal to eukaryotic pathogen databases

Cristina Aurrecoechea; John Brestelli; Brian P. Brunk; Steve Fischer; Bindu Gajria; Xin Gao; Alan R. Gingle; Gregory R. Grant; Omar S. Harb; Mark Heiges; Frank Innamorato; John Iodice; Jessica C. Kissinger; Eileen Kraemer; Wei Li; John A. Miller; Vishal Nayak; Cary Pennington; Deborah F. Pinney; David S. Roos; Chris Ross; Ganesh Srinivasamoorthy; Christian J. Stoeckert; Ryan Thibodeau; Charles Treatman; Haiming Wang

EuPathDB (http://EuPathDB.org; formerly ApiDB) is an integrated database covering the eukaryotic pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora, Plasmodium, Toxoplasma, Trichomonas and Trypanosoma. While each of these groups is supported by a taxon-specific database built upon the same infrastructure, the EuPathDB portal offers an entry point to all these resources, and the opportunity to leverage orthology for searches across genera. The most recent release of EuPathDB includes updates and changes affecting data content, infrastructure and the user interface, improving data access and enhancing the user experience. EuPathDB currently supports more than 80 searches and the recently-implemented ‘search strategy’ system enables users to construct complex multi-step searches via a graphical interface. Search results are dynamically displayed as the strategy is constructed or modified, and can be downloaded, saved, revised, or shared with other database users.


Nucleic Acids Research | 2013

EuPathDB: The Eukaryotic Pathogen database

Cristina Aurrecoechea; Ana Barreto; John Brestelli; Brian P. Brunk; Shon Cade; Ryan Doherty; Steve Fischer; Bindu Gajria; Xin Gao; Alan R. Gingle; Gregory R. Grant; Omar S. Harb; Mark Heiges; Sufen Hu; John Iodice; Jessica C. Kissinger; Eileen Kraemer; Wei Li; Deborah F. Pinney; Brian Pitts; David S. Roos; Ganesh Srinivasamoorthy; Christian J. Stoeckert; Haiming Wang; Susanne Warrenfeltz

EuPathDB (http://eupathdb.org) resources include 11 databases supporting eukaryotic pathogen genomic and functional genomic data, isolate data and phylogenomics. EuPathDB resources are built using the same infrastructure and provide a sophisticated search strategy system enabling complex interrogations of underlying data. Recent advances in EuPathDB resources include the design and implementation of a new data loading workflow, a new database supporting Piroplasmida (i.e. Babesia and Theileria), the addition of large amounts of new data and data types and the incorporation of new analysis tools. New data include genome sequences and annotation, strand-specific RNA-seq data, splice junction predictions (based on RNA-seq), phosphoproteomic data, high-throughput phenotyping data, single nucleotide polymorphism data based on high-throughput sequencing (HTS) and expression quantitative trait loci data. New analysis tools enable users to search for DNA motifs and define genes based on their genomic colocation, view results from searches graphically (i.e. genes mapped to chromosomes or isolates displayed on a map) and analyze data from columns in result tables (word cloud and histogram summaries of column content). The manuscript herein describes updates to EuPathDB since the previous report published in NAR in 2010.


Plant Physiology | 2005

Sorghum Expressed Sequence Tags Identify Signature Genes for Drought, Pathogenesis, and Skotomorphogenesis from a Milestone Set of 16,801 Unique Transcripts

Lee H. Pratt; Chun Liang; Manish Shah; Feng Sun; Haiming Wang; St Patrick Reid; Alan R. Gingle; Andrew H. Paterson; Rod A. Wing; Ralph A. Dean; Robert R. Klein; Henry T. Nguyen; Hong Mei Ma; Xin Zhao; Daryl T. Morishige; John E. Mullet; Marie Michèle Cordonnier-Pratt

Improved knowledge of the sorghum transcriptome will enhance basic understanding of how plants respond to stresses and serve as a source of genes of value to agriculture. Toward this goal, Sorghum bicolor L. Moench cDNA libraries were prepared from light- and dark-grown seedlings, drought-stressed plants, Colletotrichum-infected seedlings and plants, ovaries, embryos, and immature panicles. Other libraries were prepared with meristems from Sorghum propinquum (Kunth) Hitchc. that had been photoperiodically induced to flower, and with rhizomes from S. propinquum and johnsongrass (Sorghum halepense L. Pers.). A total of 117,682 expressed sequence tags (ESTs) were obtained representing both 3′ and 5′ sequences from about half that number of cDNA clones. A total of 16,801 unique transcripts, representing tentative UniScripts (TUs), were identified from 55,783 3′ ESTs. Of these TUs, 9,032 are represented by two or more ESTs. Collectively, these libraries were predicted to contain a total of approximately 31,000 TUs. Individual libraries, however, were predicted to contain no more than about 6,000 to 9,000, with the exception of light-grown seedlings, which yielded an estimate of close to 13,000. In addition, each library exhibits about the same level of complexity with respect to both the number of TUs preferentially expressed in that library and the frequency with which two or more ESTs is found in only that library. These results indicate that the sorghum genome is expressed in highly selective fashion in the individual organs and in response to the environmental conditions surveyed here. Close to 2,000 differentially expressed TUs were identified among the cDNA libraries examined, of which 775 were differentially expressed at a confidence level of 98%. From these 775 TUs, signature genes were identified defining drought, Colletotrichum infection, skotomorphogenesis (etiolation), ovary, immature panicle, and embryo.


G3: Genes, Genomes, Genetics | 2013

PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms

Justin T. Page; Alan R. Gingle

Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP’s single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity.

Collaboration


Dive into the Alan R. Gingle's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Bindu Gajria

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

Brian P. Brunk

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Deborah F. Pinney

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

John Brestelli

University of Pennsylvania

View shared research outputs
Top Co-Authors

Avatar

John Iodice

University of Pennsylvania

View shared research outputs
Researchain Logo
Decentralizing Knowledge