Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Joanna Collins is active.

Publication


Featured researches published by Joanna Collins.


PLOS Biology | 2011

Modernizing Reference Genome Assemblies

Deanna M. Church; Valerie Schneider; Tina Graves; Katherine Auger; Fiona Cunningham; Nathan Bouk; Hsiu Chuan Chen; Richa Agarwala; William M. McLaren; Graham R. S. Ritchie; Derek Albracht; Milinn Kremitzki; Susan Rock; Holland Kotkiewicz; Colin Kremitzki; Aye Wollam; Lee Trani; Lucinda Fulton; Robert S. Fulton; Lucy Matthews; S. Whitehead; William Chow; James Torrance; Matthew Dunn; Glenn Harden; Glen Threadgold; Jonathan Wood; Joanna Collins; Paul Heath; Guy Griffiths

I have read the journals policy and have the following conflicts: Paul Flicek is married to the deputy editor of PLoS Medicine, Melissa Norton. Evan Eichler is on the board of Pacific Biosciences. Support for this work came from the Intramural Research Program of the NIH, The National Library of Medicine, the European Molecular Biology Laboratory, the Wellcome Trust (grant number 077198), and the Howard Hughes Medical Institute (EEE). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.


Genome Research | 2017

Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

Valerie Schneider; Tina A. Graves-Lindsay; Kerstin Howe; Nathan Bouk; Hsiu-Chuan Chen; Paul Kitts; Terence Murphy; Kim D. Pruitt; Françoise Thibaud-Nissen; Derek Albracht; Robert S. Fulton; Milinn Kremitzki; Vincent Magrini; Chris Markovic; Sean McGrath; Karyn Meltz Steinberg; Kate Auger; William Chow; Joanna Collins; Glenn Harden; Tim Hubbard; Sarah Pelan; Jared T. Simpson; Glen Threadgold; James Torrance; Jonathan Wood; Laura Clarke; Sergey Koren; Matthew Boitano; Paul Peluso

The human reference genome assembly plays a central role in nearly all aspects of todays basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health.


bioRxiv | 2018

Multiple laboratory mouse reference genomes define strain specific haplotypes and novel functional loci

Jingtao Lilue; Anthony G. Doran; Ian T Fiddes; Monica Abrudan; Joel Armstrong; Ruth Bennett; William Chow; Joanna Collins; Anne Czechanski; Petr Danecek; Mark Diekhans; Dirk-Dominic Dolle; Matthew Dunn; Richard Durbin; Dent Earl; Anne C. Ferguson-Smith; Paul Flicek; Jonathan Flint; Adam Frankish; Beiyuan Fu; Mark Gerstein; James Gilbert; Leo Goodstadt; Jennifer Harrow; Kerstin Howe; Mikhail Kolmogorov; Stefanie Koenig; Chris Lelliott; Jane Loveland; Richard Mott

The most commonly employed mammalian model organism is the laboratory mouse. A wide variety of genetically diverse inbred mouse strains, representing distinct physiological states, disease susceptibilities, and biological mechanisms have been developed over the last century. We report full length draft de novo genome assemblies for 16 of the most widely used inbred strains and reveal for the first time extensive strain-specific haplotype variation. We identify and characterise 2,567 regions on the current Genome Reference Consortium mouse reference genome exhibiting the greatest sequence diversity between strains. These regions are enriched for genes involved in defence and immunity, and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. Several immune related loci, some in previously identified QTLs for disease response have novel haplotypes not present in the reference that may explain the phenotype. We used these genomes to improve the mouse reference genome resulting in the completion of 10 new gene structures, and 62 new coding loci were added to the reference genome annotation. Notably this high quality collection of genomes revealed a previously unannotated gene (Efcab3-like) encoding 5,874 amino acids, one of the largest known in the rodent lineage. Interestingly, Efcab3-like−/− mice exhibit severe size anomalies in four regions of the brain suggesting a mechanism of Efcab3-like regulating brain development.


Nature Genetics | 2018

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci

Jingtao Lilue; Anthony G. Doran; Ian T Fiddes; Monica Abrudan; Joel Armstrong; Ruth Bennett; William Chow; Joanna Collins; Stephan C. Collins; Anne Czechanski; Petr Danecek; Mark Diekhans; Dirk-Dominik Dolle; Matthew Dunn; Richard Durbin; Dent Earl; Anne C. Ferguson-Smith; Paul Flicek; Jonathan Flint; Adam Frankish; Beiyuan Fu; Mark Gerstein; James Gilbert; Leo Goodstadt; Jennifer Harrow; Kerstin Howe; Ximena Ibarra-Soria; Mikhail Kolmogorov; Chris Lelliott; Darren W. Logan

We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.Sequence assemblies for the genomes of 16 widely used inbred laboratory mouse strains highlight considerable strain-specific haplotype variation and allow for the identification of regions with the greatest sequence diversity between strains.


Nature | 1995

A high-density YAC contig map of human chromosome 22.

Joanna Collins; Charlotte G. Cole; Luc J. Smink; Garrett Cl; M. A. Leversha; Carol Soderlund; Maslen Gl; Everett La; Rice Km; Alison J. Coffey


Nature , 377 Su pp. 367-379. (1995) | 1995

A high resolution integrated yeast artificial chromosome clone map of human chromosome 22.

Joanna Collins; Charlotte G. Cole; Luc J. Smink; Cl Garrett; M. A. Leversha; Carol Soderlund; Gl Maslen; La Everett; Km Rice; Aj Coffey; S Gregory; R Gwilliam; A Dunham; Af Davies; S Hassock; Cm Todd; Hans Lehrach; Tjm Hulsebos; J Weissenbach; B Morrow; Raju Kucherlapati; R Wadey; Peter J. Scambler; U-J Kim; Melvin I. Simon; Nigel P. Carter; Richard Durbin; Jan P. Dumanski; Bentley; Ian Dunham

Collaboration


Dive into the Joanna Collins's collaboration.

Top Co-Authors

Avatar

William Chow

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Kerstin Howe

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Matthew Dunn

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar

Richard Durbin

Wellcome Trust Sanger Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dent Earl

University of California

View shared research outputs
Top Co-Authors

Avatar

Derek Albracht

Washington University in St. Louis

View shared research outputs
Top Co-Authors

Avatar

Ian T Fiddes

University of California

View shared research outputs
Top Co-Authors

Avatar

Joel Armstrong

University of California

View shared research outputs
Top Co-Authors

Avatar

Jonathan Flint

University of California

View shared research outputs
Researchain Logo
Decentralizing Knowledge