Arian Smit
University of Washington
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Arian Smit.
Nature | 2005
Tarjei S. Mikkelsen; LaDeana W. Hillier; Evan E. Eichler; Michael C. Zody; David B. Jaffe; Shiaw-Pyng Yang; Wolfgang Enard; Ines Hellmann; Kerstin Lindblad-Toh; Tasha K. Altheide; Nicoletta Archidiacono; Peer Bork; Jonathan Butler; Jean L. Chang; Ze Cheng; Asif T. Chinwalla; Pieter J. de Jong; Kimberley D. Delehaunty; Catrina C. Fronick; Lucinda L. Fulton; Yoav Gilad; Gustavo Glusman; Sante Gnerre; Tina Graves; Toshiyuki Hayakawa; Karen E. Hayden; Xiaoqiu Huang; Hongkai Ji; W. James Kent; Mary Claire King
Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.
Science | 2010
Jared C. Roach; Gustavo Glusman; Arian Smit; Chad D. Huff; Robert Hubley; Paul Shannon; Lee Rowen; Krishna Pant; Nathan Goodman; Michael J. Bamshad; Jay Shendure; Radoje Drmanac; Lynn B. Jorde; Leroy Hood; David J. Galas
Runs in the Family The power to detect mutations involved in disease by genome sequencing is enhanced when combined with the ability to discover specific mutations that may have arisen between offspring and parents. Roach et al. (p. 636, published online 10 March) present the sequence of a family with two offspring affected with two genetic disorders: Miller syndrome and primary ciliary dyskinesia. Sequence analysis of the children and their parents not only showed that the intergenerational mutation rate was lower than anticipated but also revealed recombination sites and the occurrence of rare polymorphisms. Genomic sequencing of an entire family reveals the rate of spontaneous mutations in humans and identifies disease genes. We analyzed the whole-genome sequences of a family of four, consisting of two siblings and their parents. Family-based sequencing allowed us to delineate recombination sites precisely, identify 70% of the sequencing errors (resulting in > 99.999% accuracy), and identify very rare single-nucleotide polymorphisms. We also directly estimated a human intergeneration mutation rate of ~1.1 × 10−8 per position per haploid genome. Both offspring in this family have two recessive disorders: Miller syndrome, for which the gene was concurrently identified, and primary ciliary dyskinesia, for which causative genes have been previously identified. Family-based genome analysis enabled us to narrow the candidate genes for both of these Mendelian disorders to only four. Our results demonstrate the value of complete genome sequencing in families.
Current Opinion in Genetics & Development | 1999
Arian Smit
The bulk of the human genome is ultimately derived from transposable elements. Observations in the past year lead to some new and surprising ideas on functions and consequences of these elements and their remnants in our genome. The many new examples of human genes derived from single transposon insertions highlight the large contribution of selfish DNA to genomic evolution.
Nature | 2003
James W. Thomas; Jeffrey W. Touchman; Robert W. Blakesley; Gerard G. Bouffard; Stephen M. Beckstrom-Sternberg; Elliott H. Margulies; Mathieu Blanchette; Adam Siepel; Pamela J. Thomas; Jennifer C. McDowell; Baishali Maskeri; Nancy F. Hansen; M. Schwartz; Ryan Weber; William Kent; Donna Karolchik; T. C. Bruen; R. Bevan; David J. Cutler; Scott Schwartz; Laura Elnitski; Jacquelyn R. Idol; A. B. Prasad; S. Q. Lee-Lin; Valerie Maduro; T. J. Summers; Matthew E. Portnoy; Nicole Dietrich; N. Akhter; K. Ayele
The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.
Nature | 2010
Wesley C. Warren; David F. Clayton; Hans Ellegren; Arthur P. Arnold; LaDeana W. Hillier; Axel Künstner; Steve Searle; Simon White; Albert J. Vilella; Susan Fairley; Andreas Heger; Lesheng Kong; Chris P. Ponting; Erich D. Jarvis; Claudio V. Mello; Patrick Minx; Peter V. Lovell; Tarciso Velho; Margaret Ferris; Christopher N. Balakrishnan; Saurabh Sinha; Charles Blatti; Sarah E. London; Yun Li; Ya-Chi Lin; Julia M. George; Jonathan V. Sweedler; Bruce R. Southey; Preethi H. Gunaratne; M. G. Watson
The zebra finch is an important model organism in several fields with unique relevance to human neuroscience. Like other songbirds, the zebra finch communicates through learned vocalizations, an ability otherwise documented only in humans and a few other animals and lacking in the chicken—the only bird with a sequenced genome until now. Here we present a structural, functional and comparative analysis of the genome sequence of the zebra finch (Taeniopygia guttata), which is a songbird belonging to the large avian order Passeriformes. We find that the overall structures of the genomes are similar in zebra finch and chicken, but they differ in many intrachromosomal rearrangements, lineage-specific gene family expansions, the number of long-terminal-repeat-based retrotransposons, and mechanisms of sex chromosome dosage compensation. We show that song behaviour engages gene regulatory networks in the zebra finch brain, altering the expression of long non-coding RNAs, microRNAs, transcription factors and their targets. We also show evidence for rapid molecular evolution in the songbird lineage of genes that are regulated during song experience. These results indicate an active involvement of the genome in neural processes underlying vocal communication and identify potential genetic substrates for the evolution and regulation of this behaviour.
Current Opinion in Genetics & Development | 1996
Arian Smit
Over a third of the human genome consists of interspersed repetitive sequences which are primarily degenerate copies of transposable elements. In the past year, the identities of many of these transposable elements were revealed. The emerging concept is that only three mechanisms of amplification are responsible for the vast majority of interspersed repeats and that with each autonomous element a number of dependent non-autonomous sequences have co-amplified.
Nucleic Acids Research | 2015
Kate R. Rosenbloom; Joel Armstrong; Galt P. Barber; Jonathan Casper; Hiram Clawson; Mark Diekhans; Timothy R. Dreszer; Pauline A. Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A. Harte; Steven G. Heitner; Glenn Hickey; Angie S. Hinrichs; Robert Hubley; Donna Karolchik; Katrina Learned; Brian T. Lee; Chin H. Li; Karen H. Miga; Ngan Nguyen; Benedict Paten; Brian J. Raney; Arian Smit; Matthew L. Speir; Ann S. Zweig; David Haussler; Robert M. Kuhn; W. James Kent
Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), ‘mined the web’ for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled.
Nature | 2011
Devin P. Locke; LaDeana W. Hillier; Wesley C. Warren; Kim C. Worley; Lynne V. Nazareth; Donna M. Muzny; Shiaw-Pyng Yang; Zhengyuan Wang; Asif T. Chinwalla; Patrick Minx; Makedonka Mitreva; Lisa Cook; Kim D. Delehaunty; Catrina C. Fronick; Heather K. Schmidt; Lucinda A. Fulton; Robert S. Fulton; Joanne O. Nelson; Vincent Magrini; Craig S. Pohl; Tina Graves; Chris Markovic; Andy Cree; Huyen Dinh; Jennifer Hume; Christie Kovar; Gerald Fowler; Gerton Lunter; Stephen Meader; Andreas Heger
‘Orang-utan’ is derived from a Malay term meaning ‘man of the forest’ and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.
Nature | 2014
Lucia Carbone; R. Alan Harris; Sante Gnerre; Krishna R. Veeramah; Belen Lorente-Galdos; John Huddleston; Thomas J. Meyer; Javier Herrero; Christian Roos; Bronwen Aken; Fabio Anaclerio; Nicoletta Archidiacono; Carl Baker; Daniel Barrell; Mark A. Batzer; Kathryn Beal; Antoine Blancher; Craig Bohrson; Markus Brameier; Michael S. Campbell; Claudio Casola; Giorgia Chiatante; Andrew Cree; Annette Damert; Pieter J. de Jong; Laura Dumas; Marcos Fernandez-Callejo; Paul Flicek; Nina V. Fuchs; Ivo Gut
Gibbons are small arboreal apes that display an accelerated rate of evolutionary chromosomal rearrangement and occupy a key node in the primate phylogeny between Old World monkeys and great apes. Here we present the assembly and analysis of a northern white-cheeked gibbon (Nomascus leucogenys) genome. We describe the propensity for a gibbon-specific retrotransposon (LAVA) to insert into chromosome segregation genes and alter transcription by providing a premature termination site, suggesting a possible molecular mechanism for the genome plasticity of the gibbon lineage. We further show that the gibbon genera (Nomascus, Hylobates, Hoolock and Symphalangus) experienced a near-instantaneous radiation ∼5 million years ago, coincident with major geographical changes in southeast Asia that caused cycles of habitat compression and expansion. Finally, we identify signatures of positive selection in genes important for forelimb development (TBX5) and connective tissues (COL1A1) that may have been involved in the adaptation of gibbons to their arboreal habitat.
Immunity | 2001
Gustavo Glusman; Lee Rowen; Inyoul Lee; Cecilie Boysen; Jared C. Roach; Arian Smit; Kai Wang; Ben F. Koop; Leroy Hood
The availability of the complete genomic sequences of the human and mouse T cell receptor loci opens up new opportunities for understanding T cell receptors (TCRs) and their genes. The full complement of TCR gene segments is finally known and should prove a valuable resource for supporting functional studies. A rational nomenclature system has been implemented and is widely available through IMGT and other public databases. Systematic comparisons of the genomic sequences within each locus, between loci, and across species enable precise analyses of the various diversification mechanisms and some regulatory signals. The genomic landscape of the TCR loci provides fundamental insights into TCR evolution as highly localized and tightly regulated gene families.