T. P. L. Smith | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where T. P. L. Smith is active.

Explore More

Publication

Featured researches published by T. P. L. Smith.

PLOS ONE | 2009

Design of a high density SNP genotyping assay in the pig using SNPs identified and characterized by next generation sequencing technology

A. M. Ramos; R.P.M.A. Crooijmans; Nabeel A. Affara; Andreia J. Amaral; Alan Archibald; Jonathan E. Beever; Christian Bendixen; Carol Churcher; Richard Clark; Patrick Dehais; Mark Hansen; Jakob Hedegaard; Zhi-Liang Hu; Hindrik Hd Kerstens; Andy Law; Hendrik-Jan Megens; Denis Milan; D. J. Nonneman; G. A. Rohrer; Max F. Rothschild; T. P. L. Smith; Robert D. Schnabel; Curt P. Van Tassell; Jeremy F. Taylor; Ralph T Wiedmann; Lawrence B. Schook; M.A.M. Groenen

Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illuminas Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs.

PLOS ONE | 2009

Development and Characterization of a High Density SNP Genotyping Assay for Cattle

Lakshmi K. Matukumalli; Cynthia T. Lawley; Robert D. Schnabel; Jeremy F. Taylor; Mark F. Allan; Michael P. Heaton; Jeff O'Connell; Stephen S. Moore; T. P. L. Smith; Tad S. Sonstegard; Curtis P. Van Tassell

The success of genome-wide association (GWA) studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP) genotyping for the identification of quantitative trait loci (QTL) and for marker-assisted selection in model and agricultural species. A cost-effective and efficient approach for the development of a custom genotyping assay interrogating 54,001 SNP loci to support GWA applications in cattle is described. A novel algorithm for achieving a compressed inter-marker interval distribution proved remarkably successful, with median interval of 37 kb and maximum predicted gap of <350 kb. The assay was tested on a panel of 576 animals from 21 cattle breeds and six outgroup species and revealed that from 39,765 to 46,492 SNP are polymorphic within individual breeds (average minor allele frequency (MAF) ranging from 0.24 to 0.27). The assay also identified 79 putative copy number variants in cattle. Utility for GWA was demonstrated by localizing known variation for coat color and the presence/absence of horns to their correct genomic locations. The combination of SNP selection and the novel spacing algorithm allows an efficient approach for the development of high-density genotyping platforms in species having full or even moderate quality draft sequence. Aspects of the approach can be exploited in species which lack an available genome sequence. The BovineSNP50 assay described here is commercially available from Illumina and provides a robust platform for mapping disease genes and QTL in cattle.

Science | 2009

Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle Breeds

Ra Gibbs; Jf Taylor; Cp Van Tassel; W. Barendse; Ka Eversole; Ca Gill; Rd Green; Dl Hamernik; Sm Kappes; Sigbjørn Lien; Lk Matukumalli; Jc Mcevan; Lv Mazareth; Rd Schnabel; Gm Weinstock; Da Wheeler; Paolo Ajmone Marsan; Pj Boettcher; Ar Caetano; Jf Garcia; Olivier Hanotte; Paola Mariani; Lc Skow; Ts Sonstegard; Jl Williams; B Diallo; L Hailemariam; Ml Martinez; Ca Morris; Lo Silva

A survey of genetic diversity of cattle suggests two domestication events in Asia and selection by husbandry. Not Just Dinner on Legs Several thousand years ago, human beings realized the virtues of domesticating wild animals as easy meat. Soon other possibilities became apparent, and as revealed in a series of papers in this issue, early pastoralists became selective about breeding for wool, leather, milk, and muscle power. In two papers, Gibbs et al. report on the bovine genome sequence (p. 522; see the cover, the Perspective by Lewin, and the Policy Forum by Roberts) and trace the diversity and genetic history of cattle (p. 528), while Chessa et al. (p. 532) survey the occurrence of endogenous retroviruses in sheep and map their distribution to historical waves of human selection and dispersal across Europe. Finally, Ludwig et al. (p. 485) note the origins of variation in the coat-color of horses and suggest that it is most likely to have been selected for by humans in need of good-looking transport. The imprints of domestication and breed development on the genomes of livestock likely differ from those of companion animals. A deep draft sequence assembly of shotgun reads from a single Hereford female and comparative sequences sampled from six additional breeds were used to develop probes to interrogate 37,470 single-nucleotide polymorphisms (SNPs) in 497 cattle from 19 geographically and biologically diverse breeds. These data show that cattle have undergone a rapid recent decrease in effective population size from a very large ancestral population, possibly due to bottlenecks associated with domestication, selection, and breed formation. Domestication and artificial selection appear to have left detectable signatures of selection within the cattle genome, yet the current levels of diversity within breeds are at least as great as exists within humans.

Nature Methods | 2008

SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries.

Curtis P. Van Tassell; T. P. L. Smith; Lakshmi K. Matukumalli; Jeremy F Taylor; Robert D. Schnabel; Cynthia T. Lawley; Christian D. Haudenschild; Stephen S. Moore; Wesley C. Warren; Tad S. Sonstegard

High-density single-nucleotide polymorphism (SNP) arrays have revolutionized the ability of genome-wide association studies to detect genomic regions harboring sequence variants that affect complex traits. Extensive numbers of validated SNPs with known allele frequencies are essential to construct genotyping assays with broad utility. We describe an economical, efficient, single-step method for SNP discovery, validation and characterization that uses deep sequencing of reduced representation libraries (RRLs) from specified target populations. Using nearly 50 million sequences generated on an Illumina Genome Analyzer from DNA of 66 cattle representing three populations, we identified 62,042 putative SNPs and predicted their allele frequencies. Genotype data for these 66 individuals validated 92% of 23,357 selected genome-wide SNPs, with a genotypic and sequence allele frequency correlation of r = 0.67. This approach for simultaneous de novo discovery of high-quality SNPs and population characterization of allele frequencies may be applied to any species with at least a partially sequenced genome.

Genome Biology | 2013

Reducing assembly complexity of microbial genomes with single-molecule sequencing.

Sergey Koren; Gregory P. Harhay; T. P. L. Smith; James L. Bono; Dayna M. Harhay; Scott McVey; Diana Radune; Nicholas H. Bergman; Adam M. Phillippy

BackgroundThe short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem.ResultsTo measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads.ConclusionsAutomated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to

Genome Research | 2010

Analysis of copy number variations among diverse cattle breeds

George E. Liu; Yali Hou; Bin Zhu; Maria Francesca Cardone; Lu Jiang; Angelo Cellamare; Apratim Mitra; L. J. Alexander; Luiz Lehmann Coutinho; Maria Elena Dell'Aquila; Lou C. Gasbarre; Gianni Lacalandra; Robert W. Li; Lakshmi K. Matukumalli; Dan J. Nonneman; Luciana Correia de Almeida Regitano; T. P. L. Smith; Jiuzhou Song; Tad S. Sonstegard; Curt P. Van Tassell; Mario Ventura; Evan E. Eichler; Tara G. McDaneld; J. W. Keele

1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.

Mammalian Genome | 2002

Selection and use of SNP markers for animal identification and paternity analysis in U.S. beef cattle

Michael P. Heaton; Gregory P. Harhay; G. L. Bennett; R. T. Stone; W. Michael Grosse; E. Casas; J. W. Keele; T. P. L. Smith; Carol G. Chitko-McKown; William W. Laegreid

Genomic structural variation is an important and abundant source of genetic and phenotypic variation. Here, we describe the first systematic and genome-wide analysis of copy number variations (CNVs) in modern domesticated cattle using array comparative genomic hybridization (array CGH), quantitative PCR (qPCR), and fluorescent in situ hybridization (FISH). The array CGH panel included 90 animals from 11 Bos taurus, three Bos indicus, and three composite breeds for beef, dairy, or dual purpose. We identified over 200 candidate CNV regions (CNVRs) in total and 177 within known chromosomes, which harbor or are adjacent to gains or losses. These 177 high-confidence CNVRs cover 28.1 megabases or approximately 1.07% of the genome. Over 50% of the CNVRs (89/177) were found in multiple animals or breeds and analysis revealed breed-specific frequency differences and reflected aspects of the known ancestry of these cattle breeds. Selected CNVs were further validated by independent methods using qPCR and FISH. Approximately 67% of the CNVRs (119/177) completely or partially span cattle genes and 61% of the CNVRs (108/177) directly overlap with segmental duplications. The CNVRs span about 400 annotated cattle genes that are significantly enriched for specific biological functions, such as immunity, lactation, reproduction, and rumination. Multiple gene families, including ULBP, have gone through ruminant lineage-specific gene amplification. We detected and confirmed marked differences in their CNV frequencies across diverse breeds, indicating that some cattle CNVs are likely to arise independently in breeds and contribute to breed differences. Our results provide a valuable resource beyond microsatellites and single nucleotide polymorphisms to explore the full dimension of genetic variability for future cattle genomic research.

Genome Biology | 2013

Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

Daniël P. Melters; Keith Bradnam; Hugh A. Young; Natalie Telis; Michael R. May; J. Graham Ruby; Robert Sebra; Paul Peluso; John Eid; David Rank; José Fernando Garcia; Joseph L. DeRisi; T. P. L. Smith; Christian M. Tobias; Jeffrey Ross-Ibarra; Ian Korf; Simon W. L. Chan

Abstract. DNA marker technology represents a promising means for determining the genetic identity and kinship of an animal. Compared with other types of DNA markers, single nucleotide polymorphisms (SNPs) are attractive because they are abundant, genetically stable, and amenable to high-throughput automated analysis. In cattle, the challenge has been to identify a minimal set of SNPs with sufficient power for use in a variety of popular breeds and crossbred populations. This report describes a set of 32 highly informative SNP markers distributed among 18 autosomes and both sex chromosomes. Informativity of these SNPs in U.S. beef cattle populations was estimated from the distribution of allele and genotype frequencies in two panels: one consisting of 96 purebred sires representing 17 popular breeds, and another with 154 purebred American Angus from six herds in four Midwestern states. Based on frequency data from these panels, the estimated probability that two randomly selected, unrelated individuals will possess identical genotypes for all 32 loci was 2.0 × 10−13 for multi-breed composite populations and 1.9 × 10−10 for purebred Angus populations. The probability that a randomly chosen candidate sire will be excluded from paternity was estimated to be 99.9% and 99.4% for the same respective populations. The DNA immediately surrounding the 32 target SNPs was sequenced in the 96 sires of the multi-breed panel and found to contain an additional 183 polymorphic sites. Knowledge of these additional sites, together with the 32 target SNPs, allows the design of robust, accurate genotype assays on a variety of high-throughput SNP genotyping platforms.

Genome Research | 2010

Detecting copy number variation with mated short reads.

Paul Medvedev; Marc Fiume; Misko Dzamba; T. P. L. Smith; Michael Brudno

BackgroundCentromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data.ResultsOur methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution.ConclusionsWhile centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

BMC Molecular Biology | 2008

The Artiodactyl APOBEC3 Innate Immune Repertoire Shows Evidence for a Multi-Functional Domain Organization that Existed in the Ancestor of Placental Mammals

Rebecca S. LaRue; Stefán R. Jónsson; Kevin A. T. Silverstein; Mathieu Lajoie; Denis Bertrand; Nadia El-Mabrouk; Isidro Hötzel; Valgerdur Andrésdóttir; T. P. L. Smith; Reuben S. Harris

The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can also be used for the reliable identification of large copy-variable regions. Such methods, however, are hindered by sequencing biases that lead certain regions of the genome to be over- or undersampled, lowering their resolution and ability to accurately identify the exact breakpoints of the variants. In this work, we develop a method for CNV detection that supplements the depth-of-coverage with paired-end mapping information, where mate pairs mapping discordantly to the reference serve to indicate the presence of variation. Our algorithm, called CNVer, combines this information within a unified computational framework called the donor graph, allowing us to better mitigate the sequencing biases that cause uneven local coverage and accurately predict CNVs. We use CNVer to detect 4879 CNVs in the recently described genome of a Yoruban individual. Most of the calls (77%) coincide with previously known variants within the Database of Genomic Variants, while 81% of deletion copy number variants previously known for this individual coincide with one of our loss calls. Furthermore, we demonstrate that CNVer can reconstruct the absolute copy counts of segments of the donor genome and evaluate the feasibility of using CNVer with low coverage datasets.

Explore More