Trevor J. Pemberton
University of Manitoba
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Trevor J. Pemberton.
American Journal of Human Genetics | 2010
Trevor J. Pemberton; Chaolong Wang; Jun Li; Noah A. Rosenberg
The International Haplotype Map Project (HapMap) has provided an essential database for studies of human population genetics and genome-wide association. Phases I and II of the HapMap project generated genotype data across ∼3 million SNP loci in 270 individuals representing four populations. Phase III provides dense genotype data on ∼1.5 million SNPs, generated by Illumina and Affymetrix platforms in a larger set of individuals. Release 3 of phase III of the HapMap contains 1397 individuals from 11 populations, including 250 of the original 270 phase I and phase II individuals and 1147 additional individuals. Although some known relationships among the phase III individuals have been described in the data release, the genotype data that are currently available provide an opportunity to empirically ascertain previously unknown relationships. We performed a systematic analysis of genetic relatedness and were able not only to confirm the reported relationships, but also to detect numerous additional, previously unidentified pairs of close relatives in the HapMap sample. The inferred relative pairs make it possible to propose standardized subsets of unrelated individuals for use in future studies in which relatedness needs to be clearly defined.
American Journal of Human Genetics | 2013
Zachary A. Szpiech; Jishu Xu; Trevor J. Pemberton; Weiping Peng; Sebastian Zöllner; Noah A. Rosenberg; Jun Li
Exome sequencing offers the potential to study the population-genomic variables that underlie patterns of deleterious variation. Runs of homozygosity (ROH) are long stretches of consecutive homozygous genotypes probably reflecting segments shared identically by descent as the result of processes such as consanguinity, population size reduction, and natural selection. The relationship between ROH and patterns of predicted deleterious variation can provide insight into the way in which these processes contribute to the maintenance of deleterious variants. Here, we use exome sequencing to examine ROH in relation to the distribution of deleterious variation in 27 individuals of varying levels of apparent inbreeding from 6 human populations. A significantly greater fraction of all genome-wide predicted damaging homozygotes fall in ROH than would be expected from the corresponding fraction of nondamaging homozygotes in ROH (p < 0.001). This pattern is strongest for long ROH (p < 0.05). ROH, and especially long ROH, harbor disproportionately more deleterious homozygotes than would be expected on the basis of the total ROH coverage of the genome and the genomic distribution of nondamaging homozygotes. The results accord with a hypothesis that recent inbreeding, which generates long ROH, enables rare deleterious variants to exist in homozygous form. Thus, just as inbreeding can elevate the occurrence of rare recessive diseases that represent homozygotes for strongly deleterious mutations, inbreeding magnifies the occurrence of mildly deleterious variants as well.
G3: Genes, Genomes, Genetics | 2013
Trevor J. Pemberton; Michael DeGiorgio; Noah A. Rosenberg
Over the past two decades, microsatellite genotypes have provided the data for landmark studies of human population-genetic variation. However, the various microsatellite data sets have been prepared with different procedures and sets of markers, so that it has been difficult to synthesize available data for a comprehensive analysis. Here, we combine eight human population-genetic data sets at the 645 microsatellite loci they share in common, accounting for procedural differences in the production of the different data sets, to assemble a single data set containing 5795 individuals from 267 worldwide populations. We perform a systematic analysis of genetic relatedness, detecting 240 intra-population and 92 inter-population pairs of previously unidentified close relatives and proposing standardized subsets of unrelated individuals for use in future studies. We then augment the human data with a data set of 84 chimpanzees at the 246 loci they share in common with the human samples. Multidimensional scaling and neighbor-joining analyses of these data sets offer new insights into the structure of human populations and enable a comparison of genetic variation patterns in chimpanzees with those in humans. Our combined data sets are the largest of their kind reported to date and provide a resource for use in human population-genetic studies.
American Journal of Medical Genetics Part A | 2007
Patrick Tarpey; Trevor J. Pemberton; David W. Stockton; Parimal Das; Vasiliki Ninis; Sarah Edkins; P. Andrew Futreal; Richard Wooster; Sushanth Kamath; Rabindra Nayak; Michael R. Stratton; Pragna Patel
Patrick Tarpey, Trevor J. Pemberton, David W. Stockton, Parimal Das, Vasiliki Ninis, Sarah Edkins, P. Andrew Futreal, Richard Wooster, Sushanth Kamath, Rabindra Nayak, Michael R. Stratton, and Pragna I. Patel* Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK Institute for Genetic Medicine, University of Southern California, Los Angeles, California Center for Craniofacial Molecular Biology, University of Southern California, Los Angeles, California Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas Department of Internal Medicine, Baylor College of Medicine, Houston, Texas Department of Ophthalmology, Baylor College of Medicine, Houston, Texas Department of Neurology, Baylor College of Medicine, Houston, Texas Kasturba Medical College, Mangalore, India Ambedkar Medical College, Kadugondanahalli, Bangalore, India Departments of Pediatrics and Internal Medicine, Wayne State University School of Medicine, Detroit, Michigan
Comparative and Functional Genomics | 2005
Trevor J. Pemberton; John E. Kay
The peptidyl-prolyl cis/trans isomerase (PPIase) class of proteins comprises three member families that are found throughout nature and are present in all the major compartments of the cell. Their numbers appear to be linked to the number of genes in their respective genomes, although we have found the human repertoire to be smaller than expected due to a reduced cyclophilin repertoire. We show here that whilst the members of the cyclophilin family (which are predominantly found in the nucleus and cytoplasm) and the parvulin family (which are predominantly nuclear) are largely conserved between different repertoires, the FKBPs (which are predominantly found in the cytoplasm and endoplasmic reticulum) are not. It therefore appears that the cyclophilins and parvulins have evolved to perform conserved functions, while the FKBPs have evolved to fill ever-changing niches within the constantly evolving organisms. Many orthologous subgroups within the different PPIase families appear to have evolved from a distinct common ancestor, whereas others, such as the mitochondrial cyclophilins, appear to have evolved independently of one another. We have also identified a novel parvulin within Drosophila melanogaster that is unique to the fruit fly, indicating a recent evolutionary emergence. Interestingly, the fission yeast repertoire, which contains no unique cyclophilins and parvulins, shares no PPIases solely with the budding yeast but it does share a majority with the higher eukaryotes in this study, unlike the budding yeast. It therefore appears that, in comparison with Schizosaccharomyces pombe, Saccharomyces cerevisiae is a poor representation of the higher eukaryotes for the study of PPIases.
BMC Genomics | 2009
Trevor J. Pemberton; Conner I. Sandefur; Mattias Jakobsson; Noah A. Rosenberg
BackgroundMicrosatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database.ResultsCalibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length), under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity.ConclusionsThese results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.
Annals of Human Genetics | 2008
Trevor J. Pemberton; Mattias Jakobsson; Donald F. Conrad; Graham Coop; Jeffrey D. Wall; Jonathan K. Pritchard; Pragna Patel; Noah A. Rosenberg
When performing association studies in populations that have not been the focus of large‐scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis – such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation.
PLOS Genetics | 2014
Paul Verdu; Trevor J. Pemberton; Romain Laurent; Brian M. Kemp; Angélica González-Oliver; Clara Gorodezky; Cris E. Hughes; Milena R. Shattuck; Barbara Petzelt; Joycelynn Mitchell; Harold Harry; Theresa William; Rosita Worl; Jerome S. Cybulski; Noah A. Rosenberg; Ripan S. Malhi
The initial contact of European populations with indigenous populations of the Americas produced diverse admixture processes across North, Central, and South America. Recent studies have examined the genetic structure of indigenous populations of Latin America and the Caribbean and their admixed descendants, reporting on the genomic impact of the history of admixture with colonizing populations of European and African ancestry. However, relatively little genomic research has been conducted on admixture in indigenous North American populations. In this study, we analyze genomic data at 475,109 single-nucleotide polymorphisms sampled in indigenous peoples of the Pacific Northwest in British Columbia and Southeast Alaska, populations with a well-documented history of contact with European and Asian traders, fishermen, and contract laborers. We find that the indigenous populations of the Pacific Northwest have higher gene diversity than Latin American indigenous populations. Among the Pacific Northwest populations, interior groups provide more evidence for East Asian admixture, whereas coastal groups have higher levels of European admixture. In contrast with many Latin American indigenous populations, the variance of admixture is high in each of the Pacific Northwest indigenous populations, as expected for recent and ongoing admixture processes. The results reveal some similarities but notable differences between admixture patterns in the Pacific Northwest and those in Latin America, contributing to a more detailed understanding of the genomic consequences of European colonization events throughout the Americas.
Proceedings of the National Academy of Sciences of the United States of America | 2015
Nicole Creanza; Merritt Ruhlen; Trevor J. Pemberton; Noah A. Rosenberg; Marcus W. Feldman
Significance Linguistic data are often combined with genetic data to frame inferences about human population history. However, little is known about whether human demographic history generates patterns in linguistic data that are similar to those found in genetic data at a global scale. Here, we analyze the largest available datasets of both phonemes and genotyped populations. Similar axes of human geographic differentiation can be inferred from genetic data and phoneme inventories; however, geographic isolation does not necessarily lead to the loss of phonemes. Our results show that migration within geographic regions shapes phoneme evolution, although human expansion out of Africa has not left a strong signature on phonemes. Worldwide patterns of genetic variation are driven by human demographic history. Here, we test whether this demographic history has left similar signatures on phonemes—sound units that distinguish meaning between words in languages—to those it has left on genes. We analyze, jointly and in parallel, phoneme inventories from 2,082 worldwide languages and microsatellite polymorphisms from 246 worldwide populations. On a global scale, both genetic distance and phonemic distance between populations are significantly correlated with geographic distance. Geographically close language pairs share significantly more phonemes than distant language pairs, whether or not the languages are closely related. The regional geographic axes of greatest phonemic differentiation correspond to axes of genetic differentiation, suggesting that there is a relationship between human dispersal and linguistic variation. However, the geographic distribution of phoneme inventory sizes does not follow the predictions of a serial founder effect during human expansion out of Africa. Furthermore, although geographically isolated populations lose genetic diversity via genetic drift, phonemes are not subject to drift in the same way: within a given geographic radius, languages that are relatively isolated exhibit more variance in number of phonemes than languages with many neighbors. This finding suggests that relatively isolated languages are more susceptible to phonemic change than languages with many neighbors. Within a language family, phoneme evolution along genetic, geographic, or cognate-based linguistic trees predicts similar ancestral phoneme states to those predicted from ancient sources. More genetic sampling could further elucidate the relative roles of vertical and horizontal transmission in phoneme evolution.
Genetic Epidemiology | 2011
Lucy Huang; Mattias Jakobsson; Trevor J. Pemberton; Muntaser E. Ibrahim; Thomas B. Nyambo; Sabah A. Omar; Jonathan K. Pritchard; Sarah A. Tishkoff; Noah A. Rosenberg
Sub‐Saharan Africa has been identified as the part of the world with the greatest human genetic diversity. This high level of diversity causes difficulties for genome‐wide association (GWA) studies in African populations—for example, by reducing the accuracy of genotype imputation in African populations compared to non‐African populations. Here, we investigate haplotype variation and imputation in Africa, using 253 unrelated individuals from 15 Sub‐Saharan African populations. We identify the populations that provide the greatest potential for serving as reference panels for imputing genotypes in the remaining groups. Considering reference panels comprising samples of recent African descent in Phase 3 of the HapMap Project, we identify mixtures of reference groups that produce the maximal imputation accuracy in each of the sampled populations. We find that optimal HapMap mixtures and maximal imputation accuracies identified in detailed tests of imputation procedures can instead be predicted by using simple summary statistics that measure relationships between the pattern of genetic variation in a target population and the patterns in potential reference panels. Our results provide an empirical basis for facilitating the selection of reference panels in GWA studies of diverse human populations, especially those of African ancestry. Genet. Epidemiol. 35:766–780, 2011.