Matthew W. Hahn | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Matthew W. Hahn is active.

Explore More

Publication

Featured researches published by Matthew W. Hahn.

Nature | 2007

Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

Alexander Stark; Michael F. Lin; Pouya Kheradpour; Jakob Skou Pedersen; Leopold Parts; Joseph W. Carlson; Madeline A. Crosby; Matthew D. Rasmussen; Sushmita Roy; Ameya N. Deoras; J. Graham Ruby; Julius Brennecke; Harvard FlyBase curators; Berkeley Drosophila Genome; Emily Hodges; Angie S. Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V. Han; Morgan L. Maeder; Benjamin J. Polansky; Bryanne E. Robson; Stein Aerts; Jacques van Helden; Bassem A. Hassan; Donald G. Gilbert; Deborah A. Eastman; Michael D. Rice; Michael Weir

Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.

PLOS Biology | 2005

Genomic Islands of Speciation in Anopheles gambiae

Thomas L. Turner; Matthew W. Hahn; Sergey V. Nuzhdin

The African malaria mosquito, Anopheles gambiae sensu stricto (A. gambiae), provides a unique opportunity to study the evolution of reproductive isolation because it is divided into two sympatric, partially isolated subtaxa known as M form and S form. With the annotated genome of this species now available, high-throughput techniques can be applied to locate and characterize the genomic regions contributing to reproductive isolation. In order to quantify patterns of differentiation within A. gambiae, we hybridized population samples of genomic DNA from each form to Affymetrix GeneChip microarrays. We found that three regions, together encompassing less than 2.8 Mb, are the only locations where the M and S forms are significantly differentiated. Two of these regions are adjacent to centromeres, on Chromosomes 2L and X, and contain 50 and 12 predicted genes, respectively. Sequenced loci in these regions contain fixed differences between forms and no shared polymorphisms, while no fixed differences were found at nearby control loci. The third region, on Chromosome 2R, contains only five predicted genes; fixed differences in this region were also verified by direct sequencing. These “speciation islands” remain differentiated despite considerable gene flow, and are therefore expected to contain the genes responsible for reproductive isolation. Much effort has recently been applied to locating the genes and genetic changes responsible for reproductive isolation between species. Though much can be inferred about speciation by studying taxa that have diverged for millions of years, studying differentiation between taxa that are in the early stages of isolation will lead to a clearer view of the number and size of regions involved in the genetics of speciation. Despite appreciable levels of gene flow between the M and S forms of A. gambiae, we were able to isolate three small regions of differentiation where genes responsible for ecological and behavioral isolation are likely to be located. We expect reproductive isolation to be due to changes at a small number of loci, as these regions together contain only 67 predicted genes. Concentrating future mapping experiments on these regions should reveal the genes responsible for reproductive isolation between forms.

Nature | 2011

Comparative and demographic analysis of orang-utan genomes

Devin P. Locke; LaDeana W. Hillier; Wesley C. Warren; Kim C. Worley; Lynne V. Nazareth; Donna M. Muzny; Shiaw-Pyng Yang; Zhengyuan Wang; Asif T. Chinwalla; Patrick Minx; Makedonka Mitreva; Lisa Cook; Kim D. Delehaunty; Catrina C. Fronick; Heather K. Schmidt; Lucinda A. Fulton; Robert S. Fulton; Joanne O. Nelson; Vincent Magrini; Craig S. Pohl; Tina Graves; Chris Markovic; Andy Cree; Huyen Dinh; Jennifer Hume; Christie Kovar; Gerald Fowler; Gerton Lunter; Stephen Meader; Andreas Heger

‘Orang-utan’ is derived from a Malay term meaning ‘man of the forest’ and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.

Molecular Ecology | 2014

Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow

Tami Cruickshank; Matthew W. Hahn

The metaphor of ‘genomic islands of speciation’ was first used to describe heterogeneous differentiation among loci between the genomes of closely related species. The biological model proposed to explain these differences was that the regions showing high levels of differentiation were resistant to gene flow between species, while the remainder of the genome was being homogenized by gene flow and consequently showed lower levels of differentiation. However, the conditions under which such differentiation can occur at multiple unlinked loci are restrictive; additionally, essentially, all previous analyses have been carried out using relative measures of divergence, which can be misleading when regions with different levels of recombination are compared. Here, we test the model of differential gene flow by asking whether absolute divergence is also higher in the previously identified ‘islands’. Using five species pairs for which full sequence data are available, we find that absolute measures of divergence are not higher in genomic islands. Instead, in all cases examined, we find reduced diversity in these regions, a consequence of which is that relative measures of divergence are abnormally high. These data therefore do not support a model of differential gene flow among loci, although islands of relative divergence may represent loci involved in local adaptation. Simulations using the program IMa2 further suggest that inferences of any gene flow may be incorrect in many comparisons. We instead present an alternative explanation for heterogeneous patterns of differentiation, one in which postspeciation selection generates patterns consistent with multiple aspects of the data.

Bioinformatics | 2006

CAFE: a computational tool for the study of gene family evolution

Tijl De Bie; Nello Cristianini; Jeffery P. Demuth; Matthew W. Hahn

SUMMARY We present CAFE (Computational Analysis of gene Family Evolution), a tool for the statistical analysis of the evolution of the size of gene families. It uses a stochastic birth and death process to model the evolution of gene family sizes over a phylogeny. For a specified phylogenetic tree, and given the gene family sizes in the extant species, CAFE can estimate the global birth and death rate of gene families, infer the most likely gene family size at all internal nodes, identify gene families that have accelerated rates of gain and loss (quantified by a p-value) and identify which branches cause the p-value to be small for significant families. AVAILABILITY Software is available from http://www.bio.indiana.edu/~hahnlab/Software.html

PLOS Biology | 2014

Sex Determination: Why So Many Ways of Doing It?

Doris Bachtrog; Judith E. Mank; Catherine L. Peichel; Mark Kirkpatrick; Sarah P. Otto; Tia-Lynn Ashman; Matthew W. Hahn; Jun Kitano; Itay Mayrose; Ray Ming; Nicolas Perrin; Laura Ross; Nicole Valenzuela; Jana C. Vamosi

Sex is universal amongst most eukaryotes, yet a remarkable diversity of sex determining mechanisms exists. We review our current understanding of how and why sex determination evolves in animals and plants.

Proceedings of the Royal Society series B : biological sciences, 2004, Vol.271(1547), pp.1443-1450 [Peer Reviewed Journal] | 2004

Random drift and culture change

Ra Bentley; Matthew W. Hahn; Stephen Shennan

We show that the frequency distributions of cultural variants, in three different real–world examples—first names, archaeological pottery and applications for technology patents—follow power laws that can be explained by a simple model of random drift. We conclude that cultural and economic choices often reflect a decision process that is value–neutral; this result has far–reaching testable implications for social–science research.

PLOS Genetics | 2007

Gene family evolution across 12 Drosophila genomes.

Matthew W. Hahn; Mira V. Han; Sang-Gook Han

Comparison of whole genomes has revealed large and frequent changes in the size of gene families. These changes occur because of high rates of both gene gain (via duplication) and loss (via deletion or pseudogenization), as well as the evolution of entirely new genes. Here we use the genomes of 12 fully sequenced Drosophila species to study the gain and loss of genes at unprecedented resolution. We find large numbers of both gains and losses, with over 40% of all gene families differing in size among the Drosophila. Approximately 17 genes are estimated to be duplicated and fixed in a genome every million years, a rate on par with that previously found in both yeast and mammals. We find many instances of extreme expansions or contractions in the size of gene families, including the expansion of several sex- and spermatogenesis-related families in D. melanogaster that also evolve under positive selection at the nucleotide level. Newly evolved gene families in our dataset are associated with a class of testes-expressed genes known to have evolved de novo in a number of cases. Gene family comparisons also allow us to identify a number of annotated D. melanogaster genes that are unlikely to encode functional proteins, as well as to identify dozens of previously unannotated D. melanogaster genes with conserved homologs in the other Drosophila. Taken together, our results demonstrate that the apparent stasis in total gene number among species has masked rapid turnover in individual gene gain and loss. It is likely that this genomic revolving door has played a large role in shaping the morphological, physiological, and metabolic differences among species.

PLOS ONE | 2006

The Evolution of Mammalian Gene Families

Jeffery P. Demuth; Tijl De Bie; Jason E. Stajich; Nello Cristianini; Matthew W. Hahn

Gene families are groups of homologous genes that are likely to have highly similar functions. Differences in family size due to lineage-specific gene duplication and gene loss may provide clues to the evolutionary forces that have shaped mammalian genomes. Here we analyze the gene families contained within the whole genomes of human, chimpanzee, mouse, rat, and dog. In total we find that more than half of the 9,990 families present in the mammalian common ancestor have either expanded or contracted along at least one lineage. Additionally, we find that a large number of families are completely lost from one or more mammalian genomes, and a similar number of gene families have arisen subsequent to the mammalian common ancestor. Along the lineage leading to modern humans we infer the gain of 689 genes and the loss of 86 genes since the split from chimpanzees, including changes likely driven by adaptive natural selection. Our results imply that humans and chimpanzees differ by at least 6% (1,418 of 22,000 genes) in their complement of genes, which stands in stark contrast to the oft-cited 1.5% difference between orthologous nucleotide sequences. This genomic “revolving door” of gene gain and loss represents a large number of genetic differences separating humans from our closest relatives.

Journal of Heredity | 2009

Distinguishing Among Evolutionary Models for the Maintenance of Gene Duplicates

Matthew W. Hahn

Determining the evolutionary forces responsible for the maintenance of gene duplicates is key to understanding the processes leading to evolutionary adaptation and novelty. In his highly prescient book, Susumu Ohno recognized that duplicate genes are fixed and maintained within a population with 3 distinct outcomes: neofunctionalization, subfunctionalization, and conservation of function. Subsequent researchers have proposed a multitude of population genetic models that lead to these outcomes, each differing largely in the role played by adaptive natural selection. In this paper, I present a nonmathematical review of these models, their predictions, and the evidence collected in support of each of them. Though the various outcomes of gene duplication are often strictly associated with the presence or absence of adaptive natural selection, I argue that determining the outcome of duplication is orthogonal to determining whether natural selection has acted. Despite an ever-growing field of research into the fate of gene duplicates, there is not yet clear evidence for the preponderance of one outcome over the others, much less evidence for the importance of adaptive or nonadaptive forces in maintaining these duplicates.

Explore More