Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Bernhard Haubold is active.

Publication


Featured researches published by Bernhard Haubold.


german conference on bioinformatics | 2000

LIAN 3.0: detecting linkage disequilibrium in multilocus data

Bernhard Haubold; Richard R. Hudson

SUMMARY LIAN is a program to test the null hypothesis of linkage equilibrium for multilocus data. LIAN incorporates both a Monte Carlo method as well as a novel algebraic method to carry out the hypothesis test. The program further returns the genetic diversity of the sample and the pairwise distances between its members.


American Journal of Botany | 2001

Molecular systematics of the Brassicaceae: evidence from coding plastidic matK and nuclear Chs sequences

Marcus A. Koch; Bernhard Haubold; Thomas Mitchell-Olds

Phylogenetic relationships were inferred using nucleotide sequence variation of the nuclear-encoded chalcone synthase gene (Chs) and the chloroplast gene matK for members of five tribes from the family Brassicaceae to analyze tribal and subtribal structures. Phylogenetic trees from individual data sets are mostly in congruence with the results from a combined matK-Chs analysis with a total of 2721 base pairs, but with greater resolution and higher statistical support for deeper branching patterns. The analysis indicates that tribes Lepidieae, Arabideae, and Sisymbrieae are not monophyletic. Among taxa under study four different lineages each were detected in tribes Arabideae and Lepidieae, interspersed with taxa from tribes Sisymbrieae, Hesperideae, and Brassiceae. It is concluded that tribe Brassiceae might be the only monophyletic group of the traditional tribes. From our data we estimated several divergence times for different lineages among cruciferous plants: 5.8 mya (million years ago) for the Arabidopsis-Cardaminopsis split, 20 mya for the Brassica-Arabidopsis split, and ∼40 mya for the age of the deepest split between the most basal crucifer Aethionema and remaining cruciferous taxa.


Molecular Ecology | 2000

Genetic isolation by distance in Arabidopsis thaliana: biogeography and postglacial colonization of Europe

Timothy F. Sharbel; Bernhard Haubold; Thomas Mitchell-Olds

Arabidopsis thaliana provides a useful model system for functional, evolutionary and ecological studies in plant biology. We have analysed natural genetic variation in A. thaliana in order to infer its biogeographical and historical distribution across Eurasia. We analysed 79 amplified fragment length polymorphism (AFLP) markers in 142 accessions from the species’ native range, and found highly significant genetic isolation by distance among A. thaliana accessions from Eurasia and southern Europe. These spatial patterns of genetic variation suggest that A. thaliana colonized central and northern Europe from Asia and from Mediterranean Pleistocene refugia, a trend which has been identified in other species. Statistically significant levels of multilocus linkage disequilibrium suggest intermediate levels of disequilibrium among subsets of loci, and analysis of genetic relationships among accessions reveal a star or bush‐like dendrogram with low bootstrap support. Taken together, it appears that there has been sufficient historical recombination in the A. thaliana genome such that accessions do not conform to a tree‐like, bifurcating pattern of evolution – there is no ‘ecotype phylogeny.’ Nonetheless, significant isolation by distance provides a framework upon which studies of natural variation in A. thaliana may be designed and interpreted.


Genetics | 2009

The Impact of Sampling Schemes on the Site Frequency Spectrum in Nonequilibrium Subdivided Populations

Thomas Städler; Bernhard Haubold; Carlos Merino; Wolfgang Stephan; Peter Pfaffelhuber

Using coalescent simulations, we study the impact of three different sampling schemes on patterns of neutral diversity in structured populations. Specifically, we are interested in two summary statistics based on the site frequency spectrum as a function of migration rate, demographic history of the entire substructured population (including timing and magnitude of specieswide expansions), and the sampling scheme. Using simulations implementing both finite-island and two-dimensional stepping-stone spatial structure, we demonstrate strong effects of the sampling scheme on Tajimas D (DT) and Fu and Lis D (DFL) statistics, particularly under specieswide (range) expansions. Pooled samples yield average DT and DFL values that are generally intermediate between those of local and scattered samples. Local samples (and to a lesser extent, pooled samples) are influenced by local, rapid coalescence events in the underlying coalescent process. These processes result in lower proportions of external branch lengths and hence lower proportions of singletons, explaining our finding that the sampling scheme affects DFL more than it does DT. Under specieswide expansion scenarios, these effects of spatial sampling may persist up to very high levels of gene flow (Nm > 25), implying that local samples cannot be regarded as being drawn from a panmictic population. Importantly, many data sets on humans, Drosophila, and plants contain signatures of specieswide expansions and effects of sampling scheme that are predicted by our simulation results. This suggests that validating the assumption of panmixia is crucial if robust demographic inferences are to be made from local or pooled samples. However, future studies should consider adopting a framework that explicitly accounts for the genealogical effects of population subdivision and empirical sampling schemes.


PLOS Pathogens | 2013

Genomic Analysis of the Kiwifruit Pathogen Pseudomonas syringae pv. actinidiae Provides Insight into the Origins of an Emergent Plant Disease

Honour C. McCann; Erik H. A. Rikkerink; Frederic Bertels; Mark Fiers; Ashley Lu; Jonathan Rees-George; Mark T. Andersen; Andrew P. Gleave; Bernhard Haubold; Mark Wohlers; David S. Guttman; Pauline W. Wang; Christina Straub; J. L. Vanneste; Paul B. Rainey; Matthew D. Templeton

The origins of crop diseases are linked to domestication of plants. Most crops were domesticated centuries – even millennia – ago, thus limiting opportunity to understand the concomitant emergence of disease. Kiwifruit (Actinidia spp.) is an exception: domestication began in the 1930s with outbreaks of canker disease caused by P. syringae pv. actinidiae (Psa) first recorded in the 1980s. Based on SNP analyses of two circularized and 34 draft genomes, we show that Psa is comprised of distinct clades exhibiting negligible within-clade diversity, consistent with disease arising by independent samplings from a source population. Three clades correspond to their geographical source of isolation; a fourth, encompassing the Psa-V lineage responsible for the 2008 outbreak, is now globally distributed. Psa has an overall clonal population structure, however, genomes carry a marked signature of within-pathovar recombination. SNP analysis of Psa-V reveals hundreds of polymorphisms; however, most reside within PPHGI-1-like conjugative elements whose evolution is unlinked to the core genome. Removal of SNPs due to recombination yields an uninformative (star-like) phylogeny consistent with diversification of Psa-V from a single clone within the last ten years. Growth assays provide evidence of cultivar specificity, with rapid systemic movement of Psa-V in Actinidia chinensis. Genomic comparisons show a dynamic genome with evidence of positive selection on type III effectors and other candidate virulence genes. Each clade has highly varied complements of accessory genes encoding effectors and toxins with evidence of gain and loss via multiple genetic routes. Genes with orthologs in vascular pathogens were found exclusively within Psa-V. Our analyses capture a pathogen in the early stages of emergence from a predicted source population associated with wild Actinidia species. In addition to candidate genes as targets for resistance breeding programs, our findings highlight the importance of the source population as a reservoir of new disease.


Molecular Ecology | 2010

Nucleotide divergence vs. gene expression differentiation: comparative transcriptome sequencing in natural isolates from the carrion crow and its hybrid zone with the hooded crow

Jochen B. W. Wolf; Till Bayer; Bernhard Haubold; Markus Schilhabel; Philip Rosenstiel; Diethard Tautz

Recent advances in sequencing technology promise to provide new strategies for studying population differentiation and speciation phenomena in their earliest phases. We focus here on the black carrion crow (Corvus [corone] corone), which forms a zone of hybridization and overlap with the grey coated hooded crow (Corvus [corone] cornix). However, although these semispecies are taxonomically distinct, previous analyses based on several types of genetic markers did not reveal significant molecular differentiation between them. We here corroborate this result with sequence data obtained from a set of 25 nuclear intronic loci. Thus, the system represents a case of a very early phase of species divergence that requires new molecular approaches for its description. We have therefore generated RNAseq expression profiles using barcoded massively parallel pyrosequencing of brain mRNA from six individuals of the carrion crow and five individuals from a hybrid zone with the hooded crow. We obtained 856 675 reads from two runs, with average read length of 270 nt and coverage of 8.44. Reads were assembled de novo into 19 552 contigs, 70% of which could be assigned to annotated genes in chicken and zebra finch. This resulted in a total of 7637 orthologous genes and a core set of 1301 genes that could be compared across all individuals. We find a clear clustering of expression profiles for the pure carrion crow animals and disperse profiles for the animals from the hybrid zone. These results suggest that gene expression differences may indeed be a sensitive indicator of initial species divergence.


Current Opinion in Microbiology | 1998

Bacterial genomics and adaptation to life on plants: implications for the evolution of pathogenicity and symbiosis

Gail M. Preston; Bernhard Haubold; Paul B. Rainey

Many bacteria form intimate associations with plants. Despite the agricultural and biotechnological significance of these bacteria, no whole genome sequences have yet been described. Plant-associated bacteria form a phylogenetically diverse group, with representative species from many major taxons. Sequence information from genomes of closely related bacteria, in combination with technological developments in the field of functional genomics, provides new opportunities for determining the origin and evolution of traits that contribute to bacterial fitness and interactions with plant hosts.


BMC Bioinformatics | 2005

Genome comparison without alignment using shortest unique substrings

Bernhard Haubold; Nora Pierstorff; Friedrich Möller; Thomas Wiehe

BackgroundSequence comparison by alignment is a fundamental tool of molecular biology. In this paper we show how a number of sequence comparison tasks, including the detection of unique genomic regions, can be accomplished efficiently without an alignment step. Our procedure for nucleotide sequence comparison is based on shortest unique substrings. These are substrings which occur only once within the sequence or set of sequences analysed and which cannot be further reduced in length without losing the property of uniqueness. Such substrings can be detected using generalized suffix trees.ResultsWe find that the shortest unique substrings in Caenorhabditis elegans, human and mouse are no longer than 11 bp in the autosomes of these organisms. In mouse and human these unique substrings are significantly clustered in upstream regions of known genes. Moreover, the probability of finding such short unique substrings in the genomes of human or mouse by chance is extremely small. We derive an analytical expression for the null distribution of shortest unique substrings, given the GC-content of the query sequences. Furthermore, we apply our method to rapidly detect unique genomic regions in the genome of Staphylococcus aureus strain MSSA476 compared to four other staphylococcal genomes.ConclusionWe combine a method to rapidly search for shortest unique substrings in DNA sequences and a derivation of their null distribution. We show that unique regions in an arbitrary sample of genomes can be efficiently detected with this method. The corresponding programs shustring (SHortest Unique subSTRING) and shulen are written in C and available at http://adenine.biz.fh-weihenstephan.de/shustring/.


Molecular Ecology | 2010

mlRho – a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes

Bernhard Haubold; Peter Pfaffelhuber; Michael Lynch

Improvements in sequencing technology over the past 5 years are leading to routine application of shotgun sequencing in the fields of ecology and evolution. However, the theory to estimate evolutionary parameters from these data is still being worked out. Here we present an extension and implementation of part of this theory, mlRho. This program can efficiently compute the following three maximum likelihood estimators based on shotgun sequence data obtained from single diploid individuals: the population mutation rate (4Neμ), the sequencing error rate, and the population recombination rate (4Nec). We demonstrate the accuracy of mlRho by applying it to simulated data sets. In addition, we analyse the genomes of the sea squirt Ciona intestinalis and the water flea Daphnia pulex. Ciona intestinalis is an obligate outcrosser, while D. pulex is a cyclic parthenogen, and we discuss how these contrasting life histories are reflected in our parameter estimates. The program mlRho is freely available from http://guanine.evolbio.mpg.de/mlRho.


Bioinformatics | 2009

Efficient estimation of pairwise distances between genomes

Mirjana Domazet-Lošo; Bernhard Haubold

MOTIVATION Genome comparison is central to contemporary genomics and typically relies on sequence alignment. However, genome-wide alignments are difficult to compute. We have, therefore, recently developed an accurate alignment-free estimator of the number of substitutions per site based on the lengths of exact matches between pairs of sequences. The previous implementation of this measure requires n(n-1) suffix tree constructions and traversals, where n is the number of sequences analyzed. This does not scale well for large n. RESULTS We present an algorithm to extract pairwise distances in a single traversal of a single suffix tree containing n sequences. As a result, the run time of the suffix tree construction phase of our algorithm is reduced from O(n(2)L) to O(nL), where L is the length of each sequence. We implement this algorithm in the program kr version 2 and apply it to 825 HIV genomes, 13 genomes of enterobacteria and the complete genomes of 12 Drosophila species. We show that, depending on the input dataset, the new program is at least 10 times faster than its predecessor. AVAILABILITY Version 2 of kr can be tested via a web interface at http://guanine.evolbio.mpg.de/kr2/. It is written in standard C and its source code is available under the GNU General Public License from the same web site. CONTACT [email protected] Supplementary informations: Supplementary data are available at Bioinformatics online.

Collaboration


Dive into the Bernhard Haubold's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Michael Lynch

Arizona State University

View shared research outputs
Researchain Logo
Decentralizing Knowledge