Thomas Wiehe | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Thomas Wiehe is active.

Explore More

Publication

Featured researches published by Thomas Wiehe.

Theoretical Population Biology | 1992

The Effect of Strongly Selected Substitutions on Neutral Polymorphism: Analytical Results Based on Diffusion Theory

Wolfgang Stephan; Thomas Wiehe; Marcus W. Lenz

We derived analytical results for the reduction of the level of neutral polymorphism due to genetic hitchhiking using diffusion theory. In the case of a single strongly selected substitution, expected heterozygosity at a linked neutral locus is reduced by a factor (2cs)α−2csΓ(−2cs), 1α), where s is the selective advantage of the favored allele, c is the recombination fraction between the neutral and selected locus, and α = 2Ns, with N as the diploid population size. Γ denotes the incomplete gamma function. Using this result and assuming that at any one time at most one substitution is on its way to fixation, the effect of recurrent selected substitutions on expected heterozygosity can be approximated by 1[1−α(vϱ)λ(1α, − 2Ms)], where ν and ϱ are the expected numbers of selected substitutions and crossovers, respectively, per chromosome, per nucleotide, per generation, and M is the maximal recombination distance from the neutral locus that a selected mutation can be and still have a hitchhiking effect on the neutral locus. This expression involves a special function, λ < 0, which has a simple integral representation. The results are compared with those of deterministic population genetics and coalescent theory.

Genetics Research | 1998

Genetic hitch-hiking in a subdivided population

Montgomery Slatkin; Thomas Wiehe

The problem of genetic hitch-hiking in a geographically subdivided population is analysed under the assumption that migration rates among populations are relatively small compared with the selection coefficient for a newly arising advantageous allele. The approximate method used in the paper is valid when the number of emigrants per generation (Nm) is less than one. The approximate analysis shows that hitch-hiking can result in substantial differences among populations in the frequencies of neutral alleles closely linked to the advantageous allele. Thus, in cases for which genetic hitch-hiking is thought to be responsible for low levels of genetic variability in regions of the genome with restricted crossing over, it might be possible to find confirmatory evidence for that hypothesis by finding unusual patterns of geographic differentiation in the same regions of the genome.

Mammalian Genome | 2000

Comparative sequence analysis of the MECP2-locus in human and mouse reveals new transcribed regions

Kathrin Reichwald; Jens Thiesen; Thomas Wiehe; Joachim M. Weitzel; Wolf H. Strätling; Petra Kioschis; Annemarie Poustka; André Rosenthal; Matthias Platzer

Abstract. Comparative sequence analysis facilitates the identification of evolutionarily conserved regions, that is, gene-regulatory elements, which can not be detected by analyzing one species only. Sequencing of a 152-kb region on human Chromosome (Chr) Xq28 and of the synthenic 123 kb on mouse Chr XC identified the MECP2/Mecp2 locus, which is flanked by the gene coding for Interleukin-1 receptor associated kinase (IRAK/Il1rak) and the red opsin gene (RCP/Rsvp). By comparative sequence analysis, we identified a previously unknown, non-coding 5′ exon embedded in a CpG island associated with MECP2/Mecp2. Thus, the MECP2/Mecp2 gene is comprised of four exons instead of three. Furthermore, sequence comparison 3′ to the previously reported polyadenylation signal revealed a highly conserved region of 8.5 kb terminating in an alternative polyadenylation signal. Northern blot analysis verified the existence of two main transcripts of 1.9 kb and ∼10 kb, respectively. Both transcripts exhibit tissue-specific expression patterns and have almost identical short half-lifes. The ∼10-kb transcript corresponds to a giant 3′ UTR contained in the fourth exon of MECP2. The long 3′ UTR and the newly identified first intron of MECP2/Mecp2 are highly conserved in human and mouse. Furthermore, the human MECP2 locus is heterogeneous with respect to its DNA composition. We postulate that it represents a boundary between two H3 isochores that has not been observed previously.

Proceedings of the National Academy of Sciences of the United States of America | 2012

The chromatin insulator CTCF and the emergence of metazoan diversity

Peter Heger; Birger Marin; Marek Bartkuhn; Einhard Schierenberg; Thomas Wiehe

The great majority of metazoans belong to bilaterian phyla. They diversified during a short interval in Earth’s history known as the Cambrian explosion, ∼540 million years ago. However, the genetic basis of these events is poorly understood. Here we argue that the vertebrate genome organizer CTCF (CCCTC-binding factor) played an important role for the evolution of bilaterian animals. We provide evidence that the CTCF protein and a genome-wide abundance of CTCF-specific binding motifs are unique to bilaterian phyla, but absent in other eukaryotes. We demonstrate that CTCF-binding sites within vertebrate and Drosophila Hox gene clusters have been maintained for several hundred million years, suggesting an ancient origin of the previously known interaction between Hox gene regulation and CTCF. In addition, a close correlation between the presence of CTCF and Hox gene clusters throughout the animal kingdom suggests conservation of the Hox-CTCF link across the Bilateria. On the basis of these findings, we propose the existence of a Hox-CTCF kernel as principal organizer of bilaterian body plans. Such a kernel could explain (i) the formation of Hox clusters in Bilateria, (ii) the diversity of bilaterian body plans, and (iii) the uniqueness and time of onset of the Cambrian explosion.

BMC Bioinformatics | 2005

Genome comparison without alignment using shortest unique substrings

Bernhard Haubold; Nora Pierstorff; Friedrich Möller; Thomas Wiehe

BackgroundSequence comparison by alignment is a fundamental tool of molecular biology. In this paper we show how a number of sequence comparison tasks, including the detection of unique genomic regions, can be accomplished efficiently without an alignment step. Our procedure for nucleotide sequence comparison is based on shortest unique substrings. These are substrings which occur only once within the sequence or set of sequences analysed and which cannot be further reduced in length without losing the property of uniqueness. Such substrings can be detected using generalized suffix trees.ResultsWe find that the shortest unique substrings in Caenorhabditis elegans, human and mouse are no longer than 11 bp in the autosomes of these organisms. In mouse and human these unique substrings are significantly clustered in upstream regions of known genes. Moreover, the probability of finding such short unique substrings in the genomes of human or mouse by chance is extremely small. We derive an analytical expression for the null distribution of shortest unique substrings, given the GC-content of the query sequences. Furthermore, we apply our method to rapidly detect unique genomic regions in the genome of Staphylococcus aureus strain MSSA476 compared to four other staphylococcal genomes.ConclusionWe combine a method to rapidly search for shortest unique substrings in DNA sequences and a derivation of their null distribution. We show that unique regions in an arbitrary sample of genomes can be efficiently detected with this method. The corresponding programs shustring (SHortest Unique subSTRING) and shulen are written in C and available at http://adenine.biz.fh-weihenstephan.de/shustring/.

Bioinformatics | 1999

Post-processing long pairwise alignments

Zheng Zhang; Piotr Berman; Thomas Wiehe; Webb Miller

MOTIVATION The local alignment problem for two sequences requires determining similar regions, one from each sequence, and aligning those regions. For alignments computed by dynamic programming, current approaches for selecting similar regions may have potential flaws. For instance, the criterion of Smith and Waterman can lead to inclusion of an arbitrarily poor internal segment. Other approaches can generate an alignment scoring less than some of its internal segments. RESULTS We develop an algorithm that decomposes a long alignment into sub-alignments that avoid these potential imperfections. Our algorithm runs in time proportional to the original alignments length. Practical applications to alignments of genomic DNA sequences are described.

Genetics | 2008

Identification of Selective Sweeps in Closely Related Populations of the House Mouse Based on Microsatellite Scans

Meike Teschke; Odette Mukabayire; Thomas Wiehe; Diethard Tautz

Genome scans of polymorphisms promise to provide insights into the patterns and frequencies of positive selection under natural conditions. The use of microsatellites as markers has the potential to focus on very recent events, since in contrast to SNPs, their high mutation rates should remove signatures of older events. We assess this concept here in a large-scale study. We have analyzed two population pairs of the house mouse, one pair of the subspecies Mus musculus domesticus and the other of M. m. musculus. A total of 915 microsatellite loci chosen to cover the whole genome were assessed in a prescreening procedure, followed by individual typing of candidate loci. Schlötterers ratio statistics (lnRH) were applied to detect loci with significant deviations from patterns of neutral expectation. For eight loci from each population pair we have determined the size of the potential sweep window and applied a second statistical procedure (linked locus statistics). For the two population pairs, we find five and four significant sweep loci, respectively, with an average estimated window size of 120 kb. On the basis of the analysis of individual allele frequencies, it is possible to identify the most recent sweep, for which we estimate an onset of 400–600 years ago. Given the known population history for the French–German population pair, we infer that the average frequency of selective sweeps in these populations is higher than 1 in 100 generations across the whole genome. We discuss the implications for adaptation processes in natural populations.

Genetics | 2006

Identification of selective sweeps using a dynamically adjusted number of linked microsatellites

Thomas Wiehe; Viola Nolte; Daniel Zivkovic; Christian Schlötterer

There is currently large interest in distinguishing the signatures of genetic variation produced by demographic events from those produced by natural selection. We propose a simple multilocus statistical test to identify candidate sites of selective sweeps with high power. The test is based on the variability profile measured in an array of linked microsatellites. We also show that the analysis of flanking markers drastically reduces the number of false positives among the candidates that are identified in a genomewide survey of unlinked loci and find that this property is maintained in many population-bottleneck scenarios. However, for a certain range of intermediately severe population bottlenecks we find genomic signatures that are very similar to those produced by a selective sweep. While in these worst-case scenarios the power of the proposed test remains high, the false-positive rate reaches values close to 50%. Hence, selective sweeps may be hard to identify even if multiple linked loci are analyzed. Nevertheless, the integration of information from multiple linked loci always leads to a considerable reduction of the false-positive rate compared to a genome scan of unlinked loci. We discuss the application of this test to experimental data from Drosophila melanogaster.

Genetics | 2008

Second-Order Moments of Segregating Sites Under Variable Population Size

Daniel Zivkovic; Thomas Wiehe

The identification of genomic regions that have been exposed to positive selection is a major challenge in population genetics. Since selective sweeps are expected to occur during environmental changes or when populations are colonizing a new habitat, statistical tests constructed on the assumption of constant population size are biased by the co-occurrence of population size changes and selection. To delimit this problem and gain better insights into demographic factors, theoretical results regarding the second-order moments of segregating sites, such as the variance of segregating sites, have been derived. Driven by emerging genomewide surveys, which allow the estimation of demographic parameters, a generalized version of Tajimas D has been derived that takes into account a previously estimated demographic scenario to test single loci for traces of selection against the null hypothesis of neutral evolution under variable population size.

BMC Bioinformatics | 2006

How repetitive are genomes

Bernhard Haubold; Thomas Wiehe

BackgroundGenome sequences vary strongly in their repetitiveness and the causes for this are still debated. Here we propose a novel measure of genome repetitiveness, the index of repetitiveness, Ir, which can be computed in time proportional to the length of the sequences analyzed. We apply it to 336 genomes from all three domains of life.ResultsThe expected value of Ir is zero for random sequences of any G/C content and greater than zero for sequences with excess repeats. We find that the Ir of archaea is significantly smaller than that of eubacteria, which in turn is smaller than that of eukaryotes. Mouse chromosomes have a significantly higher Ir than human chromosomes and within each genome the Y chromosome is most repetitive. A sliding window analysis reveals that the human HOXA cluster and two surrounding genes are characterized by local minima in Ir. A program for calculating the Ir is freely available at http://adenine.biz.fh-weihenstephan.de/ir/.ConclusionThe general measure of DNA repetitiveness proposed in this paper can be efficiently computed on a genomic scale. This reveals a broad spectrum of repetitiveness among diverse genomes which agrees qualitatively with previous studies of repeat content. A sliding window analysis helps to analyze the intragenomic distribution of repeats.

Explore More