Maido Remm
University of Tartu
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Maido Remm.
Bioinformatics | 2007
Triinu Koressaar; Maido Remm
UNLABELLED The determination of annealing temperature is a critical step in PCR design. This parameter is typically derived from the melting temperature of the PCR primers, so for successful PCR work it is important to determine the melting temperature of primer accurately. We introduced several enhancements in the widely used primer design program Primer3. The improvements include a formula for calculating melting temperature and a salt correction formula. Also, the new version can take into account the effects of divalent cations, which are included in most PCR buffers. Another modification enables using lowercase masked template sequences for primer design. AVAILABILITY Features described in this article have been implemented into the development code of Primer3 and will be available in future versions (version 1.1 and newer) of Primer3. Also, a modified version is compiled under the name of mPrimer3 which is distributed independently. The web-based version of mPrimer3 is available at http://bioinfo.ebc.ee/mprimer3/ and the binary code is freely downloadable from the URL http://bioinfo.ebc.ee/download/.
Nucleic Acids Research | 2004
Kevin P. O'Brien; Maido Remm; Erik L. L. Sonnhammer
The Inparanoid eukaryotic ortholog database (http://inparanoid.cgb.ki.se/) is a collection of pairwise ortholog groups between 17 whole genomes; Anopheles gambiae, Caenorhabditis briggsae, Caenorhabditis elegans, Drosophila melanogaster, Danio rerio, Takifugu rubripes, Gallus gallus, Homo sapiens, Mus musculus, Pan troglodytes, Rattus norvegicus, Oryza sativa, Plasmodium falciparum, Arabidopsis thaliana, Escherichia coli, Saccharomyces cerevisiae and Schizosaccharomyces pombe. Complete proteomes for these genomes were derived from Ensembl and UniProt and compared pairwise using Blast, followed by a clustering step using the Inparanoid program. An Inparanoid cluster is seeded by a reciprocally best-matching ortholog pair, around which inparalogs (should they exist) are gathered independently, while outparalogs are excluded. The ortholog clusters can be searched on the website using Ensembl gene/protein or UniProt identifiers, annotation text or by Blast alignment against our protein datasets. The entire dataset can be downloaded, as can the Inparanoid program itself.
PLOS ONE | 2009
Mari Nelis; Tonu Esko; Reedik Mägi; Fritz Zimprich; Alexander Zimprich; Draga Toncheva; Sena Karachanak; T. Piskackova; I. Balascak; Leena Peltonen; Eveliina Jakkula; Karola Rehnström; Mark Lathrop; Simon Heath; Pilar Galan; Stefan Schreiber; Thomas Meitinger; Arne Pfeufer; H-Erich Wichmann; Béla Melegh; Noémi Polgár; Daniela Toniolo; Paolo Gasparini; Pio D'Adamo; Janis Klovins; Liene Nikitina-Zake; Vaidutis Kučinskas; Jūratė Kasnauskienė; Jan Lubinski; Tadeusz Dębniak
Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (Fst) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS).
Genome Research | 2015
Monika Karmin; Lauri Saag; Mário Vicente; Melissa A. Wilson Sayres; Mari Järve; Ulvi Gerst Talas; Siiri Rootsi; Anne-Mai Ilumäe; Reedik Mägi; Mario Mitt; Luca Pagani; Tarmo Puurand; Zuzana Faltyskova; Florian Clemente; Alexia Cardona; Ene Metspalu; Hovhannes Sahakyan; Bayazit Yunusbayev; Georgi Hudjashov; Michael DeGiorgio; Eva-Liis Loogväli; Christina A. Eichstaedt; Mikk Eelmets; Gyaneshwer Chaubey; Kristiina Tambets; S. S. Litvinov; Maru Mormina; Yali Xue; Qasim Ayub; Grigor Zoraqi
It is commonly thought that human genetic diversity in non-African populations was shaped primarily by an out-of-Africa dispersal 50-100 thousand yr ago (kya). Here, we present a study of 456 geographically diverse high-coverage Y chromosome sequences, including 299 newly reported samples. Applying ancient DNA calibration, we date the Y-chromosomal most recent common ancestor (MRCA) in Africa at 254 (95% CI 192-307) kya and detect a cluster of major non-African founder haplogroups in a narrow time interval at 47-52 kya, consistent with a rapid initial colonization model of Eurasia and Oceania after the out-of-Africa bottleneck. In contrast to demographic reconstructions based on mtDNA, we infer a second strong bottleneck in Y-chromosome lineages dating to the last 10 ky. We hypothesize that this bottleneck is caused by cultural changes affecting variance of reproductive success among males.
European Journal of Human Genetics | 2007
Maris Kuningas; Reedik Mägi; Rudi G. J. Westendorp; P. Eline Slagboom; Maido Remm; Diana van Heemst
Recently, the Daf-16 gene has been shown to regulate the lifespan of nematodes and flies. In mammals, the Daf-16 homologues are forkhead (FOXO) transcription factors, of which specific functions have been identified for Foxo1a and Foxo3a. Despite that, their influence on human age-related trajectories and lifespan is unknown. Here, we analysed the effect of genetic variance in Foxo1a and Foxo3a on metabolic profile, age-related diseases, fertility, fecundity and mortality. This study was carried out in the prospective population-based Leiden 85-plus Study, which includes 1245 participants, aged 85 years or more. The mean follow-up time was 4.4 years. Haplotype analyses of Foxo1a revealed that carriers of haplotype 3 ‘TCA’ have higher HbA1c levels (P=0.025) and a 1.14-fold higher all-cause mortality risk (P=0.021). This increase in mortality was attributable to death from diabetes, for which a 2.43-fold increase was observed (P=0.025). The analyses with Foxo3a haplotypes revealed no differences in metabolic profile, fertility or fecundity. However, increased risks of stroke were observed for Foxo3a block-A haplotype 2 ‘GAGC’ (P=0.007) and haplotype 4 ‘AAAT’ (P=0.014) carriers. In addition, the haplotype 2 ’GAGC’ carriers had a 1.13-fold increased risk for all-cause mortality (P=0.036) and 1.19-fold increased risk for cardiovascular mortality (P=0.052). In conclusion, this study shows that genetic variation in evolutionarily conserved Foxo1a and Foxo3a genes influences lifespan in our study population.
American Journal of Human Genetics | 2011
Mait Metspalu; Irene Gallego Romero; Bayazit Yunusbayev; Gyaneshwer Chaubey; Chandana Basu Mallick; Georgi Hudjashov; Mari Nelis; Reedik Mägi; Ene Metspalu; Maido Remm; Ramasamy Pitchappan; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; Toomas Kivisild
South Asia harbors one of the highest levels genetic diversity in Eurasia, which could be interpreted as a result of its long-term large effective population size and of admixture during its complex demographic history. In contrast to Pakistani populations, populations of Indian origin have been underrepresented in previous genomic scans of positive selection and population structure. Here we report data for more than 600,000 SNP markers genotyped in 142 samples from 30 ethnic groups in India. Combining our results with other available genome-wide data, we show that Indian populations are characterized by two major ancestry components, one of which is spread at comparable frequency and haplotype diversity in populations of South and West Asia and the Caucasus. The second component is more restricted to South Asia and accounts for more than 50% of the ancestry in Indian populations. Haplotype diversity associated with these South Asian ancestry components is significantly higher than that of the components dominating the West Eurasian ancestry palette. Modeling of the observed haplotype diversities suggests that both Indian ancestry components are older than the purported Indo-Aryan invasion 3,500 YBP. Consistent with the results of pairwise genetic distances among world regions, Indians share more ancestry signals with West than with East Eurasians. However, compared to Pakistani populations, a higher proportion of their genes show regionally specific signals of high haplotype homozygosity. Among such candidates of positive selection in India are MSTN and DOK5, both of which have potential implications in lipid metabolism and the etiology of type 2 diabetes.
BMC Molecular Biology | 2007
Vladimir Vimberg; Age Tats; Maido Remm; Tanel Tenson
BackgroundThe mRNA translation initiation region (TIR) comprises the initiator codon, Shine-Dalgarno (SD) sequence and translational enhancers. Probably the most abundant class of enhancers contains A/U-rich sequences. We have tested the influence of SD sequence length and the presence of enhancers on the efficiency of translation initiation.ResultsWe found that during bacterial growth at 37°C, a six-nucleotide SD (AGGAGG) is more efficient than shorter or longer sequences. The A/U-rich enhancer contributes strongly to the efficiency of initiation, having the greatest stimulatory effect in the exponential growth phase of the bacteria. The SD sequences and the A/U-rich enhancer stimulate translation co-operatively: strong SDs are stimulated by the enhancer much more than weak SDs. The bacterial growth rate does not have a major influence on the TIR selection pattern. On the other hand, temperature affects the TIR preference pattern: shorter SD sequences are preferred at lower growth temperatures. We also performed an in silico analysis of the TIRs in all E. coli mRNAs. The base pairing potential of the SD sequences does not correlate with the codon adaptation index, which is used as an estimate of gene expression level.ConclusionIn E. coli the SD selection preferences are influenced by the growth temperature and not influenced by the growth rate. The A/U rich enhancers stimulate translation considerably by acting co-operatively with the SD sequences.
BMC Genomics | 2007
Tõnu Margus; Maido Remm; Tanel Tenson
BackgroundTranslational GTPases are a family of proteins in which GTPase activity is stimulated by the large ribosomal subunit. Conserved sequence features allow members of this family to be identified.ResultsTo achieve accurate protein identification and grouping we have developed a method combining searches with Hidden Markov Model profiles and tree based grouping. We found all the genes for translational GTPases in 191 fully sequenced bacterial genomes. The protein sequences were grouped into nine subfamilies.Analysis of the results shows that three translational GTPases, the translation factors EF-Tu, EF-G and IF2, are present in all organisms examined. In addition, several copies of the genes encoding EF-Tu and EF-G are present in some genomes. In the case of multiple genes for EF-Tu, the gene copies are nearly identical; in the case of multiple EF-G genes, the gene copies have been considerably diverged. The fourth translational GTPase, LepA, the function of which is currently unknown, is also nearly universally conserved in bacteria, being absent from only one organism out of the 191 analyzed. The translation regulator, TypA, is also present in most of the organisms examined, being absent only from bacteria with small genomes.Surprisingly, some of the well studied translational GTPases are present only in a very small number of bacteria. The translation termination factor RF3 is absent from many groups of bacteria with both small and large genomes. The specialized translation factor for selenocysteine incorporation – SelB – was found in only 39 organisms. Similarly, the tetracycline resistance proteins (Tet) are present only in a small number of species.Proteins of the CysN/NodQ subfamily have acquired functions in sulfur metabolism and production of signaling molecules. The genes coding for CysN/NodQ proteins were found in 74 genomes. This protein subfamily is not confined to Proteobacteria, as suggested previously but present also in many other groups of bacteria.ConclusionFour of the translational GTPase subfamilies (IF2, EF-Tu, EF-G and LepA) are represented by at least one member in each bacterium studied, with one exception in LepA. This defines the set of translational GTPases essential for basic cell functions.
Bioinformatics | 2005
Lauris Kaplinski; Reidar Andreson; Tarmo Puurand; Maido Remm
UNLABELLED MultiPLX is a new program for automatic grouping of PCR primers. It can use many different parameters to estimate the compatibility of primers, such as primer-primer interactions, primer-product interactions, difference in melting temperatures, difference in product length and the risk of generating alternative products from the template. A unique feature of the MultiPLX is the ability to perform automatic grouping of large number (thousands) of primer pairs. AVAILABILITY Binaries for Windows, Linux and Solaris are available from http://bioinfo.ebc.ee/download/. A graphical version with limited capabilities can be used through a web interface at http://bioinfo.ebc.ee/multiplx/. The source code of the program is available on request for academic users. CONTACT [email protected].
Toxicon | 2012
Yves Terrat; Daniel Biass; Sébastien Dutertre; Philippe Favreau; Maido Remm; Reto Stöcklin; David Piquemal; Frédéric Ducancel
Although cone snail venoms have been intensively investigated in the past few decades, little is known about the whole conopeptide and protein content in venom ducts, especially at the transcriptomic level. If most of the previous studies focusing on a limited number of sequences have contributed to a better understanding of conopeptide superfamilies, they did not give access to a complete panorama of a whole venom duct. Additionally, rare transcripts were usually not identified due to sampling effect. This work presents the data and analysis of a large number of sequences obtained from high throughput 454 sequencing technology using venom ducts of Conus consors, an Indo-Pacific living piscivorous cone snail. A total of 213,561 Expressed Sequence Tags (ESTs) with an average read length of 218 base pairs (bp) have been obtained. These reads were assembled into 65,536 contiguous DNA sequences (contigs) then into 5039 clusters. The data revealed 11 conopeptide superfamilies representing a total of 53 new isoforms (full length or nearly full-length sequences). Considerable isoform diversity and major differences in transcription level could be noted between superfamilies. A, O and M superfamilies are the most diverse. The A family isoforms account for more than 70% of the conopeptide cocktail (considering all ESTs before clustering step). In addition to traditional superfamilies and families, minor transcripts including both cysteine free and cysteine-rich peptides could be detected, some of them figuring new clades of conopeptides. Finally, several sets of transcripts corresponding to proteins commonly recruited in venom function could be identified for the first time in cone snail venom duct. This work provides one of the first large-scale EST project for a cone snail venom duct using next-generation sequencing, allowing a detailed overview of the venom duct transcripts. This leads to an expanded definition of the overall cone snail venom duct transcriptomic activity, which goes beyond the cysteine-rich conopeptides. For instance, this study enabled to detect proteins involved in common post-translational maturation and folding, and to reveal compounds classically involved in hemolysis and mechanical penetration of the venom into the prey. Further comparison with proteomic and genomic data will lead to a better understanding of conopeptides diversity and the underlying mechanisms involved in conopeptide evolution.