Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Carsten Friis is active.

Publication


Featured researches published by Carsten Friis.


Journal of Clinical Microbiology | 2012

Multilocus Sequence Typing of Total Genome Sequenced Bacteria

Mette Voldby Larsen; Salvatore Cosentino; Simon Rasmussen; Carsten Friis; Henrik Hasman; Rasmus Lykke Marvig; Lars Jelsbak; Thomas Sicheritz-Pontén; David W. Ussery; Frank Møller Aarestrup; Ole Lund

ABSTRACT Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.


Microbial Ecology | 2011

The Salmonella enterica pan-genome.

Annika Jacobsen; Rene S. Hendriksen; Frank M. Aaresturp; David W. Ussery; Carsten Friis

Salmonella enterica is divided into four subspecies containing a large number of different serovars, several of which are important zoonotic pathogens and some show a high degree of host specificity or host preference. We compare 45 sequenced S. enterica genomes that are publicly available (22 complete and 23 draft genome sequences). Of these, 35 were found to be of sufficiently good quality to allow a detailed analysis, along with two Escherichia coli strains (K-12 substr. DH10B and the avian pathogenic E. coli (APEC O1) strain). All genomes were subjected to standardized gene finding, and the core and pan-genome of Salmonella were estimated to be around 2,800 and 10,000 gene families, respectively. The constructed pan-genomic dendrograms suggest that gene content is often, but not uniformly correlated to serotype. Any given Salmonella strain has a large stable core, whilst there is an abundance of accessory genes, including the Salmonella pathogenicity islands (SPIs), transposable elements, phages, and plasmid DNA. We visualize conservation in the genomes in relation to chromosomal location and DNA structural features and find that variation in gene content is localized in a selection of variable genomic regions or islands. These include the SPIs but also encompass phage insertion sites and transposable elements. The islands were typically well conserved in several, but not all, isolates—a difference which may have implications in, e.g., host specificity.


BMC Genomics | 2012

snpTree - a web-server to identify and construct SNP trees from whole genome sequence data

Pimlapas Leekitcharoenphon; Rolf Sommer Kaas; Martin Christen Frølund Thomsen; Carsten Friis; Simon Rasmussen; Frank Møller Aarestrup

BackgroundThe advances and decreasing economical cost of whole genome sequencing (WGS), will soon make this technology available for routine infectious disease epidemiology. In epidemiological studies, outbreak isolates have very little diversity and require extensive genomic analysis to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data.ResultsHere we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed from concatenated SNPs using FastTree and a perl script. The online server was implemented by HTML, Java and python script.The server was evaluated using four published bacterial WGS data sets (V. cholerae, S. aureus CC398, S. Typhimurium and M. tuberculosis). The evalution results for the first three cases was consistent and concordant for both raw reads and assembled genomes. In the latter case the original publication involved extensive filtering of SNPs, which could not be repeated using snpTree.ConclusionsThe snpTree server is an easy to use option for rapid standardised and automatic SNP analysis in epidemiological studies also for users with limited bioinformatic experience. The web server is freely accessible at http://www.cbs.dtu.dk/services/snpTree-1.0/.


Plant Physiology | 2009

The proteome of seed development in the model legume Lotus japonicus

Svend Secher Dam; Brian S. Laursen; Jane H. Ørnfelt; Bjarne Jochimsen; Hans Henrik Stærfeldt; Carsten Friis; Kasper Nielsen; Nicolas Goffard; Søren Besenbacher; Lene Krusell; Shusei Sato; Satoshi Tabata; Ida B. Thøgersen; Jan J. Enghild; Jens Stougaard

We have characterized the development of seeds in the model legume Lotus japonicus. Like soybean (Glycine max) and pea (Pisum sativum), Lotus develops straight seed pods and each pod contains approximately 20 seeds that reach maturity within 40 days. Histological sections show the characteristic three developmental phases of legume seeds and the presence of embryo, endosperm, and seed coat in desiccated seeds. Furthermore, protein, oil, starch, phytic acid, and ash contents were determined, and this indicates that the composition of mature Lotus seed is more similar to soybean than to pea. In a first attempt to determine the seed proteome, both a two-dimensional polyacrylamide gel electrophoresis approach and a gel-based liquid chromatography-mass spectrometry approach were used. Globulins were analyzed by two-dimensional polyacrylamide gel electrophoresis, and five legumins, LLP1 to LLP5, and two convicilins, LCP1 and LCP2, were identified by matrix-assisted laser desorption ionization quadrupole/time-of-flight mass spectrometry. For two distinct developmental phases, seed filling and desiccation, a gel-based liquid chromatography-mass spectrometry approach was used, and 665 and 181 unique proteins corresponding to gene accession numbers were identified for the two phases, respectively. All of the proteome data, including the experimental data and mass spectrometry spectra peaks, were collected in a database that is available to the scientific community via a Web interface (http://www.cbs.dtu.dk/cgi-bin/lotus/db.cgi). This database establishes the basis for relating physiology, biochemistry, and regulation of seed development in Lotus. Together with a new Web interface (http://bioinfoserver.rsbs.anu.edu.au/utils/PathExpress4legumes/) collecting all protein identifications for Lotus, Medicago, and soybean seed proteomes, this database is a valuable resource for comparative seed proteomics and pathway analysis within and beyond the legume family.


PLOS ONE | 2010

Genomic Characterization of Campylobacter jejuni Strain M1

Carsten Friis; Trudy M. Wassenaar; Muhammad Afzal Javed; Lars Snipen; Karin Lagesen; Peter F. Hallin; Diane G. Newell; Monique Toszeghy; Anne J. Ridley; Georgina Manning; David W. Ussery

Campylobacter jejuni strain M1 (laboratory designation 99/308) is a rarely documented case of direct transmission of C. jejuni from chicken to a person, resulting in enteritis. We have sequenced the genome of C. jejuni strain M1, and compared this to 12 other C. jejuni sequenced genomes currently publicly available. Compared to these, M1 is closest to strain 81116. Based on the 13 genome sequences, we have identified the C. jejuni pan-genome, as well as the core genome, the auxiliary genes, and genes unique between strains M1 and 81116. The pan-genome contains 2,427 gene families, whilst the core genome comprised 1,295 gene families, or about two-thirds of the gene content of the average of the sequenced C. jejuni genomes. Various comparison and visualization tools were applied to the 13 C. jejuni genome sequences, including a species pan- and core genome plot, a BLAST Matrix and a BLAST Atlas. Trees based on 16S rRNA sequences and on the total gene families in each genome are presented. The findings are discussed in the background of the proven virulence potential of M1.


Research in Microbiology | 1999

Three views of microbial genomes.

Lars Juhl Jensen; Carsten Friis; David W. Ussery

We describe here GenomeAtlases as a method for visualising three different aspects of complete microbial chromosomes: repeats, DNA structural characteristics, and base composition. We have applied this method to all publicly available genomes, and find a general strand preference of global repeats. The atlas for the Mycoplasma genitalium genome is presented as an example, and results from all three views are consistent with known characteristics of the genome.


Genome Biology | 2006

An environmental signature for 323 microbial genomes based on codon adaptation indices

Hanni Willenbrock; Carsten Friis; Agnieszka Sierakowska Juncker; David W. Ussery

BackgroundCodon adaptation indices (CAIs) represent an evolutionary strategy to modulate gene expression and have widely been used to predict potentially highly expressed genes within microbial genomes. Here, we evaluate and compare two very different methods for estimating CAI values, one corresponding to translational codon usage bias and the second obtained mathematically by searching for the most dominant codon bias.ResultsThe level of correlation between these two CAI methods is a simple and intuitive measure of the degree of translational bias in an organism, and from this we confirm that fast replicating bacteria are more likely to have a dominant translational codon usage bias than are slow replicating bacteria, and that this translational codon usage bias may be used for prediction of highly expressed genes. By analyzing more than 300 bacterial genomes, as well as five fungal genomes, we show that codon usage preference provides an environmental signature by which it is possible to group bacteria according to their lifestyle, for instance soil bacteria and soil symbionts, spore formers, enteric bacteria, aquatic bacteria, and intercellular and extracellular pathogens.ConclusionThe results and the approach described here may be used to acquire new knowledge regarding species lifestyle and to elucidate relationships between organisms that are far apart evolutionarily.


BMC Genomics | 2012

Genomic variation in Salmonella enterica core genes for epidemiological typing

Pimlapas Leekitcharoenphon; Oksana Lukjancenko; Carsten Friis; Frank Møller Aarestrup; David W. Ussery

BackgroundTechnological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most) members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology.ResultsWe identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes.ConclusionsGenomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis.


Molecular Genetics and Genomics | 2010

A virulent parent with probiotic progeny: comparative genomics of Escherichia coli strains CFT073, Nissle 1917 and ABU 83972.

Rebecca Munk Vejborg; Carsten Friis; Viktoria Hancock; Mark A. Schembri; Per Klemm

Escherichia coli is a highly versatile species encompassing a diverse spectrum of strains, i.e. from highly virulent isolates causing serious infectious diseases to commensals and probiotic strains. Although much is known about bacterial pathogenicity in E. coli, the understanding of which genetic determinants differentiates a virulent from an avirulent strain still remains limited. In this study we designed a new comparative genomic hybridization microarray based on 31 sequenced E. coli strains and used it to compare two E. coli strains used as prophylactic agents (i.e. Nissle 1917 and 83972) with the highly virulent uropathogen CFT073. Only relatively minor genetic variations were found between the isolates, suggesting that the three strains may have originated from the same virulent ancestral parent. Interestingly, Nissle 1917 (a gut commensal strain) was more similar to CFT073 with respect to genotype and phenotype than 83972 (an asymptomatic bacteriuria strain). The study indicates that genetic variations (e.g. mutations) and expression differences, rather than genomic content per se, contribute to the divergence in disease-causing ability between these strains. This has implications for the use of virulence factors in epidemiological research, and emphasizes the need for more comparative genomic studies of closely related strains to compare their virulence potential.


Nucleic Acids Research | 2003

GenePublisher: automated analysis of DNA microarray data

Steen Knudsen; Christopher T. Workman; Thomas Sicheritz-Pontén; Carsten Friis

GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with a specification of the data. The server performs normalization, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user.

Collaboration


Dive into the Carsten Friis's collaboration.

Top Co-Authors

Avatar

David W. Ussery

University of Arkansas for Medical Sciences

View shared research outputs
Top Co-Authors

Avatar

Frank Møller Aarestrup

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Hanni Willenbrock

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Hans Henrik Stærfeldt

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Lars Jelsbak

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kasper Nielsen

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Kristoffer Kiil

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Lars Juhl Jensen

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Lei Yang

Technical University of Denmark

View shared research outputs
Researchain Logo
Decentralizing Knowledge