Max Käller
Royal Institute of Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Max Käller.
Nature | 2013
Björn Nystedt; Nathaniel R. Street; Anna Wetterbom; Andrea Zuccolo; Yao-Cheng Lin; Douglas G. Scofield; Francesco Vezzi; Nicolas Delhomme; Stefania Giacomello; Andrey Alexeyenko; Riccardo Vicedomini; Kristoffer Sahlin; Ellen Sherwood; Malin Elfstrand; Lydia Gramzow; Kristina Holmberg; Jimmie Hällman; Olivier Keech; Lisa Klasson; Maxim Koriabine; Melis Kucukoglu; Max Käller; Johannes Luthman; Fredrik Lysholm; Totte Niittylä; Åke Olson; Nemanja Rilakovic; Carol Ritland; Josep A. Rosselló; Juliana Stival Sena
Conifers have dominated forests for more than 200 million years and are of huge ecological and economic importance. Here we present the draft assembly of the 20-gigabase genome of Norway spruce (Picea abies), the first available for any gymnosperm. The number of well-supported genes (28,354) is similar to the >100 times smaller genome of Arabidopsis thaliana, and there is no evidence of a recent whole-genome duplication in the gymnosperm lineage. Instead, the large genome size seems to result from the slow and steady accumulation of a diverse set of long-terminal repeat transposable elements, possibly owing to the lack of an efficient elimination mechanism. Comparative sequencing of Pinus sylvestris, Abies sibirica, Juniperus communis, Taxus baccata and Gnetum gnemon reveals that the transposable element diversity is shared among extant conifers. Expression of 24-nucleotide small RNAs, previously implicated in transposable element silencing, is tissue-specific and much lower than in other plants. We further identify numerous long (>10,000 base pairs) introns, gene-like fragments, uncharacterized long non-coding RNAs and short RNAs. This opens up new genomic avenues for conifer forestry and breeding.
Bioinformatics | 2016
Philip Ewels; Måns Magnusson; Sverker Lundin; Max Käller
Motivation: Fast and accurate quality control is essential for studies involving next-generation sequencing data. Whilst numerous tools exist to quantify QC metrics, there is no common approach to flexibly integrate these across tools and large sample sets. Assessing analysis results across an entire project can be time consuming and error prone; batch effects and outlier samples can easily be missed in the early stages of analysis. Results: We present MultiQC, a tool to create a single report visualising output from multiple tools across many samples, enabling global trends and biases to be quickly identified. MultiQC can plot data from many common bioinformatics tools and is built to allow easy extension and customization. Availability and implementation: MultiQC is available with an GNU GPLv3 license on GitHub, the Python Package Index and Bioconda. Documentation and example reports are available at http://multiqc.info Contact: [email protected]
Bioinformatics | 2010
Henrik Stranneheim; Max Käller; Tobias Allander; Björn Andersson; Lars Arvestad; Joakim Lundeberg
Motivation: New generation sequencing technologies producing increasingly complex datasets demand new efficient and specialized sequence analysis algorithms. Often, it is only the ‘novel’ sequences in a complex dataset that are of interest and the superfluous sequences need to be removed. Results: A novel algorithm, fast and accurate classification of sequences (FACSs), is introduced that can accurately and rapidly classify sequences as belonging or not belonging to a reference sequence. FACS was first optimized and validated using a synthetic metagenome dataset. An experimental metagenome dataset was then used to show that FACS achieves comparable accuracy as BLAT and SSAHA2 but is at least 21 times faster in classifying sequences. Availability: Source code for FACS, Bloom filters and MetaSim dataset used is available at http://facs.biotech.kth.se. The Bloom::Faster 1.6 Perl module can be downloaded from CPAN at http://search.cpan.org/∼palvaro/Bloom-Faster-1.6/ Contacts: [email protected]; [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
Pigment Cell & Melanoma Research | 2009
Veronica Höiom; Rainer Tuominen; Max Käller; Diana Lindén; Afshin Ahmadian; Eva Månsson-Brahme; Suzanne Egyhazi; Klas Sjöberg; Joakim Lundeberg; Johan Hansson
The genetic background of cutaneous malignant melanoma (CMM) includes both germ line aberrations in high‐penetrance genes, like CDKN2A, and allelic variation in low‐penetrance genes like the melanocortin‐1 receptor gene, MC1R. Red‐hair colour associated MC1R alleles (RHC) have been associated with red hair, fair skin and risk of CMM. We investigated MC1R and CDKN2A variation in relation to phenotype, clinical factors and CMM risk in the Swedish population. The study cohort consisted of sporadic primary melanoma patients, familial melanoma patients and a control group. An allele‐dose dependent increase in melanoma risk for carriers of variant MC1R alleles (after adjusting for phenotype), with an elevated risk among familial CMM patients, was observed. This elevated risk was found to be significantly associated with an increased frequency of dysplastic nevi (DN) among familial patients compared to sporadic patients. MC1R variation was found to be less frequent among acral lentiginous melanomas (ALM) and dependent on tumour localisation. No association was found between CDKN2A gene variants and general melanoma risk. Two new variants in the POMC gene were identified in red haired individuals without RHC alleles.
Expert Review of Molecular Diagnostics | 2007
Max Käller; Joakim Lundeberg; Afshin Ahmadian
Over the last few years, several initiatives have described efforts to combine previously invented techniques in molecular biology with parallel detection principles to sequence or genotype DNA signatures. The Infinium® system from Illumina and the Affymetrix GeneChips® are two systems suitable for whole-genome scoring of variable positions. However, directed candidate-gene approaches are more cost effective and several academic groups and the private sector provide techniques with moderate typing throughput combined with large sample capacity suiting these needs. Recently, whole-genome sequencing platforms based on the sequencing-by-synthesis principle were presented by 454 Life Sciences and Solexa, showing great potential as alternatives to conventional genotyping approaches. In addition to these sequencing initiatives, many efforts are pursuing novel ideas to facilitate fast and cost-effective whole genome sequencing, such as ligation-based sequencing. Reliable methods for routine re-sequencing of human genomes as a tool for personalized medicine, however, remain to be developed.
European Respiratory Journal | 2016
Johan Grunewald; Ylva Kaiser; Mahyar Ostadkarampour; Natalia V. Rivera; Francesco Vezzi; Britta Lötstedt; Lina Sylwan; Sverker Lundin; Max Käller; Tatiana Sandalova; Kerstin M. Ahlgren; Jan Wahlström; Adnane Achour; Marcus Ronninger; Anders Eklund
In pulmonary sarcoidosis, CD4+ T-cells expressing T-cell receptor Vα2.3 accumulate in the lungs of HLA-DRB1*03+ patients. To investigate T-cell receptor-HLA-DRB1*03 interactions underlying recognition of hitherto unknown antigens, we performed detailed analyses of T-cell receptor expression on bronchoalveolar lavage fluid CD4+ T-cells from sarcoidosis patients. Pulmonary sarcoidosis patients (n=43) underwent bronchoscopy with bronchoalveolar lavage. T-cell receptor α and β chains of CD4+ T-cells were analysed by flow cytometry, DNA-sequenced, and three-dimensional molecular models of T-cell receptor-HLA-DRB1*03 complexes generated. Simultaneous expression of Vα2.3 with the Vβ22 chain was identified in the lungs of all HLA-DRB1*03+ patients. Accumulated Vα2.3/Vβ22-expressing T-cells were highly clonal, with identical or near-identical Vα2.3 chain sequences and inter-patient similarities in Vβ22 chain amino acid distribution. Molecular modelling revealed specific T-cell receptor-HLA-DRB1*03-peptide interactions, with a previously identified, sarcoidosis-associated vimentin peptide, (Vim)429–443 DSLPLVDTHSKRTLL, matching both the HLA peptide-binding cleft and distinct T-cell receptor features perfectly. We demonstrate, for the first time, the accumulation of large clonal populations of specific Vα2.3/Vβ22 T-cell receptor-expressing CD4+ T-cells in the lungs of HLA-DRB1*03+ sarcoidosis patients. Several distinct contact points between Vα2.3/Vβ22 receptors and HLA-DRB1*03 molecules suggest presentation of prototypic vimentin-derived peptides. Clonal CD4+ lung T-cells associating with HLA-DRB1*03 molecules indicate specific antigens in pulmonary sarcoidosis http://ow.ly/UB81x
Scientific Reports | 2013
Sverker Lundin; Joel Gruselius; Björn Nystedt; Preben Lexow; Max Käller; Joakim Lundeberg
Here we demonstrate the use of short-read massive sequencing systems to in effect achieve longer read lengths through hierarchical molecular tagging. We show how indexed and PCR-amplified targeted libraries are degraded, sub-sampled and arrested at timed intervals to achieve pools of differing average length, each of which is indexed with a new tag. By this process, indices of sample origin, molecular origin, and degree of degradation is incorporated in order to achieve a nested hierarchical structure, later to be utilized in the data processing to order the reads over a longer distance than the sequencing system originally allows. With this protocol we show how continuous regions beyond 3000 bp can be decoded by an Illumina sequencing system, and we illustrate the potential applications by calling variants of the lambda genome, analysing TP53 in cancer cell lines, and targeting a variable canine mitochondrial region.
BMC Genomics | 2012
Beata Werne Solnestam; Henrik Stranneheim; Jimmie Hällman; Max Käller; Emma Lundberg; Joakim Lundeberg; Pelin Akan
BackgroundThe majority of published gene-expression studies have used RNA isolated from whole cells, overlooking the potential impact of including nuclear transcriptome in the analyses. In this study, mRNA fractions from the cytoplasm and from whole cells (total RNA) were prepared from three human cell lines and sequenced using massive parallel sequencing.ResultsFor all three cell lines, of about 15000 detected genes approximately 400 to 1400 genes were detected in different amounts in the cytoplasmic and total RNA fractions. Transcripts detected at higher levels in the total RNA fraction had longer coding sequences and higher number of miRNA target sites. Transcripts detected at higher levels in the cytoplasmic fraction were shorter or contained shorter untranslated regions. Nuclear retention of transcripts and mRNA degradation via miRNA pathway might contribute to this differential detection of genes. The consequence of the differential detection was further investigated by comparison to proteomics data. Interestingly, the expression profiles of cytoplasmic and total RNA correlated equally well with protein abundance levels indicating regulation at a higher level.ConclusionsWe conclude that expression levels derived from the total RNA fraction be regarded as an appropriate estimate of the amount of mRNAs present in a given cell population, independent of the coding sequence length or UTRs.
Human Mutation | 2017
Daniel Nilsson; Maria Pettersson; Peter Gustavsson; Alisa Förster; Wolfgang Hofmeister; Josephine Wincent; Vasilios Zachariadis; Britt-Marie Anderlid; Ann Nordgren; Outi Mäkitie; Valtteri Wirta; Max Käller; Francesco Vezzi; James R. Lupski; Magnus Nordenskjöld; Elisabeth Syk Lundberg; Claudia M.B. Carvalho; Anna Lindstrand
Most balanced translocations are thought to result mechanistically from nonhomologous end joining or, in rare cases of recurrent events, by nonallelic homologous recombination. Here, we use low‐coverage mate pair whole‐genome sequencing to fine map rearrangement breakpoint junctions in both phenotypically normal and affected translocation carriers. In total, 46 junctions from 22 carriers of balanced translocations were characterized. Genes were disrupted in 48% of the breakpoints; recessive genes in four normal carriers and known dominant intellectual disability genes in three affected carriers. Finally, seven candidate disease genes were disrupted in five carriers with neurocognitive disabilities (SVOPL, SUSD1, TOX, NCALD, SLC4A10) and one XX‐male carrier with Tourette syndrome (LYPD6, GPC5). Breakpoint junction analyses revealed microhomology and small templated insertions in a substantive fraction of the analyzed translocations (17.4%; n = 4); an observation that was substantiated by reanalysis of 37 previously published translocation junctions. Microhomology associated with templated insertions is a characteristic seen in the breakpoint junctions of rearrangements mediated by error‐prone replication‐based repair mechanisms. Our data implicate that a mechanism involving template switching might contribute to the formation of at least 15% of the interchromosomal translocation events.
PLOS ONE | 2014
Johanna Hasmats; Henrik Gréen; Cedric Orear; Pierre Validire; Mikael Huss; Max Käller; Joakim Lundeberg
Exome sequence capture and massively parallel sequencing can be combined to achieve inexpensive and rapid global analyses of the functional sections of the genome. The difficulties of working with relatively small quantities of genetic material, as may be necessary when sharing tumor biopsies between collaborators for instance, can be overcome using whole genome amplification. However, the potential drawbacks of using a whole genome amplification technology based on random primers in combination with sequence capture followed by massively parallel sequencing have not yet been examined in detail, especially in the context of mutation discovery in tumor material. In this work, we compare mutations detected in sequence data for unamplified DNA, whole genome amplified DNA, and RNA originating from the same tumor tissue samples from 16 patients diagnosed with non-small cell lung cancer. The results obtained provide a comprehensive overview of the merits of these techniques for mutation analysis. We evaluated the identified genetic variants, and found that most (74%) of them were observed in both the amplified and the unamplified sequence data. Eighty-nine percent of the variations found by WGA were shared with unamplified DNA. We demonstrate a strategy for avoiding allelic bias by including RNA-sequencing information.