Ketil Malde
University of Bergen
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ketil Malde.
Nature | 2011
Bastiaan Star; Sissel Jentoft; Unni Grimholt; Martin Malmstrøm; Tone F. Gregers; Trine B. Rounge; Jonas Paulsen; Monica Hongrø Solbakken; Animesh Sharma; Ola F. Wetten; Anders Lanzén; Roger Winer; James Knight; Jan-Hinnerk Vogel; Bronwen Aken; Øivind Andersen; Karin Lagesen; Ave Tooming-Klunderud; Rolf B. Edvardsen; Kirubakaran G. Tina; Mari Espelund; Chirag Nepal; Christopher Previti; Bård Ove Karlsen; Truls Moum; Morten Skage; Paul R. Berg; Tor Gjøen; Heiner Kuhl; Jim Thorsen
Atlantic cod (Gadus morhua) is a large, cold-adapted teleost that sustains long-standing commercial fisheries and incipient aquaculture. Here we present the genome sequence of Atlantic cod, showing evidence for complex thermal adaptations in its haemoglobin gene cluster and an unusual immune architecture compared to other sequenced vertebrates. The genome assembly was obtained exclusively by 454 sequencing of shotgun and paired-end libraries, and automated annotation identified 22,154 genes. The major histocompatibility complex (MHC) II is a conserved feature of the adaptive immune system of jawed vertebrates, but we show that Atlantic cod has lost the genes for MHC II, CD4 and invariant chain (Ii) that are essential for the function of this pathway. Nevertheless, Atlantic cod is not exceptionally susceptible to disease under natural conditions. We find a highly expanded number of MHC I genes and a unique composition of its Toll-like receptor (TLR) families. This indicates how the Atlantic cod immune system has evolved compensatory mechanisms in both adaptive and innate immunity in the absence of MHC II. These observations affect fundamental assumptions about the evolution of the adaptive immune system and its components in vertebrates.
Bioinformatics | 2010
Susanne Mignon Balzer; Ketil Malde; Anders Lanzén; Animesh Sharma; Inge Jonassen
Motivation: The commercial launch of 454 pyrosequencing in 2005 was a milestone in genome sequencing in terms of performance and cost. Throughout the three available releases, average read lengths have increased to ∼500 base pairs and are thus approaching read lengths obtained from traditional Sanger sequencing. Study design of sequencing projects would benefit from being able to simulate experiments. Results: We explore 454 raw data to investigate its characteristics and derive empirical distributions for the flow values generated by pyrosequencing. Based on our findings, we implement Flowsim, a simulator that generates realistic pyrosequencing data files of arbitrary size from a given set of input DNA sequences. We finally use our simulator to examine the impact of sequence lengths on the results of concrete whole-genome assemblies, and we suggest its use in planning of sequencing projects, benchmarking of assembly methods and other fields. Availability: Flowsim is freely available under the General Public License from http://blog.malde.org/index.php/flowsim/ Contact: [email protected]; [email protected]
intelligent systems in molecular biology | 2011
Susanne Mignon Balzer; Ketil Malde; Inge Jonassen
Motivation: 454 pyrosequencing, by Roche Diagnostics, has emerged as an alternative to Sanger sequencing when it comes to read lengths, performance and cost, but shows higher per-base error rates. Although there are several tools available for noise removal, targeting different application fields, data interpretation would benefit from a better understanding of the different error types. Results: By exploring 454 raw data, we quantify to what extent different factors account for sequencing errors. In addition to the well-known homopolymer length inaccuracies, we have identified errors likely to originate from other stages of the sequencing process. We use our findings to extend the flowsim pipeline with functionalities to simulate these errors, and thus enable a more realistic simulation of 454 pyrosequencing data with flowsim. Availability: The flowsim pipeline is freely available under the General Public License from http://biohaskell.org/Applications/FlowSim. Contact: [email protected]
Bioinformatics | 2003
Ketil Malde; Eivind Coward; Inge Jonassen
MOTIVATION Efficient clustering is important for handling the large amount of available EST sequences. Most contemporary methods are based on some kind of all-against-all comparison, resulting in a quadratic time complexity. A different approach is needed to keep up with the rapid growth of EST data. RESULTS A new, fast EST clustering algorithm is presented. Sub-quadratic time complexity is achieved by using an algorithm based on suffix arrays. A prototype implementation has been developed and run on a benchmark data set. The produced clusterings are validated by comparing them to clusterings produced by other methods, and the results are quite promising. AVAILABILITY The source code for the prototype implementation is available under a GPL license from http://www.ii.uib.no/~ketil/bio/.
BMC Genomics | 2014
Francois Besnier; Matthew Kent; Rasmus Skern-Mauritzen; Sigbjørn Lien; Ketil Malde; Rolf B. Edvardsen; Simon Taylor; Lina Ljungfeldt; Frank Nilsen; Kevin A. Glover
BackgroundThe salmon louse, Lepeophtheirus salmonis, is an ectoparasite of salmonids that causes huge economic losses in salmon farming, and has also been causatively linked with declines of wild salmonid populations. Lice control on farms is reliant upon a few groups of pesticides that have all shown time-limited efficiency due to resistance development. However, to date, this example of human-induced evolution is poorly documented at the population level due to the lack of molecular tools. As such, important evolutionary and management questions, linked to the development and dispersal of pesticide resistance in this parasite, remain unanswered. Here, we introduce the first Single Nucleotide Polymorphism (SNP) array for the salmon louse, which includes 6000 markers, and present a population genomic scan using this array on 576 lice from twelve farms distributed across the North Atlantic.ResultsOur results support the hypothesis of a single panmictic population of lice in the Atlantic, and importantly, revealed very strong selective sweeps on linkage groups 1 and 5. These sweeps included candidate genes potentially connected to pesticide resistance. After genotyping a further 576 lice from 12 full sibling families, a genome-wide association analysis established a highly significant association between the major sweep on linkage group 5 and resistance to emamectin benzoate, the most widely used pesticide in salmonid aquaculture for more than a decade.ConclusionsThe analysis of conserved haplotypes across samples from the Atlantic strongly suggests that emamectin benzoate resistance developed at a single source, and rapidly spread across the Atlantic within the period 1999 when the chemical was first introduced, to 2010 when samples for the present study were obtained. These results provide unique insights into the development and spread of pesticide resistance in the marine environment, and identify a small genomic region strongly linked to emamectin benzoate resistance. Finally, these results have highly significant implications for the way pesticide resistance is considered and managed within the aquaculture industry.
Nutrition & Metabolism | 2010
Marian Kjellevold Malde; Susanne Bügel; Mette Kristensen; Ketil Malde; Ingvild Eide Graff; Jan I. Pedersen
BackgroundCalcium (Ca) - fortified foods are likely to play an important role in helping the consumer achieve an adequate Ca intake, especially for persons with a low intake of dairy products. Fish bones have a high Ca content, and huge quantities of this raw material are available as a by-product from the fish industry. Previously, emphasis has been on producing high quality products from fish by-products by use of bacterial proteases. However, documentation of the nutritional value of the enzymatically rinsed Ca-rich bone fraction remains unexplored. The objective of the present study was to assess the bioavailability of calcium in bones of Atlantic salmon (oily fish) and Atlantic cod (lean fish) in a double-blinded randomised crossover design.MethodsCa absorption was measured in 10 healthy young men using 47Ca whole body counting after ingestion of a test meal extrinsically labelled with the 47Ca isotope. The three test meals contained 800 mg of Ca from three different calcium sources: cod bones, salmon bones and control (CaCO3).ResultsMean Ca absorption (± SEE) from the three different Ca sources were 21.9 ± 1.7%, 22.5 ± 1.7% and 27.4 ± 1.8% for cod bones, salmon bones, and control (CaCO3), respectively.ConclusionWe conclude that bones from Atlantic salmon and Atlantic cod are suitable as natural Ca sources in e.g. functional foods or as supplements.
Bioinformatics | 2005
Ketil Malde; Eivind Coward; Inge Jonassen
MOTIVATION EST sequences constitute an abundant, yet error prone resource for computational biology. Expressed sequences are important in gene discovery and identification, and they are also crucial for the discovery and classification of alternative splicing. An important challenge when processing EST sequences is the reconstruction of mRNA by assembling EST clusters into consensus sequences. RESULTS In contrast to the more established assembly tools, we propose an algorithm that constructs a graph over sequence fragments of fixed size, and produces consensus sequences as traversals of this graph. We provide a tool implementing this algorithm, and perform an experiment where the consensus sequences produced by our implementation, as well as by currently available tools, are compared to mRNA. The results show that our proposed algorithm in a majority of the cases produces consensus of higher quality than the established sequence assemblers and at a competitive speed. AVAILABILITY The source code for the implementation is available under a GPL license from http://www.ii.uib.no/~ketil/bioinformatics/ CONTACT [email protected].
Fish & Shellfish Immunology | 2009
Sonal Patel; Ketil Malde; Anders Lanzén; Rolf Hetlelid Olsen; Audun Helge Nerland
To identify and characterize genes and proteins of the Atlantic halibut (Hippoglossus hippoglossus) immune system, six cDNA libraries were constructed from liver, kidney, spleen, peripheral blood, and thymus. Halibut were injected with nodavirus, infectious pancreatic necrosis virus (IPNV), or vibriosis vaccine and tissue samples were collected at various time points. Leukocytes from peripheral blood and spleen from stimulated and mock-injected fish were isolated and further in vitro activated with the mitogens, concanavalin A (Con A) and phorbol myristate acetate (PMA) to facilitate activation and proliferation. A total of 5117 high quality expressed sequence tags (ESTs) were identified and assembled into 781 contigs and 2796 singletons. Amongst these ESTs, 147 different putative immune related genes were identified. Several genes involved in innate and adaptive immune responses such as complement proteins, immunoglobulins, cell surface receptors, and cytokines and chemokines were identified. Of the immune related genes identified in this study, 44% had no match against any of the publicly available sequence data for halibut and thus can be considered as novel identification in halibut species. The approach of combining in vivo antigenic with in vitro mitogen stimulation, in addition to preparation of cDNA libraries from thymus enabled identification of many of the interesting genes including those involved in T-cell receptor complex.
Bioinformatics | 2006
Ketil Malde; Korbinian Schneeberger; Eivind Coward; Inge Jonassen
MOTIVATION Repeat sequences in ESTs are a source of problems, in particular for clustering. ESTs are therefore commonly masked against a library of known repeats. High quality repeat libraries are available for the widely studied organisms, but for most other organisms the lack of such libraries is likely to compromise the quality of EST analysis. RESULTS We present a fast, flexible and library-less method for masking repeats in EST sequences, based on match statistics within the EST collection. The method is not linked to a particular clustering algorithm. Extensive testing on datasets using different clustering methods and a genomic mapping as reference shows that this method gives results that are better than or as good as those obtained using RepeatMasker with a repeat library. AVAILABILITY The implementation of RBR is available under the terms of the GPL from http://www.ii.uib.no/~ketil/bioinformatics CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Bioinformatics | 2013
Susanne Mignon Balzer; Ketil Malde; Markus A. Grohme; Inge Jonassen
Motivation: Throughout the recent years, 454 pyrosequencing has emerged as an efficient alternative to traditional Sanger sequencing and is widely used in both de novo whole-genome sequencing and metagenomics. Especially the latter application is extremely sensitive to sequencing errors and artificially duplicated reads. Both are common in 454 pyrosequencing and can create a strong bias in the estimation of diversity and composition of a sample. To date, there are several tools that aim to remove both sequencing noise and duplicates. Nevertheless, duplicate removal is often based on nucleotide sequences rather than on the underlying flow values, which contain additional information. Results: With the novel tool JATAC, we present an approach towards a more accurate duplicate removal by analysing flow values directly. Making use of previous findings on 454 flow data characteristics, we combine read clustering with Bayesian distance measures. Finally, we provide a benchmark with an existing algorithm. Availability: JATAC is freely available under the General Public License from http://malde.org/ketil/jatac/. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online