Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Jose M. G. Izarzugaza is active.

Publication


Featured researches published by Jose M. G. Izarzugaza.


Proteins | 2007

Assessment of intramolecular contact predictions for CASP7

Jose M. G. Izarzugaza; Osvaldo Graña; Michael L. Tress; Alfonso Valencia; Neil D. Clarke

Predictions of intramolecular residue–residue contacts were assessed as part of the seventh community‐wide Critical Assessment of Structure Prediction experiment (CASP7). As in past assessments, we focused on contacts that lie far apart in sequence as these are likely to be more informative in predicting protein structure. One lab did somewhat better than others according to our assessment, and there is some reason to think that this labs results represent progress over CASP6. In general, contacts inferred from 3D structural predictions are similar in accuracy to those predicted by contact prediction methods. However, contact prediction methods were more accurate for some targets. Proteins 2007.


Nature Communications | 2015

Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios

Søren Besenbacher; Siyang Liu; Jose M. G. Izarzugaza; Jakob Grove; Kirstine Belling; Jette Bork-Jensen; Shujia Huang; Thomas Damm Als; Shengting Li; Rachita Yadav; Arcadio Rubio-García; Francesco Lescai; Ditte Demontis; Junhua Rao; Weijian Ye; Thomas Mailund; Rune M. Friborg; Christian N. S. Pedersen; Ruiqi Xu; Jihua Sun; Hao Liu; Ou Wang; Xiaofang Cheng; David Flores; Emil Rydza; Kristoffer Rapacki; John Damm Sørensen; Piotr Jaroslaw Chmura; David Westergaard; Piotr Dworzynski

Building a population-specific catalogue of single nucleotide variants (SNVs), indels and structural variants (SVs) with frequencies, termed a national pan-genome, is critical for further advancing clinical and public health genetics in large cohorts. Here we report a Danish pan-genome obtained from sequencing 10 trios to high depth (50 × ). We report 536k novel SNVs and 283k novel short indels from mapping approaches and develop a population-wide de novo assembly approach to identify 132k novel indels larger than 10 nucleotides with low false discovery rates. We identify a higher proportion of indels and SVs than previous efforts showing the merits of high coverage and de novo assembly approaches. In addition, we use trio information to identify de novo mutations and use a probabilistic method to provide direct estimates of 1.27e−8 and 1.5e−9 per nucleotide per generation for SNVs and indels, respectively.


Proteins | 2009

Assessment of domain boundary predictions and the prediction of intramolecular contacts in CASP8.

Iakes Ezkurdia; Osvaldo Graña; Jose M. G. Izarzugaza; Michael L. Tress

This article details the assessment process and evaluation results for two categories in the 8th Critical Assessment of Protein Structure Prediction experiment (CASP8). The domain prediction category was evaluated with a range of scores including the Normalized Domain Overlap score and a domain boundary distance measure. Residue‐residue contact predictions were evaluated with standard CASP measures, prediction accuracy, and Xd. In the domain boundary prediction category, prediction methods still make reliable predictions for targets that have structural templates, but continue to struggle to make good predictions for the few ab initio targets in CASP. There was little indication of improvement in the domain prediction category. The contact prediction category demonstrated that there was renewed interest among predictors and despite the small sample size the results suggested that there had been an increase in prediction accuracy. In contrast to CASP7 contact specialists predicted contacts more accurately than the majority of tertiary structure predictors. Despite this small success, the lack of free modeling targets makes it unlikely that either category will be included in their present form in CASP9. Proteins 2009.


PLOS ONE | 2013

Tumor Mutation Burden Forecasts Outcome in Ovarian Cancer with BRCA1 or BRCA2 Mutations

Nicolai Juul Birkbak; Bose S. Kochupurakkal; Jose M. G. Izarzugaza; Aron Charles Eklund; Yang Li; Joyce Liu; Zoltan Szallasi; Ursula A. Matulonis; Andrea L. Richardson; J. Dirk Iglehart; Zhigang C. Wang

Background Increased number of single nucleotide substitutions is seen in breast and ovarian cancer genomes carrying disease-associated mutations in BRCA1 or BRCA2. The significance of these genome-wide mutations is unknown. We hypothesize genome-wide mutation burden mirrors deficiencies in DNA repair and is associated with treatment outcome in ovarian cancer. Methods and Results The total number of synonymous and non-synonymous exome mutations (Nmut), and the presence of germline or somatic mutation in BRCA1 or BRCA2 (mBRCA) were extracted from whole-exome sequences of high-grade serous ovarian cancers from The Cancer Genome Atlas (TCGA). Cox regression and Kaplan-Meier methods were used to correlate Nmut with chemotherapy response and outcome. Higher Nmut correlated with a better response to chemotherapy after surgery. In patients with mBRCA-associated cancer, low Nmut was associated with shorter progression-free survival (PFS) and overall survival (OS), independent of other prognostic factors in multivariate analysis. Patients with mBRCA-associated cancers and a high Nmut had remarkably favorable PFS and OS. The association with survival was similar in cancers with either BRCA1 or BRCA2 mutations. In cancers with wild-type BRCA, tumor Nmut was associated with treatment response in patients with no residual disease after surgery. Conclusions Tumor Nmut was associated with treatment response and with both PFS and OS in patients with high-grade serous ovarian cancer carrying BRCA1 or BRCA2 mutations. In the TCGA cohort, low Nmut predicted resistance to chemotherapy, and for shorter PFS and OS, while high Nmut forecasts a remarkably favorable outcome in mBRCA-associated ovarian cancer. Our observations suggest that the total mutation burden coupled with BRCA1 or BRCA2 mutations in ovarian cancer is a genomic marker of prognosis and predictor of treatment response. This marker may reflect the degree of deficiency in BRCA-mediated pathways, or the extent of compensation for the deficiency by alternative mechanisms.


BMC Bioinformatics | 2009

Extraction of human kinase mutations from literature, databases and genotyping studies

Martin Krallinger; Jose M. G. Izarzugaza; Carlos Rodríguez-Penagos; Alfonso Valencia

BackgroundThere is a considerable interest in characterizing the biological role of specific protein residue substitutions through mutagenesis experiments. Additionally, recent efforts related to the detection of disease-associated SNPs motivated both the manual annotation, as well as the automatic extraction, of naturally occurring sequence variations from the literature, especially for protein families that play a significant role in signaling processes such as kinases. Systematic integration and comparison of kinase mutation information from multiple sources, covering literature, manual annotation databases and large-scale experiments can result in a more comprehensive view of functional, structural and disease associated aspects of protein sequence variants. Previously published mutation extraction approaches did not sufficiently distinguish between two fundamentally different variation origin categories, namely natural occurring and induced mutations generated through in vitro experiments.ResultsWe present a literature mining pipeline for the automatic extraction and disambiguation of single-point mutation mentions from both abstracts as well as full text articles, followed by a sequence validation check to link mutations to their corresponding kinase protein sequences. Each mutation is scored according to whether it corresponds to an induced mutation or a natural sequence variant. We were able to provide direct literature links for a considerable fraction of previously annotated kinase mutations, enabling thus more efficient interpretation of their biological characterization and experimental context. In order to test the capabilities of the presented pipeline, the mutations in the protein kinase domain of the kinase family were analyzed. Using our literature extraction system, we were able to recover a total of 643 mutations-protein associations from PubMed abstracts and 6,970 from a large collection of full text articles. When compared to state-of-the-art annotation databases and high throughput genotyping studies, the mutation mentions extracted from the literature overlap to a good extent with the existing knowledgebases, whereas the remaining mentions suggest new mutation records that were not previously annotated in the databases.ConclusionUsing the proposed residue disambiguation and classification approach, we were able to differentiate between natural variant and mutagenesis types of mutations with an accuracy of 93.88. The resulting system is useful for constructing a Gold Standard set of mutations extracted from the literature by human experts with minimal manual curation effort, providing direct pointers to relevant evidence sentences. Our system is able to recover mutations from the literature that are not present in state-of-the-art databases. Human expert manual validation of a subset of the literature extracted mutations conducted on 100 mutations from PubMed abstracts highlights that almost three quarters (72%) of the extracted mutations turned out to be correct, and more than half of these had not been previously annotated in databases.


EMBO Reports | 2009

From cancer genomes to cancer models: bridging the gaps

Anaïs Baudot; Francisco X. Real; Jose M. G. Izarzugaza; Alfonso Valencia

Cancer genome projects are now being expanded in an attempt to provide complete landscapes of the mutations that exist in tumours. Although the importance of cataloguing genome variations is well recognized, there are obvious difficulties in bridging the gaps between high‐throughput resequencing information and the molecular mechanisms of cancer evolution. Here, we describe the current status of the high‐throughput genomic technologies, and the current limitations of the associated computational analysis and experimental validation of cancer genetic variants. We emphasize how the current cancer‐evolution models will be influenced by the high‐throughput approaches, in particular through efforts devoted to monitoring tumour progression, and how, in turn, the integration of data and models will be translated into mechanistic knowledge and clinical applications.


Proteins | 2009

Cancer-associated mutations are preferentially distributed in protein kinase functional sites

Jose M. G. Izarzugaza; Oliver Redfern; Christine A. Orengo; Alfonso Valencia

Protein kinases are a superfamily involved in many crucial cellular processes, including signal transmission and regulation of cell cycle. As a consequence of this role, kinases have been reported to be associated with many types of cancer and are considered as potential therapeutic targets. We analyzed the distribution of pathogenic somatic point mutations (drivers) in the protein kinase superfamily with respect to their location in the protein, such as in structural, evolutionary, and functionally relevant regions. We find these driver mutations are more clearly associated with key protein features than other somatic mutations (passengers) that have not been directly linked to tumor progression. This observation fits well with the expected implication of the alterations in protein kinase function in cancer pathogenicity. To explain the relevance of the detected association of cancer driver mutations at the molecular level in the human kinome, we compare these with genetically inherited mutations (SNPs). We find that the subset of nonsynonymous SNPs that are associated to disease, but sufficiently mild to the point of being widespread in the population, tend to avoid those key protein regions, where they could be more detrimental for protein function. This tendency contrasts with the one detected for cancer associated‐driver‐mutations, which seems to be more directly implicated in the alteration of protein function. The detailed analysis of protein kinase groups and a number of relevant examples, confirm the relation between cancer associated‐driver‐mutations and key regions for protein kinase structure and function. Proteins 2009.


Nucleic Acids Research | 2006

TSEMA: interactive prediction of protein pairings between interacting families

Jose M. G. Izarzugaza; David Juan; Carles Pons; Juan A. G. Ranea; Alfonso Valencia; Florencio Pazos

An entire family of methodologies for predicting protein interactions is based on the observed fact that families of interacting proteins tend to have similar phylogenetic trees due to co-evolution. One application of this concept is the prediction of the mapping between the members of two interacting protein families (which protein within one family interacts with which protein within the other). The idea is that the real mapping would be the one maximizing the similarity between the trees. Since the exhaustive exploration of all possible mappings is not feasible for large families, current approaches use heuristic techniques which do not ensure the best solution to be found. This is why it is important to check the results proposed by heuristic techniques and to manually explore other solutions. Here we present TSEMA, the server for efficient mapping assessment. This system calculates an initial mapping between two families of proteins based on a Monte Carlo approach and allows the user to interactively modify it based on performance figures and/or specific biological knowledge. All the explored mappings are graphically shown over a representation of the phylogenetic trees. The system is freely available at . Standalone versions of the software behind the interface are available upon request from the authors.


Methods of Molecular Biology | 2008

Prediction of protein interaction based on similarity of phylogenetic trees.

Florencio Pazos; David Juan; Jose M. G. Izarzugaza; Eduardo Leon; Alfonso Valencia

Computational methods for predicting protein interaction partners are becoming increasingly popular. Many of them are mature enough to be widely used by molecular biologists who can look for proteins related to the protein of interest in order to infer information about its context in the cell. In this chapter we describe the use of the mirrortree set of programs and related software for predicting protein interactions. They are all based on the idea that interacting or functionally related proteins tend to show similar phylogenetic trees due to coevolution. The basic mirrortree program can be used to calculate the similarity between the phylogenetic trees implicit in the multiple sequence alignments of two protein families. The ECID database contains protein interactions and relationships from different computational and experimental sources for the model organism Escherichia coli, including the ones generated with mirrortree. Finally, the TSEMA server uses the concept of tree similarity between interacting families to look for the best mapping between two families of interacting proteins: which member in one family interacts with which member in the other.


Journal of Clinical Microbiology | 2016

Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing

Sarah Mollerup; Jens Friis-Nielsen; Lasse Vinner; Thomas Arn Hansen; Stine Raith Richter; Helena Fridholm; Jose Alejandro Romero Herrera; Ole Lund; Søren Brunak; Jose M. G. Izarzugaza; Tobias Mourier; Lars Peter Nielsen; Anders J. Hansen

ABSTRACT Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition.

Collaboration


Dive into the Jose M. G. Izarzugaza's collaboration.

Top Co-Authors

Avatar

Søren Brunak

University of Copenhagen

View shared research outputs
Top Co-Authors

Avatar

Alfonso Valencia

Barcelona Supercomputing Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jens Friis-Nielsen

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lasse Vinner

University of Copenhagen

View shared research outputs
Top Co-Authors

Avatar

Tobias Mourier

University of Copenhagen

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Sarah Mollerup

University of Copenhagen

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge