Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alfonso Valencia is active.

Publication


Featured researches published by Alfonso Valencia.


Nucleic Acids Research | 2007

iHOP web services

José María Fernández; Robert Hoffmann; Alfonso Valencia

iHOP provides fast, accurate, comprehensive, and up-to-date summary information on more than 80u2009000 biological molecules by automatically extracting key sentences from millions of PubMed documents. Its intuitive user interface and navigation scheme have made iHOP extremely successful among biologists, counting more than 500u2009000 visits per month (iHOP access statistics: http://www.ihop-net.org/UniPub/iHOP/info/logs/). Here we describe a public programmatic API that enables the integration of main iHOP functionalities in bioinformatic programs and workflows.


IEEE/ACM Transactions on Computational Biology and Bioinformatics | 2010

An Overview of BioCreative II.5

Florian Leitner; Scott A. Mardis; Martin Krallinger; Gianni Cesareni; Lynette Hirschman; Alfonso Valencia

We present the results of the BioCreative II.5 evaluation in association with the FEBS Letters experiment, where authors created Structured Digital Abstracts to capture information about protein-protein interactions. The BioCreative II.5 challenge evaluated automatic annotations from 15 text mining teams based on a gold standard created by reconciling annotations from curators, authors, and automated systems. The tasks were to rank articles for curation based on curatable protein-protein interactions; to identify the interacting proteins (using UniProt identifiers) in the positive articles (61); and to identify interacting protein pairs. There were 595 full-text articles in the evaluation test set, including those both with and without curatable protein interactions. The principal evaluation metrics were the interpolated area under the precision/recall curve (AUC iP/R), and (balanced) F-measure. For article classification, the best AUC iP/R was 0.70; for interacting proteins, the best system achieved good macroaveraged recall (0.73) and interpolated area under the precision/recall curve (0.58), after filtering incorrect species and mapping homonymous orthologs; for interacting protein pairs, the top (filtered, mapped) recall was 0.42 and AUC iP/R was 0.29. Ensemble systems improved performance for the interacting protein task.


Science Signaling | 2005

Text Mining for Metabolic Pathways, Signaling Cascades, and Protein Networks

Robert Hoffmann; Martin Krallinger; Eduardo Andrés; Javier Tamames; Christian Blaschke; Alfonso Valencia

The complexity of the information stored in databases and publications on metabolic and signaling pathways, the high throughput of experimental data, and the growing number of publications make it imperative to provide systems to help the researcher navigate through these interrelated information resources. Text-mining methods have started to play a key role in the creation and maintenance of links between the information stored in biological databases and its original sources in the literature. These links will be extremely useful for database updating and curation, especially if a number of technical problems can be solved satisfactorily, including the identification of protein and gene names (entities in general) and the characterization of their types of interactions. The first generation of openly accessible text-mining systems, such as iHOP (Information Hyperlinked over Proteins), provides additional functions to facilitate the reconstruction of protein interaction networks, combine database and text information, and support the scientist in the formulation of novel hypotheses. The next challenge is the generation of comprehensive information regarding the general function of signaling pathways and protein interaction networks.


Genome Biology | 2008

Text mining for biology - the way forward: opinions from leading scientists

Russ B. Altman; Casey M. Bergman; Judith A. Blake; Christian Blaschke; Aaron M. Cohen; Frank Gannon; Les Grivell; Udo Hahn; William R. Hersh; Lynette Hirschman; Lars Juhl Jensen; Martin Krallinger; Barend Mons; Seán I. O'Donoghue; Manuel C. Peitsch; Dietrich Rebholz-Schuhmann; Hagit Shatkay; Alfonso Valencia

This article collects opinions from leading scientists about how text mining can provide better access to the biological literature, how the scientific community can help with this process, what the next steps are, and what role future BioCreative evaluations can play. The responses identify several broad themes, including the possibility of fusing literature and biological databases through text mining; the need for user interfaces tailored to different classes of users and supporting community-based annotation; the importance of scaling text mining technology and inserting it into larger workflows; and suggestions for additional challenge evaluations, new applications, and additional resources needed to make progress.


BMC Bioinformatics | 2011

Overview of the BioCreative III Workshop

Cecilia N. Arighi; Zhiyong Lu; Martin Krallinger; Kevin Bretonnel Cohen; W. John Wilbur; Alfonso Valencia; Lynette Hirschman; Cathy H. Wu

BackgroundThe overall goal of the BioCreative Workshops is to promote the development of text mining and text processing tools which are useful to the communities of researchers and database curators in the biological sciences. To this end BioCreative I was held in 2004, BioCreative II in 2007, and BioCreative II.5 in 2009. Each of these workshops involved humanly annotated test data for several basic tasks in text mining applied to the biomedical literature. Participants in the workshops were invited to compete in the tasks by constructing software systems to perform the tasks automatically and were given scores based on their performance. The results of these workshops have benefited the community in several ways. They have 1) provided evidence for the most effective methods currently available to solve specific problems; 2) revealed the current state of the art for performance on those problems; 3) and provided gold standard data and results on that data by which future advances can be gauged. This special issue contains overview papers for the three tasks of BioCreative III.ResultsThe BioCreative III Workshop was held in September of 2010 and continued the tradition of a challenge evaluation on several tasks judged basic to effective text mining in biology, including a gene normalization (GN) task and two protein-protein interaction (PPI) tasks. In total the Workshop involved the work of twenty-three teams. Thirteen teams participated in the GN task which required the assignment of EntrezGene IDs to all named genes in full text papers without any species information being provided to a system. Ten teams participated in the PPI article classification task (ACT) requiring a system to classify and rank a PubMed® record as belonging to an article either having or not having “PPI relevant” information. Eight teams participated in the PPI interaction method task (IMT) where systems were given full text documents and were required to extract the experimental methods used to establish PPIs and a text segment supporting each such method. Gold standard data was compiled for each of these tasks and participants competed in developing systems to perform the tasks automatically.BioCreative III also introduced a new interactive task (IAT), run as a demonstration task. The goal was to develop an interactive system to facilitate a user’s annotation of the unique database identifiers for all the genes appearing in an article. This task included ranking genes by importance (based preferably on the amount of described experimental information regarding genes). There was also an optional task to assist the user in finding the most relevant articles about a given gene. For BioCreative III, a user advisory group (UAG) was assembled and played an important role 1) in producing some of the gold standard annotations for the GN task, 2) in critiquing IAT systems, and 3) in providing guidance for a future more rigorous evaluation of IAT systems. Six teams participated in the IAT demonstration task and received feedback on their systems from the UAG group. Besides innovations in the GN and PPI tasks making them more realistic and practical and the introduction of the IAT task, discussions were begun on community data standards to promote interoperability and on user requirements and evaluation metrics to address utility and usability of systems.ConclusionsIn this paper we give a brief history of the BioCreative Workshops and how they relate to other text mining competitions in biology. This is followed by a synopsis of the three tasks GN, PPI, and IAT in BioCreative III with figures for best participant performance on the GN and PPI tasks. These results are discussed and compared with results from previous BioCreative Workshops and we conclude that the best performing systems for GN, PPI-ACT and PPI-IMT in realistic settings are not sufficient for fully automatic use. This provides evidence for the importance of interactive systems and we present our vision of how best to construct an interactive system for a GN or PPI like task in the remainder of the paper.


Genome Biology | 2006

The success (or not) of HUGO nomenclature

Javier Tamames; Alfonso Valencia

Current usage of gene nomenclature is ambiguous and impairs the efficient handling of scientific information. Therefore it is important to propose guidelines to deal with this problem. This study attempts to evaluate the success of HUGO nomenclature for human genes. The results indicate that HUGO guidelines are not supported by the scientific community.


international acm sigir conference on research and development in information retrieval | 2004

SIGIR 2003 workshop on text analysis and search for bioinformatics

Eric W. Brown; William R. Hersh; Alfonso Valencia

Bioinformatics is generally defined as the application of information technology to help solve problems in cellular and molecular biology. This covers a broad range of topics from computational models of protein folding to the storage, search, and retrieval of gene sequence data. An emerging topic of interest in this area is automatic analysis of the bio-medical scientific literature. The goals in this area generally include providing easy access to specific textual information from a potentially very large corpus, and automatically extracting information from the text in a form amenable to further, possibly more structured analysis.


Genome Informatics | 2001

The Potential Use of SUISEKI as a Protein Interaction Discovery Tool

Christian Blaschke; Alfonso Valencia


Genome Informatics | 2002

Automatic Ontology Construction from the Literature

Christian Blaschke; Alfonso Valencia


Publisher | 2008

Introducing meta-services for biomedical information extraction

Florian Leitner; Martin Krallinger; Carlos Rodríguez-Penagos; Jörg Hakenberg; Conrad Plake; Cheng-Ju Kuo; Chun-Nan Hsu; Richard Tzong-Han Tsai; Hsi-Chuan Hung; William W. Lau; Calvin A. Johnson; Rune Sætre; Kazuhiro Yoshida; Yan-Hua Chen; Sun Kim; Soo-Yong Shin; Byoung-Tak Zhang; William A. Baumgartner; Lawrence Hunter; Barry Haddow; Michael Matthews; Xinglong Wang; Patrick Ruch; Frédéric Ehrler; Arzucan Özgür; Güneş Erkan; Dragomir R. Radev; Michael Krauthammer; Thaibinh Luong; Robert Hoffmann

Collaboration


Dive into the Alfonso Valencia's collaboration.

Top Co-Authors

Avatar

Martin Krallinger

Spanish National Research Council

View shared research outputs
Top Co-Authors

Avatar

Christian Blaschke

Spanish National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Robert Hoffmann

Spanish National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Florian Leitner

Technical University of Madrid

View shared research outputs
Top Co-Authors

Avatar

Javier Tamames

Spanish National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Cathy H. Wu

University of Delaware

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge