José Carlos Cortizo | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where José Carlos Cortizo is active.

Explore More

Publication

Featured researches published by José Carlos Cortizo.

International Journal of Electronic Commerce | 2011

Introduction to the Special Issue: Mining Social Media

José Carlos Cortizo; Francisco M. Carrero; José María Gómez

Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of the Content.

applications of natural language to data bases | 2004

Concept indexing for automated text categorization

José M. Gómez; José Carlos Cortizo; Enrique Puertas; Miguel E. Ruiz

In this paper we explore the potential of concept indexing with WordNet synsets for Text Categorization, in comparison with the traditional bag of words text representation model. We have performed a series of experiments in which we also test the possibility of using simple yet robust disambiguation methods for concept indexing, and the effectiveness of stoplist-filtering and stemming on the SemCor semantic concordance. Results are not conclusive yet promising.

conference on information and knowledge management | 2010

Overview of the third international workshop on search and mining user-generated contents

Iván Cantador; José Carlos Cortizo; Francisco M. Carrero; José A. Troyano; Paolo Rosso; Markus Schedl

This overview introduces the aim of the SMUC 2010 workshop, as well as the list of papers presented in the workshop.

international conference on digital information management | 2008

Testing concept indexing in crosslingual medical text classification

Francisco M. Carrero; José Carlos Cortizo; José María Gómez

MetaMap is an online application that allows mapping text to UMLS Metathesaurus concepts, which is very useful for interoperability among different languages and systems within the biomedical domain. MetaMap Transfer (MMTx) is a Java program that makes MetaMap available to biomedical researchers in controlled, configurable environment. Currently there is no Spanish version of MetaMap, which difficult the use of UMLS Metathesaurus to extract concepts from Spanish biomedical texts. Developing a Spanish version of MetaMap would be a huge task, since there has been a lot of work supporting the English version for the last sixteen years. Our ongoing research is mainly focused on using biomedical concepts for crosslingual text classification. In this context the use of concepts instead of bag of words representation allows us to face text classification tasks abstracting from the language. In this paper we show our experiments on combining automatic translation techniques with the use of biomedical ontologies to produce an English text that can be processed by MMTx in order to extract concepts for text classification.

conference on information and knowledge management | 2008

In the development of a spanish metamap

Francisco M. Carrero; José Carlos Cortizo; José María Gómez; Manuel de Buenaga

MetaMap is an online application that allows mapping text to UMLS Metathesaurus concepts, which is very useful interoperability among different languages and systems within the biomedical domain. MetaMap Transfer (MMTx) is a Java program that makes MetaMap available to biomedical researchers. Currently there is no Spanish version of MetaMap, which difficults the use of UMLS Metathesaurus to extract concepts from Spanish biomedical texts. Our ongoing research is mainly focused on using biomedical concepts for cross-lingual text classification and retrieval [3]. In this context the use of concepts instead of bag of words representation allows us to face text classification tasks abstracting from the language [4]. In this paper we evaluate the possibility of combining automatic translation techniques with the use of biomedical ontologies to produce an English text that can be processed by MMTx.

intelligent data engineering and automated learning | 2007

Wrapping the Naive Bayes classifier to relax the effect of dependences

José Carlos Cortizo; J. Ignacio Giráldez; Maria Cruz Gaya

The Naive Bayes Classifier is based on the (unrealistic) assumption of independence among the values of the attributes given the class value. Consequently, its effectiveness may decrease in the presence of interdependent attributes. In spite of this, in recent years, Naive Bayes classifier is worked for a privilege position due to several reasons [1]. We present DGW (Dependency Guided Wrapper), a wrapper that uses information about dependences to transform the data representation to improve the Naive Bayes classification. This paper presents experiments comparing the performance and execution time of 12 DGW variations against 12 previous approaches, as constructive induction of cartesian product attributes, and wrappers that perform a search for optimal subsets of attributes. Experimental results show that DGW generates a new data representation that allows the Naive Bayes to obtain better accuracy more times than any other wrapper tested. DGW variations also obtain the best possible accuracy more often than the state of the art wrappers while often spending less time in the attribute subset search process.

intelligent data engineering and automated learning | 2006

Multi criteria wrapper improvements to naive bayes learning

José Carlos Cortizo; J. Ignacio Giráldez

Feature subset selection using a wrapper means to perform a search for an optimal set of attributes using the Machine Learning Algorithm as a black box. The Naive Bayes Classifier is based on the assumption of independence among the values of the attributes given the class value. Consequently, its effectiveness may decrease when the attributes are interdependent. We present FBL, a wrapper that uses information about dependencies to guide the search for the optimal subset of features and we use the Naive Bayes Classifier as the black-box Machine Learning algorithm. Experimental results show that FBL allows the Naive Bayes Classifier to achieve greater accuracies, and that FBL performs better than other classical filters and wrappers.

ACM Transactions on Intelligent Systems and Technology | 2012

Introduction to the Special Section on Search and Mining User-Generated Content

José Carlos Cortizo; Francisco M. Carrero; Iván Cantador; José A. Troyano; Paolo Rosso

The primary goal of this special section of ACM Transactions on Intelligent Systems and Technology is to foster research in the interplay between Social Media, Data/Opinion Mining and Search, aiming to reflect the actual developments in technologies that exploit user-generated content.

intelligent data engineering and automated learning | 2008

Building a Spanish MMTx by Using Automatic Translation and Biomedical Ontologies

Francisco M. Carrero; José Carlos Cortizo; José María Gómez

The use of domain ontologies is becoming increasingly popular in Medical Natural Language Processing Systems. A wide variety of knowledge bases in multiple languages has been integrated into the Unified Medical Language System (UMLS) to create a huge knowledge source that can be accessed with diverse lexical tools. MetaMap (and its java version MMTx) is a tool that allows extracting medical concepts from free text, but currently there not exists a Spanish version. Our ongoing research is centered on the application of biomedical concepts to cross-lingual text classification, what makes it necessary to have a Spanish MMTx available. We have combined automatic translation techniques with biomedical ontologies and the existing English MMTx to produce a Spanish version of MMTx. We have evaluated different approaches and applied several types of evaluation according to different concept representations for text classification. Our results prove that the use of existing translation tools such as Google Translate produce translations with a high similarity to original texts in terms of extracted concepts.

Community-Built Databases | 2011

On the Future of Mobile Phones as the Heart of Community-Built Databases

José Carlos Cortizo; Luis I. Diaz; Francisco M. Carrero; Adrian Yanes; Borja Monsalve

In retrospect, 10 years ago, we would not have imagined ourselves uploading or consuming high-quality videos via the Web, contributing to an online encyclopedia written by millions of users around the world or instantly sharing information with our friends and colleagues using an online platform that allows us to manage our contacts. And the Web is still evolving and what seemed to be science fiction then would become reality within 5–10 years. Nowadays, the Mobile Web concept is still an immature prototype of what will be in a few years’ time, but it represents a giant industry (it is expected that some five billion people will be using mobile/cellular phones in 2010) with even greater possibilities in the future. In this paper, we examine the possible future of mobile devices as the heart of community-built databases. The mobile devices characteristics, as both current and future features, will allow them to have a very relevant role not only as interfaces to community-driven databases, but also as platforms where applications using data from community-driven databases will be running, or even as distributed databases where users can have better control of relevant data they are contributing to those databases.

Explore More