Cristóbal J. Carmona

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Cristóbal J. Carmona is active.

Explore More

Publication

Featured researches published by Cristóbal J. Carmona.

Knowledge and Information Systems | 2011

An overview on subgroup discovery: foundations and applications

Franciso Herrera; Cristóbal J. Carmona; Pedro González; María José del Jesús

Subgroup discovery is a data mining technique which extracts interesting rules with respect to a target variable. An important characteristic of this task is the combination of predictive and descriptive induction. An overview related to the task of subgroup discovery is presented. This review focuses on the foundations, algorithms, and advanced studies together with the applications of subgroup discovery presented throughout the specialised bibliography.

Knowledge Based Systems | 2012

Evolutionary-based selection of generalized instances for imbalanced classification

Salvador García; Joaquín Derrac; Isaac Triguero; Cristóbal J. Carmona; Francisco Herrera

In supervised classification, we often encounter many real world problems in which the data do not have an equitable distribution among the different classes of the problem. In such cases, we are dealing with the so-called imbalanced data sets. One of the most used techniques to deal with this problem consists of preprocessing the data previously to the learning process. This paper proposes a method belonging to the family of the nested generalized exemplar that accomplishes learning by storing objects in Euclidean n-space. Classification of new data is performed by computing their distance to the nearest generalized exemplar. The method is optimized by the selection of the most suitable generalized exemplars based on evolutionary algorithms. An experimental analysis is carried out over a wide range of highly imbalanced data sets and uses the statistical tests suggested in the specialized literature. The results obtained show that our evolutionary proposal outperforms other classic and recent models in accuracy and requires to store a lower number of generalized examples.

IEEE Transactions on Fuzzy Systems | 2010

NMEEF-SD: Non-dominated Multiobjective Evolutionary Algorithm for Extracting Fuzzy Rules in Subgroup Discovery

Cristóbal J. Carmona; Pedro González; M. J. del Jesus; Francisco Herrera

A non-dominated multiobjective evolutionary algorithm for extracting fuzzy rules in subgroup discovery (NMEEF-SD) is described and analyzed in this paper. This algorithm, which is based on the hybridization between fuzzy logic and genetic algorithms, deals with subgroup-discovery problems in order to extract novel and interpretable fuzzy rules of interest, and the evolutionary fuzzy system NMEEF-SD is based on the well-known Non-dominated Sorting Genetic Algorithm II (NSGA-II) model but is oriented toward the subgroup-discovery task using specific operators to promote the extraction of interpretable and high-quality subgroup-discovery rules. The proposal includes different mechanisms to improve diversity in the population and permits the use of different combinations of quality measures in the evolutionary process. An elaborate experimental study, which was reinforced by the use of nonparametric tests, was performed to verify the validity of the proposal, and the proposal was compared with other subgroup discovery methods. The results show that NMEEF-SD obtains the best results among several algorithms studied.

Expert Systems With Applications | 2012

Web usage mining to improve the design of an e-commerce website: OrOliveSur.com

Cristóbal J. Carmona; S. Ramírez-Gallego; F. Torres; Enrique Bernal; M. J. del Jesus; Salvador García

Web usage mining is the process of extracting useful information from users history databases associated to an e-commerce website. The extraction is usually performed by data mining techniques applied on server log data or data obtained from specific tools such as Google Analytics. This paper presents the methodology used in an e-commerce website of extra virgin olive oil sale called www.OrOliveSur.com. We will describe the set of phases carried out including data collection, data preprocessing, extraction and analysis of knowledge. The knowledge is extracted using unsupervised and supervised data mining algorithms through descriptive tasks such as clustering, association and subgroup discovery; applying classical and recent approaches. The results obtained will be discussed especially for the interests of the designer team of the website, providing some guidelines for improving its usability and user satisfaction.

soft computing | 2011

Evolutionary fuzzy rule extraction for subgroup discovery in a psychiatric emergency department

Cristóbal J. Carmona; Pedro González; M. J. del Jesus; M. Navío-Acosta; L. Jiménez-Trevino

This paper describes the application of evolutionary fuzzy systems for subgroup discovery to a medical problem, the study on the type of patients who tend to visit the psychiatric emergency department in a given period of time of the day. In this problem, the objective is to characterise subgroups of patients according to their time of arrival at the emergency department. To solve this problem, several subgroup discovery algorithms have been applied to determine which of them obtains better results. The multiobjective evolutionary algorithm MESDIF for the extraction of fuzzy rules obtains better results and so it has been used to extract interesting information regarding the rate of admission to the psychiatric emergency department.

Neurocomputing | 2014

Addressing imbalanced classification with instance generation techniques: IPADE-ID

Victoria López; Isaac Triguero; Cristóbal J. Carmona; Salvador García; Francisco Herrera

A wide number of real word applications presents a class distribution where examples belonging to one class heavily outnumber the examples in the other class. This is an arduous situation where standard classification techniques usually decrease their performance, creating a handicap to correctly identify the minority class, which is precisely the case under consideration in these applications. In this work, we propose the usage of the Iterative Instance Adjustment for Imbalanced Domains (IPADE-ID) algorithm. It is an evolutionary framework, which uses an instance generation technique, designed to face the existing imbalance modifying the original training set. The method, iteratively learns the appropriate number of examples that represent the classes and their particular positioning. The learning process contains three key operations in its design: a customized initialization procedure, an evolutionary optimization of the positioning of the examples and a selection of the most representative examples for each class. An experimental analysis is carried out with a wide range of highly imbalanced datasets over the proposal and recognized solutions to the problem. The results obtained, which have been contrasted through non-parametric statistical tests, show that our proposal outperforms previously proposed methods.

Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery | 2014

Overview on evolutionary subgroup discovery: analysis of the suitability and potential of the search performed by evolutionary algorithms

Cristóbal J. Carmona; Pedro González; María José del Jesús; Francisco Herrera

Subgroup discovery (SD) is a descriptive data mining technique using supervised learning. In this article, we review the use of evolutionary algorithms (EAs) for SD. In particular, we will focus on the suitability and potential of the search performed by EAs in the development of SD algorithms. Future directions in the use of EAs for SD are also presented in order to show the advantages and benefits that this search strategy contribute to this task.

International Journal of Computational Intelligence Systems | 2016

A View on Fuzzy Systems for Big Data: Progress and Opportunities

Alberto Fernández; Cristóbal J. Carmona; María José del Jesús; Francisco Herrera

AbstractCurrently, we are witnessing a growing trend in the study and application of problems in the framework of Big Data. This is mainly due to the great advantages which come from the knowledge extraction from a high volume of information. For this reason, we observe a migration of the standard Data Mining systems towards a new functional paradigm that allows at working with Big Data. By means of the MapReduce model and its different extensions, scalability can be successfully addressed, while maintaining a good fault tolerance during the execution of the algorithms. Among the different approaches used in Data Mining, those models based on fuzzy systems stand out for many applications. Among their advantages, we must stress the use of a representation close to the natural language. Additionally, they use an inference model that allows a good adaptation to different scenarios, especially those with a given degree of uncertainty. Despite the success of this type of systems, their migration to the Big Dat...

global engineering education conference | 2010

Evolutionary algorithms for subgroup discovery applied to e-learning data

Cristóbal J. Carmona; Pedro González; M. J. del Jesus; Cristóbal Romero; Sebastián Ventura

This work presents the application of subgroup discovery techniques to e-learning data from learning management systems (LMS) of andalusian universities. The objective is to extract rules describing relationships between the use of the different activities and modules available in the e-learning platform and the final mark obtained by the students. For this purpose, the results of different classical and evolutionary subgroup discovery algorithms are compared, showing the adequacy of the evolutionary algorithms to solve this problem. Some of the rules obtained are analyzed with the aim of extract knowledge allowing the teachers to take actions to improve the performance of their students.

Applied Soft Computing | 2014

Training algorithms for Radial Basis Function Networks to tackle learning processes with imbalanced data-sets

María Dolores Pérez-Godoy; Antonio J. Rivera; Cristóbal J. Carmona; M. J. del Jesus

Graphical abstractDisplay Omitted HighlightsWe present a study about the performance of two classical weight training methods of Radial Basis Function Networks (RBFN), Least Mean Square (LMS) and Singular Value Decomposition (SVD), applied to classification problems, when the data-sets are imbalanced.These methods are tested with representative RBFN design paradigms: Clustering, Incremental, Genetic and CO2RBFN (a cooperative-competitive method proposed by the authors).The results obtained, statistically validated, show that SVD outperforms LMS, when the imbalance ratio of data-sets is low but when the imbalance ratio of these data sets grows, LMS outperforms SVD. Nowadays, many real applications comprise data-sets where the distribution of the classes is significantly different. These data-sets are commonly known as imbalanced data-sets. Traditional classifiers are not able to deal with these kinds of data-sets because they tend to classify only majority classes, obtaining poor results for minority classes. The approaches that have been proposed to address this problem can be categorized into three types: resampling methods, algorithmic adaptations and cost sensitive techniques.Radial Basis Function Networks (RBFNs), artificial neural networks composed of local models or RBFs, have demonstrated their efficiency in different machine learning areas. Centers, widths and output weights for the RBFs must be determined when designing RBFNs.Taking into account the locally tuned response of RBFs, the objective of this paper is to study the influence of global and local paradigms on the weights training phase, within the RBFNs design methodology, for imbalanced data-sets. Least Mean Square and the Singular Value Decomposition have been chosen as representatives of local and global weights training paradigms respectively. These learning algorithms are inserted into classical RBFN design methods that are run on imbalanced data-sets and also on these data-sets preprocessed with re-balance techniques. After applying statistical tests to the results obtained, some guidelines about the RBFN design methodology for imbalanced data-sets are provided.

Explore More