Márcio de Carvalho
Universidade Federal de Minas Gerais
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Márcio de Carvalho.
european conference on principles of data mining and knowledge discovery | 2002
Adriano Veloso; Bruno Gusmão; Wagner Meira; Márcio de Carvalho; Srinivasan Parthasarathy; Mohammed Javeed Zaki
Much of the existing work in machine learning and data mining has relied on devising efficient techniques to build accurate models from the data. Research on how the accuracyof a model changes as a function of dynamic updates to the databases is very limited. In this work we show that extracting this information: knowing which aspects of the model are changing; and how theyare changing as a function of data updates; can be verye effective for interactive data mining purposes (where response time is often more important than model qualityas long as model qualityi s not too far off the best (exact) model.In this paper we consider the problem of generating approximate models within the context of association mining, a keyda ta mining task. We propose a new approach to incrementallyg enerate approximate models of associations in evolving databases. Our approach is able to detect how patterns evolve over time (an interesting result in its own right), and uses this information in generating approximate models with high accuracy at a fraction of the cost (of generating the exact model). Extensive experimental evaluation on real databases demonstrates the effectiveness and advantages of the proposed approach.
international conference on management of data | 2000
Bruno Pôssas; Márcio de Carvalho; Rodolfo F. Resende; W. Meita Jr.
The problem of mining association rules in categorical data presented in customer transactions was introduced by Agrawal, Imielinski and Swami [2]. This seminal work gave birth to several investigation efforts [4, 13] resulting in descriptions of how to extend the original concepts and how to increase the performance of the related algorithms.The original problem of mining association rules was formulated as how to find rules of the form set1 → set2. This rule is supposed to denote affinity or correlation among the two sets containing nominal or ordinal data items. More specifically, such an association rule should translate the following meaning: customers that buy the products in set1 also buy the products in set2. Statistical basis is represented in the form of minimum support and confidence measures of these rules with respect to the set of customer transactions.The original problem as proposed by Agrawal et al. [2] was extended in several directions such as adding or replacing the confidence and support by other measures, or filtering the rules during or after generation, or including quantitative attributes. Srikant e Agrawal [16] describe an new approach where quantitative data can be treated as categorical. This is very important since otherwise part of the customer transaction information is discarded. Whenever an extension is proposed it must be checked in terms of its performance. The algorithm efficiency is linked to the size of the database that is amenable to be treated. Therefore it is crucial to have efficient algorithms that enable us to examine and extract valuable decision-making information in the ever larger databases.In this paper we present an algorithm that can be used in the context of several of the extensions provided in the literature but at the same time preserves its performance, as demonstrated in a case study. The approach in our algorithm is to explore multidimensional properties of the data (provided such properties are present), allowing us to combine this additional information in a very efficient pruning phase. This results in a very flexible and efficient algorithm that was used with success in several experiments using categorical and quantitative databases.The paper is organized as follows. In the next section we describe the quantitative association rules and we present an algorithm to generate it. Section 3 presents an optimization of the pruning phase of the Apriori [4] algorithm based on quantitative information associated with the items. Section 4 presents our experimental results for mining four synthetic workloads, followed by some related work in Section 5. Finally we present some conclusions and future work in Section 6.
british national conference on databases | 2002
Adriano Veloso; Bruno Rocha; Márcio de Carvalho; Wagner Meira
Across a wide variety of fields, data are being collected and accumulated at a dramatic pace, and therefore a new generation of techniques has been proposed to assist humans in extracting usefull information from the rapidly growing volumes of data. One of these techniques is the association rule discovery, a key data mining task which has attracted tremendous interest among data mining researchers. Due to its vast applicability, many algorithms have been developed to perform the association rule mining task. However, an immediate problem facing researchers is which of these algorithms is likely to make a good match with the database to be used in the mining operation. In this paper we consider this problem, dealing with both algorithmic and data aspects of association rule mining by performing a systematic experimental evaluation of different algorithms on different databases. We observed that each algorithm has different strengths and weaknesses depending on data characteristics. This careful analysis enables us to develop an algorithm which achieves better performance than previously proposed algorithms, specially on databases obtained from actual applications.
siam international conference on data mining | 2002
Adriano Veloso; Wagner Meira; Márcio de Carvalho; Bruno Pôssas; Srinivasan Parthasarathy; Mohammed Javeed Zaki
brazilian symposium on databases | 2003
Adriano Veloso; Wagner Meira; Srinivasan Parthasarathy; Márcio de Carvalho
brazilian symposium on databases | 2002
Adriano Veloso; Wagner Meira; Márcio de Carvalho
brazilian symposium on databases | 2001
Adriano Veloso; Bruno Pôssas; Gustavo Menezes Siqueira; Wagner Meira; Márcio de Carvalho
Society for Information Technology & Teacher Education International Conference | 2013
Liamara Scortegagna; Márcio de Carvalho
WOB | 2003
Tiago Adriano de Knegt López de Prado; Wagner Meira; Marcelo Matos Santoro; Márcio de Carvalho; Rodrigo L. Carceroni; Carlos H. da Silveira; Raquel C. de Melo; Fabiano A. Fonseca
brazilian symposium on databases | 2002
Gustavo Menezes Siqueira; Tiago Adriano de Knegt López de Prado; Wagner Meira; Márcio de Carvalho