Is this you? Create Your Porfile

Márcio de Carvalho

Universidade Federal de Minas Gerais

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Márcio de Carvalho is active.

Explore More

Publication

Featured researches published by Márcio de Carvalho.

european conference on principles of data mining and knowledge discovery | 2002

Efficiently Mining Approximate Models of Associations in Evolving Databases

Adriano Veloso; Bruno Gusmão; Wagner Meira; Márcio de Carvalho; Srinivasan Parthasarathy; Mohammed Javeed Zaki

Much of the existing work in machine learning and data mining has relied on devising efficient techniques to build accurate models from the data. Research on how the accuracyof a model changes as a function of dynamic updates to the databases is very limited. In this work we show that extracting this information: knowing which aspects of the model are changing; and how theyare changing as a function of data updates; can be verye effective for interactive data mining purposes (where response time is often more important than model qualityas long as model qualityi s not too far off the best (exact) model.In this paper we consider the problem of generating approximate models within the context of association mining, a keyda ta mining task. We propose a new approach to incrementallyg enerate approximate models of associations in evolving databases. Our approach is able to detect how patterns evolve over time (an interesting result in its own right), and uses this information in generating approximate models with high accuracy at a fraction of the cost (of generating the exact model). Extensive experimental evaluation on real databases demonstrates the effectiveness and advantages of the proposed approach.

international conference on management of data | 2000

Using quantitative information for efficient association rule generation

Bruno Pôssas; Márcio de Carvalho; Rodolfo F. Resende; W. Meita Jr.

The problem of mining association rules in categorical data presented in customer transactions was introduced by Agrawal, Imielinski and Swami [2]. This seminal work gave birth to several investigation efforts [4, 13] resulting in descriptions of how to extend the original concepts and how to increase the performance of the related algorithms.The original problem of mining association rules was formulated as how to find rules of the form set1 → set2. This rule is supposed to denote affinity or correlation among the two sets containing nominal or ordinal data items. More specifically, such an association rule should translate the following meaning: customers that buy the products in set1 also buy the products in set2. Statistical basis is represented in the form of minimum support and confidence measures of these rules with respect to the set of customer transactions.The original problem as proposed by Agrawal et al. [2] was extended in several directions such as adding or replacing the confidence and support by other measures, or filtering the rules during or after generation, or including quantitative attributes. Srikant e Agrawal [16] describe an new approach where quantitative data can be treated as categorical. This is very important since otherwise part of the customer transaction information is discarded. Whenever an extension is proposed it must be checked in terms of its performance. The algorithm efficiency is linked to the size of the database that is amenable to be treated. Therefore it is crucial to have efficient algorithms that enable us to examine and extract valuable decision-making information in the ever larger databases.In this paper we present an algorithm that can be used in the context of several of the extensions provided in the literature but at the same time preserves its performance, as demonstrated in a case study. The approach in our algorithm is to explore multidimensional properties of the data (provided such properties are present), allowing us to combine this additional information in a very efficient pruning phase. This results in a very flexible and efficient algorithm that was used with success in several experiments using categorical and quantitative databases.The paper is organized as follows. In the next section we describe the quantitative association rules and we present an algorithm to generate it. Section 3 presents an optimization of the pruning phase of the Apriori [4] algorithm based on quantitative information associated with the items. Section 4 presents our experimental results for mining four synthetic workloads, followed by some related work in Section 5. Finally we present some conclusions and future work in Section 6.

british national conference on databases | 2002

Real World Association Rule Mining

Adriano Veloso; Bruno Rocha; Márcio de Carvalho; Wagner Meira

Across a wide variety of fields, data are being collected and accumulated at a dramatic pace, and therefore a new generation of techniques has been proposed to assist humans in extracting usefull information from the rapidly growing volumes of data. One of these techniques is the association rule discovery, a key data mining task which has attracted tremendous interest among data mining researchers. Due to its vast applicability, many algorithms have been developed to perform the association rule mining task. However, an immediate problem facing researchers is which of these algorithms is likely to make a good match with the database to be used in the mining operation. In this paper we consider this problem, dealing with both algorithmic and data aspects of association rule mining by performing a systematic experimental evaluation of different algorithms on different databases. We observed that each algorithm has different strengths and weaknesses depending on data characteristics. This careful analysis enables us to develop an algorithm which achieves better performance than previously proposed algorithms, specially on databases obtained from actual applications.

siam international conference on data mining | 2002

Mining Frequent Itemsets in Evolving Databases.

Adriano Veloso; Wagner Meira; Márcio de Carvalho; Bruno Pôssas; Srinivasan Parthasarathy; Mohammed Javeed Zaki

brazilian symposium on databases | 2003

Efficient, Accurate and Privacy-Preserving Data Mining for Frequent Itemsets in Distributed Databases.

Adriano Veloso; Wagner Meira; Srinivasan Parthasarathy; Márcio de Carvalho

brazilian symposium on databases | 2002

Mining Reliable Models of Associations in Dynamic Databases

Adriano Veloso; Wagner Meira; Márcio de Carvalho

brazilian symposium on databases | 2001

Mineração Incremental de Regras de Associação.

Adriano Veloso; Bruno Pôssas; Gustavo Menezes Siqueira; Wagner Meira; Márcio de Carvalho

Society for Information Technology & Teacher Education International Conference | 2013

Brazilian Government Initiatives for Teacher Education with the use of Information Technology

Liamara Scortegagna; Márcio de Carvalho

WOB | 2003

Using Structural Signatures for Identifying Globins: the Intra-Subunit Electrostatic Interactions.

Tiago Adriano de Knegt López de Prado; Wagner Meira; Marcelo Matos Santoro; Márcio de Carvalho; Rodrigo L. Carceroni; Carlos H. da Silveira; Raquel C. de Melo; Fabiano A. Fonseca

brazilian symposium on databases | 2002