Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Maguelonne Teisseire is active.

Publication


Featured researches published by Maguelonne Teisseire.


data and knowledge engineering | 2003

Incremental mining of sequential patterns in large databases

Florent Masseglia; Pascal Poncelet; Maguelonne Teisseire

In this paper, we consider the problem of the incremental mining of sequential patterns when new transactions or new customers are added to an original database. We present a new algorithm for mining frequent sequences that uses information collected during an earlier mining process to cut down the cost of finding new sequential patterns in the updated database. Our test shows that the algorithm performs significantly faster than the naive approach of mining the whole updated database from scratch. The difference is so pronounced that this algorithm could also be useful for mining sequential patterns, since in many cases it is faster to apply our algorithm than to mine sequential patterns using a standard algorithm, by breaking down the database into an original database plus an increment.


ACM Sigweb Newsletter | 1999

Using data mining techniques on Web access logs to dynamically improve hypertext structure

Florent Masseglia; Pascal Poncelet; Maguelonne Teisseire

With the growing popularity of the World Wide Web (Web), large volumes of data such as user address or URL requested are gathered automatically by Web servers and collected in access log files. Discovering relationships and global patterns that exist in such files can provide significant and useful information for performance enhancement, restructuring a Web site for increased effectiveness, and customer targeting in electronic commerce. In this paper, we propose an integrated system (WebTool) for applying data mining techniques such as association rules or sequential patterns on access log files. Once interesting patterns are discovered, we illustrate how they can be used to customize the server hypertext organization dynamically.


ACM Transactions on Knowledge Discovery From Data | 2010

Mining multidimensional and multilevel sequential patterns

Marc Plantevit; Anne Laurent; Dominique Laurent; Maguelonne Teisseire; Yeow Wei Choong

Multidimensional databases have been designed to provide decision makers with the necessary tools to help them understand their data. This framework is different from transactional data as the datasets contain huge volumes of historicized and aggregated data defined over a set of dimensions that can be arranged through multiple levels of granularities. Many tools have been proposed to query the data and navigate through the levels of granularity. However, automatic tools are still missing to mine this type of data in order to discover regular specific patterns. In this article, we present a method for mining sequential patterns from multidimensional databases, at the same time taking advantage of the different dimensions and levels of granularity, which is original compared to existing work. The necessary definitions and algorithms are extended from regular sequential patterns to this particular case. Experiments are reported, showing the significance of this approach.


Data Mining and Knowledge Discovery | 2008

Web usage mining: extracting unexpected periods from web logs

Florent Masseglia; Pascal Poncelet; Maguelonne Teisseire; Alice Marascu

Existing Web usage mining techniques are currently based on an arbitrary division of the data (e.g. “one log per month”) or guided by presumed results (e.g. “what is the customers’ behaviour for the period of Christmas purchases?”). These approaches have two main drawbacks. First, they depend on the above-mentioned arbitrary organization of data. Second, they cannot automatically extract “seasonal peaks” from among the stored data. In this paper, we propose a specific data mining process (in particular, to extract frequent behaviour patterns) in order to reveal the densest periods automatically. From the whole set of possible combinations, our method extracts the frequent sequential patterns related to the extracted periods. A period is considered to be dense if it contains at least one frequent sequential pattern for the set of users connected to the website in that period. Our experiments show that the extracted periods are relevant and our approach is able to extract both frequent sequential patterns and the associated dense periods.


european conference on machine learning | 2005

M 2 SP: mining sequential patterns among several dimensions

Marc Plantevit; Yeow Wei Choong; Anne Laurent; Dominique Laurent; Maguelonne Teisseire

Mining sequential patterns aims at discovering correlations between events through time. However, even if many works have dealt with sequential pattern mining, none of them considers frequent sequential patterns involving several dimensions in the general case. In this paper, we propose a novel approach, called M2SP, to mine multidimensional sequential patterns. The main originality of our proposition is that we obtain not only intra-pattern sequences but also inter-pattern sequences. Moreover, we consider generalized multidimensional sequential patterns, called jokerized patterns, in which some of the dimension values may not be instanciated. Experiments on synthetic data are reported and show the scalability of our approach.


Expert Systems With Applications | 2009

Efficient mining of sequential patterns with time constraints: Reducing the combinations

Florent Masseglia; Pascal Poncelet; Maguelonne Teisseire

In this paper we consider the problem of discovering sequential patterns by handling time constraints as defined in the Gsp algorithm. While sequential patterns could be seen as temporal relationships between facts embedded in the database where considered facts are merely characteristics of individuals or observations of individual behavior, generalized sequential patterns aim to provide the end user with a more flexible handling of the transactions embedded in the database. We thus propose a new efficient algorithm, called Gtc (Graph for Time Constraints) for mining such patterns in very large databases. It is based on the idea that handling time constraints in the earlier stage of the data mining process can be highly beneficial. One of the most significant new feature of our approach is that handling of time constraints can be easily taken into account in traditional levelwise approaches since it is carried out prior to and separately from the counting step of a data sequence. Our test shows that the proposed algorithm performs significantly faster than a state-of-the-art sequence mining algorithm.


intelligent information systems | 2006

Mining spatio-temporal data

Gennady L. Andrienko; Donato Malerba; Michael May; Maguelonne Teisseire

Both the temporal and spatial dimensions add substantial complexity to data mining tasks. First of all, the spatial relations, both metric (such as distance) and non-metric (such as topology, direction, shape, etc.) and the temporal relations (such as before and after) are information bearing and therefore need to be considered in the data mining methods. Secondly, some spatial and temporal relations are implicitly defined, that is, they are not explicitly encoded in a database. These relations must be extracted from the data and there is a trade-off between precomputing them before the actual mining process starts (eager approach) and computing them on-the-fly when they are actually needed (lazy approach). Moreover, despite much formalization of space and time relations available in spatio-temporal reasoning, the extraction of spatial/ temporal relations implicitly defined in the data introduces some degree of fuzziness that may have a large impact on the results of the data mining process. J Intell Inf Syst (2006) 27: 187–190 DOI 10.1007/s10844-006-9949-3


Archive | 2007

Successes and New Directions in Data Mining

Pascal Poncelet; Florent Masseglia; Maguelonne Teisseire

The problem of mining patterns is becoming a very active research area and efficient techniques have been widely applied to problems in industry, government, and science. From the initial definition and motivated by real-applications, the problem of mining patterns not only addresses the finding of itemsets but also more and more complex patterns. Successes and New Directions in Data Mining addresses existing solutions for data mining, with particular emphasis on potential real-world applications. Capturing defining research on topics such as fuzzy set theory, clustering algorithms, semi-supervised clustering, modeling and managing data mining patterns, and sequence motif mining, this book is an indispensable resource for library collections.


IEEE Transactions on Fuzzy Systems | 2007

From Crispness to Fuzziness: Three Algorithms for Soft Sequential Pattern Mining

Céline Fiot; Anne Laurent; Maguelonne Teisseire

Most real world databases consist of historical and numerical data such as sensor, scientific or even demographic data. In this context, classical algorithms extracting sequential patterns, which are well adapted to the temporal aspect of data, do not allow numerical information processing. Therefore, the data are pre-processed to be transformed into a binary representation, which leads to a loss of information. Fuzzy algorithms have been proposed to process numerical data using intervals, particularly fuzzy intervals, but none of these methods is satisfactory. Therefore this paper completely defines the concepts linked to fuzzy sequential pattern mining. Using different fuzzification levels, we propose three methods to mine fuzzy sequential patterns and detail the resulting algorithms (SpeedyFuzzy, MiniFuzzy, and TotallyFuzzy). Finally, we assess them through different experiments, thus revealing the robustness and the relevancy of this work.


international symposium on temporal representation and reasoning | 2004

Pre-processing time constraints for efficiently mining generalized sequential patterns

Florent Masseglia; Pascal Poncelet; Maguelonne Teisseire

In this paper we consider the problem of discovering sequential patterns by handling time constraints. While sequential patterns could be seen as temporal relationships between facts embedded in the database, generalized sequential patterns aim at providing the end user with a more flexible handling of the transactions embedded in the database. We propose a new efficient algorithm, called GTC (graph for time constraints) for mining such patterns in very large databases. It is based on the idea that handling time constraints in the earlier stage of the algorithm can be highly beneficial since it minimizes computational costs by preprocessing data sequences. Our test shows that the proposed algorithm performs significantly faster than a state-of-the-art sequence mining algorithm.

Collaboration


Dive into the Maguelonne Teisseire's collaboration.

Top Co-Authors

Avatar

Pascal Poncelet

University of Montpellier

View shared research outputs
Top Co-Authors

Avatar

Anne Laurent

University of Montpellier

View shared research outputs
Top Co-Authors

Avatar

Mathieu Roche

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mathieu Roche

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Céline Fiot

Centre national de la recherche scientifique

View shared research outputs
Top Co-Authors

Avatar

Paola Salle

Centre national de la recherche scientifique

View shared research outputs
Researchain Logo
Decentralizing Knowledge