Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Baptiste Jeudy is active.

Publication


Featured researches published by Baptiste Jeudy.


Genome Biology | 2002

Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data

Céline Becquet; Sylvain Blachon; Baptiste Jeudy; Jean-François Boulicaut; Olivier Gandrillon

BackgroundThe association-rules discovery (ARD) technique has yet to be applied to gene-expression data analysis. Even in the absence of previous biological knowledge, it should identify sets of genes whose expression is correlated. The first association-rule miners appeared six years ago and proved efficient at dealing with sparse and weakly correlated data. A huge international research effort has led to new algorithms for tackling difficult contexts and these are particularly suited to analysis of large gene-expression matrices. To validate the ARD technique we have applied it to freely available human serial analysis of gene expression (SAGE) data.ResultsThe approach described here enables us to designate sets of strong association rules. We normalized the SAGE data before applying our association rule miner. Depending on the discretization algorithm used, different properties of the data were highlighted. Both common and specific interpretations could be made from the extracted rules. In each and every case the extracted collections of rules indicated that a very strong co-regulation of mRNA encoding ribosomal proteins occurs in the dataset. Several rules associating proteins involved in signal transduction were obtained and analyzed, some pointing to yet-unexplored directions. Furthermore, by examining a subset of these rules, we were able both to reassign a wrongly labeled tag, and to propose a function for an expressed sequence tag encoding a protein of unknown function.ConclusionsWe show that ARD is a promising technique that turns out to be complementary to existing gene-expression clustering techniques.


Data Mining and Knowledge Discovery | 2005

Constraint-based Data Mining

Jean-François Boulicaut; Baptiste Jeudy

Knowledge Discovery in Databases (KDD) is a complex interactive process. The promising theoretical framework of inductive databases considers this is essentially a querying process. It is enabled by a query language which can deal either with raw data or patterns which hold in the data. Mining patterns turns to be the so-called inductive query evaluation process for which constraint-based Data Mining techniques have to be designed. An inductive query specifies declara-tively the desired constraints and algorithms are used to compute the patterns satisfying the constraints in the data. We survey important results of this active research domain. This chapter emphasizes a real breakthrough for hard problems concerning local pattern mining under various constraints and it points out the current directions of research as well.


international database engineering and applications symposium | 2001

Mining free itemsets under constraints

Jean-François Boulicaut; Baptiste Jeudy

Computing frequent itemsets and their frequencies from large Boolean matrices (e.g., to derive association rules) has been one of the hot topics in data mining. Levelwise algorithms (e.g., the a priori algorithm) have been proved effective for frequent itemset mining from sparse data. However, in many practical applications, the computation turns out to be intractable for the user-given frequency threshold and the lack of focus leads to huge collections of frequent itemsets. In the last three years, two promising issues have been investigated: the use of user defined constraints and closed set mining. To the best of our knowledge, combining these two frameworks has not been studied yet. The authors show that the benefit of these two approaches can be combined into levelwise algorithms. An experimental validation related to the discovery of association rules with negations is reported.


european conference on principles of data mining and knowledge discovery | 2002

Using Condensed Representations for Interactive Association Rule Mining

Baptiste Jeudy; Jean-François Boulicaut

Association rule mining is a popular data mining task. It has an interactive and iterative nature, i.e., the user has to refine his mining queries until he is satisfied with the discovered patterns. To support such an interactive process, we propose to optimize sequences of queries by means of a cache that stores information from previous queries. Unlike related works, we use condensed representations like free and closed itemsets for both data mining and caching. This results in a much more efficient mining technique in highly correlated data and a much smaller cache than in previous approaches.


flexible query answering systems | 2001

Towards the Tractable Discovery of Association Rules with Negations

Jean-François Boulicaut; Artur Bykowski; Baptiste Jeudy

Frequent association rules (e.g., A∧B⇒C to say that when properties A and B are true in a record then, C tends to be also true) have become a popular way to summarize huge datasets. The last 5 years, there has been a lot of research on association rule mining and more precisely, the tractable discovery of interesting rules among the frequent ones. We consider now the problem of mining association rules that may involve negations e.g., A∧B⇒⌝C or ⌝A∧B⇒C. Mining such rules is difficult and remains an open problem. We identify several possibilities for a tractable approach in practical cases. Among others, we discuss the active use of constraints. We propose a generic algorithm and discuss the use of constraints to mine the generalized sets from which rules with negations can be derived.


Lecture Notes in Computer Science | 2002

Constraint-Based Discovery and Inductive Queries: Application to Association Rule Mining

Baptiste Jeudy; Jean-François Boulicaut

Recently inductive databases (IDBs) have been proposed to afford the problem of knowledge discovery from huge databases. Querying these databases needs for primitives to: (1) select, manipulate and query data, (2) select, manipulate and query interesting patterns (i.e., those patterns that satisfy certain constraints), and (3) cross over patterns and data (e.g., selecting the data in which some patterns hold). Designing such query languages is a long-term goal and only preliminary approaches have been studied, mainly for the association rule mining task. Starting from a discussion on the MINE RULE operator, we identify several open issues for the design of inductive databases dedicated to these descriptive rules. These issues concern not only the offered primitives but also the availability of efficient evaluation schemes. We emphasize the need for primitives that work on more or less condensed representations for the frequent itemsets, e.g., the (frequent) ?-free and closed itemsets. It is useful not only for optimizing single association rule mining queries but also for sophisticated post-processing and interactive rule mining.


arXiv: Learning | 2004

Database transposition for constrained (closed) pattern mining

Baptiste Jeudy; François Rioult

Recently, different works proposed a new way to mine patterns in databases with pathological size. For example, experiments in genome biology usually provide databases with thousands of attributes (genes) but only tens of objects (experiments). In this case, mining the “transposed” database runs through a smaller search space, and the Galois connection allows to infer the closed patterns of the original database. We focus here on constrained pattern mining for those unusual databases and give a theoretical framework for database and constraint transposition. We discuss the properties of constraint transposition and look into classical constraints. We then address the problem of generating the closed patterns of the original database satisfying the constraint, starting from those mined in the “transposed” database. Finally, we show how to generate all the patterns satisfying the constraint from the closed ones.


intelligent data analysis | 2002

Optimization of association rule mining queries

Baptiste Jeudy; Jean-François Boulicaut


Archive | 2000

Using Constraints During Set Mining: Should We Prune or not?

Jean-Fran cois Boulicaut; Baptiste Jeudy


Archive | 2002

Optimisation de requêtes inductives : application à l'extraction sous contraintes de règles d'association

Baptiste Jeudy

Collaboration


Dive into the Baptiste Jeudy's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Artur Bykowski

Institut national des sciences Appliquées de Lyon

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge