Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hannu Toivonen is active.

Publication


Featured researches published by Hannu Toivonen.


Data Mining and Knowledge Discovery | 1997

Discovery of Frequent Episodes in Event Sequences

Heikki Mannila; Hannu Toivonen; A. Inkeri Verkamo

Sequences of events describing the behavior and actions of users or systems can be collected in several domains. An episode is a collection of events that occur relatively close to each other in a given partial order. We consider the problem of discovering frequently occurring episodes in a sequence. Once such episodes are known, one can produce rules for describing or predicting the behavior of the sequence. We give efficient algorithms for the discovery of all frequent episodes from a given class of episodes, and present detailed experimental results. The methods are in use in telecommunication alarm management.


Data Mining and Knowledge Discovery | 1997

Levelwise Search and Borders of Theories in KnowledgeDiscovery

Heikki Mannila; Hannu Toivonen

AbstractOne of the basic problems in knowledge discovery in databases (KDD) is the following: given a data set r, a class L of sentences for defining subgroups of r, and a selection predicate, find all sentences of L deemed interesting by the selection predicate. We analyze the simple levelwise algorithm for finding all such descriptions. We give bounds for the number of database accesses that the algorithm makes. For this, we introduce the concept of the border of a theory, a notion that turns out to be surprisingly powerful in analyzing the algorithm. We also consider the verification problem of a KDD process: given r and a set of sentences S


conference on information and knowledge management | 1994

Finding interesting rules from large sets of discovered association rules

Mika Klemettinen; Heikki Mannila; Pirjo Ronkainen; Hannu Toivonen; A. Inkeri Verkamo


The Computer Journal | 1999

TANE: An Efficient Algorithm for Discovering Functional and Approximate Dependencies

Yka Huhtala; Juha Kärkkäinen; Pasi Porkka; Hannu Toivonen

\subseteq


Data Mining and Knowledge Discovery | 1999

Discovery of frequent DATALOG patterns

Luc Dehaspe; Hannu Toivonen


international conference on data mining | 2001

Time series segmentation for context recognition in mobile devices

Johan Himberg; Kalle Korpiaho; Heikki Mannila; Johanna Tikanmäki; Hannu Toivonen

L determine whether S is exactly the set of interesting statements about r. We show strong connections between the verification problem and the hypergraph transversal problem. The verification problem arises in a natural way when using sampling to speed up the pattern discovery step in KDD.


Quaternary Science Reviews | 2002

Holocene temperature changes in northern Fennoscandia reconstructed from chironomids using Bayesian modelling

Atte Korhola; Kari Vasko; Hannu Toivonen; Heikki Olander

Association rules, introduced by Agrawal, Imielinski, and Swami, are rules of the form “for 90% of the rows of the relation, if the row has value 1 in the columns in set W, then it has 1 also in column B”. Efficient methods exist for discovering association rules from large collections of data. The number of discovered rules can, however, be so large that browsing the rule set and finding interesting rules from it can be quite difficult for the user. We show how a simple formalism of rule templates makes it possible to easily describe the structure of interesting rules. We also give examples of visualization of rules, and show how a visualization tool interfaces with rule templates.


international conference on data engineering | 1998

Efficient discovery of functional and approximate dependencies using partitions

Yka Huhtala; Juha Kärkkäinen; Pasi Porkka; Hannu Toivonen

The discovery of functional dependencies from relations is an important database analysis technique. We present TANE, an efficient algorithm for finding functional dependencies from large databases. TANE is based on partitioning the set of rows with respect to their attribute values, which makes testing the validity of functional dependencies fast even for a large number of tuples. The use of partitions also makes the discovery of approximate functional dependencies easy and efficient and the erroneous or exceptional rows can be identified easily. Experiments show that T ANE is fast in practice. For benchmark databases the running times are improved by several orders of magnitude over previously published results. The algorithm is also applicable to much larger datasets than the previous methods.


international conference on pervasive computing | 2004

Adaptive on-device location recognition

Kari Laasonen; Mika Raento; Hannu Toivonen

Discovery of frequent patterns has been studied in a variety of data mining settings. In its simplest form, known from association rule mining, the task is to discover all frequent itemsets, i.e., all combinations of items that are found in a sufficient number of examples. The fundamental task of association rule and frequent set discovery has been extended in various directions, allowing more useful patterns to be discovered with special purpose algorithms. We present WARMR, a general purpose inductive logic programming algorithm that addresses frequent query discovery: a very general DATALOG formulation of the frequent pattern discovery problem.The motivation for this novel approach is twofold. First, exploratory data mining is well supported: WARMR offers the flexibility required to experiment with standard and in particular novel settings not supported by special purpose algorithms. Also, application prototypes based on WARMR can be used as benchmarks in the comparison and evaluation of new special purpose algorithms. Second, the unified representation gives insight to the blurred picture of the frequent pattern discovery domain. Within the DATALOG formulation a number of dimensions appear that relink diverged settings.We demonstrate the frequent query approach and its use on two applications, one in alarm analysis, and one in a chemical toxicology domain.


international conference on data engineering | 1996

Knowledge discovery from telecommunication network alarm databases

Kimmo Hätönen; Mika Klemettinen; Heikki Mannila; Pirjo Ronkainen; Hannu Toivonen

Recognizing the context of use is important in making mobile devices as simple to use as possible. Finding out what the users situation is can help the device and underlying service in providing an adaptive and personalized user interface. The device can infer parts of the context of the user from sensor data: the mobile device can include sensors for acceleration, noise level, luminosity, humidity, etc. In this paper we consider context recognition by unsupervised segmentation of time series produced by sensors. Dynamic programming can be used to find segments that minimize the intra-segment variances. While this method produces optimal solutions, it is too slow for long sequences of data. We present and analyze randomized variations of the algorithm. One of them, global iterative replacement or GIR, gives approximately optimal results in a fraction of the time required by dynamic programming. We demonstrate the use of time series segmentation in context recognition for mobile phone applications.

Collaboration


Dive into the Hannu Toivonen's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Oskar Gross

University of Helsinki

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Antoine Doucet

University of La Rochelle

View shared research outputs
Top Co-Authors

Avatar

Jason Tsong-Li Wang

New Jersey Institute of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge