Is this you? Create Your Porfile

Toshihiro Kamishima

National Institute of Advanced Industrial Science and Technology

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Toshihiro Kamishima is active.

Explore More

Publication

Featured researches published by Toshihiro Kamishima.

international conference on data mining | 2011

Fairness-aware Learning through Regularization Approach

Toshihiro Kamishima; Shotaro Akaho; Jun Sakuma

With the spread of data mining technologies and the accumulation of social data, such technologies and data are being used for determinations that seriously affect peoples lives. For example, credit scoring is frequently determined based on the records of past credit data together with statistical prediction techniques. Needless to say, such determinations must be socially and legally fair from a viewpoint of social responsibility, namely, it must be unbiased and nondiscriminatory in sensitive features, such as race, gender, religion, and so on. Several researchers have recently begun to attempt the development of analysis techniques that are aware of social fairness or discrimination. They have shown that simply avoiding the use of sensitive features is insufficient for eliminating biases in determinations, due to the indirect influence of sensitive information. From a privacy-preserving viewpoint, this can be interpreted as hiding sensitive information when classification results are observed. In this paper, we first discuss three causes of unfairness in machine learning. We then propose a regularization approach that is applicable to any prediction algorithm with probabilistic discriminative models. We further apply this approach to logistic regression and empirically show its effectiveness and efficiency.

international conference on data mining | 2009

TrBagg: A Simple Transfer Learning Method and its Application to Personalization in Collaborative Tagging

Toshihiro Kamishima; Masahiro Hamasaki; Shotaro Akaho

The aim of transfer learning is to improve prediction accuracy on a target task by exploiting the training examples for tasks that are related to the target one. Transfer learning has received more attention in recent years, because this technique is considered to be helpful in reducing the cost of labeling. In this paper, we propose a very simple approach to transfer learning: TrBagg, which is the extension of bagging. TrBagg is composed of two stages: Many weak classifiers are first generated as in standard bagging, and these classifiers are then filtered based on their usefulness for the target task. This simplicity makes it easy to work reasonably well without severe tuning of learning parameters. Further, our algorithm equips an algorithmic scheme to avoid negative transfer. We applied TrBagg to personalized tag prediction tasks for social bookmarks Our approach has several convenient characteristics for this task such as adaptation to multiple tasks with low computational cost.

Preference Learning | 2010

A Survey and Empirical Comparison of Object Ranking Methods

Toshihiro Kamishima; Hideto Kazawa; Shotaro Akaho

Ordered lists of objects are widely used as representational forms. Such ordered objects include Web search results or bestseller lists. In spite of their importance, methods of processing orders have received little attention. However, research concerning orders has recently become common; in particular, researchers have developed various methods for the task of Object Ranking to acquire functions for object sorting from example orders. Here, we give a unified view of these methods and compare their merits and demerits.

international conference on data mining | 2005

Supervised ordering - an empirical survey

Toshihiro Kamishima; Hideto Kazawa; Shotaro Akaho

Ordered lists of objects are widely used as representational forms. Such ordered objects include Web search results or bestseller lists. In spite of their importance, methods of processing orders have received little attention. However, research concerning orders has become common; in particular, researchers have developed various methods for the task of supervised ordering to acquire functions for object sorting from example orders. Here, we give a unified view of these methods and our new one, and empirically survey their merits and demerits.

international conference on data mining | 2006

Efficient Clustering for Orders

Toshihiro Kamishima; Shotaro Akaho

Lists of ordered objects are widely used as representational forms. Such ordered objects include Web search results or best-seller lists. Clustering is a useful data analysis technique for grouping mutually similar objects. To cluster orders, hierarchical clustering methods have been used together with dissimilarities defined between pairs of orders. However, hierarchical clustering methods cannot be applied to large-scale data due to their computational cost in terms of the number of orders. To avoid this problem, we developed an k-o?means algorithm. This algorithm successfully extracted grouping structures in orders, and was computationally efficient with respect to the number of orders. However, it was not efficient in cases where there are too many possible objects yet. We therefore propose a new method (k-o?means-EBC), grounded on a theory of order statistics. We further propose several techniques to analyze acquired clusters of orders.

Machine Learning | 2003

Learning from Cluster Examples

Toshihiro Kamishima; Fumio Motoyoshi

Learning from cluster examples (LCE) is a hybrid task combining features of two common grouping tasks: learning from examples and clustering. In LCE, each training example is a partition of objects. The task is then to learn from a training set, a rule for partitioning unseen object sets. A general method for learning such partitioning rules is useful in any situation where explicit algorithms for deriving partitions are hard to formalize, while individual examples of correct partitions are easy to specify. In the past, clustering techniques have been applied to such problems, despite being essentially unsuited to the task. We present a technique that has qualitative advantages over standard clustering approaches. We demonstrate these advantages by applying our method to problems in two domains; one with dot patterns and one with more realistic vector-data images.

international conference on data mining | 2002

Learning from order examples

Toshihiro Kamishima; Shotaro Akaho

We advocate a new learning task that deals with orders of items, and we call this the learning from order examples (LOE) task. The aim of the task is to acquire the rule that is used for estimating the proper order of a given unordered item set. The rule is acquired from training examples that are ordered item sets. We present several solution methods for this task, and evaluate the performance and the characteristics of these methods based on the experimental results of tests using both artificial data and realistic data.

international conference on data mining | 2006

Dimension Reduction for Supervised Ordering

Toshihiro Kamishima; Shotaro Akaho

Ordered lists of objects are widely used as representational forms. Such ordered objects include Web search results and best-seller lists. Techniques for processing such ordinal data are being developed, particularly methods for a supervised ordering task: i.e., learning functions used to sort objects from sample orders. In this article, we propose two dimension reduction methods specifically designed to improve prediction performance in a supervised ordering task.

Artificial Intelligence Review | 2006

Supervised ordering by regression combined with Thurstone's model

Toshihiro Kamishima; Shotaro Akaho

In this paper, we advocate a learning task that deals with the orders of objects, which we call the Supervised Ordering task. The term order means a sequence of objects sorted according to a specific property, such as preference, size, cost. The aim of this task is to acquire the rule that is used for estimating an appropriate order of a given unordered object set. The rule is acquired from sample orders consisting of objects represented by attribute vectors. Developing solution methods for accomplishing this task would be useful, for example, in carrying out a questionnaire survey to predict one’s preferences. We develop a solution method based on a regression technique imposing a Thurstone’s model and evaluate the performance and characteristics of these methods based on the experimental results of tests using both artificial data and real data.

international conference on data mining | 2004

Filling-in missing objects in orders

Toshihiro Kamishima; Shotaro Akaho

Filling-in techniques are important, since missing values frequently appear in real data. Such techniques have been established for categorical or numerical values. Though lists of ordered objects are widely used as representational forms (e.g., Web search results, best-seller lists), filling-in techniques for orders have received little attention. We therefore propose a simple but effective technique to fill-in missing objects in orders. We built this technique into our collaborative filtering system.

Explore More