IEICE Trans. Inf. Syst. | 2019

Recursive Nearest Neighbor Graph Partitioning for Extreme Multi-Label Learning

 

Abstract


As the data size of Web-related multi-label classification problems continues to increase, the label space has also grown extremely large. For example, the number of labels appearing in Web page tagging and E-commerce recommendation tasks reaches hundreds of thousands or even millions. In this paper, we propose a graph partitioning tree (GPT), which is a novel approach for extreme multi-label learning. At an internal node of the tree, the GPT learns a linear separator to partition a feature space, considering approximate k-nearest neighbor graph of the label vectors. We also developed a simple sequential optimization procedure for learning the linear binary classifiers. Extensive experiments on large-scale real-world data sets showed that our method achieves better prediction accuracy than state-of-the-art tree-based methods, while maintaining fast prediction. key words: extreme multi-label classification, k-nearest neighbor graph, tree-based classifier

Volume 102-D
Pages 579-587
DOI 10.1587/TRANSINF.2018EDP7106
Language English
Journal IEICE Trans. Inf. Syst.

Full Text