Journal of Statistical Software | 2019

CoClust: A Python Package for Co-Clustering

 
 
 

Abstract


Co-clustering (also known as biclustering), is an important extension of cluster analysis since it allows to simultaneously group objects and features in a matrix, resulting in row and column clusters that are both more accurate and easier to interpret. This paper presents the theory underlying several effective diagonal and non-diagonal co-clustering algorithms, and describes CoClust, a package which provides implementations for these algorithms. The quality of the results produced by the implemented algorithms is demonstrated through extensive tests performed on datasets of various size and balance. CoClust has been designed to complete and easily interface with popular Python machine learning libraries such as scikit-learn.

Volume 88
Pages 1-29
DOI 10.18637/JSS.V088.I07
Language English
Journal Journal of Statistical Software

Full Text