2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) | 2019

An Overview of Clustering Models with an Application to Document Clustering

 
 
 

Abstract


This paper presents an overview of selected clustering models and shows an application of K-Means algorithm to document clustering. In the introductory part, the definitions of basic concepts and common characteristics of clustering models are described. Then an overview of clustering models is given. The methods of clustering, basic characteristics, visualization and possible input data for each algorithm are presented. The authors also explain the assessment of each algorithm taking into consideration measures such as Rand index, homogeneity completeness, V-measure and Silhouette coefficient. Furthermore, the paper describes the application of the K-Means algorithm to document clustering showing the final result and elaborating the procedures applied when clustering the documents.

Volume None
Pages 1659-1664
DOI 10.23919/MIPRO.2019.8756868
Language English
Journal 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Full Text