Document clustering plays an important role in several applications. K-Medoids and CLARA are among the most notable algorithms for clustering. These algorithms together with their relatives have been employed widely in clustering problems. In this paper we present a solution to improve the original K-Medoids and CLARA by making change in the way they assign objects to clusters. Experimental results on various document datasets using three distance measures have shown that the approach helps enhance the clustering outcomes substantially as demonstrated by three quality metrics, i.e. Entropy, Purity and F-Measure.
Modification to K-medoids and CLARA for effective document clustering / Nguyen, Phuong T.; Eckert, Kai; Ragone, Azzurra; Di Noia, Tommaso. - 10352:(2017), pp. 481-491. (Intervento presentato al convegno 23rd International Symposium on Methodologies for Intelligent Systems, ISMIS 2017 tenutosi a Warsaw, Poland nel June 26-29, 2017) [10.1007/978-3-319-60438-1_47].
Modification to K-medoids and CLARA for effective document clustering
Ragone, Azzurra;Di Noia, Tommaso
2017-01-01
Abstract
Document clustering plays an important role in several applications. K-Medoids and CLARA are among the most notable algorithms for clustering. These algorithms together with their relatives have been employed widely in clustering problems. In this paper we present a solution to improve the original K-Medoids and CLARA by making change in the way they assign objects to clusters. Experimental results on various document datasets using three distance measures have shown that the approach helps enhance the clustering outcomes substantially as demonstrated by three quality metrics, i.e. Entropy, Purity and F-Measure.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.