Performance Evaluation of Clustering Algorithm Using Different Datasets

Peerzada Hamid Ahmad, Shilpa Dang

Abstract


With the advancement of technology, Cluster analysis plays an important role in analyzing text mining techniques. It divides the dataset into several meaningful clusters to reflect the dataset’s natural structure. In this paper we analyze the four major clustering algorithms namely Simple K-mean, DBSCAN, HCA and MDBCA and compare the performance of these four clustering algorithms. Performance of these four techniques are presented and compared using a clustering tool WEKA. The results are tested on different datasets namely Abalone, Bankdata, Router, SMS and Webtk dataset using WEKA interface and compute instances, attributes and the time taken to build the model. I have also highlighted the advantages, disadvantages and applications of each clustering technique.

Keywords: Density based clustering algorithm; Hierarchical clustering algorithm; Make density based clustering; Simple K-mean.


Full Text: PDF
Download the IISTE publication guideline!

To list your conference here. Please contact the administrator of this platform.

Paper submission email: JIEA@iiste.org
ISSN (Paper)2224-5782 ISSN (Online)2225-0506
Please add our address "contact@iiste.org" into your email contact list.
This journal follows ISO 9001 management standard and licensed under a Creative Commons Attribution 3.0 License.
Copyright © www.iiste.org