Implementation and Analysis of Clustering Algorithms in Data Mining

Authors

  • Prabhjot Kaur GES Polytechnic College Hoshiarpur
  • Robin Parkash Mathur Lovely Professional University Phagwara

DOI:

https://doi.org/10.24297/ijct.v6i1.4448

Abstract

Data mining plays a very important role in information industry and in society due to the presence of huge amount of data. Organizations in the whole world are already aware about data mining. Data mining is the process which uses various kinds of data analysis tools to obtain patterns which also referred to as knowledge discovery from data. Clustering is called unsupervised learning algorithm as groups are not predefined but defined by the data. There are so many research areas in data mining. This paper is focusing on performance and evaluation of clustering algorithm: K-means, SOM and HAC. Evaluations of these three algorithms are purely based on the survey based analysis. These algorithms are analyzed by applying on the data set of banking which is a very high dimensional data. Performances of these algorithms are also compared with each other. Our results indicate that SOM technique is better than k-means and as good as or better than the hierarchical clustering technique. We have also generated one code in Orange Python which is the enhanced algorithm based on the hybrid approach of SOM, K-means and HAC.

Downloads

Download data is not yet available.

Author Biographies

Prabhjot Kaur, GES Polytechnic College Hoshiarpur

CSE

Robin Parkash Mathur, Lovely Professional University Phagwara

CSE

Downloads

Published

2013-05-30

How to Cite

Kaur, P., & Mathur, R. P. (2013). Implementation and Analysis of Clustering Algorithms in Data Mining. INTERNATIONAL JOURNAL OF COMPUTERS &Amp; TECHNOLOGY, 6(1), 232–236. https://doi.org/10.24297/ijct.v6i1.4448

Issue

Section

Research Articles