Comparisons Between Data Clustering Algorithms

Comparisons Between Data Clustering Algorithms 

Osama Abu Abbas

Computer Science Department, Yarmouk University, Jordan

Abstract: Clustering is a division of data into groups of similar objects. Each group, called a cluster, consists of objects that are similar between themselves and dissimilar compared to objects of other groups. This paper is intended to study and compare different data clustering algorithms. The algorithms under investigation are: k-means algorithm, hierarchical clustering algorithm, self-organizing maps algorithm, and expectation maximization clustering algorithm. All these algorithms are compared according to the following factors: size of dataset, number of clusters, type of dataset and type of software used. Some conclusions that are extracted belong to the performance, quality, and accuracy of the clustering algorithms.  

Keywords: Clustering, k-means algorithm, hierarchical clustering algorithm, self-organizing maps algorithm, expectation maximization clustering algorithm. 

Received January 18, 2007; accepted May 2, 2007 

Full Text

 
Read 6240 times Last modified on Wednesday, 20 January 2010 01:55
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…