A Method for Finding the Appropriate Number of Clusters
Huan Doan and Dinh Nguyen
Department of Information System, University of Information Technology, Vietnam
Abstract: Drawback of almost partition based clustering algorithms is the requirement for the number of clusters specified at the beginning. Identifying the true number of clusters at the beginning is a difficult problem. So far, there were some works studied on this issue but no method is perfect in every case. This paper proposes a method to find the appropriate number of clusters in the clustering process by making an index indicated the appropriate number of clusters. This index is built from the intra-cluster coefficient and inter-cluster coefficient. The intra-cluster coefficient reflects intra-distortion of the cluster. The inter-cluster coefficient reflects the distance among clusters. Those coefficients are made only by extremely marginal objects of clusters. The looking for the extremely marginal objects and the building of the index are integrated
Keywords: Method for finding the number of clusters, appropriate a number of clusters, fuzzy c-means, clustering algorithm.