EXPERT SYSTEMS, vol.29, no.1, pp.25-38, 2012 (SCI-Expanded)
Data clustering is a key task for various processes including sequence analysis and pattern recognition. This paper studies a clustering algorithm that aimed to increase accuracy and sensitivity when working with biological data such as DNA sequences. The new algorithm is a modified version of fuzzy C-means (FCM) and is based on the well-known self-organizing map (SOM). In order to show the performance of the algorithm, seven different data sets are processed. The experimental results demonstrate that the proposed algorithm has the potential to outperform SOM and FCM in terms of clustering and classification accuracy abilities. Additionally, a brief comparison is made the proposed algorithm with some previously studied 'FCM-SOM' hybrid algorithms from the literature.