UTHM Institutional Repository

Clustering for binary data sets by using genetic algorithm-incremental K-means

Saharan, S. and Baragona, R. and Nor, M. E. and Salleh, R. M. and M. Asrah, N. (2018) Clustering for binary data sets by using genetic algorithm-incremental K-means. Journal of Physics: Conference Series, 995 (012038). pp. 1-5. ISSN 17426596

[img] PDF
J6171_5b40d48fed1a6e7c1dece63eacfe7197.pdf

Download (443kB)

Abstract

This research was initially driven by the lack of clustering algorithms that specifically focus in binary data. To overcome this gap in knowledge, a promising technique for analysing this type of data became the main subject in this research, namely Genetic Algorithms (GA). For the purpose of this research, GA was combined with the Incremental Kmeans (IKM) algorithm to cluster the binary data streams. In GAIKM, the objective function was based on a few sufficient statistics that may be easily and quickly calculated on binary numbers. The implementation of IKM will give an advantage in terms of fast convergence. The results show that GAIKM is an efficient and effective new clustering algorithm compared to the clustering algorithms and to the IKM itself. In conclusion, the GAIKM outperformed other clustering algorithms such as GCUK, IKM, Scalable K-means (SKM) and K-means clustering and paves the way for future research involving missing data and outliers.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA273 Probabilities. Mathematical statistics
Divisions: Faculty of Applied Science and Technology > Department of Mathematics and Statistic
Depositing User: Mr. Mohammad Shaifulrip Ithnin
Date Deposited: 23 Jun 2019 06:59
Last Modified: 23 Jun 2019 06:59
URI: http://eprints.uthm.edu.my/id/eprint/11209
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year