K-Means Clustering for Market Basket Data Segmentation

Authors

  • Eka Pandu Cynthia UIN Sultan Syarif Kasim Riau
  • Maulidania Mediawati Cynthia Politeknik Lembaga Pendidikan dan Pengembangan Profesi Indonesia
  • Dessy Nia Cynthia Universitas Terbuka

          DOI:

https://doi.org/10.62712/ijapset.v1i1.4

Keywords:

K-Means Clustering, Market Basket Analysis, Data Mining, Customer Segmentation, Retail Analytics

Abstract

The rapid growth of retail transaction data has created new opportunities for businesses to analyze customer purchasing behavior and improve decision-making strategies. Market basket data contains valuable information about product combinations purchased together within a single transaction, which can reveal hidden patterns of consumer behavior. This study aims to apply the K-Means clustering algorithm to segment market basket transaction data based on similarities in purchasing patterns. The research method involves several stages, including data preprocessing, transformation of transaction data into a binary feature matrix, determination of the optimal number of clusters, and clustering analysis using the K-Means algorithm. The results show that the clustering process successfully groups transactions into several clusters representing different purchasing characteristics. Each cluster reflects distinct consumer behavior patterns such as routine household purchases, breakfast-related items, snack-oriented transactions, and fresh product selections. These findings demonstrate that K-Means clustering can effectively identify meaningful patterns within market basket datasets. The clustering results provide useful insights that can support retail strategies such as targeted promotions, product bundling, store layout optimization, and inventory management. Overall, the application of clustering techniques in market basket analysis contributes to improving data-driven decision-making and enhancing the understanding of customer purchasing behavior in retail environments.

References

[1] Oseremi Onesi-Ozigagun, Yinka James Ololade, Nsisong Louis Eyo-Udo, and Damilola Oluwaseun Ogundipe, “Data-driven decision making: Shaping the future of business efficiency and customer engagement,” Int. J. Multidiscip. Res. Updat., vol. 7, no. 2, pp. 019–029, 2024, doi: 10.53430/ijmru.2024.7.2.0031.

[2] I. W. P. Pratama, “Exploring the Depths of Market Basket Analysis: A Comprehensive Guide to Transaction Analysis with FP-Growth and Apriori Algorithms,” INVOTEK J. Inov. Vokasional dan Teknol., vol. 23, no. 2, pp. 109–118, 2024, doi: 10.24036/invotek.v23i2.1094.

[3] D. N. Nguyen, V. H. Nguyen, T. Trinh, T. Ho, and H. S. Le, “A personalized product recommendation model in e-commerce based on retrieval strategy,” J. Open Innov. Technol. Mark. Complex., vol. 10, no. 2, p. 100303, 2024, doi: 10.1016/j.joitmc.2024.100303.

[4] S. Brandy, “Overcoming Challenges and Unlocking the Potential: Empowering Small and Medium Enterprises (SMEs) with Data Analytics Solutions,” Int. J. Inf. Technol. Comput. Sci. Appl., vol. 1, no. 3, pp. 150–160, 2023, doi: 10.58776/ijitcsa.v1i3.47.

[5] D. Dwiputra, A. Mulyo Widodo, H. Akbar, and G. Firmansyah, “Evaluating the Performance of Association Rules in Apriori and FP-Growth Algorithms: Market Basket Analysis to Discover Rules of Item Combinations,” J. World Sci., vol. 2, no. 8, pp. 1229–1248, 2023, doi: 10.58344/jws.v2i8.403.

[6] Caroline, Yuswardi, and Y. U. Rofi’i, “Analysis of E-Commerce Purchase Patterns Using Big Data: An Integrative Approach to Understanding Consumer Behavior,” Int. J. Softw. Eng. Comput. Sci., vol. 3, no. 3, pp. 352–364, 2023, doi: 10.35870/ijsecs.v3i3.1840.

[7] M. Sarkar, A. R. Puja, and F. R. Chowdhury, “Optimizing Marketing Strategies with RFM Method and K-Means Clustering-Based AI Customer Segmentation Analysis,” J. Bus. Manag. Stud., vol. 6, no. 2, pp. 54–60, 2024, doi: 10.32996/jbms.2024.6.2.5.

[8] Nnenna Ijeoma Okeke, Olufunke Anne Alabi, Abbey Ngochindo Igwe, Onyeka Chrisanctus Ofodile, and Chikezie Paul-Mikki Ewim, “Personalized customer journeys for underserved communities: Tailoring solutions to address unique needs,” World J. Adv. Res. Rev., vol. 24, no. 1, pp. 1988–2003, 2024, doi: 10.30574/wjarr.2024.24.1.3205.

[9] S. O. Fageeri, M. A. Kausar, and A. Soosaimanickam, “MBA: Market Basket Analysis Using Frequent Pattern Mining Techniques,” Int. J. Recent Innov. Trends Comput. Commun., vol. 11, pp. 15–21, 2023, doi: 10.17762/ijritcc.v11i5s.6591.

[10] Y. Januzaj, E. Beqiri, and A. Luma, “Determining the Optimal Number of Clusters using Silhouette Score as a Data Mining Technique,” Int. J. online Biomed. Eng., vol. 19, no. 4, pp. 174–182, 2023, doi: 10.3991/ijoe.v19i04.37059.

[11] R. Zaib and O. Ourlis, “Large Scale Data Using K-Means,” Mesopotamian J. Big Data, vol. 2023, pp. 36–45, 2023, doi: 10.58496/MJBD/2023/006.

[12] T. R. Noviandy et al., “Environmental and Economic Clustering of Indonesian Provinces: Insights from K-Means Analysis,” Leuser J. Environ. Stud., vol. 2, no. 1, pp. 41–51, 2024, doi: 10.60084/ljes.v2i1.181.

[13] O. R. Tiamiyu, “Unveiling Hidden Money Laundering Networks: The Application of Graph Neural Networks in Financial Transaction Analysis,” J. Comput. Anal. Appl., vol. 34, no. 9, pp. 50–74, 2025, [Online]. Available: https://orcid.org/0009-0000-3991-0683

[14] B. Dwarakanath, S. Sai Ganesh, M. Mathesh, N. Goutham Ram, and K. Suresh Kumar, “Investigating Customer Segmentation Dynamics via Market Basket Analysis: An Extensive Exploration of Market Insights,” in Lecture Notes in Networks and Systems, Springer, 2025, pp. 221–242. doi: 10.1007/978-981-96-5784-1_18.

[15] A. Aldelemy and R. A. Abd-Alhameed, “Binary Classification of Customer’s Online Purchasing Behavior Using Machine Learning,” J. Tech., vol. 5, no. 2, pp. 163–186, 2023, doi: 10.51173/jt.v5i2.1226.

[16] Rianat Abbas, Ajuwon Samuel, Kumbirai Bernard Muhwati, Sarah Mavire, Enock Katenda, and Adetomiwa Adesokan, “Machine learning and Blockchain approaches for enhancing fraud prevention in financial transactions,” Eng. Sci. Technol. J., vol. 6, no. 6, pp. 296–312, 2025, doi: 10.51594/estj.v6i6.1971.

[17] J. Tang, “Unlocking Retail Insights: Predictive Modeling and Customer Segmentation Through Data Analytics,” J. Theor. Appl. Electron. Commer. Res. , vol. 20, no. 2, p. 59, 2025, doi: 10.3390/jtaer20020059.

[18] M. I. Akazue et al., “Handling Transactional Data Features via Associative Rule Mining for Mobile Online Shopping Platforms,” Int. J. Adv. Comput. Sci. Appl., vol. 15, no. 3, pp. 530–538, 2024, doi: 10.14569/IJACSA.2024.0150354.

[19] S. Chakraborty, “a Study on Hybrid Recommender Systems for Effective Targeted Marketing in E-Commerce Platforms,” Int. J. Adv. Bus. Manag. Res., vol. 02, no. 04, pp. 54–64, 2025, doi: 10.62674/ijabmr.2025.v2i04.006.

[20] J. M. Bruhin, M. R. Fengler, W. Koeniger, and R. Rohrkemper, “Consumer spending in Switzerland: insights from a novel transactional data index,” Swiss J. Econ. Stat., vol. 161, no. 1, p. 14, 2025, doi: 10.1186/s41937-025-00146-5.

Downloads

Published

2026-03-09

How to Cite

Cynthia, E. P., Cynthia, M. M., & Cynthia, D. N. (2026). K-Means Clustering for Market Basket Data Segmentation. International Journal of Applied Science and Technology Application, 1(1), 30–38. https://doi.org/10.62712/ijapset.v1i1.4