Design and Development of a Customer Data Platform for Loyalty Programs: Data Deduplication and Personalized Marketing Infrastructure
Main Article Content
Abstract
This position paper presents the architecture and deployment of a Customer Data Platform (CDP) for the Koçtaş loyalty program to enhance data quality, unification, and personalization-based marketing strategies. The project entails bringing together disparate customer data collected across multiple channels into a single, deduplicated data store to enable advanced analytics and AI-driven personalization. By employing a combination of big data technologies, cloud infrastructure, and machine learning algorithms, the proposed system will enable real-time data processing of information, customer segmentation, and predictive modeling. Through this system, the platform will enhance marketing performance, customer satisfaction, and operational efficiency while adhering to data privacy legislations such as GDPR and KVKK compliance. The article situates the project within the contexts of customer relationship management (CRM), loyalty program studies, and personalization studies. It discusses data consolidation, deduplication, and system development processes, highlighting innovative elements such as adaptive algorithms, real-time learning processes, and secure data management. Expected gains are increased marketing ROI, additional loyal customers, and streamlined operational processes. The paper concludes with the analysis of the long-term potential contribution of the project and with future research avenues for large-scale data-driven marketing infrastructures.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
References
Stourm, V., Neslin, S. A., Bradlow, E. T., Breugelmans, E., Chun, S. Y., Gardete, P., ... & Venkatesan, R. (2020). Refocusing loyalty programs in the era of big data: a societal lens paradigm. Marketing Letters, 31(4), 405-418.
Morais, C. P., Ferreira, J., & Jayantilal, S. (2024). A systematic literature review on the network perspective and resource interaction: where are we now and where should we go?. Journal of Organizational Change Management, 37(7), 1559-1593.
Valdez Mendia, J. M., & Flores-Cuautle, J. D. J. A. (2022). Toward customer hyperpersonalization experience—A data-driven approach. Cogent Business & Management, 9 (1), 2041384.
Hardcastle, K., Vorster, L., & Brown, D. M. (2025). Understanding customer responses to AI-Driven personalized journeys: impacts on the customer experience. Journal of Advertising, 54(2), 176-195.
Asha, A. I., Hasan, S. K., Islam, M. A., Priya, S. A., & Islam, N. M. (2024). The Role of Big Data Analytics in Personalized Marketing: Enhancing Consumer Engagement and Business Outcomes.
Boiński, P., Sienkiewicz, M., Bębel, B., Wrembel, R., Gałęzowski, D., & Graniszewski, W. (2022). On customer data deduplication: Lessons learned from a r&d project in the financial sector. In CEUR Workshop Proceedings (Vol. 3135).
Goel, A., & Prabha, C. (2025). Leveraging Data Deduplication Approaches in the Cloud Storage: A Succinct Review. Procedia Computer Science, 259, 1640-1651.
Sehat, H., Pagnin, E., & Lucani, D. E. (2021, June). Yggdrasil: Privacy-aware dual deduplication in multi client settings. In ICC 2021-IEEE International Conference on Communications (pp. 1-6). IEEE.
Shi, H., Liu, X., Lv, F., Xue, H., Hu, J., Du, S., & Li, T. (2025). A pre-trained data deduplication model based on active learning. Expert Systems with Applications, 128628.
Mishra, D., & Patwa, S. (2019, March). A study of data De-duplication methods. In Proceedings of 2nd International Conference on Advanced Computing and Software Engineering (ICACSE).
Andrzejewski, W., Bębel, B., Boiński, P., & Wrembel, R. (2024). On tuning parameters guiding similarity computations in a data deduplication pipeline for customers records.
Ogunwole, O., Onukwulu, E. C., Sam-Bulya, N. J., Joel, M. O., & Achumie, G. O. (2022). Optimizing automated pipelines for realtime data processing in digital media and e-commerce. International Journal of Multidisciplinary Research and Growth Evaluation, 3(1), 112-120.
Ravikanth, M., Korra, S., Mamidisetti, G., Goutham, M., & Bhaskar, T. (2024). An efficient learning based approach for automatic record deduplication with benchmark datasets. Scientific Reports, 14(1), 16254.
Elouataoui, W., El Alaoui, I., El Mendili, S., & Gahi, Y. (2022). An end-to-end big data deduplication framework based on online continuous learning. International Journal of Advanced Computer Science and Applications, 13(9).
Bogart, C., Chhajer, R., Singh, B., Fontana, T., & Sakr, M. (2025). PlantD: Performance, Latency ANalysis, and Testing for Data Pipelines--An Open Source Measurement, Testing, and Simulation Framework. arXiv preprint arXiv:2504.10692.
Ahmed, S. M. M., Owais, M., Raza, M., Nadeem, Q., & Ahmed, B. (2025). The Impact of AI-Driven Personalization on Consumer Engagement and Brand Loyalty. Qlantic Journal of Social Sciences, 6(1), 311-323.
Hassan, N., Abdelraouf, M., & El-Shihy, D. (2025). The moderating role of personalized recommendations in the trust–satisfaction–loyalty relationship: an empirical study of AI-driven e-commerce. Future Business Journal, 11(1), 66.
M. S. Aktaş and M. Kapdan, “Structural Code Clone Detection Methodology Using Software Metrics,” Int. J. Softw. Eng. Knowl. Eng., vol. 26, no. 2, pp. 307–332, 2016, doi: 10.1142/S0218194016500133.
M. Uzun-Per, A. V. Gürel, A. B. Can, and M. S. Aktaş, “Scalable Recommendation Systems Based on Finding Similar Items and Sequences,” Concurrency Comput. Pract. Exper., vol. 34, no. 20, e6841, 2022.
I. Erdem, R. F. Oğuz, E. Ölmezogullari, and M. S. Aktaş, “Test Script Generation Based on Hidden Markov Models Learning from User Browsing Behaviors,” in Proc. IEEE Big Data, 2021, pp. 2998–3005.
M. Oz, C. Kaya, E. Ölmezogullari, and M. S. Aktaş, “On the Use of Generative Deep Learning Approaches for Generating Hidden Test Scripts,” Int. J. Softw. Eng. Knowl. Eng., vol. 31, no. 10, pp. 1447–1468, 2021.
M. J. Baeth and M. Aktaş, “On the Detection of Information Pollution and Violation of Copyrights in the Social Web,” in Proc. IEEE SOCA, 2015.
B. Yildiz, “Efficient Text Classification with Deep Learning on Imbalanced Data Improved with Better Distribution,” Turk. J. Sci. Technol., vol. 17, no. 1, pp. 89–98, 2022.
B. Yildiz, “Reinforcement Learning Using Fully Connected, Attention, and Transformer Models in Knapsack Problem Solving,” Concurrency Comput. Pract. Exper., vol. 34, no. 9, e6509, 2022.
D. Bakır, M. S. Aktaş, and B. Yildiz, “A Model-Based Evaluation Metric for Question Answering Systems,” Int. J. Softw. Eng. Knowl. Eng., vol. 35, no. 2, pp. 243–262, 2025, doi: 10.1142/S0218194025500032.
F. O. Çatak, M. A. Imran, Y. Dalveren, B. Yildiz, and A. Kara, “Radar Emitter Localization Based on Multipath Exploitation Using Machine Learning,” IEEE Access, https://doi.org/10.1109/ACCESS.2024.3488959, 2024.
B. Yildiz, “Enhancing Image Resolution with Generative Adversarial Networks,” in Proc. UBMK, 2022.