Design and Development of an Automated Product Categorization Software: AI-Driven Solutions for E-Commerce Platforms

Amirkia Rafiei Oskooei

Yildiz Technical University

https://orcid.org/0009-0004-3490-550X

Asli Terim

Procat Research and Development Center

https://orcid.org/0009-0003-1082-5866

Cemal Arık

Procat Research and Development Center

https://orcid.org/0009-0002-4700-4597

Engin Bıçakçı

Procat Research and Development Center

https://orcid.org/0009-0001-9398-4795

DOI: https://doi.org/10.56038/oprd.v3i1.399

Keywords: Automatic Product Categorization, E-commerce, Artificial Intelligence, Machine Learning, User Experience


Abstract

This article outlines the design and development process of an automatic product categorization software intended for use in e-commerce and online marketplace platforms. The project aims to tackle the urgent issue of effectively classifying a wide range of products within digital markets. By utilizing artificial intelligence (AI) and machine learning methodologies, the software effectively analyzes product descriptions, enabling users to seamlessly incorporate products through automated categorization. The significance of the project is rooted in its ability to guarantee the accuracy and consistency of category hierarchies, automate the process of categorizing, and improve the overall user experience. The main goals involve the process of project planning, data collecting, and preparation. Machine learning models have been built and subsequently incorporated to facilitate the study of product descriptions. Through strict evaluation and optimization processes, a high level of accuracy and efficiency is achieved, resulting in several anticipated benefits. These benefits cover automated product categorization, enhanced user experience, and the potential for online platforms to gain a competitive edge. The key elements of innovation involve AI-driven textual analysis, learning methodologies grounded in data, and the ability to adapt to diverse industry contexts. Precautions and backup strategies are implemented to tackle technical issues, including the selection of machine learning libraries and algorithms, ensuring data quality, and integrating with various platforms. The success criteria encompass the objective of achieving a minimum prediction accuracy rate of 90%, optimizing business efficiency, enhancing user pleasure, and ensuring smooth system functioning. This project is a significant contribution to the field of product categorization inside the digital marketplace, as it provides automation, accuracy, and efficiency, ultimately resulting in an enhanced user experience.


References

Lin, Yiu-Chang, et al. "A dataset and baselines for e-commerce product categorization." Proceedings of the 2019 ACM SIGIR international conference on theory of information retrieval. 2019. DOI: https://doi.org/10.1145/3341981.3344237

Pan, Hong, and Hanxun Zhou. "Study on convolutional neural network and its application in data mining and sales forecasting for E-commerce." Electronic Commerce Research 20 (2020): 297-320. DOI: https://doi.org/10.1007/s10660-020-09409-0

Cao, Zhihao, Shaomin Mu, and Mengping Dong. "Two-attribute e-commerce image classification based on a convolutional neural network." The Visual Computer 36 (2020): 1619-1634. DOI: https://doi.org/10.1007/s00371-019-01763-x

Kozareva, Zornitsa. "Everyone likes shopping! multi-class product categorization for e-commerce." Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies. 2015. DOI: https://doi.org/10.3115/v1/N15-1147

Somvanshi, Madan, et al. "A review of machine learning techniques using decision tree and support vector machine." 2016 international conference on computing communication control and automation (ICCUBEA). IEEE, 2016. DOI: https://doi.org/10.1109/ICCUBEA.2016.7860040

Ha, Jung-Woo, Hyuna Pyo, and Jeonghee Kim. "Large-scale item categorization in e-commerce using multiple recurrent neural networks." Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. DOI: https://doi.org/10.1145/2939672.2939678

Uçar, Kemal Toprak. Multi-class Categorization of User-Generated Content in a Domain Specific Medium: Inferring Product Specifications from E-Commerce Marketplaces. Diss. Marmara Universitesi (Turkey), 2019. DOI: https://doi.org/10.1007/978-3-030-23756-1_31

Chavaltada, Chanawee, Kitsuchart Pasupa, and David R. Hardoon. "A comparative study of machine learning techniques for automatic product categorisation." Advances in Neural Networks-ISNN 2017: 14th International Symposium, ISNN 2017, Sapporo, Hakodate, and Muroran, Hokkaido, Japan, June 21–26, 2017, Proceedings, Part I 14. Springer international publishing, 2017.

Aktas, M.S., et al. "Information services for dynamically assembled semantic grids", The First International Conference on Semantics Knowledge and Grid (SKG 2005) Beijing China, 2005. DOI: https://doi.org/10.1109/SKG.2005.83

Aktas, M.S. et al., "Information services for grid/web service oriented architecture (soa) based geospatial applications", The First International Conference on Semantics Knowledge and Grid (SKG 2005) Beijing China, 2005

Aktas, M.S., Fox, G.C., Pierce, M., Managing dynamic metadata as context, The 2005 Istanbul International Computational Science and Engineering Conference (ICCSE2005), Istanbul, Turkey, 2005.

Aktas, M.S., et al., Implementing geographical information system grid services to support computational geophysics in a service-oriented environment. NASAEarth-Sun System Technology Conference, University of Maryland, Adelphi, Maryland, 2005.

Baloglu, A., Aktas, M. S., BlogMiner: Web blog mining application for classification of movie reviews, 2010 Fifth International Conference on Internet and Web Applications and Services, 2010. DOI: https://doi.org/10.1109/ICIW.2010.19

Uygun, Y., et al., On the Large-scale Graph Data Processing for User Interface Testing in Big Data Science Projects, 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, 2020, pp. 2049-2056, doi: 10.1109/BigData50022.2020.9378153. DOI: https://doi.org/10.1109/BigData50022.2020.9378153

Olmezogullari, E.; Aktas, M. S., Pattern2Vec: Representation of clickstream data sequences for learning user navigational behavior. Concurrency and Computation: Practice and Experience 34 (9), 2022. DOI: https://doi.org/10.1002/cpe.6546

Olmezogullari, E.; Aktas, M. S., Representation of Click-Stream DataSequences for Learning User Navigational Behavior by Using Embeddings. 2020 IEEE International Conference on Big Data (Big Data), 3173-3179, 2020. DOI: https://doi.org/10.1109/BigData50022.2020.9378437

Sahinoglu, M. et al., Mobile Application Verification: A Systematic Mapping Study. In: , et al. Computational Science and Its Applications – ICCSA 2015. ICCSA 2015. Lecture Notes in Computer Science, vol 9159. Springer, Cham. https://doi.org/10.1007/978-3-319-21413-9 11 DOI: https://doi.org/10.1007/978-3-319-21413-9_11

Kapdan, M. et al., On the Structural Code Clone Detection Problem: A Survey and Software Metric Based Approach. In: , et al. Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol 8583. Springer, Cham. https://doi.org/10.1007/978-3-319-09156-3 35. DOI: https://doi.org/10.1007/978-3-319-09156-3_35

A. Tufek, A. Gurbuz, O. F. Ekuklu and M. S. Aktas, Provenance Collection Platform for the Weather Research and Forecasting Model, 2018 14th International Conference on Semantics, Knowledge and Grids (SKG), Guangzhou, China, 2018, pp. 17-24, doi: 10.1109/SKG.2018.00009. DOI: https://doi.org/10.1109/SKG.2018.00009

Dundar, B. et al., A Big Data Processing Framework for Self-Healing Internet of Things Applications, 2016 12th International Conference on Semantics, Knowledge and Grids (SKG), Beijing, China, 2016, pp. 62-68, doi: 10.1109/SKG.2016.017. DOI: https://doi.org/10.1109/SKG.2016.017

Baeth, M. J. et al., Detecting Misinformation in Social Networks Using Provenance Data, 2017 13th International Conference on Semantics, Knowledge and Grids (SKG), Beijing, China, 2017, pp. 85-89, doi: 10.1109/SKG.2017.00022. DOI: https://doi.org/10.1109/SKG.2017.00022

Most read articles by the same author(s)