Digital Workforce Matching: A Machine Learning Approach for Skill-Based Job Classification and Recommendation
DOI:
https://doi.org/10.59796/jcst.V15N4.2025.137Keywords:
digital workforce matching, skill-based job, classification, recommendation, machine learning modelAbstract
This research presents an integrated machine learning approach for optimizing digital workforce matching in Thailand's evolving digital economy. The study develops a novel job recommendation system combining Natural Language Processing (NLP) with Random Forest classification to analyze job market data from Thailand's leading recruitment platforms. Using FastText for initial job classification and a Random Forest model for skill-based matching, the system achieves 75% accuracy in job recommendations across 20 digital job categories. The methodology incorporates automated skill extraction, cross-validated model comparison, and a user-friendly web interface for practical applications. Our findings reveal distinct skill clusters and job-skill relationships in Thailand's digital sector, with the Random Forest model outperforming traditional Decision Tree approaches by 4% in accuracy metrics. The system demonstrates robust performance in real-world testing, achieving 86.67% accuracy in matching previously unseen job postings. This research contributes to both theoretical understanding of skill-based job matching and practical workforce development, offering insights for curriculum development and career planning for workforce development stakeholders in Thailand's digital sector.
References
Alghamlas, M., & Alabduljabbar, R. (2019). Predicting the suitability of IT students' skills for the recruitment in Saudi labor market [Conference presentation]. The 2019 2nd International Conference on Computer Applications & Information Security (ICCAIS), Riyadh, Saudi Arabia. https://doi.org/10.1109/CAIS.2019.8769577
Anthony, G. S. (2024). Developing a framework to identify professional skills required for banking sector employee in UK using Natural Language Processing (NLP) techniques [Doctoral dissertation], The University of Salford, Manchester, United Kingdom.
Bhushan, N., Mekhilef, S., Tey, K. S., Shaaban, M., Seyedmahmoudian, M., & Stojcevski, A. (2024). Dynamic K-decay learning rate optimization for deep convolutional neural network to estimate the state of charge for electric vehicle batteries. Energies, 17(16), Article 3884. https://doi.org/10.3390/en17163884
Chaudhary, A., Kolhe, S., & Kamal, R. (2016). An improved random forest classifier for multi-class classification. Information Processing in Agriculture, 3(4), 215-222. https://doi.org/10.1016/j.inpa.2016.08.002
Dalal, A., Rayan, R., Barua, A., Vasserman, E. Y., Sarker, M. K., & Hitzler, P. (2024). On the value of labeled data and symbolic methods for hidden neuron activation analysis [Conference presentation]. International Conference on Neural-Symbolic Learning and Reasoning. Cham, Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-71170-1_12
Dehghani, M., & Manthouri, M. (2021). Semi-automatic detection of Persian stopwords using FastText library [Conference presentation]. The 2021 11th International Conference on Computer Engineering and Knowledge (ICCKE), Mashhad, Iran. https://doi.org/10.1109/ICCKE54056.2021.9721519
Fuzul, E., & Horvat, M. (2019). Formal model of student competencies in higher education and required skills in the job market [Conference presentation]. The 2019 Central European Conference on Information and Intelligent Systems (CECIIS), Varaždin, Croatia.
Giabelli, A., Malandri, L., Mercorio, F., Mezzanzanica, M., & Seveso, A. (2021). Skills2Job: A recommender system that encodes job offer embeddings on graph databases. Applied Soft Computing, 101, Article 107049. https://doi.org/10.1016/j.asoc.2020.107049
Hakim, L., Sari, Z., Aristyo, A. R., & Pangestu, S. (2024). Optimizing android program malware classification using GridSearchCV optimized random forest. Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, 9(2), 173-180. https://doi.org/10.22219/kinetik.v9i2.1944
Hardeniya, N., Perkins, J., Chopra, D., Joshi, N., & Mathur, I. (2016). Natural language processing: Python and NLTK. Birmingham, UK: Packt Publishing Ltd.
Hassan, M. U., Alaliyat, S., Sarwar, R., Nawaz, R., & Hameed, I. A. (2023). Leveraging deep learning and big data to enhance computing curriculum for industry-relevant skills: A Norwegian case study. Heliyon, 9(4). Article e15407. https://doi.org/10.1016/j.heliyon.2023.e15407
Htet, T. S., & San, K. K. (2024). Transforming tech hiring: BERT-powered neural networks for automated job title classification [Conference presentation]. The 2024 5th International Conference on Advanced Information Technologies (ICAIT), Yangon, Myanmar.
Ibadov, I., Aksenov, A., Iumanova, I., & Sozykin, A. (2020). The concept of a dynamic model of competencies for the labor market analysis [Conference presentation]. The 2020 Ural Symposium on Biomedical Engineering, Radioelectronics and Information Technology (USBEREIT), Ekaterinburg, Russia. https://doi.org/10.1109/USBEREIT48449.2020.9117691
Ingole, A., Wadurkar, A., Chaudhari, S., & Chavhan, B. (2024). Skill-based job role suggestion using machine learning. International Journal of Ingenious Research, Invention and Development, 3(2), 86-94. https://doi.org/10.5281/zenodo.11034674
Kanraweekultana, N., Waijanya, S., Promrit, N., Nopnapaporn, U., Korsanan, A., & Poolphol, S. (2024). Comparison of capability of data classification models to predict consistent results for depression analysis based on user-behaviour tracking and facial expression recognition during PHQ-9 assessment. Engineering and Applied Science Research, 51(1), 11-21. https://doi.org/10.14456/easr.2024.2
Mahlich, C., Vente, T., & Beel, J. (2024). From theory to practice: Implementing and evaluating e-fold cross-validation. arXiv preprint arXiv:2410.09463. https://arxiv.org/abs/2410.09463
Melo, G., Chaves, M., Kolter, M., & Schleifenbaum, J. H. (2023). Skills requirements of additive manufacturing-a textual analysis of job postings using natural language processing [Conference presentation]. The 2023 International Conference on Additive Manufacturing in Products and Applications, Cham, Switzerland.
Ministry of Higher Education, Science, Research and Innovation. (2024). Higher education statistics. Retrieved from https://info.mhesi.go.th (In Thai)
Office of the National Digital Economy and Society Commission. (2019). National policy and plan on digital development for the economy and society. Retrieved from https://datacatalog.bde.go.th/dataset/eb2e0e52-adc5-4f0c-aca1-96358b79ce32 (In Thai)
Office of the National Economic and Social Development Council. (2017). Thai social conditions in the first quarter of 2017. Retrieved from https://www.m-society.go.th/ewtadmin/ewt/mso_web/article_attach/21484/21278.pdf (In Thai)
Phaphuangwittayakul, A., Saranwong, S., Panyakaew, S. N., Inkeaw, P., & Chaijaruwanich, J. (2018). Analysis of skill demand in Thai labor market from online jobs recruitments websites [Conference presentation]. The 2018 15th International Joint Conference on Computer Science and Software Engineering (JCSSE), Nakhon Ratchasima, Thailand. https://doi.org/10.1109/JCSSE.2018.8457393
Pias, S. A., Hossain, M., Rahman, H., & Hossain, M. M. (2024). Enhancing job matching through natural language processing: A BERT-based approach [Conference presentation]. The 2024 International Conference on Innovations in Science, Engineering and Technology (ICISET), Dhaka, Bangladesh. https://doi.org/10.1109/ICISET62123.2024.10939860
Pillai, P., & Amin, D. (2020). Understanding the requirements of the Indian IT industry using web scrapping. Procedia Computer Science, 172, 308-313. https://doi.org/10.1016/j.procs.2020.05.050
Pundir, R. S., Dhasmana, A., Karakoti, U., Sikder, A., Sharma, S., & Manchanda, M. (2024). Enhancing resume recommendation system through skill-based similarity using deep learning models [Conference presentation]. The 2024 International Conference on Inventive Computation Technologies (ICICT), Lalitpur, Nepal. https://doi.org/10.1109/ICICT60155.2024.10544875
Qin, C., Zhu, H., Xu, T., Zhu, C., Ma, C., Chen, E., & Xiong, H. (2020). An enhanced neural network approach to person-job fit in talent recruitment. ACM Transactions on Information Systems (TOIS), 38(2), 1-33. https://doi.org/10.1145/3376927
Rahhal, I., Carley, K. M., Kassou, I., & Ghogho, M. (2023). Two stage job title identification system for online job advertisements. IEEE Access, 11, 19073-19092. https://doi.org/10.1109/ACCESS.2023.3247866
Senger, E., Zhang, M., Van Der Goot, R., & Plank, B. (2024). Deep learning-based computational job market analysis: A survey on skill extraction and classification from job postings. arXiv preprint arXiv:2402.05617. https://doi.org/10.48550/arXiv.2402.05617
Surendar, V., Sriram, E., & Subhash, S. (2024). Web-based deep learning model for zero day vulnerability detection using FastAPI [Conference presentation]. The 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), Chennai, India. https://doi.org/10.1109/ADICS58448.2024.10533540
Weichselbraun, A., Süsstrunk, N., Waldvogel, R., Glatzl, A., Braşoveanu, A. M., & Scharl, A. (2024). Anticipating job market demands-A deep learning approach to determining the future readiness of professional skills. Future Internet, 16(5), Article 144. https://doi.org/10.3390/fi16050144
Wu, X. (2024). FastAPI as a backend framework [Bachelor Thesis]. Tampere, Tampere University, Finland.
Yahya, A. E., Yafooz, W. M., & Gharbi, A. (2024). Mapping graduate skills to market demands: a holistic examination of curriculum development and employment trends. Engineering, Technology & Applied Science Research, 14(4), 14793-14800. https://doi.org/10.48084/etasr.7454

Downloads
Published
How to Cite
Issue
Section
Categories
License
Copyright (c) 2025 Journal of Current Science and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.