Automated Techniques for Detecting Healthcare Associated Infections: A Review
DOI:
https://doi.org/10.24203/4nd7vf70Keywords:
Healthcare-Associated Infections, Machine Learning, Deep Learning, Natural Language Processing, Transformer, Electronic Health RecordsAbstract
Automated detection of Healthcare-Associated Infections (HAIs) faces major obstacles due to unclear medical documentation, scarcity of well-annotated data, and multiple symptoms that overlap between HAIs. This review investigates recent advances in using classical machine learning, deep learning, transformers, and natural language processing (NLP) methods in detecting healthcare-associated infections. It examines empirical studies from 2019 to 2025, focusing on models' performance based on various metrics, data issues, and ethical considerations. The study sought to assess and compare the performance of natural language processing (NLP) approaches of detecting Healthcare-Associated Infections (HAIs). Ethical and technical concerns such as data privacy and data imbalance, are critical barriers to implementation of NLP to detection of HAIs. The review underscores the promise of NLP to detection of HAIs while emphasizing the need for standardized metrics for evaluating HAI detection model and ethical frameworks of handling the datasets.
References
[1] A. B. Scardoni, S. Federica, C. Cabitza, F. Odone and Anna, "Artificial intelligence-based tools to control healthcare associated infections: a systematic review of the literature," Journal of infection and public health, vol. 13, pp. 1061--1077, 2020.
[2] F. Setiawan and Ferdi, "Human resources strategy improving perceived performance outcome infection control in hospital: importance-performance map analysis," International Journal of Environment, Workplace and Employment, vol. 7, pp. 319--339, 2023.
[3] V. a. C. Puro, N. a. Frasca, A. a. Gentile, I. a. Luzzaro, F. a. Peghetti, A. a. Sganga and Gabriele, "Pillars for prevention and control of healthcare-associated infections: an Italian expert opinion statement," Antimicrobial Resistance & Infection Control, vol. 11, p. 87, 2022.
[4] U. a. A. Abubakar and J. Rodríguez-Baño, "Healthcare-associated infections in Africa: a systematic review and meta-analysis of point prevalence studies," Journal of pharmaceutical policy and practice, vol. 15, p. 99, 2022.
[5] W.-S. Z. Chen, W.-H. Li, Z.-J. Yang, Y. Chen, G. Fu, W. Xue-Shun, F. Ting-Rui, F. Ping, L. Cheng-Yi and o. Jing, "Evaluation of manual and electronic healthcare-associated infections surveillance: a multi-center study with 21 tertiary general hospitals in China," Annals of Translational Medicine, vol. 7, p. 444, 2019.
[6] H. R. A. Streefkerk, B. Roel PAJ, V. Wichor M and H. A, "Electronically assisted surveillance systems of healthcare-associated infections: a systematic review," Eurosurveillance, vol. 25, p. 1900321, 2020.
[7] R. Dos Santos, M. D, L. A, D. S, C. CH, G. O, G. J, P. N, V. R and TA, "Automated healthcare-associated infection surveillance using an artificial intelligence algorithm," Infection Prevention in Practice, vol. 3, p. 100167, 2021.
[8] J. Frain, "Critical Appraisal," ABC of Evidence-Based Healthcare, p. 57, 2025.
[9] M. Amir-Behghadami and A. Janati, "opulation, Intervention, Comparison, Outcomes and Study (PICOS) design as a framework to formulate eligibility criteria in systematic reviews," Emergency Medicine Journal, 2020.
[10] X. J. Liang, L. AP, X. Tao, W. YY and GT, "LR-SMOTE—An improved unbalanced data set oversampling based on K-means and SVM," Knowledge-Based System, vol. 196, p. 105845, 2020.
[11] F.-H. S{'a}nchez-Hern{'a}ndez, K. Juan Carlos, S.-B. Mohamed S and M.-G. M. N. Mercedes, "Predictive modeling of ICU healthcare-associated infections from imbalanced data. Using ensembles and a clustering-based undersampling approach," Applied Sciences, vol. 9, p. 5287, 2019.
[12] J. I. Park, C. Donna Z, D. Chih-Lin, W. Connie W and B. L, "Knowledge discovery with machine learning for hospital-acquired catheter-associated urinary tract infections," CIN: Computers, Informatics, Nursing, vol. 38, pp. 28--35, 2020.
[13] M. M. A. Mitu, S. Sadot, H. Zulkarnain, F. Md Afanur and D. Md, "Pruning-based ensemble tree for multi-class classification," 2024 6th International Conference on Electrical Engineering and Information & Communication Technology (ICEEICT), pp. 481--486, 2024.
[14] S. M. A. van Rooden, C. Olov, G. Elena, J. Sophie, J.-C. a. M. S. P.-B. Anders and Lucet, P. Zaira R, T. Elisabeth and o. Evelina, "Governance aspects of large-scale implementation of automated surveillance of healthcare-associated infections," Clinical Microbiology and Infection, vol. 27, pp. S20--S28, 2021.
[15] S. M. Van Rooden, P. Evelina, G. Miquel, K. Aina, R. Jan AJW, M. Jannie, C.-D. Gonny, B. Elodie, B. J. R. Camille and others, "A framework to develop semiautomated surveillance of surgical site infections: an international multicenter study," Infection Control & Hospital Epidemiology, vol. 41, pp. 194--201, 2020.
[16] C. P. Thirukumaran, R. Anis, C. Paul T, L. Casey, R. Yue, B. Benjamin F, K. Wajeeh R and Henry, "Natural language processing for the identification of surgical site infections in orthopaedics," JBJS, vol. 101, pp. 2167--2174, 2019.
[17] C. R. Sushama, S. Shaik Mohammad, S. Kumar A, S. A, S. R and Balambigai, "Central Line Associated Bloodstream Infection Prediction Using Deep Attention Nets in the Healthcare Field," Informing Science, vol. 28, p. 13, 2025.
[18] A. B. Babu and S. Babu, "Bert-based medical chatbot: Enhancing healthcare communication through natural language understanding," Exploratory research in clinical and social pharmacy, vol. 100419, p. 13, 2024.
[19] L. X. Rasmy, X. Yang, T. Ziqian and Z. D. Cui, "Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction," NPJ digital medicine, vol. 4, p. 86, 2021.
[20] J. Shi, P. Siru, L. Liese CC, F. Carolyn L, G. Jeffrey P, C. Adi V, B. Wendy W and B. T, "Using natural language processing to improve EHR structured data-based surgical site infection surveillance," AMIA annual symposium proceedings, vol. 2019, p. 794, 2020.
[21] H. A. Amjad, S. Mohammad Shehroz, K. Syed Zoraiz Ali, F. Saad, H. Muhammad Moazam, B. Tahir and S. A. Chan, "Attention-Based Explainability Approaches in Healthcare Natural Language Processing.," HEALTHINF, pp. 689--696, 2023.
[22] T. D.-F. Goodwin and Dina, "Deep learning from incomplete data: detecting imminent risk of hospital-acquired pneumonia in ICU patients," in AMIA Annual Symposium Proceedings, 2019, p. 467.
[23] S. J. Rabhi, M. J{'e}r{'e}mie and Marie-Helene, "Deep learning versus conventional machine learning for detection of healthcare-associated infections in French clinical narratives," Methods of information in medicine, vol. 58, pp. 031--041, 2019.
[24] F. Li, L. Yonghao, R. Weisong, C. Bhanu Pratap Singh, Y. Pengshan and o. Hong, "Fine-tuning bidirectional encoder representations from transformers (BERT)--based models on large-scale electronic health record notes: an empirical study," JMIR medical informatics, vol. 7, p. e14830, 2019.
[25] F. Fitzpatrick, L. Aaron and Gerard, "Using artificial intelligence in infection prevention," Current treatment options in infectious diseases, vol. 121, pp. 135--144, 2020.
[26] C. F. Luz, D. Marcus, N. Johan, G. Maarten W, S. Corinna and Bhanu, "Machine learning in infection management using routine electronic health records: tools, techniques, and reporting of future technologies," Clinical Microbiology and Infection, vol. 26, pp. 1291--1299, 2020.
[27] T. D. Lopes, J{'u}lio.Cardoso, M. Sara, D. Jo{~a}o, L. Ricardo, S. Jo{~a}o and M. Filipe, "Classification Models for Early Prediction of Surgical Site Infections," Procedia Computer Science, vol. 238, pp. 926--931, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Joseph Karuri Mwaura, Kennedy Malanga Ndenga, James Mwikya Reuben

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The articles published in International Journal of Computer and Information Technology (IJCIT) is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
 
						 
							