Forecasting length of stay: will it be clear or cloudy today?

Objective: Patient length of stay (LOS) is a vital metric for hospital operational efficiency, and shorter LOS is tied to better patient outcomes and improved financial performance. Models that provide accurate, real-time LOS forecasts can help hospitals effectively manage their resources and bed ca...

Full description

Saved in:
Bibliographic Details
Main Authors: Deng, Charles, Reddy, Arjun, Kavitesh, Bali Kavitesh, Babu, Myoungmee, Babu, Benson A.
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/164592
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-164592
record_format dspace
spelling sg-ntu-dr.10356-1645922023-02-06T04:07:21Z Forecasting length of stay: will it be clear or cloudy today? Deng, Charles Reddy, Arjun Kavitesh, Bali Kavitesh Babu, Myoungmee Babu, Benson A. School of Computer Science and Engineering Engineering::Computer science and engineering Artificial Intelligence Neural Networks Objective: Patient length of stay (LOS) is a vital metric for hospital operational efficiency, and shorter LOS is tied to better patient outcomes and improved financial performance. Models that provide accurate, real-time LOS forecasts can help hospitals effectively manage their resources and bed capacity. Forecasting LOS is a perfect problem for modern machine learning methods. In this paper, we conduct a descriptive literature review of studies that use machine learning methods to predict LOS. Methods: We searched Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, and Cochrane databases for articles published between 2008 and 2021 that use machine learning models to forecast patient LOS. From 87 articles identified through keyword search and the two articles identified using the snowball method, we used pre-specified inclusion criteria to select the final 12 articles in the descriptive literature review. The articles are international, retrospective, and carried out during the ML development lifecycle. Results: Most studies approached the LOS forecasting problem as a classification problem, with a minority of studies opting to train regression models instead. The most frequently used models included support vector machines, random forests, gradient boosted trees, logistic regressions, and neural networks. In general, tree-based models like random forests and gradient boosted trees had the best performance – stacked methods that combined the predictions of multiple models also performed well. Several studies used natural language processing (NLP) methods and other techniques to extract features from unstructured electronic health record data and improve model performance. In addition to model and feature selection, data preprocessing decisions, such as careful handling of missing data and resampling to address the class imbalance, significantly improved model performance. Conclusion: Machine learning methods are capable of forecasting patient LOS with impressive accuracy. However, most studies were designed as pre-deployment experimental models. As AI applications advance, a systematic approach to crafting high-quality data management and monitoring during real-time clinical ML production is essential to developing a precise prediction service. While in production, factors such as data drift recognition, monitoring, and correction are required for accurate model performance. Future longitudinal studies must validate these models during production to recognize their real-world healthcare impact. Published version 2023-02-06T04:07:21Z 2023-02-06T04:07:21Z 2022 Journal Article Deng, C., Reddy, A., Kavitesh, B. K., Babu, M. & Babu, B. A. (2022). Forecasting length of stay: will it be clear or cloudy today?. Intelligence-Based Medicine, 6, 100078-. https://dx.doi.org/10.1016/j.ibmed.2022.100078 2666-5212 https://hdl.handle.net/10356/164592 10.1016/j.ibmed.2022.100078 2-s2.0-85139724974 6 100078 en Intelligence-Based Medicine © 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Artificial Intelligence
Neural Networks
spellingShingle Engineering::Computer science and engineering
Artificial Intelligence
Neural Networks
Deng, Charles
Reddy, Arjun
Kavitesh, Bali Kavitesh
Babu, Myoungmee
Babu, Benson A.
Forecasting length of stay: will it be clear or cloudy today?
description Objective: Patient length of stay (LOS) is a vital metric for hospital operational efficiency, and shorter LOS is tied to better patient outcomes and improved financial performance. Models that provide accurate, real-time LOS forecasts can help hospitals effectively manage their resources and bed capacity. Forecasting LOS is a perfect problem for modern machine learning methods. In this paper, we conduct a descriptive literature review of studies that use machine learning methods to predict LOS. Methods: We searched Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, and Cochrane databases for articles published between 2008 and 2021 that use machine learning models to forecast patient LOS. From 87 articles identified through keyword search and the two articles identified using the snowball method, we used pre-specified inclusion criteria to select the final 12 articles in the descriptive literature review. The articles are international, retrospective, and carried out during the ML development lifecycle. Results: Most studies approached the LOS forecasting problem as a classification problem, with a minority of studies opting to train regression models instead. The most frequently used models included support vector machines, random forests, gradient boosted trees, logistic regressions, and neural networks. In general, tree-based models like random forests and gradient boosted trees had the best performance – stacked methods that combined the predictions of multiple models also performed well. Several studies used natural language processing (NLP) methods and other techniques to extract features from unstructured electronic health record data and improve model performance. In addition to model and feature selection, data preprocessing decisions, such as careful handling of missing data and resampling to address the class imbalance, significantly improved model performance. Conclusion: Machine learning methods are capable of forecasting patient LOS with impressive accuracy. However, most studies were designed as pre-deployment experimental models. As AI applications advance, a systematic approach to crafting high-quality data management and monitoring during real-time clinical ML production is essential to developing a precise prediction service. While in production, factors such as data drift recognition, monitoring, and correction are required for accurate model performance. Future longitudinal studies must validate these models during production to recognize their real-world healthcare impact.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Deng, Charles
Reddy, Arjun
Kavitesh, Bali Kavitesh
Babu, Myoungmee
Babu, Benson A.
format Article
author Deng, Charles
Reddy, Arjun
Kavitesh, Bali Kavitesh
Babu, Myoungmee
Babu, Benson A.
author_sort Deng, Charles
title Forecasting length of stay: will it be clear or cloudy today?
title_short Forecasting length of stay: will it be clear or cloudy today?
title_full Forecasting length of stay: will it be clear or cloudy today?
title_fullStr Forecasting length of stay: will it be clear or cloudy today?
title_full_unstemmed Forecasting length of stay: will it be clear or cloudy today?
title_sort forecasting length of stay: will it be clear or cloudy today?
publishDate 2023
url https://hdl.handle.net/10356/164592
_version_ 1759058762910400512