Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study
Objective: To predict older adults’ risk of avoidable hospitalisation related to ambulatory care sensitive conditions (ACSC) using machine learning applied to administrative health data of Ontario, Canada. Design, setting and participants: A retrospective cohort study was conducted on a large cohort...
Saved in:
Main Authors: | , , , , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163265 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163265 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Administrative Health Data Ambulatory Care |
spellingShingle |
Engineering::Computer science and engineering Administrative Health Data Ambulatory Care Yi, Seung Eun Harish, Vinyas Gutierrez, Jahir Ravaut, Mathieu Kornas, Kathy Watson, Tristan Poutanen, Tomi Ghassemi, Marzyeh Volkovs, Maksims Rosella, Laura C. Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
description |
Objective: To predict older adults’ risk of avoidable hospitalisation related to ambulatory care sensitive conditions (ACSC) using machine learning applied to administrative health data of Ontario, Canada. Design, setting and participants: A retrospective cohort study was conducted on a large cohort of all residents covered under a single-payer system in Ontario, Canada over the period of 10 years (2008– 2017). The study included 1.85 million Ontario residents between 65 and 74 years old at any time throughout the study period. Data sources: Administrative health data from Ontario, Canada obtained from the (ICES formerly known as the Institute for Clinical Evaluative Sciences Data Repository. Main outcome measures: Risk of hospitalisations due to ACSCs 1 year after the observation period. Results: The study used a total of 1 854 116 patients, split into train, validation and test sets. The ACSC incidence rates among the data points were 1.1% for all sets. The final XGBoost model achieved an area under the receiver operating curve of 80.5% and an area under precision–recall curve of 0.093 on the test set, and the predictions were well calibrated, including in key subgroups. When ranking the model predictions, those at the top 5% of risk as predicted by the model captured 37.4% of those presented with an ACSC-related hospitalisation. A variety of features such as the previous number of ambulatory care visits, presence of ACSC-related hospitalisations during the observation window, age, rural residence and prescription of certain medications were contributors to the prediction. Our model was also able to capture the geospatial heterogeneity of ACSC risk in Ontario, and especially the elevated risk in rural and marginalised regions. Conclusions: This study aimed to predict the 1-year risk of hospitalisation from ambulatory-care sensitive conditions in seniors aged 65–74 years old with a single, large-scale machine learning model. The model shows the potential to inform population health planning and interventions to reduce the burden of ACSC-related hospitalisations. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Yi, Seung Eun Harish, Vinyas Gutierrez, Jahir Ravaut, Mathieu Kornas, Kathy Watson, Tristan Poutanen, Tomi Ghassemi, Marzyeh Volkovs, Maksims Rosella, Laura C. |
format |
Article |
author |
Yi, Seung Eun Harish, Vinyas Gutierrez, Jahir Ravaut, Mathieu Kornas, Kathy Watson, Tristan Poutanen, Tomi Ghassemi, Marzyeh Volkovs, Maksims Rosella, Laura C. |
author_sort |
Yi, Seung Eun |
title |
Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
title_short |
Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
title_full |
Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
title_fullStr |
Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
title_full_unstemmed |
Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
title_sort |
predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163265 |
_version_ |
1751548553823518720 |
spelling |
sg-ntu-dr.10356-1632652022-11-30T00:27:19Z Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study Yi, Seung Eun Harish, Vinyas Gutierrez, Jahir Ravaut, Mathieu Kornas, Kathy Watson, Tristan Poutanen, Tomi Ghassemi, Marzyeh Volkovs, Maksims Rosella, Laura C. School of Computer Science and Engineering Engineering::Computer science and engineering Administrative Health Data Ambulatory Care Objective: To predict older adults’ risk of avoidable hospitalisation related to ambulatory care sensitive conditions (ACSC) using machine learning applied to administrative health data of Ontario, Canada. Design, setting and participants: A retrospective cohort study was conducted on a large cohort of all residents covered under a single-payer system in Ontario, Canada over the period of 10 years (2008– 2017). The study included 1.85 million Ontario residents between 65 and 74 years old at any time throughout the study period. Data sources: Administrative health data from Ontario, Canada obtained from the (ICES formerly known as the Institute for Clinical Evaluative Sciences Data Repository. Main outcome measures: Risk of hospitalisations due to ACSCs 1 year after the observation period. Results: The study used a total of 1 854 116 patients, split into train, validation and test sets. The ACSC incidence rates among the data points were 1.1% for all sets. The final XGBoost model achieved an area under the receiver operating curve of 80.5% and an area under precision–recall curve of 0.093 on the test set, and the predictions were well calibrated, including in key subgroups. When ranking the model predictions, those at the top 5% of risk as predicted by the model captured 37.4% of those presented with an ACSC-related hospitalisation. A variety of features such as the previous number of ambulatory care visits, presence of ACSC-related hospitalisations during the observation window, age, rural residence and prescription of certain medications were contributors to the prediction. Our model was also able to capture the geospatial heterogeneity of ACSC risk in Ontario, and especially the elevated risk in rural and marginalised regions. Conclusions: This study aimed to predict the 1-year risk of hospitalisation from ambulatory-care sensitive conditions in seniors aged 65–74 years old with a single, large-scale machine learning model. The model shows the potential to inform population health planning and interventions to reduce the burden of ACSC-related hospitalisations. Published version This study was supported by ICES, which is funded by an annual grant from the Ontario Ministry of Health (MOH) and the Ministry of Long-Term Care (MLTC). This work was supported by the New Frontiers in Research Fund (NFRFE2018-00662), a Canada Research Chair in Population Health Analytics (950- 230702) (LR), Ontario Graduate Scholarship (number N/A) (VH), Canadian Institutes of Health Research Banting and Best Canada Graduate Scholarship Master’s and Doctoral awards (numbers N/A) (VH), and Vector Institute Post-graduate Fellowship (number N/A) (VH). 2022-11-30T00:27:19Z 2022-11-30T00:27:19Z 2022 Journal Article Yi, S. E., Harish, V., Gutierrez, J., Ravaut, M., Kornas, K., Watson, T., Poutanen, T., Ghassemi, M., Volkovs, M. & Rosella, L. C. (2022). Predicting hospitalisations related to ambulatory care sensitive conditions with machine learning for population health planning: derivation and validation cohort study. BMJ Open, 12(4), e051403-. https://dx.doi.org/10.1136/bmjopen-2021-051403 2044-6055 https://hdl.handle.net/10356/163265 10.1136/bmjopen-2021-051403 35365510 2-s2.0-85127388685 4 12 e051403 en BMJ Open © Author(s) (or their employer(s)) 2022. Published by BMJ. Open access. This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/. application/pdf |