TY - JOUR
T1 - PRERISK
T2 - A Personalized, Artificial Intelligence-Based and Statistically-Based Stroke Recurrence Predictor for Recurrent Stroke
AU - Colangelo, Giorgio
AU - Ribo, Marc
AU - Montiel, Estefanía
AU - Dominguez, Didier
AU - Olivé-Gadea, Marta
AU - Muchada, Marian
AU - Garcia-Tornel, Álvaro
AU - Requena, Manuel
AU - Pagola, Jorge
AU - Juega, Jesús
AU - Rodriguez-Luna, David
AU - Rodriguez-Villatoro, Noelia
AU - Rizzo, Federica
AU - Taborda, Belén
AU - Molina, Carlos A.
AU - Rubiera, Marta
N1 - Publisher Copyright:
© 2024 Wolters Kluwer Health. All rights reserved.
PY - 2024/5/1
Y1 - 2024/5/1
N2 - BACKGROUND: Predicting stroke recurrence for individual patients is difficult, but individualized prediction may improve stroke survivors' engagement in self-care. We developed PRERISK: a statistical and machine learning classifier to predict individual risk of stroke recurrence. METHODS: We analyzed clinical and socioeconomic data from a prospectively collected public health care-based data set of 41 975 patients admitted with stroke diagnosis in 88 public health centers over 6 years (2014-2020) in Catalonia-Spain. A new stroke diagnosis at least 24 hours after the index event was considered as a recurrent stroke, which was considered as our outcome of interest. We trained several supervised machine learning models to provide individualized risk over time and compared them with a Cox regression model. Models were trained to predict early, late, and long-term recurrence risk, within 90, 91 to 365, and >365 days, respectively. C statistics and area under the receiver operating characteristic curve were used to assess the accuracy of the models. RESULTS: Overall, 16.21% (5932 of 36 114) of patients had stroke recurrence during a median follow-up of 2.69 years. The most powerful predictors of stroke recurrence were time from previous stroke, Barthel Index, atrial fibrillation, dyslipidemia, age, diabetes, and sex, which were used to create a simplified model with similar performance, together with modifiable vascular risk factors (glycemia, body mass index, high blood pressure, cholesterol, tobacco dependence, and alcohol abuse). The areas under the receiver operating characteristic curve were 0.76 (95% CI, 0.74-0.77), 0.60 (95% CI, 0.58-0.61), and 0.71 (95% CI, 0.69-0.72) for early, late, and long-term recurrence risk, respectively. The areas under the receiver operating characteristic curve of the Cox risk class probability were 0.73 (95% CI, 0.72-0.75), 0.59 (95% CI, 0.57-0.61), and 0.67 (95% CI, 0.66-0.70); machine learning approaches (random forest and AdaBoost) showed statistically significant improvement (P<0.05) over the Cox model for the 3 recurrence time periods. Stroke recurrence curves can be simulated for each patient under different degrees of control of modifiable factors. CONCLUSIONS: PRERISK is a novel approach that provides a personalized and fairly accurate risk prediction of stroke recurrence over time. The model has the potential to incorporate dynamic control of risk factors.
AB - BACKGROUND: Predicting stroke recurrence for individual patients is difficult, but individualized prediction may improve stroke survivors' engagement in self-care. We developed PRERISK: a statistical and machine learning classifier to predict individual risk of stroke recurrence. METHODS: We analyzed clinical and socioeconomic data from a prospectively collected public health care-based data set of 41 975 patients admitted with stroke diagnosis in 88 public health centers over 6 years (2014-2020) in Catalonia-Spain. A new stroke diagnosis at least 24 hours after the index event was considered as a recurrent stroke, which was considered as our outcome of interest. We trained several supervised machine learning models to provide individualized risk over time and compared them with a Cox regression model. Models were trained to predict early, late, and long-term recurrence risk, within 90, 91 to 365, and >365 days, respectively. C statistics and area under the receiver operating characteristic curve were used to assess the accuracy of the models. RESULTS: Overall, 16.21% (5932 of 36 114) of patients had stroke recurrence during a median follow-up of 2.69 years. The most powerful predictors of stroke recurrence were time from previous stroke, Barthel Index, atrial fibrillation, dyslipidemia, age, diabetes, and sex, which were used to create a simplified model with similar performance, together with modifiable vascular risk factors (glycemia, body mass index, high blood pressure, cholesterol, tobacco dependence, and alcohol abuse). The areas under the receiver operating characteristic curve were 0.76 (95% CI, 0.74-0.77), 0.60 (95% CI, 0.58-0.61), and 0.71 (95% CI, 0.69-0.72) for early, late, and long-term recurrence risk, respectively. The areas under the receiver operating characteristic curve of the Cox risk class probability were 0.73 (95% CI, 0.72-0.75), 0.59 (95% CI, 0.57-0.61), and 0.67 (95% CI, 0.66-0.70); machine learning approaches (random forest and AdaBoost) showed statistically significant improvement (P<0.05) over the Cox model for the 3 recurrence time periods. Stroke recurrence curves can be simulated for each patient under different degrees of control of modifiable factors. CONCLUSIONS: PRERISK is a novel approach that provides a personalized and fairly accurate risk prediction of stroke recurrence over time. The model has the potential to incorporate dynamic control of risk factors.
KW - artificial intelligence
KW - awareness
KW - machine learning
KW - prognosis
KW - recurrence
KW - stroke
KW - survivors
UR - http://www.scopus.com/inward/record.url?scp=85191106543&partnerID=8YFLogxK
U2 - 10.1161/STROKEAHA.123.043691
DO - 10.1161/STROKEAHA.123.043691
M3 - Article
C2 - 38545798
AN - SCOPUS:85191106543
SN - 0039-2499
VL - 55
SP - 1200
EP - 1209
JO - Stroke
JF - Stroke
IS - 5
ER -