A multiple-model generalisation of updating clinical prediction models

Glen P Martin; Mamas A Mamas; Niels Peek; Iain Buchan; Matthew Sperrin

doi:10.1002/sim.7586

A multiple-model generalisation of updating clinical prediction models

Stat Med. 2018 Apr 15;37(8):1343-1358. doi: 10.1002/sim.7586. Epub 2017 Dec 18.

Authors

Glen P Martin¹, Mamas A Mamas^{1

2}, Niels Peek^{1

3}, Iain Buchan^{1

4}, Matthew Sperrin¹

Affiliations

¹ Farr Institute, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK.
² Keele Cardiovascular Research Group, Keele University, Stoke-on-Trent, UK.
³ NIHR Greater Manchester Primary Care Patient Safety Translational Research Centre, University of Manchester, Manchester, UK.
⁴ Microsoft Research, Cambridge, UK.

Abstract

There is growing interest in developing clinical prediction models (CPMs) to aid local healthcare decision-making. Frequently, these CPMs are developed in isolation across different populations, with repetitive de novo derivation a common modelling strategy. However, this fails to utilise all available information and does not respond to changes in health processes through time and space. Alternatively, model updating techniques have previously been proposed that adjust an existing CPM to suit the new population, but these techniques are restricted to a single model. Therefore, we aimed to develop a generalised method for updating and aggregating multiple CPMs. The proposed "hybrid method" re-calibrates multiple CPMs using stacked regression while concurrently revising specific covariates using individual participant data (IPD) under a penalised likelihood. The performance of the hybrid method was compared with existing methods in a clinical example of mortality risk prediction after transcatheter aortic valve implantation, and in 2 simulation studies. The simulation studies explored the effect of sample size and between-population-heterogeneity on the method, with each representing a situation of having multiple distinct CPMs and 1 set of IPD. When the sample size of the IPD was small, stacked regression and the hybrid method had comparable but highest performance across modelling methods. Conversely, in large IPD samples, development of a new model and the hybrid method gave the highest performance. Hence, the proposed strategy can inform the choice between utilising existing CPMs or developing a model de novo, thereby incorporating IPD, existing research, and prior (clinical) knowledge into the modelling strategy.

Keywords: clinical prediction models; logistic regression; model aggregation; model updating; stacked regression; validation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Aged
Aged, 80 and over
Aortic Valve / surgery
Aortic Valve Stenosis / mortality
Aortic Valve Stenosis / surgery
Computer Simulation
Decision Support Techniques*
Female
Humans
Linear Models*
Logistic Models*
Male
Probability
Regression Analysis
Reproducibility of Results
Risk Assessment / methods*
Transcatheter Aortic Valve Replacement / adverse effects

Abstract

Publication types

MeSH terms

Grants and funding