Combining artificial intelligence and conventional statistics to predict bronchopulmonary dysplasia in very preterm infants using routinely collected clinical variables

Sara Montagna; Dalila Magno; Stefano Ferretti; Michele Stelluti; Andrea Gona; Camilla Dionisi; Giuliana Simonazzi; Silvia Martini; Luigi Corvaglia; Arianna Aceti

doi:10.1002/ppul.27216

Combining artificial intelligence and conventional statistics to predict bronchopulmonary dysplasia in very preterm infants using routinely collected clinical variables

Pediatr Pulmonol. 2024 Dec;59(12):3400-3409. doi: 10.1002/ppul.27216. Epub 2024 Aug 16.

Authors

Sara Montagna¹, Dalila Magno^{2

3}, Stefano Ferretti¹, Michele Stelluti⁴, Andrea Gona², Camilla Dionisi^{2

5}, Giuliana Simonazzi^{2

5}, Silvia Martini^{2

3}, Luigi Corvaglia^{2

3}, Arianna Aceti^{2

3}

Affiliations

¹ Department of Pure and Applied Sciences (DiSPeA), University of Urbino Carlo Bo, Urbino, Italy.
² Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
³ Neonatal Intensive Care Unit, IRCCS AOU BO, Bologna, Italy.
⁴ Department of Computer Science and Engineering, University of Bologna, Bologna, Italy.
⁵ Obstetric Unit, IRCCS AOU BO, Bologna, Italy.

Abstract

Background: Prematurity is the strongest predictor of bronchopulmonary dysplasia (BPD). Most previous studies investigated additional risk factors by conventional statistics, while the few studies applying artificial intelligence, and specifically machine learning (ML), for this purpose were mainly targeted to the predictive ability of specific interventions. This study aimed to apply ML to identify, among routinely collected data, variables predictive of BPD, and to compare these variables with those identified through conventional statistics.

Methods: Very preterm infants were recruited; antenatal, perinatal, and postnatal clinical data were collected. A BPD prediction model was built using conventional statistics, and nine supervised ML algorithms were applied for the same purpose: the results of the best-performing model were described and compared with those of conventional statistics.

Results: Both conventional statistics and ML identified the degree of immaturity (low gestational age and/or birth weight), need for mechanical ventilation, and absent or reversed end diastolic flow (AREDF) in the umbilical arteries as risk factors for BPD. Each of the two approaches also identified additional potentially predictive clinical variables.

Conclusion: ML algorithms might be useful to integrate conventional statistics in identifying novel risk factors, in addition to prematurity, for the development of BPD in very preterm infants. Specifically, the identification of AREDF status as an independent risk factor for BPD by both conventional statistics and ML highlights the opportunity to include detailed antenatal information in clinical predictive models for neonatal diseases.

Keywords: artificial intelligence; bronchopulmonary dysplasia; machine learning; preterm infant.

MeSH terms

Algorithms
Artificial Intelligence*
Bronchopulmonary Dysplasia* / epidemiology
Female
Gestational Age
Humans
Infant, Extremely Premature
Infant, Newborn
Infant, Premature*
Machine Learning
Male
Respiration, Artificial / statistics & numerical data
Risk Factors

Grants and funding

None