A practical method to quantify knowledge-based DVH prediction accuracy and uncertainty with reference cohorts

J Appl Clin Med Phys. 2021 Mar;22(3):279-284. doi: 10.1002/acm2.13199. Epub 2021 Feb 26.

Abstract

The adoption of knowledge-based dose-volume histogram (DVH) prediction models for assessing organ-at-risk (OAR) sparing in radiotherapy necessitates quantification of prediction accuracy and uncertainty. Moreover, DVH prediction error bands should be readily interpretable as confidence intervals in which to find a percentage of clinically acceptable DVHs. In the event such DVH error bands are not available, we present an independent error quantification methodology using a local reference cohort of high-quality treatment plans, and apply it to two DVH prediction models, ORBIT-RT and RapidPlan, trained on the same set of 90 volumetric modulated arc therapy (VMAT) plans. Organ-at-risk DVH predictions from each model were then generated for a separate set of 45 prostate VMAT plans. Dose-volume histogram predictions were then compared to their analogous clinical DVHs to define prediction errors V c l i n , i - V p r e d , i (ith plan), from which prediction bias μ, prediction error variation σ, and root-mean-square error R M S E pred 1 N i V c l i n , i - V p r e d , i 2 σ 2 + μ 2 could be calculated for the cohort. The empirical R M S E pred was then contrasted to the model-provided DVH error estimates. For all prostate OARs, above 50% Rx dose, ORBIT-RT μ and σ were comparable to or less than those of RapidPlan. Above 80% Rx dose, μ < 1% and σ < 3-4% for both models. As a result, above 50% Rx dose, ORBIT-RT R M S E pred was below that of RapidPlan, indicating slightly improved accuracy in this cohort. Because μ ≈ 0, R M S E pred is readily interpretable as a canonical standard deviation σ, whose error band is expected to correctly predict 68% of normally distributed clinical DVHs. By contrast, RapidPlan's provided error band, although described in literature as a standard deviation range, was slightly less predictive than R M S E pred (55-70% success), while the provided ORBIT-RT error band was confirmed to resemble an interquartile range (40-65% success) as described. Clinicians can apply this methodology using their own institutions' reference cohorts to (a) independently assess a knowledge-based model's predictive accuracy of local treatment plans, and (b) interpret from any error band whether further OAR dose sparing is likely attainable.

Keywords: DVH error; DVH estimate; ORBIT-RT; knowledge-based planning.

MeSH terms

  • Humans
  • Male
  • Organs at Risk*
  • Radiotherapy Dosage
  • Radiotherapy Planning, Computer-Assisted
  • Radiotherapy, Intensity-Modulated*
  • Uncertainty