MRI measurement of brain tumor response: comparison of visual metric and automatic segmentation

Magn Reson Imaging. 1998 Apr;16(3):271-9. doi: 10.1016/s0730-725x(97)00302-0.

Abstract

An automatic magnetic resonance imaging (MRI) multispectral segmentation method and a visual metric are compared for their effectiveness to measure tumor response to therapy. Automatic response measurements are important for multicenter clinical trials. A visual metric such as the product of the largest diameter and the largest perpendicular diameter of the tumor is a standard approach, and is currently used in the Radiation Treatment Oncology Group (RTOG) and the Eastern Cooperative Oncology Group (EGOG) clinical trials. In the standard approach, the tumor response is based on the percentage change in the visual metric and is categorized into cure, partial response, stable disease, or progression. Both visual and automatic methods are applied to six brain tumor cases (gliomas) of varying levels of segmentation difficulty. The analyzed data were serial multispectral MR images, collected using MR contrast enhancement. A fully automatic knowledge guided method (KG) was applied to the MRI multispectral data, while the visual metric was taken from the MRI films using the T1 gadolinium enhanced image, with repeat measurements done by two radiologists and two residents. Tumor measurements from both visual and automatic methods are compared to "ground truth," (GT) i.e., manually segmented tumor. The KG method was found to slightly overestimate tumor volume, but in a consistent manner, and the estimated tumor response compared very well to hand-drawn ground truth with a correlation coefficient of 0.96. In contrast, the visually estimated metric had a large variation between observers, particularly for difficult cases, where the tumor margins are not well delineated. The inter-observer variation for the measurement of the visual metric was only 16%, i.e., observers generally agreed on the lengths of the diameters. However, in 30% of the studied cases no consensus was found for the categorical tumor response measurement, indicating that the categories are very sensitive to variations in the diameter measurements. Moreover, the method failed to correctly identify the response in half of the cases. The data demonstrate that automatic 3D methods are clearly necessary for objective and clinically meaningful assessment of tumor volume in single or multicenter clinical trials.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adult
  • Aged
  • Artifacts
  • Artificial Intelligence*
  • Brain / pathology
  • Brain Neoplasms / diagnosis
  • Brain Neoplasms / pathology
  • Brain Neoplasms / therapy*
  • Chemotherapy, Adjuvant
  • Clinical Trials as Topic
  • Combined Modality Therapy
  • Expert Systems*
  • Feasibility Studies
  • Female
  • Glioblastoma / diagnosis
  • Glioblastoma / pathology
  • Glioblastoma / therapy*
  • Humans
  • Image Enhancement / instrumentation*
  • Image Processing, Computer-Assisted / instrumentation*
  • Magnetic Resonance Imaging / instrumentation*
  • Male
  • Middle Aged
  • Radiotherapy
  • Sensitivity and Specificity
  • Treatment Outcome