The SAFE procedure: a practical stopping heuristic for active learning-based screening in systematic reviews and meta-analyses

Syst Rev. 2024 Mar 1;13(1):81. doi: 10.1186/s13643-024-02502-7.

Abstract

Active learning has become an increasingly popular method for screening large amounts of data in systematic reviews and meta-analyses. The active learning process continually improves its predictions on the remaining unlabeled records, with the goal of identifying all relevant records as early as possible. However, determining the optimal point at which to stop the active learning process is a challenge. The cost of additional labeling of records by the reviewer must be balanced against the cost of erroneous exclusions. This paper introduces the SAFE procedure, a practical and conservative set of stopping heuristics that offers a clear guideline for determining when to end the active learning process in screening software like ASReview. The eclectic mix of stopping heuristics helps to minimize the risk of missing relevant papers in the screening process. The proposed stopping heuristic balances the costs of continued screening with the risk of missing relevant records, providing a practical solution for reviewers to make informed decisions on when to stop screening. Although active learning can significantly enhance the quality and efficiency of screening, this method may be more applicable to certain types of datasets and problems. Ultimately, the decision to stop the active learning process depends on careful consideration of the trade-off between the costs of additional record labeling against the potential errors of the current model for the specific dataset and context.

Keywords: Active learning; Machine learning; Meta-analysis; Methodology; Screening prioritization; Stopping heuristic; Stopping rule; Systematic review.

MeSH terms

  • Heuristics*
  • Humans
  • Problem-Based Learning*
  • Software
  • Systematic Reviews as Topic