Lessons Learned for Identifying and Annotating Permissions in Clinical Consent Forms

Elizabeth E Umberfield; Yun Jiang; Susan H Fenton; Cooper Stansbury; Kathleen Ford; Kaycee Crist; Sharon L R Kardia; Andrea K Thomer; Marcelline R Harris

doi:10.1055/s-0041-1730032

Lessons Learned for Identifying and Annotating Permissions in Clinical Consent Forms

Appl Clin Inform. 2021 May;12(3):429-435. doi: 10.1055/s-0041-1730032. Epub 2021 Jun 23.

Authors

Elizabeth E Umberfield^{1

2}, Yun Jiang³, Susan H Fenton⁴, Cooper Stansbury^{5

6}, Kathleen Ford³, Kaycee Crist⁷, Sharon L R Kardia⁸, Andrea K Thomer⁹, Marcelline R Harris³

Affiliations

¹ Health Policy & Management, Indiana University Richard M Fairbanks School of Public Health, Indianapolis, Indiana, United States.
² Center for Biomedical Informatics, Regenstrief Institute, Inc., Indianapolis, Indiana, United States.
³ Department of Systems, Populations and Leadership, University of Michigan School of Nursing, Ann Arbor, Michigan, United States.
⁴ School of Biomedical Informatics, University of Texas Health Science Center, Houston, Texas, United States.
⁵ Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan, United States.
⁶ The Michigan Institute for Computational Discovery and Engineering, University of Michigan, Ann Arbor, Michigan, United States.
⁷ Rory Meyers School of Nursing, New York University, New York, New York, United States.
⁸ Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan, United States.
⁹ University of Michigan School of Information, Ann Arbor, Michigan, United States.

Abstract

Background: The lack of machine-interpretable representations of consent permissions precludes development of tools that act upon permissions across information ecosystems, at scale.

Objectives: To report the process, results, and lessons learned while annotating permissions in clinical consent forms.

Methods: We conducted a retrospective analysis of clinical consent forms. We developed an annotation scheme following the MAMA (Model-Annotate-Model-Annotate) cycle and evaluated interannotator agreement (IAA) using observed agreement (A _o), weighted kappa (κw ), and Krippendorff's α.

Results: The final dataset included 6,399 sentences from 134 clinical consent forms. Complete agreement was achieved for 5,871 sentences, including 211 positively identified and 5,660 negatively identified as permission-sentences across all three annotators (A _o = 0.944, Krippendorff's α = 0.599). These values reflect moderate to substantial IAA. Although permission-sentences contain a set of common words and structure, disagreements between annotators are largely explained by lexical variability and ambiguity in sentence meaning.

Conclusion: Our findings point to the complexity of identifying permission-sentences within the clinical consent forms. We present our results in light of lessons learned, which may serve as a launching point for developing tools for automated permission extraction.

Lessons Learned for Identifying and Annotating Permissions in Clinical Consent Forms

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding