Introduction: To support long COVID research in National COVID Cohort Collaborative (N3C), the N3C Phenotype and Data Acquisition team created data designs to aid contributing sites in enhancing their data. Enhancements include: long COVID specialty clinic indicator; Admission, Discharge, and Transfer (ADT) transactions; patient-level social determinants of health; and in-hospital use of oxygen supplementation.
Methods: For each enhancement, we defined the scope and wrote guidance on how to prepare and populate the data in a standardized way.
Results: As of June 2024, 29 sites have added at least one data enhancement to their N3C pipeline.
Discussion: The use of common data models is critical to the success of N3C; however, these data models cannot account for all needs. Project-driven data enhancement is required. This should be done in a standardized way in alignment with CDM specifications. Our approach offers a useful pathway for enhancing data to improve fit for purpose.
Keywords: COVID-19; Clinical Informatics; Common data models; Data modeling; EHR Data.
© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association.