Purpose: Genetic researchers' selection of a database can have scientific, regulatory, and ethical implications. It is important to understand what is driving database selection such that database stewards can be responsive to user needs while balancing the interests of communities in equitably benefiting from advances.
Methods: We conducted 23 semistructured interviews with US academic genetic researchers working with private, government, and collaboratory data stewards to explore factors that they consider when selecting a genetic database.
Results: Interviewees used existing databases to avoid burdens of primary data collection, which was described as expensive and time-consuming. They highlighted ease of access as the most important selection factor, integrating concepts of familiarity and efficiency. Data features, such as size and available phenotype, were also important. Demographic diversity was not originally cited by any interviewee as a pivotal factor; when probed, most stated that the option to consider diversity in database selection was limited. Database features, including integrity, harmonization, and storage were also described as key components of efficient use.
Conclusion: There is a growing market and competition between genetic data stewards. Data need to be accessible, harmonized, and administratively supported for their existence to be translated into use and, in turn, result in scientific advancements across diverse communities.
Keywords: Data set; Diversity; ELSI; Ethics; Policy.
Copyright © 2022 American College of Medical Genetics and Genomics. Published by Elsevier Inc. All rights reserved.