A pipeline to extract drug-adverse event pairs from multiple data sources

BMC Med Inform Decis Mak. 2014 Feb 24:14:13. doi: 10.1186/1472-6947-14-13.

Abstract

Background: Pharmacovigilance aims to uncover and understand harmful side-effects of drugs, termed adverse events (AEs). Although the current process of pharmacovigilance is very systematic, the increasing amount of information available in specialized health-related websites as well as the exponential growth in medical literature presents a unique opportunity to supplement traditional adverse event gathering mechanisms with new-age ones.

Method: We present a semi-automated pipeline to extract associations between drugs and side effects from traditional structured adverse event databases, enhanced by potential drug-adverse event pairs mined from user-comments from health-related websites and MEDLINE abstracts. The pipeline was tested using a set of 12 drugs representative of two previous studies of adverse event extraction from health-related websites and MEDLINE abstracts.

Results: Testing the pipeline shows that mining non-traditional sources helps substantiate the adverse event databases. The non-traditional sources not only contain the known AEs, but also suggest some unreported AEs for drugs which can then be analyzed further.

Conclusion: A semi-automated pipeline to extract the AE pairs from adverse event databases as well as potential AE pairs from non-traditional sources such as text from MEDLINE abstracts and user-comments from health-related websites is presented.

MeSH terms

  • Adverse Drug Reaction Reporting Systems / standards*
  • Algorithms*
  • Data Mining / methods*
  • Drug-Related Side Effects and Adverse Reactions*
  • Humans
  • Natural Language Processing