Pathway-directed weighted testing procedures for the integrative analysis of gene expression and metabolomic data

Genomics. 2012 May;99(5):265-74. doi: 10.1016/j.ygeno.2012.03.004. Epub 2012 Apr 2.

Abstract

We explore the utility of p-value weighting for enhancing the power to detect differential metabolites in a two-sample setting. Related gene expression information is used to assign an a priori importance level to each metabolite being tested. We map the gene expression to a metabolite through pathways and then gene expression information is summarized per-pathway using gene set enrichment tests. Through simulation we explore four styles of enrichment tests and four weight functions to convert the gene information into a meaningful p-value weight. We implement the p-value weighting on a prostate cancer metabolomic dataset. Gene expression on matched samples is used to construct the weights. Under certain regulatory conditions, the use of weighted p-values does not inflate the type I error above what we see for the un-weighted tests except in high correlation situations. The power to detect differential metabolites is notably increased in situations with disjoint pathways and shows moderate improvement, relative to the proportion of enriched pathways, when pathway membership overlaps.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Computer Simulation*
  • Gene Expression Profiling / methods*
  • Humans
  • Male
  • Metabolomics / methods*
  • Prostatic Neoplasms / genetics
  • Prostatic Neoplasms / metabolism
  • Reproducibility of Results