Estimating significance level and power comparisons for testing multiple endpoints in clinical trials

J Gong; J C Pinheiro; D L DeMets

doi:10.1016/s0197-2456(00)00049-0

Estimating significance level and power comparisons for testing multiple endpoints in clinical trials

Control Clin Trials. 2000 Aug;21(4):313-29. doi: 10.1016/s0197-2456(00)00049-0.

Authors

J Gong¹, J C Pinheiro, D L DeMets

Affiliation

¹ Department of Biostatistics, University of Wisconsin-Madison, Madison, WI, USA.

PMID: 10913807
DOI: 10.1016/s0197-2456(00)00049-0

Abstract

Clinical trials generally include several outcome measures of interest for assessing treatment efficacy and harm. Traditionally a single measure, the primary outcome, is selected and used as the basis for the design, including sample size and power. Secondary outcomes are then generally ordered with respect to their clinical relevance and importance. While this has become the traditional paradigm, recent trials have suggested the need for additional approaches. In this setting, two outcomes are viewed as key, either one being sufficient for proof of efficacy, but with an ordering of preference. The basic question, in such cases, is how to control the overall significance level for the trial. We describe and compare two methods for testing primary and secondary endpoints, accounting for their hierarchical nature-the ordering preference. Both methods are sequential, in the sense that the secondary endpoint is only tested when the primary outcome fails to reach significance. The first method uses a global test for the combination of the primary and secondary endpoints, while the second uses a partial Bonferroni correction. Simulation results indicate that the Bonferroni adjustment method performs as well as the global test method in most cases, and even better in some cases.

MeSH terms

Clinical Trials as Topic / methods
Clinical Trials as Topic / statistics & numerical data*
Data Interpretation, Statistical*
Humans
Probability
Survival Analysis