Objectives: To test the application of statistical methods to detect data fabrication in a clinical trial.
Setting: Data from two clinical trials: a trial of a dietary intervention for cardiovascular disease and a trial of a drug intervention for the same problem.
Outcome measures: Baseline comparisons of means and variances of cardiovascular risk factors; digit preference overall and its pattern by group.
Results: In the dietary intervention trial, variances for 16 of the 22 variables available at baseline were significantly different, and 10 significant differences were seen in means for these variables. Some of these P values were extraordinarily small. Distributions of the final recorded digit were significantly different between the intervention and the control group at baseline for 14/22 variables in the dietary trial. In the drug trial, only five variables were available, and no significant differences between the groups for baseline values in means or variances or digit preference were seen.
Conclusions: Several statistical features of the data from the dietary trial are so strongly suggestive of data fabrication that no other explanation is likely.