Assessing the effect of model specification and prior sensitivity on Bayesian tests of temporal signal

PLoS Comput Biol. 2024 Nov 6;20(11):e1012371. doi: 10.1371/journal.pcbi.1012371. eCollection 2024 Nov.

Abstract

Our understanding of the evolution of many microbes has been revolutionised by the molecular clock, a statistical tool to infer evolutionary rates and timescales from analyses of biomolecular sequences. In all molecular clock models, evolutionary rates and times are jointly unidentifiable and 'calibration' information must therefore be used. For many organisms, sequences sampled at different time points can be employed for such calibration. Before attempting to do so, it is recommended to verify that the data carry sufficient information for molecular dating, a practice referred to as evaluation of temporal signal. Recently, a fully Bayesian approach, BETS (Bayesian Evaluation of Temporal Signal), was proposed to overcome known limitations of other commonly used techniques such as root-to-tip regression or date randomisation tests. BETS requires the specification of a full Bayesian phylogenetic model, posing several considerations for untangling the impact of model choice on the detection of temporal signal. Here, we aimed to (i) explore the effect of molecular clock model and tree prior specification on the results of BETS and (ii) provide guidelines for improving our confidence in molecular clock estimates. Using microbial molecular sequence data sets and simulation experiments, we assess the impact of the tree prior and its hyperparameters on the accuracy of temporal signal detection. In particular, highly informative priors that are inconsistent with the data can result in the incorrect detection of temporal signal. In consequence, we recommend: (i) using prior predictive simulations to determine whether the prior generates a reasonable expectation of parameters of interest, such as the evolutionary rate and age of the root node, (ii) conducting prior sensitivity analyses to assess the robustness of the posterior to the choice of prior, and (iii) selecting a molecular clock model that reasonably describes the evolutionary process.

MeSH terms

  • Bayes Theorem*
  • Computational Biology* / methods
  • Computer Simulation
  • Evolution, Molecular*
  • Models, Genetic*
  • Phylogeny*

Grants and funding

This work received funding from: the Inception program (Investissement d’Avenir grant ANR-16-CONV-0005 awarded to SD), the Australian National Health and Medical Research Council (2017284 awarded to SD), and the Australian Research Council (FT220100629 awarded to SD). SD received a salary from the Inception program and the Australian Research Council. JHT received a salary from the National Health and Medical Research Council. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.