When do species-tree and concatenated estimates disagree? An empirical analysis with higher-level scincid lizard phylogeny

Mol Phylogenet Evol. 2015 Jan:82 Pt A:146-55. doi: 10.1016/j.ympev.2014.10.004. Epub 2014 Oct 12.

Abstract

Simulation studies suggest that coalescent-based species-tree methods are generally more accurate than concatenated analyses. However, these species-tree methods remain impractical for many large datasets. Thus, a critical but unresolved issue is when and why concatenated and coalescent species-tree estimates will differ. We predict such differences for branches in concatenated trees that are short, weakly supported, and have conflicting gene trees. We test these predictions in Scincidae, the largest lizard family, with data from 10 nuclear genes for 17 ingroup taxa and 44 genes for 12 taxa. We support our initial predictions, andsuggest that simply considering uncertainty in concatenated trees may sometimes encompass the differences between these methods. We also found that relaxed-clock concatenated trees can be surprisingly similar to the species-tree estimate. Remarkably, the coalescent species-tree estimates had slightly lower support values when based on many more genes (44 vs. 10) and a small (∼30%) reduction in taxon sampling. Thus, taxon sampling may be more important than gene sampling when applying species-tree methods to deep phylogenetic questions. Finally, our coalescent species-tree estimates tentatively support division of Scincidae into three monophyletic subfamilies, a result otherwise found only in concatenated analyses with extensive species sampling.

Keywords: Concatenated analysis; Phylogenetic methods; Reptile; Scincidae; Species-tree analysis; Taxon sampling.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Bayes Theorem
  • Biological Evolution*
  • Likelihood Functions
  • Lizards / classification*
  • Models, Genetic
  • Phylogeny*
  • Sequence Analysis, DNA

Associated data

  • Dryad/10.5061/dryad.331jq