Inferring phylogenetic networks from multifurcating trees via cherry picking and machine learning

Mol Phylogenet Evol. 2024 Oct:199:108137. doi: 10.1016/j.ympev.2024.108137. Epub 2024 Jul 17.

Abstract

The Hybridization problem asks to reconcile a set of conflicting phylogenetic trees into a single phylogenetic network with the smallest possible number of reticulation nodes. This problem is computationally hard and previous solutions are limited to small and/or severely restricted data sets, for example, a set of binary trees with the same taxon set or only two non-binary trees with non-equal taxon sets. Building on our previous work on binary trees, we present FHyNCH, the first algorithmic framework to heuristically solve the Hybridization problem for large sets of multifurcating trees whose sets of taxa may differ. Our heuristics combine the cherry-picking technique, recently proposed to solve the same problem for binary trees, with two carefully designed machine-learning models. We demonstrate that our methods are practical and produce qualitatively good solutions through experiments on both synthetic and real data sets.

Keywords: Cherry-picking; Heuristic; Hybrid phylogeny; Hybridization problem; Machine learning.

MeSH terms

  • Algorithms*
  • Hybridization, Genetic
  • Machine Learning*
  • Models, Genetic
  • Phylogeny*