Coccolithoviruses (Phycodnaviridae) infect and lyse the most ubiquitous and successful coccolithophorid in modern oceans, Emiliania huxleyi. So far, the genomes of 13 of these giant lytic viruses (i.e., Emiliania huxleyi viruses-EhVs) have been sequenced, assembled, and annotated. Here, we performed an in-depth comparison of their genomes to try and contextualize the ecological and evolutionary traits of these viruses. The genomes of these EhVs have from 444 to 548 coding sequences (CDSs). Presence/absence analysis of CDSs identified putative genes with particular ecological significance, namely sialidase, phosphate permease, and sphingolipid biosynthesis. The viruses clustered into distinct clades, based on their DNA polymerase gene as well as full genome comparisons. We discuss the use of such clustering and suggest that a gene-by-gene investigation approach may be more useful when the goal is to reveal differences related to functionally important genes. A multi domain "Best BLAST hit" analysis revealed that 84% of the EhV genes have closer similarities to the domain Eukarya. However, 16% of the EhV CDSs were very similar to bacterial genes, contributing to the idea that a significant portion of the gene flow in the planktonic world inter-crosses the domains of life.
Keywords: E. huxleyi; coccolithovirus; domains of life; genome comparison; horizontal gene transfer.