Background: The endosymbiotic birth of organelles is accompanied by massive transfer of endosymbiont genes to the eukaryotic host nucleus. In the centric diatom Thalassiosira pseudonana the Psb28 protein is encoded in the plastid genome while a second version is nuclear-encoded and possesses a bipartite N-terminal presequence necessary to target the protein into the diatom complex plastid. Thus it can represent a gene captured during endosymbiotic gene transfer.
Methodology/principal findings: To specify the origin of nuclear- and plastid-encoded Psb28 in T. pseudonana we have performed extensive phylogenetic analyses of both mentioned genes. We have also experimentally tested the intracellular location of the nuclear-encoded Psb28 protein (nuPsb28) through transformation of the diatom Phaeodactylum tricornutum with the gene in question fused to EYFP.
Conclusions/significance: We show here that both versions of the psb28 gene in T. pseudonana are transcribed. We also provide experimental evidence for successful targeting of the nuPsb28 fused with EYFP to the diatom complex plastid. Extensive phylogenetic analyses demonstrate that nucleotide composition of the analyzed genes deeply influences the tree topology and that appropriate methods designed to deal with a compositional bias of the sequences and the long branch attraction artefact (LBA) need to be used to overcome this obstacle. We propose that nuclear psb28 in T. pseudonana is a duplicate of a plastid localized version, and that it has been transferred from its endosymbiont.