Background: Despite a significant global burden of disease, there is still no vaccine against shigellosis widely available. One aim of the European Union funded STOPENTERICS consortium is to develop vaccine candidates against Shigella. Given the importance of translational vaccine coverage, here we aimed to characterise the Shigella strains being used by the consortium by whole genome sequencing, and report on the stability of strains cultured in different laboratories or through serial passage.
Methods: We sequenced, de novo assembled and annotated 20 Shigella strains being used by the consortium. These comprised 16 different isolates belonging to 7 serotypes, and 4 derivative strains. Derivative strains from common isolates were manipulated in different laboratories or had undergone multiple passages in the same laboratory. Strains were mapped against reference genomes to detect SNP variation and phylogenetic analysis was performed.
Results: The genomes assembled into similar total lengths (range 4.14-4.83 Mbp) and had similar numbers of predicted coding sequences (average of 4,400). Mapping analysis showed the genetic stability of strains through serial passages and culturing in different laboratories, as well as varying levels of similarity to published reference genomes. Phylogenetic analysis revealed the presence of three main clades among the strains and published references, one containing the Shigella flexneri serotype 6 strains, a second containing the remaining S. flexneri serotypes and a third comprised of Shigella sonnei strains.
Conclusions: This work increases the number of the publically available Shigella genomes available and specifically provides information on strains being used for vaccine development by STOPENTERICS. It also provides information on the variability among strains maintained in different laboratories and through serial passage. This work will guide the selection of strains for further vaccine development.
Keywords: Genome; STOPENTERICS; Shigella; Vaccine.