Identification of Staphylococcus aureus virulence-modulating RNA from transcriptomics data with machine learning

Virulence. 2023 Dec;14(1):2228657. doi: 10.1080/21505594.2023.2228657.

Abstract

The virulence factors of Staphylococcus aureus are tightly controlled by two-component systems (TCSs) and small RNA (sRNA). TCSs have been well studied over the past several decades, but our understanding of sRNA functions lags far behind that of TCS functions. Here, we studied the biological role of sRNA from 506 S. aureus RNA-seq datasets using independent component analysis (ICA). We found that a previously neglected sRNA, Sau-41, functions in the Agr system. Sau-41 is located within the PSMα operon and controlled by the Agr system. It was predicted to share 22-base complementarity with RNAIII, a major regulator of S. aureus virulence. The EMSA results demonstrated that Sau-41 directly binds to RNAIII. Furthermore, our results found that Sau-41 is capable of repressing S. aureus haemolysin activity by downregulating α-haemolysin and δ-toxin. The repression of α-haemolysin was attributed to the competition between the 5' UTR of hla and Sau-41 for binding RNAIII. We observed that Sau-41 mitigated S. aureus virulence in an orthopaedic implant infection mouse model and alleviated osteolysis. Together, our results indicate that Sau-41 is a virulence-regulating RNA and suggest that Sau-41 might be involved in a negative feedback mechanism to control the Agr system. This work is a demonstration of using ICA in sRNA identification by mining high-throughput data and could be extended to other organisms as well.

Keywords: RNA-seq; Staphylococcus aureus; independent component analysis; machine learning; sRNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Hemolysin Proteins
  • Machine Learning
  • Mice
  • RNA, Small Untranslated*
  • Staphylococcal Infections*
  • Staphylococcus aureus / genetics
  • Transcriptome
  • Virulence

Substances

  • Hemolysin Proteins
  • RNA, Small Untranslated

Grants and funding

This study was supported by the National Natural Science Foundation of China under grant numbers 82272511 and 82202727; the experimental animal study support project of the Shanghai Science and Technology Commission under grant number 21140904800; and the China Postdoctoral Science Foundation under grant number 2022M712109.