gcPathogen: a comprehensive genomic resource of human pathogens for public health

Nucleic Acids Res. 2024 Jan 5;52(D1):D714-D723. doi: 10.1093/nar/gkad875.

Abstract

Here, we present the manually curated Global Catalogue of Pathogens (gcPathogen), an extensive genomic resource designed to facilitate rapid and accurate pathogen analysis, epidemiological exploration and monitoring of antibiotic resistance features and virulence factors. The catalogue seamlessly integrates and analyzes genomic data and associated metadata for human pathogens isolated from infected patients, animal hosts, food and the environment. The pathogen list is supported by evidence from medical or government pathogenic lists and publications. The current version of gcPathogen boasts an impressive collection of 1 164 974 assemblies comprising 986 044 strains from 497 bacterial taxa, 4794 assemblies encompassing 4319 strains from 265 fungal taxa, 89 965 assemblies featuring 13 687 strains from 222 viral taxa, and 646 assemblies including 387 strains from 159 parasitic taxa. Through this database, researchers gain access to a comprehensive 'one-stop shop' that facilitates global, long-term public health surveillance while enabling in-depth analysis of genomes, sequence types, antibiotic resistance genes, virulence factors and mobile genetic elements across different countries, diseases and hosts. To access and explore the data and statistics, an interactive web interface has been developed, which can be accessed at https://nmdc.cn/gcpathogen/. This user-friendly platform allows seamless querying and exploration of the extensive information housed within the gcPathogen database.

MeSH terms

  • Animals
  • Databases, Genetic*
  • Genome, Bacterial / genetics
  • Genomics
  • Humans
  • Infections* / microbiology
  • Infections* / parasitology
  • Infections* / virology
  • Public Health*
  • Virulence Factors / genetics

Substances

  • Virulence Factors