Identification of risk loci and a polygenic risk score for lung cancer: a large-scale prospective cohort study in Chinese populations

Lancet Respir Med. 2019 Oct;7(10):881-891. doi: 10.1016/S2213-2600(19)30144-4. Epub 2019 Jul 17.

Abstract

Background: Genetic variation has an important role in the development of non-small-cell lung cancer (NSCLC). However, genetic factors for lung cancer have not been fully identified, especially in Chinese populations, which limits the use of existing polygenic risk scores (PRS) to identify subpopulations at high risk of lung cancer for prevention. We therefore aimed to identify novel loci associated with NSCLC risk, and generate a PRS and evaluate its utility and effectiveness in the prediction of lung cancer risk in Chinese populations.

Methods: To systematically identify genetic variants for NSCLC risk, we newly genotyped 19 546 samples from Chinese NSCLC cases and controls from the Nanjing Medical University Global Screening Array Project and did a meta-analysis of genome-wide association studies (GWASs) of 27 120 individuals with NSCLC and 27 355 without NSCLC (13 327 cases and 13 328 controls of Chinese descent as well as 13 793 cases and 14 027 controls of European descent). We then built a PRS for Chinese populations from all reported single-nucleotide polymorphisms that have been reported to be associated with lung cancer risk at genome-wide significance level. We evaluated the utility and effectiveness of the generated PRS in predicting subpopulations at high-risk of lung cancer in an independent prospective cohort of 95 408 individuals from the China Kadoorie Biobank (CKB) with more than 10 years' follow-up.

Findings: We identified 19 susceptibility loci to be significantly associated with NSCLC risk at p≤5·0 × 10-8, including six novel loci. When applied to the CKB cohort, the PRS of the risk loci successfully predicted lung cancer incident cases in a dose-response manner in participants at a high genetic risk (top 10%) than those at a low genetic risk (bottom 10%; adjusted hazard ratio 1·96, 95% CI 1·53-2·51; ptrend=2·02 × 10-9). Specially, we observed consistently separated curves of lung cancer events in individuals at low, intermediate, and high genetic risk, respectively, and PRS was an independent effective risk stratification indicator beyond age and smoking pack-years.

Interpretation: We have shown for the first time that GWAS-derived PRS can be effectively used in discriminating subpopulations at high risk of lung cancer, who might benefit from a practically feasible PRS-based lung cancer screening programme for precision prevention in Chinese populations.

Funding: National Natural Science Foundation of China, the Priority Academic Program for the Development of Jiangsu Higher Education Institutions, National Key R&D Program of China, Science Foundation for Distinguished Young Scholars of Jiangsu, and China's Thousand Talents Program.

Publication types

  • Evaluation Study
  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Asian People / genetics*
  • Carcinoma, Non-Small-Cell Lung / genetics*
  • China
  • Female
  • Genetic Loci / genetics
  • Genetic Predisposition to Disease / ethnology
  • Genetic Predisposition to Disease / genetics*
  • Genome-Wide Association Study
  • Genotype
  • Humans
  • Lung Neoplasms / genetics*
  • Male
  • Middle Aged
  • Multifactorial Inheritance / genetics
  • Polymorphism, Single Nucleotide
  • Proportional Hazards Models
  • Prospective Studies
  • Risk Assessment / methods*
  • Risk Factors