Structures, functions and adaptations of the human LINE-1 ORF2 protein

Nature. 2024 Feb;626(7997):194-206. doi: 10.1038/s41586-023-06947-z. Epub 2023 Dec 14.

Abstract

The LINE-1 (L1) retrotransposon is an ancient genetic parasite that has written around one-third of the human genome through a 'copy and paste' mechanism catalysed by its multifunctional enzyme, open reading frame 2 protein (ORF2p)1. ORF2p reverse transcriptase (RT) and endonuclease activities have been implicated in the pathophysiology of cancer2,3, autoimmunity4,5 and ageing6,7, making ORF2p a potential therapeutic target. However, a lack of structural and mechanistic knowledge has hampered efforts to rationally exploit it. We report structures of the human ORF2p 'core' (residues 238-1061, including the RT domain) by X-ray crystallography and cryo-electron microscopy in several conformational states. Our analyses identified two previously undescribed folded domains, extensive contacts to RNA templates and associated adaptations that contribute to unique aspects of the L1 replication cycle. Computed integrative structural models of full-length ORF2p show a dynamic closed-ring conformation that appears to open during retrotransposition. We characterize ORF2p RT inhibition and reveal its underlying structural basis. Imaging and biochemistry show that non-canonical cytosolic ORF2p RT activity can produce RNA:DNA hybrids, activating innate immune signalling through cGAS/STING and resulting in interferon production6-8. In contrast to retroviral RTs, L1 RT is efficiently primed by short RNAs and hairpins, which probably explains cytosolic priming. Other biochemical activities including processivity, DNA-directed polymerization, non-templated base addition and template switching together allow us to propose a revised L1 insertion model. Finally, our evolutionary analysis demonstrates structural conservation between ORF2p and other RNA- and DNA-dependent polymerases. We therefore provide key mechanistic insights into L1 polymerization and insertion, shed light on the evolutionary history of L1 and enable rational drug development targeting L1.

MeSH terms

  • Cryoelectron Microscopy
  • Crystallography, X-Ray
  • DNA / biosynthesis
  • DNA / genetics
  • Endonucleases* / chemistry
  • Endonucleases* / genetics
  • Endonucleases* / metabolism
  • Humans
  • Immunity, Innate
  • Interferons / biosynthesis
  • Long Interspersed Nucleotide Elements* / genetics
  • RNA / genetics
  • RNA-Directed DNA Polymerase* / chemistry
  • RNA-Directed DNA Polymerase* / genetics
  • RNA-Directed DNA Polymerase* / metabolism
  • Reverse Transcription*

Substances

  • Endonucleases
  • RNA
  • RNA-Directed DNA Polymerase
  • ORF2 protein, human
  • DNA
  • cGAS protein, human
  • STING1 protein, human
  • Interferons