Reconstructing extrachromosomal DNA structural heterogeneity from long-read sequencing data using Decoil

Genome Res. 2024 Oct 11;34(9):1355-1364. doi: 10.1101/gr.279123.124.

Abstract

Circular extrachromosomal DNA (ecDNA) is a form of oncogene amplification found across cancer types and associated with poor outcome in patients. ecDNA can be structurally complex and can contain rearranged DNA sequences derived from multiple chromosome locations. As the structure of ecDNA can impact oncogene regulation and may indicate mechanisms of its formation, disentangling it at high resolution from sequencing data is essential. Even though methods have been developed to identify and reconstruct ecDNA in cancer genome sequencing, it remains challenging to resolve complex ecDNA structures, in particular amplicons with shared genomic footprints. We here introduce Decoil, a computational method that combines a breakpoint-graph approach with LASSO regression to reconstruct complex ecDNA and deconvolve co-occurring ecDNA elements with overlapping genomic footprints from long-read nanopore sequencing. Decoil outperforms de novo assembly and alignment-based methods in simulated long-read sequencing data for both simple and complex ecDNAs. Applying Decoil on whole-genome sequencing data uncovered different ecDNA topologies and explored ecDNA structure heterogeneity in neuroblastoma tumors and cell lines, indicating that this method may improve ecDNA structural analyses in cancer.

MeSH terms

  • Cell Line, Tumor
  • Computational Biology / methods
  • DNA, Circular* / genetics
  • Genome, Human
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Nanopore Sequencing / methods
  • Neuroblastoma / genetics
  • Sequence Analysis, DNA / methods
  • Software

Substances

  • DNA, Circular