Deconvolution of nucleic-acid length distributions: a gel electrophoresis analysis tool and applications

Nucleic Acids Res. 2019 Sep 19;47(16):e92. doi: 10.1093/nar/gkz534.

Abstract

Next-generation DNA-sequencing (NGS) technologies, which are designed to streamline the acquisition of massive amounts of sequencing data, are nonetheless dependent on various preparative steps to generate DNA fragments of required concentration, purity and average size (molecular weight). Current automated electrophoresis systems for DNA- and RNA-sample quality control, such as Agilent's Bioanalyzer® and TapeStation® products, are costly to acquire and use; they also provide limited information for samples having broad size distributions. Here, we describe a software tool that helps determine the size distribution of DNA fragments in an NGS library, or other DNA sample, based on gel-electrophoretic line profiles. The software, developed as an ImageJ plug-in, allows for straightforward processing of gel images, including lane selection and fitting of univariate functions to intensity distributions. The user selects the option of fitting either discrete profiles in cases where discrete gel bands are visible or continuous profiles, having multiple bands buried under a single broad peak. The method requires only modest imaging capabilities and is a cost-effective, rigorous alternative characterization method to augment existing techniques for library quality control.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Bacteriophage lambda / genetics
  • Caenorhabditis elegans / genetics
  • DNA / analysis*
  • DNA / chemistry
  • DNA / genetics
  • DNA Fragmentation
  • Electrophoresis, Agar Gel / methods*
  • Endonucleases / chemistry
  • Gene Library
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Quality Control
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, DNA / statistics & numerical data*
  • Software*

Substances

  • DNA
  • Endonucleases