Transcripts in all eukaryotes are characterized by the 5'-end specific cap structure in mRNAs. Cap Analysis Gene Expression or CAGE makes use of these caps to specifically obtain cDNA fragments from the 5'-end of RNA and sequences those at high throughput for transcript identification and genome-wide mapping of transcription start sites for coding and noncoding genes. Here, we provide an improved version of our nanoCAGE protocol that has been developed for preparing CAGE libraries from as little as 50 ng of total RNA within three standard working days. Key steps in library preparation have been improved over our previously published protocol to obtain libraries having a good 5'-end selection and a more equal size distribution for higher sequencing efficiency on Illumina MiSeq and HiSeq sequencers. We recommend nanoCAGE as the method of choice for transcriptome profiling projects even from limited amounts of RNA, and as the best approach for genome-wide mapping of transcription start sites within promoter regions.
Keywords: CAGE; CAGEscan; Cap analysis gene expression; Expression profiling; Multiplexing; RNA; TSS; Tagmentation; Template switching; Transcription Start Sites; UMI; Unique molecular identifiers; nanoCAGE.