Deep sequencing of short capped RNAs reveals novel families of noncoding RNAs

Michiel Jan Laurens De Hoon, Alessandro Bonetti, Charles Plessy, Yoshinari Ando, Chung-Chau Hon, Yuri Ishizu, Masayoshi Itoh, Sachi Katoh, Dongyan Lin, Sho Maekawa, Mitsuyoshi Murata, Hiromi Nishiyori (+10 others)
2022 biorxiv/medrxiv  
In eukaryotes, capped RNAs include long transcripts such as messenger RNAs and long noncoding RNAs, as well as shorter transcripts such as spliceosomal RNAs, small nucleolar RNAs, and enhancer RNAs. Long capped transcripts can be profiled using Cap Analysis Gene Expression (CAGE) sequencing and other methods. Here, we describe a sequencing library preparation protocol for short capped RNAs, apply it to a differentiation time course of the human cell line THP-1, and systematically compare the
more » ... dscape of short capped RNAs to that of long capped RNAs. Transcription initiation peaks associated with genes in the sense direction had a strong preference to produce either long or short capped RNAs, with 1 out of 6 peaks detected in the short capped RNA libraries only. Gene-associated short capped RNAs had highly specific 3' ends, typically overlapping splice sites. Enhancers also preferentially generated either short or long capped RNAs, with 10% of enhancers observed in the short capped RNA libraries only. Both enhancers producing short or long capped RNAs showed enrichment for GWAS-associated disease SNPs. We conclude that deep sequencing of short capped RNAs reveals new families of noncoding RNAs and elucidates the diversity of transcripts generated at known and novel promoters and enhancers.
doi:10.1101/gr.276647.122 pmid:35961773 pmcid:PMC9528987 fatcat:6imncs6yv5hgdbfnmjjuiog4sy