RNA Biol. 2020 Oct 28. 1-16
The recent discovery of long non-coding RNA as a regulatory molecule in the cellular system has altered the concept of the functional aptitude of the genome. Since our publication of the first version of LncRBase in 2014, there has been an enormous increase in the number of annotated lncRNAs of multiple species other than Human and Mouse. LncRBase V.2 hosts information of 549,648 lncRNAs corresponding to six additional species besides Human and Mouse, viz. Rat, Fruitfly, Zebrafish, Chicken, Cow and C.elegans. It provides additional distinct features such as (i) Transcription Factor Binding Site (TFBS) in the lncRNA promoter region, (ii) sub-cellular localization pattern of lncRNAs (iii) lnc-pri-miRNAs (iv) Possible small open reading frames (sORFs) within lncRNA. (v) Manually curated information of interacting target molecules and disease association of lncRNA genes (vi) Distribution of lncRNAs across multiple tissues of all species. Moreover, we have hosted ClinicLSNP within LncRBase V.2. ClinicLSNP has a comprehensive catalogue of lncRNA variants present within breast, ovarian, and cervical cancer inferred from 561 RNA-Seq data corresponding to these cancers. Further, we have checked whether these lncRNA variants overlap with (i)Repeat elements,(ii)CGI, (iii)TFBS within lncRNA loci (iv)SNP localization in trait-associated Linkage Disequilibrium(LD) region, (v)predicted the potentially pathogenic variants and (vi)effect of SNP on lncRNA secondary structure. Overall, LncRBaseV.2 is a user-friendly database to survey, search and retrieve information about multi-species lncRNAs. Further, ClinicLSNP will serve as a useful resource for cancer specific lncRNA variants and their related information. The database is freely accessible and available at http://dibresources.jcbose.ac.in/zhumur/lncrbase2/.
Keywords: Long non-coding RNA; clinicLSNP; female cancer; lncRNA variant; lncrbase; sORFs; sub-cellular localization