bims-micpro Biomed News
on Discovery and characterization of microproteins
Issue of 2021‒02‒07
three papers selected by
Thomas Martinez
Salk Institute for Biological Studies


  1. Front Oncol. 2020 ;10 622294
      Long non-coding RNA (lncRNA) was originally defined as the representative of the non-coding RNAs and unable to encode. However, recent reports suggest that some lncRNAs actually contain open reading frames that encode peptides. These coding products play important roles in the pathogenesis of many diseases. Here, we summarize the regulatory pathways of mammalian lncRNA-encoded peptides in influencing muscle function, mRNA stability, gene expression, and so on. We also address the promoting and inhibiting functions of the peptides in different cancers and other diseases. Then we introduce the computational predicting methods and data resources to predict the coding ability of lncRNA. The intention of this review is to provide references for further coding research and contribute to reveal the potential prospects for targeted tumor therapy.
    Keywords:  Ribo-seq; cancer; long non-coding RNA; peptide; translation
    DOI:  https://doi.org/10.3389/fonc.2020.622294
  2. Cancer Discov. 2021 Feb 05.
      Knockout of any of several noncanonical open reading frames (ORF) impaired cancer cell survival.
    DOI:  https://doi.org/10.1158/2159-8290.CD-RW2021-018
  3. Malar J. 2021 Feb 05. 20(1): 71
      BACKGROUND: Plasmodium falciparum causes the deadliest form of malaria, which remains one of the most prevalent infectious diseases. Unfortunately, the only licensed vaccine showed limited protection and resistance to anti-malarial drug is increasing, which can be largely attributed to the biological complexity of the parasite's life cycle. The progression from one developmental stage to another in P. falciparum involves drastic changes in gene expressions, where its infectivity to human hosts varies greatly depending on the stage. Approaches to identify candidate genes that are responsible for the development of infectivity to human hosts typically involve differential gene expression analysis between stages. However, the detection may be limited to annotated proteins and open reading frames (ORFs) predicted using restrictive criteria.METHODS: The above problem is particularly relevant for P. falciparum; whose genome annotation is relatively incomplete given its clinical significance. In this work, systems proteogenomics approach was used to address this challenge, as it allows computational detection of unannotated, novel Open Reading Frames (nORFs), which are neglected by conventional analyses. Two pairs of transcriptome/proteome were obtained from a previous study where one was collected in the mosquito-infectious oocyst sporozoite stage, and the other in the salivary gland sporozoite stage with human infectivity. They were then re-analysed using the proteogenomics framework to identify nORFs in each stage.
    RESULTS: Translational products of nORFs that map to antisense, intergenic, intronic, 3' UTR and 5' UTR regions, as well as alternative reading frames of canonical proteins were detected. Some of these nORFs also showed differential expression between the two life cycle stages studied. Their regulatory roles were explored through further bioinformatics analyses including the expression regulation on the parent reference genes, in silico structure prediction, and gene ontology term enrichment analysis.
    CONCLUSION: The identification of nORFs in P. falciparum sporozoites highlights the biological complexity of the parasite. Although the analyses are solely computational, these results provide a starting point for further experimental validation of the existence and functional roles of these nORFs.
    DOI:  https://doi.org/10.1186/s12936-021-03598-1