Ribosome decision graphs for the representation of eukaryotic RNA translation complexity

  1. Pavel V. Baranov1
  1. 1School of Biochemistry and Cell Biology, University College Cork, Cork T12 K8AF, Ireland;
  2. 2SFI Centre for Research Training in Genomics Data Science, University College Cork, Cork T12 K8AF, Ireland;
  3. 3Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, 02-106 Warsaw, Poland;
  4. 4Computational Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway;
  5. 5European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, Cambridge, United Kingdom;
  6. 6The Big Data Institute and Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom;
  7. 7Department of Biosciences, University of Oslo, 0316 Oslo, Norway
  1. 8 These authors contributed equally to this work.

  • Corresponding author: p.baranov{at}ucc.ie
  • Abstract

    The application of ribosome profiling has revealed an unexpected abundance of translation in addition to that responsible for the synthesis of previously annotated protein-coding regions. Multiple short sequences have been found to be translated within single RNA molecules, within both annotated protein-coding and noncoding regions. The biological significance of this translation is a matter of intensive investigation. However, current schematic or annotation-based representations of mRNA translation generally do not account for the apparent multitude of translated regions within the same molecules. They also do not take into account the stochasticity of the process that allows alternative translations of the same RNA molecules by different ribosomes. There is a need for formal representations of mRNA complexity that would enable the analysis of quantitative information on translation and more accurate models for predicting the phenotypic effects of genetic variants affecting translation. To address this, we developed a conceptually novel abstraction that we term ribosome decision graphs (RDGs). RDGs represent translation as multiple ribosome paths through untranslated and translated mRNA segments. We termed the latter “translons.” Nondeterministic events, such as initiation, reinitiation, selenocysteine insertion, or ribosomal frameshifting, are then represented as branching points. This representation allows for an adequate representation of eukaryotic translation complexity and focuses on locations critical for translation regulation. We show how RDGs can be used for depicting translated regions and for analyzing genetic variation and quantitative genome-wide data on translation for characterization of regulatory modulators of translation.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.278810.123.

    • Freely available online through the Genome Research Open Access option.

    • Received December 4, 2023.
    • Accepted April 1, 2024.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server