Translation initiation downstream from annotated start codons in human mRNAs coevolves with the Kozak context

  1. Pavel V. Baranov1,3
  1. 1School of Biochemistry and Cell Biology, University College Cork, Cork, T12 XF62 Ireland;
  2. 2Department of Computer Science and Artificial Intelligence, University of Granada, Granada, 18010 Spain;
  3. 3Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, 117997 Russia;
  4. 4Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992 Russia
  1. 5 These authors contributed equally to this work.

  • Corresponding author: p.baranov{at}ucc.ie
  • Abstract

    Eukaryotic translation initiation involves preinitiation ribosomal complex 5′-to-3′ directional probing of mRNA for codons suitable for starting protein synthesis. The recognition of codons as starts depends on the codon identity and on its immediate nucleotide context known as Kozak context. When the context is weak (i.e., nonoptimal), leaky scanning takes place during which a fraction of ribosomes continues the mRNA probing. We explored the relationship between the context of AUG codons annotated as starts of protein-coding sequences and the next AUG codon occurrence. We found that AUG codons downstream from weak starts occur in the same frame more frequently than downstream from strong starts. We suggest that evolutionary selection on in-frame AUGs downstream from weak start codons is driven by the advantage of the reduction of wasteful out-of-frame product synthesis and also by the advantage of producing multiple proteoforms from certain mRNAs. We confirmed translation initiation downstream from weak start codons using ribosome profiling data. We also tested translation of alternative start codons in 10 specific human genes using reporter constructs. In all tested cases, initiation at downstream start codons was more productive than at the annotated ones. In most cases, optimization of Kozak context did not completely abolish downstream initiation, and in the specific example of CMPK1 mRNA, the optimized start remained unproductive. Collectively, our work reveals previously uncharacterized forces shaping the evolution of protein-coding genes and points to the plurality of translation initiation and the existence of sequence features influencing start codon selection, other than Kozak context.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.257352.119.

    • Freely available online through the Genome Research Open Access option.

    • Received September 18, 2019.
    • Accepted June 25, 2020.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server