Codon use and aversion is largely phylogenetically conserved across the tree of life

https://doi.org/10.1016/j.ympev.2019.106697Get rights and content
Under a Creative Commons license
open access

Highlights

  • Codon usages are 1109× more likely than random to follow the Open Tree of Life.

  • We analyzed 890,814 codons from 12,337 species across all kingdoms of life.

  • We confirm turtles as sister taxa to archosaurs.

  • We provide a framework to analyze codon usages against any phylogeny.

Abstract

Using parsimony, we analyzed codon usages across 12,337 species and 25,727 orthologous genes to rank specific genes and codons according to their phylogenetic signal. We examined each codon within each ortholog to determine the codon usage for each species. In total, 890,814 codons were parsimony informative. Next, we compared species that used a codon with species that did not use the codon. We assessed each codon's congruence with species relationships provided in the Open Tree of Life (OTL) and determined the statistical probability of observing these results by random chance. We determined that 25,771 codons had no parallelisms or reversals when mapped to the OTL. Codon usages from orthologous genes spanning many species were 1109× more likely to be congruent with species relationships in the OTL than would be expected by random chance. Using the OTL as a reference, we show that codon usage is phylogenetically conserved within orthologous genes in archaea, bacteria, plants, mammals, and other vertebrates. We also show how to use our provided framework to test different tree hypotheses by confirming the placement of turtles as sister taxa to archosaurs.

Keywords

Codon aversion
Tree of life
Species classification
Maximum likelihood
Parsimony
Phylogeny

Data availability statement

All scripts, a README, and necessary test files are freely available on GitHub at https://github.com/ridgelab/codon_congruence.

Cited by (0)