Elsevier

Microbial Pathogenesis

Volume 150, January 2021, 104732
Microbial Pathogenesis

Genome-wide analysis of the synonymous codon usage pattern of Streptococcus suis

https://doi.org/10.1016/j.micpath.2021.104732Get rights and content

Abstract

Streptococcus suis (S. suis) is a gram-positive coccus that causes disease in humans and animals. The codon usage pattern of bacteria reveals a range of evolutionary changes that assist them to enhance tolerance to environments. To better understand the genetic features during the evolution of S. suis, we performed codon usage analysis. Nine pathogenic strains of different serotypes and different geographical distribution were analyzed to better understand the differences in their evolutionary process. Nucleotide compositions and relative synonymous codon usage (RSCU) analysis revealed that A/T-ending codons are dominant in S. suis. Neutrality analysis, correspondence analysis and ENC-plot results revealed that natural selection is the predominant element prompting codon usage. Cluster analysis based on RSCU was roughly consistent with the dendrogram rooted genomic BLAST analysis. Comparison of synonymous codon usage pattern between S. suis and susceptible hosts (H. sapiens and S. scrofa) revealed that the codon usage of S. suis is separated from the synonymous codon usage of susceptible hosts. The CAI values implied that S. suis includes a series of predicted highly expressed coding sequences contained in metabolism and transcriptional regulation, revealing the necessity of this pathogen to deal with various environmental conditions. The study of codon usage in S. suis may provide evidence involving the molecular evolution of bacteria and a better understanding of evolutionary relationships between S. suis and its corresponding hosts.

Introduction

Streptococcus suis (S. suis) is an emerging zoonotic pathogen, not only causing meningitis in pigs and humans with high mortality and morbidity, but also responsible for various diseases such as septicemia with sudden death, endocarditis, and arthritis [[1], [2], [3]]. Based on the capsular antigens of its capsular polysaccharides, S. suis can be classified into 33 serotypes [4]. The number of human infections with S. suis has increased over the past few years worldwide, particularly in Southeast Asia [5]. In addition, it has also been reported that S. suis can colonize the upper respiratory and gastrointestinal tracts of various animals (ruminants, cats, dogs and horses) [6]. When colonized in the respiratory tract of host, S. suis usually induce chronic or persistent infections [7], thus, S. suis seriously impacts the pig industry and human health.

Although the genetic code is redundant, synonymous codons are not used equally, which is called codon usage bias [8]. Codon usage bias (CUB) generally exists in all kinds of organisms such as viruses [[9], [10], [11], [12]], bacteria [[13], [14], [15], [16], [17], [18]], fungi [19], plants [20,21] and animals [22]. The genomes of free-living unicellular organisms such as bacteria have strong codon usage bias with different degrees of bias among genes [23]. The balance between mutation pressure and natural selection lead to translational efficiency of genes [24]. Many factors may affect the codon usage of bacteria, such as mutation pressure [25], gene length [26], natural selection [27], amount and type of available tRNAs [28] and amino acid composition [18]. CUB at the genome level can explain the molecular evolution of a single gene and help to reveal the evolutionary process of an organism [17]. CUB can also help heterologous protein expressed in certain species and molecular genetic engineering to optimize codon [31]. In addition, it aids in revealing the relationship between the host and pathogenic microorganism [32].

Until now it is indistinct whether there are variations in gene expression based upon selection pressure. Hence, we analyzed the codon usage bias of S. suis genes comprehensively, using correlation analysis and multivariate statistical techniques. In order to obtain insight into the genetic evolution of S. suis, we explored the relationship between the codon usage bias of S. suis and host adaptation to better understand the evolution of S. suis.

Section snippets

A/T versus G/C bias in S. suis

To determine the potential influence of compositional constraints on codon usage, the nucleotide compositions of the S. suis coding sequences were calculated. The average contents of the four nucleotides presented similar patterns in the 9 strains analyzed: the mean compositions (%) of nucleotides A and T were the highest, followed by G and C, indicating bias for AT content versus GC content. Furthermore, the nucleotide usage at the third codon positions of synonymous codons showed similar

Discussion

Central to understanding the emergence and evolution of emerging infectious diseases are questions related to how newly introduced pathogens make the switch to the new host, and the properties (caused by genetic mutation or recombination) that allow them to replicate in and spread efficiently to cause epidemics or pandemics in that host [[34], [35], [36], [37]]. Codon usage analysis is meaningful in genomic analysis as it indicates molecular evolutionary changes and it has been used to study

Sequences data

A total of 9 S. suis genomes were used. The coding sequences (CDS) from whole-genome sequences were obtained from National Center of Biotechnology Information (http://www.ncbi.nlm.nih.gov/Genbank). To improve the quality of sequences and minimize sampling errors, only genes with more than 100 codons in length with correct initiation were selected (Table S1).

Nucleotide content and codon usage composition

The nucleotide contents (A%, T%, G%, and C%) of each S. suis CDS sequence were calculated with the software BioEdit (version 7.0.9). Each

Author contributions

J-LC and YC conceived and designed the experiments. QMX, HC, WS, DWZ and YYZ analyzed the data. QMX and YC wrote and revised the manuscript. All authors read and approved the final manuscript.

Declaration of competing interest

The authors declare no conflict of interest.

Acknowledgments

This work was supported by National Key Research and Development Program of China (2017YFD0500203) and Natural Science Foundation of Fujian Province of China (2018J01595).

References (59)

  • F. Wright

    The 'effective number of codons' used in a gene

    Gene

    (1990)
  • Y.N. Wang

    Unique features of nucleotide and codon usage patterns in mycoplasmas revealed by information entropy

    Biosystems

    (2018)
  • A. Subramanian et al.

    Comparison of codon usage bias across Leishmania and Trypanosomatids to understand mRNA secondary structure, relative protein abundance and pathway functions

    Genomics

    (2015)
  • F. Jiang

    Human infection caused by Streptococcus suis serotype 2 in China: report of two cases and epidemic distribution based on sequence type

    BMC Infect. Dis.

    (2020)
  • L. Yi

    Antibiotic resistance related to biofilm formation in Streptococcus suis

    Appl. Microbiol. Biotechnol.

    (2020)
  • M. Gottschalk

    Streptococcus suis: a new emerging or an old neglected zoonotic pathogen?

    Future Microbiol.

    (2010)
  • M. Segura

    Streptococcus suis Research: progress and challenges

    Pathogens

    (2020)
  • M. Gajdacs

    Streptococcus suis: an underestimated emerging pathogen in Hungary?

    Microorganisms

    (2020)
  • U. Waack et al.

    Subinhibitory concentrations of amoxicillin, lincomycin, and oxytetracycline commonly used to treat swine increase Streptococcus suis biofilm formation

    Front. Microbiol.

    (2018)
  • I. Frumkin

    Codon usage of highly expressed genes affects proteome-wide translation efficiency

    Proc. Natl. Acad. Sci. U. S. A.

    (2018)
  • Y. Chen

    Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus

    PloS One

    (2017)
  • Y.P. Ma

    Analysing codon usage bias of cyprinid herpesvirus 3 and adaptation of this virus to the hosts

    (2015)
  • Y.P. Ma

    Multiple evolutionary selections involved in synonymous codon usages in the Streptococcus agalactiae genome

    Int. J. Mol. Sci.

    (2016)
  • R. Hershberg et al.

    Evidence that mutation is universally biased towards at in bacteria

    PLoS Genet.

    (2010)
  • P.M. Sharp

    Codon usage patterns in Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster and Homo sapiens; a review of the considerable within-species diversity

    Nucleic Acids Res.

    (1988)
  • J.G. Bragg

    Protein elemental sparing and codon usage bias are correlated among bacteria

    Mol. Ecol.

    (2012)
  • X.A. Gao

    Analyses of nucleotide, synonymous codon and amino acid usages at gene levels of Brucella melitensis strain QY1

    Infect. Genet. Evol.

    (2018)
  • W. Jiang

    Analysis of synonymous codon usage patterns in the edible fungus Volvariella volvacea

    Biotechnol. Appl. Biochem.

    (2017)
  • Q.P. Ma

    Analysis of synonymous codon usage in FAD7 genes from different plant species

    Genet. Mol. Res.

    (2015)
  • Cited by (6)

    • Codon usage divergence of important functional genes in Mycobacterium tuberculosis

      2022, International Journal of Biological Macromolecules
      Citation Excerpt :

      Usually, the neutrality plot (GC12 versus GC3) can be used to analyze the evolutionary direction for a certain genome [46]. GC-contents of functional genes in Mycobacterium tuberculosis genomes (Fig. 1) showed that the GC3 are obviously more than the AT3 while most GC12 are more than AT12, which is different from Streptococcus suis [47]; the genes with higher GC3 and lower ENC denote the selection pressure exist in them persistently. The neutral plot is usually used to analyze the directional mutation pressure versus natural selection of codon usage in a certain genome.

    • The cold adaption profiles of Pseudoalteromonas shioyasakiensis D1497 from Yap trench to cope with cold

      2021, Biotechnology Reports
      Citation Excerpt :

      Each nucleotide at the first, second and third position of the codons (A%, T%, G% and C%) was analyzed using the Codon W. Additionally, G + C content at the first, second, and third codon positions of each sequence was calculated. The relative synonymous codon usage (RSCU) values for all amino acids in P. shioyasakiensis D1497 were calculated to determine the characteristics of synonymous codon usage without the influence of amino acid composition and coding sequence size following a previously described method [39]. A higher RSCU value indicates that the codon is used more frequently or has a stronger codon usage bias.

    View full text