Full length article
Fish complement C4 gene evolution and gene/protein regulatory network analyses and simulated stereo conformation of C4–MASP–2 protein complex

https://doi.org/10.1016/j.fsi.2020.09.030Get rights and content

Highlights

  • Codon usage of C4 gene is much closer to genomic environment in higher vertebrates.

  • C4 genes from each species groups underwent negative selection pressures.

  • Functional complexities of two C4 isotypes tend to distinct in species from different evolutionary positions.

  • Protein structural analysis showed that C4 may have theoretical interaction with MASP–2 protein in grass carp.

Abstract

Complement C4 is a central protein by acting as pivotal molecule in the activation of the complement system. More than a decade ago, C4 gene duplication had been found in several species including fish, revealing the evolutionary origin of C4 gene. However, the evolutionary pattern and systematic function of C4 are still limited. In this study, C4 D and H types in different species groups were completely diverged. The codon usage of C4 H type in higher vertebrates were much closer to their own genome environment, in contrast to lower vertebrates, suggesting that the evolution may provide the dynamic for homogeneous codon usage between specific gene and genome. Multiple C4 sequence alignment showed that the sequences were conserved among different species. However, sequence similarity was obviously different between species C4 D and H type. Negative selection pressure was found on C4 gene evolution and it may be one of the possible reasons for the sequence broad similarity and conservation among interspecies. Proteins from C4 protein–protein interaction (PPI) network were enriched in more hematopoiesis, infections, diseases and immune–related pathways in human than zebrafish. The result suggested that the functional complexities of C4 isotypes are distinct in species from different evolutionary positions. The simulated C4 protein structures between human and grass carp shared structural similarity and the stereo structures of grass carp C4–MASP–2 protein complexes were further simulated according to a study of human. These results suggested that the interaction between C4 and MASP–2 proteins may also exist in grass carp. Our results can provide an insight for the evolutionary process of C4 and better understanding to the potential mechanism of interaction between C4 and MASP–2 in fish species.

Introduction

The complement system is a sophisticated and huge system, consists of more than 30 complement components and exert vital roles in the innate and adaptive immune responses. Three pathways, the classical, alternative and lectin pathways, are known to be activated in complement system when organism fight against exogenous microorganisms [1]. These three pathways are activated and achieved via a series of cascade and amplification reactions: from the proteolytic activation of C5 to the formation of C5β–C9 membrane attack complex (MAC) [2,3]. The formation of MAC is able to induce the pathogenic bacteria destruction by increasing their membrane permeability. Complement–mediated killing also has the possibility to trigger and connect with adaptive immunity, it occurs when complement is activated either directly by microorganisms or by antibody−antigen (Ag−Ig) complexes [4]. In general, the antibody participates and modulates complement–mediated immune response by activation of the opsonization and immune complex clearance.

Complement C4 plays an important role in the activation of the complement system. The activation and generation of C4 is the result of the activated C1s (classical pathway) and MASP−2 (Mannan−binding lectin serine protease 2) (lectin pathway). During the activation of complement, the major fragment C4b acts as an opsonin to mark the surface of pathogens and alter self−tissue for removal. It attaches to surface nucleophiles on the activator through their reaction with the thioester (TE) exposed upon C4 cleavage [5,6]. Therefore, the characterized functional importance of C4 strictly relies on its structure and regulatory mechanism. Taking human as an example, human C4 is synthesized as a prepro single chain of 1744 amino acids. It comprises 3 subunits: the α, β and γ chains. These chains have the order β−α−γ in the precursor molecule [7]. C4 can be cleaved by one activated C1s into C4a and C4b, which binds C2a and C2b to form the enzymatic complexes to activate C3 and C5 in classical pathway [8]. The generated C4b fragment is a modulatory subunit of the classical pathway C3 (C4b2a) and C5 convertases (C4bC2aC3b).

It is similar to higher vertebrate, from previous study, two isotypes of C4 were found in fish and some mammals species which are H type (active/catalytic residue is histidine H) and D type (active/catalytic histidine H residue is replaced by an aspartic acid D) [9]. Although, two types of C4 were found in most of mammalian species, it is likely to origin from same ancestor lineage according to the phylogeny. Thioester bond exists inside of the C3 and C4 molecules. Histidine (H) can cleaves thioester bond in order to allow the covalent binding of these molecules to nearby cell surfaces when C3 and C4 are converted into C3b and C4b via proteolytic activation [10]. In fish, for instance, the substrate specificity has not been analyzed for the carp C4 isotypes. However, both isotypes are functional and mediating the hemolysis of carp serum via the classical and lectin pathways [11]. In addition, the published study showed that fish complement can lyse and opsonize foreign organisms for destruction by phagocytes. There are also indications that complement fragments can participate in inflammatory reactions [4]. However, the mechanism of regulating the opsonization and its association with adaptive immunity in fish are still poorly understood. Identification of the interaction between complement protein and receptor is a promising work for understanding the body regulation of interaction between host and pathogens. It can also provide the theoretical reference for fish vaccine fight against virulent pathogens.

Many complement receptors were initially reported to have the binding function with C3 and C4b, they are C4b–binding protein (C4BP), complement receptors type 1 (CR1) and type 2 (CR2), and decay−accelerating factor (DAF) [12]. In plasma, human C4BP is an oligomeric glycoprotein which has three major isoforms: α7/β1, α7/β0, and α6/β1. C4BP, serving as cofactor, binds nascent C4b molecules in factor I−mediated proteolysis [13,14]. However, among these receptors, CR1 is not clearly recognized outside of mammals [15]. Also no direct CR2 homologue has been found in the bony fish. Decay–accelerating factor (DAF) is a glycoprotein, anchoring to the cell membrane by phosphatidylinositol [16,17], binds to activated C3b and C4b, thereby inhibiting amplification of the complement cascade on host cell membranes [[18], [19], [20]]. Mannose–binding lectin associated proteases (MASPs) are a group members of the serine protease superfamily, play a vital role in targeting recognition processes. It has been currently demonstrated to cleave C4 in a similar way to the activation of the classical pathway and trigger the following formation of the C3 convertase via binding of C4b to C2a [[21], [22], [23]]. Although the functional importance of MASP–2 in the classical pathway's activation has been noticed, the mechanism of protein interaction between MASP–2 and C4β was still unexplored.

The grass carp (Ctenopharyngodon idella) is widely distributed in Asia. It is one of the most economical and herbivorous aquaculture freshwater species for food and water weed control [24,25]. In 2015, the genomic information of grass carp had been deciphered [26]. It provides the extremely useful resource for gene structural and functional study. The significantly expanded immune–related gene family in grass carp was observed by analyzing the genome. It was consistent with the species evolutionary adaption. Furthermore, the previous study identified a total of 64 genes from grass carp complement system via blasting against draft genome and transcriptome [[26], [27], [28]].

To better understand the evolutionary process and putative function of fish complement C4, we built the C4 phylogenetic tree. Two C4 linages in teleost and birds/reptiles clusters were isolated. In the text, species C4 sequences shared high amino acid conservation. We further performed a selective pressure analysis on different species groups, similar negative selection pressures were found in all groups. The C4 putative functional exploration was performed via building regulatory network using C4 as a network hub gene/protein. We further simulated the spatial structures of C4 protein and C4–MASP–2 protein complex. Compare to other complement members, our results indicated a unique evolutionary process of C4 among species and achieved a newly putative spatial conformation of C4–MASP–2 protein complex in grass carp. These results can also provide an insight for illustrating the potential mechanism of the interaction between C4 and MASP–2.

Section snippets

Database search and sequences collection

C4 amino acid sequences were retrieved and obtained from online Ensemble (http://asia.ensembl.org/index.html) and National Center for Biotechnology Information Search database (NCBI; https://www.ncbi.nlm.nih.gov/) using species genomic protein database. Teleost C4 sequences were retrieved and acquired from the genomes of coelacanth (Latimeria chalumnae), spotted gar (Lepisosteus oculatus), medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), rainbow trout (Oncorhynchus mykiss),

Sequence collection

The species C4 amino acid sequences were retrieved from the genomes of mammals, birds/reptiles, amphibians and fishes (Table S1). No C4 hit was found in the genomes of smooth tongue sole and lamprey (Petromyzon marinus). The C4 gene numbers in different species are variable. In Table S1, single C4 gene exists in mouse, duck, turtle, xenopus, coelacanth, spotted gar, stickleback, Japanese flunder and whale shark. Gene duplications were found in most of selected species. Three C4 genes were hit

Discussion

It is different from most of the members in complement gene family, C4 gene duplication has been found in most of species. However, the evolutionary history of C4 was completely diverged and this is similar to other complement members according to the phylogenetic study. In this study, we built the C4 phylogenetic tree and the result showed that the D and H lineages of C4 gene widely existed and completely diverged in most species. Codon usage bias analysis indicated that the codon usage of C4

CRediT authorship contribution statement

Lisen Li: Data curation, Formal analysis, Investigation, Visualization, Validation. Yubang Shen: Conceptualization, Data curation, Formal analysis, Funding acquisition, Project administration, Supervision, Writing - review & editing. Xiaoyan Xu: Supervision. Weining Yang: Investigation. Jiale Li: Conceptualization, Project administration, Supervision, Writing - review & editing, Funding acquisition.

Declaration of competing interest

The authors declare they have no competing interests.

Acknowledgements

This research was supported by the China's Agricultural Research System (CARS-45-03), the Project of Shanghai Engineering and Technology Center for Promoting Ability (16DZ2281200).

References (49)

  • S. Wang et al.

    The evolutionary analysis on complement genes reveals that fishes C3 and C9 experience different evolutionary patterns

    Fish Shellfish Immunol.

    (2013)
  • S. Wang et al.

    Genomic characterization and expression pattern of Bf/C2 and C4 in miiuy croaker and molecular evolution analysis on mammals and fishes

    Fish Shellfish Immunol.

    (2014)
  • R.C. Duncan et al.

    Multiple domains of MASP-2, an initiating complement protease, are required for interaction with its substrate C4

    Mol. Immunol.

    (2012)
  • V. Rossi et al.

    Functional characterization of complement proteases C1s/mannan-binding lectin-associated serine protease-2 (MASP-2) chimeras reveals the higher C4 recognition efficacy of the MASP-2 complement control protein modules

    J. Biol. Chem.

    (2005)
  • K. Whaley et al.

    Complement and complement deficiencies

    Semin. Liver Dis.

    (1997)
  • C.T. Thai et al.

    Complement components C5 and C7: recombinant factor I modules of C7 bind to the C345C domain of C5

    J. Immunol.

    (2004)
  • S. Mortensen et al.

    Structural basis for the function of complement component C4 within the classical and lectin pathways of complement

    J. Immunol.

    (2015)
  • S.K. Law

    The covalent binding reaction of C3 and C4

    Ann. N. Y. Acad. Sci.

    (1983)
  • S.K. Law et al.

    The internal thioester and the covalent binding properties of the complement proteins C3 and C4

    Protein Sci.

    (1997)
  • J. Mutsuro et al.

    Two divergent isotypes of the fourth complement component from a bony fish, the common carp (Cyprinus carpio)

    J. Immunol.

    (2005)
  • M.C. Carroll et al.

    Organization of the genes encoding complement receptors type 1 and 2, decay-accelerating factor, and C4-binding protein in the RCA locus on human chromosome 1

    J. Exp. Med.

    (1988)
  • T. Fujita et al.

    Human C4-binding protein. II. Role in proteolysis of C4b by C3b-inactivator

    J. Exp. Med.

    (1978)
  • M.A. Davitz et al.

    Release of decay-accelerating factor (DAF) from the cell membrane by phosphatidylinositol-specific phospholipase C (PIPLC). Selective modification of a complement regulatory protein

    J. Exp. Med.

    (1986)
  • M.E. Medof et al.

    Decay accelerating factor of complement is anchored to cells by a C-terminal glycolipid

    Biochemistry

    (1986)
  • Cited by (4)

    • An overview of complement systems in teleosts

      2022, Developmental and Comparative Immunology
      Citation Excerpt :

      In Nile tilapia, MASP-1 participates in the regulation of inflammatory and migration reaction (Mu et al., 2019). In grass carp (Ctenopharyngodon idella), MASP-1 is identified and activated by NF-κB signaling and may interact with C4 and MASP-2 (Dang et al., 2017; Li et al., 2020). In mammals, the CFB is an important component in the alternative pathway (Merle et al., 2015a).

    • The SNPs of three immunity genes in striped catfish, Pangasianodon hypophthalmus are associated with resistance against infection of Aeromonas hydrophila

      2022, Aquaculture
      Citation Excerpt :

      This is the first SNP in IL4R found to be associated with immunity in fish. The complement system is also a critical part of the immune defense of teleost fish, and complement component C4 forms part of the classical and lectin pathways (Lange et al., 2019; Li et al., 2020). It has been reported that in humans, the rs2857009 SNP of C4 and the basis of hepatitis C virus were associated with the progression of hepatitis C with increasing age (Chowdhury et al., 2015).

    1

    Lisen Li and Yubang Shen contribute equally to the article.

    View full text