Revealing factors determining immunodominant responses against dominant epitopes

Ritmahan, Wannisa; Kesmir, Can; Vroomans, Renske M.A.

doi:10.1007/s00251-019-01134-9

Revealing factors determining immunodominant responses against dominant epitopes

Review
Open access
Published: 06 December 2019

Volume 72, pages 109–118, (2020)
Cite this article

Download PDF

You have full access to this open access article

Immunogenetics Aims and scope Submit manuscript

Revealing factors determining immunodominant responses against dominant epitopes

Download PDF

Wannisa Ritmahan^1,2,
Can Kesmir² &
Renske M.A. Vroomans^2,3

3879 Accesses
17 Citations
Explore all metrics

Abstract

Upon recognition of peptide-MHC complexes by T cell receptors (TCR), the cognate T cells expand and differentiate into effector T cells to generate protective immunity. Despite the fact that any immune response generates a diverse set of TCR clones against a particular epitope, only a few clones are highly expanded in any immune response. Previous studies observed that the highest frequency clones usually control viral infections better than subdominant clones, but the reasons for this dominance among T cell clones are still unclear. Here, we used publicly available TCR amino acid sequences to study which factors determine whether a response becomes immunodominance (ID) per donor; we classified the largest T cell clone as the epitope-specific dominant clone and all the other clones as subdominant responses (SD). We observed a distinctively hydrophobic CDR3 in ID responses against a dominant epitope from influenza A virus, compared to the SD responses. The common V-J combinations were shared between ID and SD responses, suggesting that the biased V-J recombination events are restricted by epitope specificity; thus, the immunodominance is not directly determined by a bias combination of V and J genetic segments. Our findings reveal a close similarity of global sequence properties between dominant and subdominant clones of epitope-specific responses but detectable distinctive amino acid enrichments in ID. Taken together, we believe this first comparative study of immunodominant and subdominant TCR sequences can guide further studies to resolve factors determining the immunodominance of antiviral as well as tumor-specific T cell responses.

Broad TCR repertoire and diverse structural solutions for recognition of an immunodominant CD8+ T cell epitope

Article 27 February 2017

InYoung Song, Anna Gil, … Lawrence J Stern

Correlative CD4 and CD8 T-cell immunodominance in humans and mice: Implications for preclinical testing

Article Open access 19 September 2023

Tertuliano Alves Pereira Neto, John Sidney, … Alessandro Sette

Cell activation-based screening of natively paired human T cell receptor repertoires

Article Open access 17 May 2023

Ahmed S. Fahad, Cheng Yu Chung, … Brandon J. DeKosky

Introduction

T cells play a crucial role in human adaptive immunity. The recognition of an antigenic peptide bound to the MHC molecule (pMHC) by T cell receptor (TCR) leads to proliferation of the antigen-specific T cell clones (Abbas et al. 2012; Osuna et al. 2014; Pennock et al. 2013). The expanded T cell subsets subsequently differentiate into effector T cells to generate specific immune responses. The effector CD4+ T cells produce several cytokines to activate the recruitment of other immune cells while the effector CD8+ T cells respond to intracellular infections by killing the infected cells. This cascade of events generates long-lived memory T cells that can rapidly initiate the specific immune responses upon re-encountering the same antigen (Abbas et al. 2012).

The TCR is a heterodimer consisting of either a pair of α and β chains or γ and δ chains. The α/β TCR is dominant in human T cell repertoires (Abbas et al. 2012). Each TCR chain is comprised of a variable and a constant extracellular domain. The variable domain is encoded by the germline V, D (only in β and δ chains) and J genetic segments. Within this domain, the antigen recognition site is formed by three complementarity-determining regions (CDR1, CDR2, and CDR3) (Abbas et al. 2012; Clements et al. 2006; Hou et al. 2016; Hughes et al. 2003). Resolved pMHC-TCR protein structures suggest that mostly CDR3 interacts with pMHC, while the CDR1 and CDR2 are involved in stabilizing the overall TCR-pMHC interaction (Borg et al. 2005; Clements et al. 2006; Dash et al. 2017; Ely et al. 2005; Glanville et al. 2017; Rudolph and Wilson 2002). Thus, the specificity of T cell clones is defined mainly by the CDR3 region of both TCR chains (Danska et al. 1990; Hughes et al. 2003; Tsuchiya et al. 2018; Yassai et al. 2009). The CDR3 region is located at the junctional sites of V(D)-J segments, where recombination of the different genetic segments with the addition and/or removal of nucleotides during imprecise joining generates a highly variable CDR3 sequence (Abbas et al. 2012; Cabaniols et al. 2001; Hou et al. 2016; Pannetier et al. 1993).

A collection of diverse TCR clonotypes generates a unique TCR repertoire in each individual and enables effective protection to a wide range of antigens (Qi et al. 2014; Yassai et al. 2009). Due to a large diversity of T cell repertoires within an individual, several T cells with different TCRs are activated when exposed to the same pMHC complex. Moreover, different individuals having the same infection and same MHC molecules can have a very different responding set of T cell clonotypes (Kedl et al. 2003; Osuna et al. 2014). However, there are usually a few clonotypes that are highly expanded and it is still not fully understood which factors play a role in the selection for large expansion. It is hypothesized that the precursor frequency of naive and memory T cells likely contributes to dominance of T cell clones (Kedl et al. 2003; Kotturi et al. 2008; La Gruta et al. 2006). In addition, the TCR affinity to pMHC (Kedl et al. 2003; Osuna et al. 2014) and antigen dose (La Gruta et al. 2006) have been reported to shape the T cell hierarchy.

In this paper, we make a direct comparison of TCR of immunodominant and subdominant T cell responses to study the role of the composition of TCR in generating immunodominancy. To this end, we analyzed the CDR3 sequences specific to known viral epitopes from the VDJ database (VDJdb) (Shugay et al. 2018). We define an immunodominant clone (ID) as the one having the highest frequency among the CDR3 sequences obtained from one individual. The other CDR3 sequences that are less frequently observed within the same individual are defined as subdominant response (SD). We mainly focused on the CDR3β in this study due to data availability and its importance in pMHC recognition. Several common characteristics of CDR3 sequences were analyzed: the CDR3 length, hydropathy, and amino acid arrangement. To our knowledge, this is the first study that makes a direct comparison CDR3 sequences of ID and SD responses.

Materials and methods

Data selection from the VDJdb and data processing

The human CDR3 AA sequences were obtained from the VDJdb (https://vdjdb.cdr3.net) (Shugay et al. 2018) on March 2018 in a tab-delimited format. Default search parameters were used, except for origin Species (set to “Human”) and selecting both TCR genes (“TRA” and “TRB”). Further filtering of the data was done using an in-house pipeline (available at GitHub repository https://github.com/ritma001/InternshipUU_2017, uses software package R version 3.4.4). First, naive T cell sequences (CD45RA+CCR7+) were removed, as well as sequences with missing frequency because their dominance could not be determined. Because very few CD4+ sequences were present, only CD8+ sequences were used for further analysis. Next, MHC allele groups (Lefranc and Lefranc 2010) and the subgroups of V and J genes (Marsh et al. 2010) were extracted to enable comparison of differently reported values across all studies. Subsequently, CD8+ T cell sequences were divided into subsets based on MHC class and the type of TCR chain (alpha or beta). Several features and physicochemical properties of CDR3 peptides were calculated including whole length, non-VJ length (length of CDR3 region that is not encoded by V and J germline genes), hydropathy, polarity, aliphatic index, bulkiness, net charge, a fraction of acidic AA (“D” and “E”), a fraction of basic AA (“H,” “K,” and “R”), a fraction of aromatic AA (“H,” “F,” “W,” and “Y”), Boman factor (Boman 2003), and 10 Kidera factors (Kidera et al. 1985). The CDR3 encoding performed with custom R scripts and predefined functions from the “alakazam” and “Peptides” R packages.

The paired α/β data used in this study also derived from the VDJdb by specifying the search parameter Gene(chain) as “only paired records.” The downloaded data was processed as previously described and the epitope-specific α/β pairs were matched using the “complex.id” column.

Defining ID and SD responses

To extract epitope-specific responses for each individual, we parsed a subject identification and CDR3 clonotype frequency which were provided in the JSON string format. The subject ID was stored in the “meta” column while the frequency can be extracted from the “method” column. Since the subject ID was often missing or insufficient to define an individual response, we combined the original subject.id, replica.id, epitope, reference, and denominator of frequency and used this combination as the subject ID, with “*” as both separator and replacement for missing data. The total clonotype frequency was recalculated if the sum of frequencies in each subject ID was higher than 100%. This was often due to a repetitive measurement of the same CDR3, so this filtering ensured that each clonotype was counted only once in each individual. After correcting for the frequency and sequence redundancy per individual and epitope, the highest CDR3 clone frequency was defined as the ID response while the rest was classified as an SD response. This resulted in a highly imbalanced dataset where for each pMHC combination in an individual, a single ID and several SD were identified. Since it is possible that an ID response of one individual can be an SD response in another individual, it is not trivial to make non-overlapping ID and SD sequence sets. As the number of ID sequences was much smaller than the number of SD sequences (due to our definition of ID and SD), we decided to remove all SD sequences if these sequences were ever found as an ID response in at least one individual in our dataset. In addition, we only selected the CDR3 sequences with a frequency over 0.01% to minimize the bias from very rare clonotypes found in a few publications.

Naive T cell data

The naive T-cell dataset (3,823 sequences) was derived from two healthy donors from an independent study (Sequence Read Archive, project SRP109035, data is analyzed in collaboration with Peter de Greef). The unique CDR3 clones with an absolute naive count > 1 and not observed among non-naive sequence reads were included. The CDR3 nucleotide sequences were available for these datasets and we implemented a Python (version 3.5.2) script to calculate the non-VJ peptide sequences containing unidentified D segment and random nucleotide insertions.

Statistical analysis

The frequency distributions were tested for similarity using the Kolmogorov–Smirnov (KS) test. The P value was not displayed if the P > 0.05 except in the supplementary figures in which “ns” was presented as “non-significant.” The two-sided Mann–Whitney statistics (two-sample Wilcoxon test in R) was used for quantitative comparisons such as length and hydropathy. The same criteria for P value was used as previously mentioned. An alternative method other than the two statistics was indicated if performed.

CDR3 amino acid enrichment analysis

The conserved V and J regions of the CDR3 sequences were removed at the terminal ends. The remaining sequence was defined as “non-VJ” region comprising of encoded D segment and randomly inserted nucleotides. The 20 naturally occurring amino acids were clustered into 3 subgroups based on hydropathic properties including hydrophobic (“A,” “C,” “F,” “I,” “L,” “M,” “W,” and “V”), neutral (“G,” “H,” “P,” “S,” “T,” and “Y”), and hydrophilic (“E,” “D,” “K,” “N,” “Q,” and “R”). These three groups were made following the Kyte–Doolittle hydropathy scale (Kyte and Doolittle 1982).

In order to compare CDR3 sequences varying in length, we mapped the CDR3 sequences to the 2D peptide representative of TCR structure (Lefranc 2014; Lefranc et al. 2003). The CDR3 region is defined as the region delimited by “C” residue at 104 position (C104) and “F”/“W”118 (F/W118) residues. So, the anchored residues (C104 and F/W118) and conserved terminal ends were aligned. According to the IMGT database, 80% of CDR3 are 13-amino acidlong and the added residues are located in the middle of CDR3 with the uniquely defined IMGT positions (Lefranc et al. 2003). To avoid multiple gap insertions in the sequence alignment, we selected CDR3 sequences varying in length from 10 to 15 amino acids which covered > 90% of our dataset. The alignment was performed separately in ID and SD subsets and the entire length of alignment was 17 amino acids due to the maximum CDR3 length (15 amino acids) and the two anchored residues. Next, the positional probability matrix (PPM) was derived for N aligned sequences at position j where j ∈ (1, …, 17):

$$ {PPM}_{k,j}=\frac{1}{N}\sum \limits_{i=1}^N\left({X}_{ij}=k\right) $$

The parameter i ∈ (1,…, N) indicates the set of sequences, while k is a set of 20 amino acids and gap (“-”) and X_ij is the amino acid at position j, sequence i of the alignment. We then calculated a log ratio of amino acid probability of ID (PMMID) to SD PMM (PMMSD), which we called the “positional log enrichment score” (pLES). A positive ratio indicated the enrichment of amino acids at a certain position in ID compared to SD. Occasional ∞ and − ∞ values appeared in the pLES matrix due to missing amino acids in SD and/or ID. These values were replaced with 5 and – 5, respectively, since the maximum |pLES| were observed around 4. The matrix was displayed as a heatmap colored based on pLES. The positive pLES was subsequently converted into a sequence logo for improved visualization.

Results

We obtained human CDR3 sequences from the VDJdb (Shugay et al. 2018), where TCR sequences with known epitope specificity from several studies are collected. We observed 99 (~ 1.7%) identical epitope-specific CDR3β detected both as ID and SD responses. This suggests that an ID response from one individual only in very rare cases (< 2%) is an SD response in another individual. In other words, immunodominance of a clone is more universal than originally thought. The redundant CDR3 sequences were removed to obtain a unique CDR3 dataset for ID and SD responses. After this processing, 5811 CDR3 sequences remained which were described as responses against nine virus species, namely HIV-1, CMV, influenza A virus (IAV), EBV, yellow fever virus (YFV), HCV, and four serotypes of dengue virus (DENV1–4) (Fig. 1a). Several epitopes from the same virus were usually presented by different MHC molecules, see, e.g., HIV-1 epitopes (Fig. 1a). As expected, most of the CDR3 sequences (56%, n = 3276) were restricted to HLA-A*02 molecules (Fig. 1b). The top three most abundant CDR3 sequences were also A2 restricted: GILGFVFTL (GIL, n = 1130) from IAV, NLVPMVATV (NLV, n = 979) from CMV, and GLCTLVAML (GLC, n = 738) from EBV.

Influence of CDR3 length on immunodominance

The CDR3β contains the D segment interspersed between V and J regions and therefore should be longer than the CDR3α, which lacks the D segment. As expected, the CDR3β chains in our dataset were significantly longer than CDR3α chains (Fig. 2a, P < 0.001 KS test). The length distributions of both chains were normally distributed (Shapiro–Wilk test, P < 0.001) as has been observed previously (Moss and Bell 1996; Ma et al. 2016; Niemi et al. 2015). The average lengths of the CDR3α and β chains were 11.6 ± 7.4 and 12.3 ± 5.5 amino acids, respectively. Surprisingly, this result was not always consistent for paired α/β CDR3, as around 30% of unique α/β pairs contained longer α chains (supplementary Fig. 1A).

Next, we divided each CDR3 sequence into 3 regions; V, J, and non-VJ segments, and performed the length comparison between the α and β chains. We used the term non-VJ region to refer to the central part of the CDR3 that is not derived from the germline-encoded V and J genes (see the “Materials and methods” section). This junctional region is composed of additional nucleotides and D segment (only in the β chain); thus, we expected the longer non-VJ region in the CDR3β chain. Interestingly, we observed significantly longer J regions in the CDR3α chains while the V and non-VJ segments were longer in CDR3β chains (supplementary Fig. 1B–D).

It has been suggested that shorter CDR3 sequences are more likely to be generated, as they closely resemble the encoded peptide and need very deletion events (Hou et al. 2016). To our knowledge, the most concrete factor described so far to explain the response is the precursor frequency (De Boer et al. 2003). Taken together, this suggests that shorter CDR3 sequences are more likely to generate a T cell response. To test this, we compared the length of CDR3β chains between ID (n = 506) and SD (n = 5305) sequences. As shown in Fig. 2b, CDR3 length was not different between ID and SD responses. Next, we focused on the non-VJ region of CDR3, as this region highly dominates the interaction (Borg et al. 2005; Clements et al. 2006; Dash et al. 2017; Ely et al. 2005; Glanville et al. 2017; Rudolph and Wilson 2002). Almost 75% of the lengths were 3 to 6 amino acids in both ID and SD responses (Fig. 2c) and the ID responses were enriched in shorter regions; however, the difference between the two distributions is not significant, probably due to the small number of ID sequences available.

The ID- and SD-associated CDR3 sequences in our dataset were derived from antigen-experienced T cells (non-naive T cells), and therefore, it might be expected that they share a similar distribution of CDR3 lengths. To examine the difference between non-naive and naive T cells in their CDR3 length distribution, we compared the sequences in our dataset (ID and SD combined) with CDR3 lengths of the naive CD8+ T cell β chain (see the “Materials and methods” section). The average length of naive CDR3 was the same as non-naive CDR3s (12.3 amino acids); however, the distribution of the CDR3 length was significantly different between the two populations (Fig. 2d): longer CDR3 were somewhat enriched in the non-naive population.

Even after grouping CDR3 sequences based on the MHC molecules to which they are restricted, or the epitopes they recognize, we did not observe a difference between the CDR3 length of ID and SD (supplementary Fig. 2A, B). Interestingly, we did observe a negative relationship between the length of epitope and the corresponding CDR3 sequences. This result might reflect a bias due to the dominance of A2 restricted T cell responses in our dataset (Fig. 1b) and the A2 epitopes were all 9 amino acid long. We removed A2 epitopes and we still found a weak but significant negative correlation (Spearman correlation coefficient, r = − 0.14, P < 0.001) between epitope length and CDR3 length (Fig. 2e). This trend was also observed in a set with only EBV and HIV-1 epitopes (8 to 11 amino acid long), restricted by different HLA types (supplementary Fig. 2C). This result suggests that longer CDR3β chains are not needed to recognize longer epitopes bulging from the MHC molecule, which was previously hypothesized/shown (Ekeruche-Makinde et al. 2013). Next, we compared epitope length with the combined CDR3 lengths of paired α/β CDR3 chains (no A2 epitopes included; supplementary Fig. 2D). The significant correlation we found between the epitope length and CDR3 length disappeared in this case, which might suggest that a shorter CDR3β chain may be compensated with a longer CDR3α chain to preserve the interaction with varying epitope lengths (supplementary Fig. 2E).

Amino acid composition in CDR3 sequences

The N- and C-terminus of CDR3 sequences are highly conserved due to the germline encoded V and J segments, respectively. However, the centrally variable region could hold the potential to diverse immune responses. Therefore, we selectively studied amino acid profiles of the non-VJ region of CDR3β sequences. The ID and SD responses differed slightly in their different amino acid distribution (Fig. 3a). In general, the frequently presented amino acids were small (based on the molecular volume) and neutral: glycine (“G”), serine (“S”), threonine (“T”), and proline (“P”). Additionally, arginine (“R”), glutamine (“Q”), glutamic acid (“D”), alanine (“A”), and leucine (“L”) were observed more than the expected 5%. Approximately 25% of amino acids detected in the non V-J region is “G” irrespective of being an ID or SD response. This observation was consistent across the HLA alleles (supplementary Fig. 3A). The highly enriched “G” residue in CDR3β sequences is possibly due to the guanine-rich nucleotide sequences of D segment which was estimated to be around 70% in both D1 and D2 genes (Freeman et al. 2009; Venturi et al. 2008). Therefore, the D segment might cause a bias in codons containing guanine (“g”) like “ggx” (where x stands for any nucleotide) for “G.” If this is the case, the dominance of the glycine residue should disappear in CDR3α sequences that contain no D segment. To test this hypothesis, we compared the amino acid distributions of all CDR3 α and β sequences in our dataset (Fig. 3b). Some hydrophobic residues, namely, cysteine (“C”), methionine (“M”), and isoleucine (“I”) are highly enriched in the non-VJ region of CDR3α relative to the β chain (Fig. 3b). However, the most predominant residue of the CDR3α was also “G.” The log enrichment ratio between the CDR3β and CDR3α chains of “G” was positive suggesting that the residue in CDR3β was more frequently observed than the CDR3α. However, the difference is rather small making it unlikely that the D segment is solely responsible for the enrichment of “G” in CDR3β sequences (Fig. 3b).

To determine if the non-VJ segment of the antigen experienced T cells are shaped by naive T cells, we also compared the amino acid composition of the naive and antigen experienced CDR3 population. The amino acid profiles of CDR3β and CDR3α sequences were similar in both T cell populations (supplementary Fig. 3B). In general, the five most frequent amino acids (S, R, P, L, and G) observed in both CDR3 chains are the ones that can be encoded by 4 or more codons, suggesting an effect of codon degeneracy on frequency of an amino acid in non-VJ regions. The nucleotide sequences available for the naive T cell data confirmed enriched guanine even without the D segment in CDR3α (Fig. 3c). Therefore, we can conclude that enrichment of the nucleotide guanine (x), which probably results in an enrichment of the amino acid glycine (due to ggx code, see above) is not influenced by D segment, TCR chain, or antigen exposure.

To test position specific enrichment of amino acids in CDR3 sequences, we aligned unique CDR3 sequences of ID and SD separately. Prior to the alignment, CDR3 sequences were mapped to the IMGT positions corresponding to the 2D peptide chains representing the functional TCR structure (Lefranc 2014; Lefranc et al. 2003). As a consequence, the conserved terminal ends were aligned and gaps were allowed in the central variable domain. We then created the positional probabilistic matrices (PPM) to present the observed frequency of each amino acid throughout the CDR3 alignment. From this PPM, we also calculated positional probability ratios of ID to SD responses. This log ratio value was used as a positional log enrichment score (see the “Materials and methods” section). In most of the positions, all amino acids were presented equally in ID and SD. However, in positions 106, 112.1, and 115 to 117, several amino acids were enriched in SD sequences, suggesting that certain amino acids promote the subdominance probably because of the weak interactions with pMHC complexes (supplementary Fig. 4).

To refine the possible motifs in analysis, we repeated the same procedure at the level (Fig. 4a). We selected three with the most CDR3 sequences (GIL, NLV, and GLC) in order to have enough data. All three were presented by the HLA-A*02 molecule. The SD-associated CDR3 sequences were abundant and only a small fraction of around 2–6% of CDR3 sequences were defined as ID for each epitope. We computed the matrices for each and displayed the matrices as heat maps and sequence logos (Fig. 4b–g). The negative (blue), inferring an enrichment in SD, was observed dominantly (Fig. 4b–d), while few highly enriched amino acids were found in ID (red) responses. The sequence logos of the positive amino acids (Fig. 4e–g) displayed very different patterns for the different, suggesting that a simple motif/pattern is very difficult to find even among CDR3 sequences recognizing different on the same HLA molecule. Although one can detect some distinctive patterns between ID and SD per epitope, a general amino acid signature that defines CDR3 being an ID response was not found.

Discussion

The recently developed VDJdb contains a vast amount of epitope-specific CDR3 sequences. These sequences were derived from multiple research groups working on shared CDR3 sequences between individuals, recognition of dominant epitopes, binding motifs in CDR3 region, V(D)-J recombinations, and repertoire diversity (Shugay et al. 2018). To our knowledge, the immunodominancy of T cell responses had not yet been explicitly addressed with these data, despite its importance in shaping immune responses (Osuna et al. 2014). We here present the first attempt to do this.

We first focused on common properties like the length and of CDR3β sequences, since they have been shown to impact immune responses (Hou et al. 2016; Ma et al. 2016; Niemi et al. 2015; Stadinski et al. 2016; Wang et al. 2017). As expected, these two properties are insufficient for explaining why some responses are while others are subdominant. However, we found other interesting correlations. For instance, longer and more hydrophobic CDR3 are enriched among non-naive T cells, and there is a negative correlation between the epitope length and CDR3 length. Furthermore, CDR3 restricted by the HLA-A locus were more hydrophobic than those restricted to the B locus. We believed that these global descriptions of CDR3 sequence properties, despite being unrelated to this study, shed light on future immunology research in several aspects.

Focusing on the CDR3 responses only for HLA-A2 restricted GIL, we found significantly higher CDR3 in ID responses compared to SD responses. We observedthe CDR3 clones responding to GIL containing more hydrophobic residues than that of subdominant clones (data not shown). Previous studies on the GIL-MHC complex discovered a plain interacting site with fewer protruding side chains of the CDR3 sequence (Chen et al. 2017). This structural constraint could limit the selection of CDR3 sequences to those that are more hydrophobic, explaining our finding of higher CDR3 in ID responses against GIL. For all the other responses, we did not find such a clear effect.

We consistently observed high enrichment of glycine in the non-VJ region regardless of antigen exposure and CDR3 chains. Apart from the highly predominant glycine, different amino acid profiles were observed in epitope-specific CDR3, making it impossible to identify a common amino acid motif/pattern indicator of immunodominancy in CDR data. This result currently suggests that immunodominancy is mostly determined by specific TCR-epitope interaction properties, although with more data a common immunodominance motif might still be identified.

Another possible contributing factor shaping immunodominance is the bias in V-J recombinations which has been observed in many studies (Chen et al. 2017; Freeman et al. 2009; Moss and Bell 1996; Lundegaard et al. 2010; Ma et al. 2016; Song et al. 2017). We observed uniquely biased V-J usage for each epitope and found that the combined V and J genes in ID clones were largely shared with the SD responses (data not shown). To predict the generation probabilities of ID and SD responses (based on their VDJ recombinations), we made use of the OLGA server (Sethna et al. 2019). A per-epitope comparison of the generation probabilities for ID and SD responses did not indicate a significant difference, suggesting that the immunodominance of a T cell clone cannot be explained by high generation probability of its TCR sequence.

We mainly performed our analysis on CDR3β and when available on CDR3α sequences. However, the binding site of TCR to pMHC is composed of CDR1, CDR 2, and CDR3 regions from both TCR chains (Abbas et al. 2012; Clements et al. 2006; Hou et al. 2016; Hughes et al. 2003). The entire complex interaction can therefore not be studied completely using the available data on CDR3 sequences, if the other CDR regions play unexpectedly pivotal role in TCR-pMHC engagement (Miyazawa and Jernigan 1996). Thus, our analysis provides only a preliminary view of the interaction between CDR3 and bounded epitopes. When more data become available on CDR2 and CDR3 sequences, our analysis should be repeated.

In conclusion, we found that the global properties of CDR3 sequences between ID and SD are highly similar. However, several features are distinctive regarding epitope specificity and, thus, can enable classification of epitope-specific CDR3 from a diverse T cell response. Most interesting findings of our study, though slightly outside of our initial research question, were differences between antigen experienced and naive T cell clones. The results raised by our analysis ask for larger sets of naive T cell repertoires for validation.

References

Abbas AK, Lichtman AH, Pillai S (2012) Cellular and molecular immunology. Saunders/Elsevier
Boman HG (2003) Antibacterial peptides: basic facts and emerging concepts. J Intern Med 254(3):197–215. https://doi.org/10.1046/j.1365-2796.2003.01228.x
Article CAS PubMed Google Scholar
Borg NA, Ely LK, Beddoe T, Macdonald WA, Reid HH, Clements CS et al (2005) The CDR3 regions of an immunodominant T cell receptor dictate the “energetic landscape” of peptide-MHC recognition. Nat Immunol 6(2):171–180. https://doi.org/10.1038/ni1155
Article CAS PubMed Google Scholar
Cabaniols J-P, Fazilleau N, Casrouge A, Kourilsky P, Kanellopoulos JM (2001) Most α/β T cell receptor diversity is due to terminal deoxynucleotidyl transferase. J Exp Med 194(9):1385. Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2195970/–1390
Article CAS PubMed PubMed Central Google Scholar
Canada, V. A. (1986). K-sample Anderson-Darling tests of fit, for continuous and discrete cases. Department of Mathematics and Statistics, (81)
Chen G, Yang X, Ko A, Sun X, Gao M, Zhang Y, Shi A, Mariuzza RA, Weng N (2017) Sequence and structural analyses reveal distinct and highly diverse human CD8 + TCR repertoires to immunodominant viral antigens. Cell Rep 19(3):569–583. https://doi.org/10.1016/j.celrep.2017.03.072
Article CAS PubMed PubMed Central Google Scholar
Clements CS, Dunstone MA, Macdonald WA, McCluskey J, Rossjohn J (2006) Specificity on a knife-edge: the αβ T cell receptor. Curr Opin Struct Biol 16(6):787–795. https://doi.org/10.1016/j.sbi.2006.09.004
Article CAS PubMed Google Scholar
Danska JS, Livingstone AM, Paragas V, Ishihara T, Fathman CG (1990) The presumptive CDR3 regions of both T cell receptor alpha and beta chains determine T cell specificity for myoglobin peptides. J Exp Med 172(1):27–33. https://doi.org/10.1084/jem.172.1.27
Article CAS PubMed Google Scholar
Dash P, Fiore-Gartland AJ, Hertz T, Wang GC, Sharma S, Souquette A, Crawford JC, Clemens EB, Nguyen THO, Kedzierska K, la Gruta NL, Bradley P, Thomas PG (2017) Quantifiable predictive features define epitope-specific T cell receptor repertoires. Nature 547(7661):89–93. https://doi.org/10.1038/nature22383
Article CAS PubMed PubMed Central Google Scholar
De Boer RJ, Homann D, Perelson AS (2003) Different dynamics of CD4+ and CD8+ T cell responses during and after acute lymphocytic choriomeningitis virus infection. Journal of Immunology (Baltimore, Md. : 1950) 171(8):3928–3935. https://doi.org/10.4049/jimmunol.171.8.3928
Article Google Scholar
Ekeruche-Makinde J, Miles JJ, van den Berg HA, Skowera A, Cole DK, Dolton G et al (2013) Peptide length determines the outcome of TCR/peptide-MHCI engagement. Blood 121(7):1112–1123. https://doi.org/10.1182/blood-2012-06-437202
Article CAS PubMed PubMed Central Google Scholar
Ely L, Kjer-Nielsen L, McCluskey J, Rossjohn J (2005) Structural studies on the αβ T-cell receptor. IUBMB Life 57(8):575–582. https://doi.org/10.1080/15216540500215556
Article CAS PubMed Google Scholar
Freeman JD, Warren RL, Webb JR, Nelson BH, Holt RA (2009) Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing. Genome Res 19(10):1817–1824. https://doi.org/10.1101/gr.092924.109
Article CAS PubMed PubMed Central Google Scholar
Glanville J, Huang H, Nau A, Hatton O, Wagar LE, Rubelt F, Ji X, Han A, Krams SM, Pettus C, Haas N, Arlehamn CSL, Sette A, Boyd SD, Scriba TJ, Martinez OM, Davis MM (2017) Identifying specificity groups in the T cell receptor repertoire. Nature 547(7661):94–98. https://doi.org/10.1038/nature22976
Article CAS PubMed PubMed Central Google Scholar
Hou X, Wang M, Lu C, Xie Q, Cui G, Chen J, du Y, Dai Y, Diao H (2016) Analysis of the repertoire features of TCR beta chain CDR3 in human by high-throughput sequencing. Cell Physiol Biochem 39(2):651–667. https://doi.org/10.1159/000445656
Article CAS PubMed Google Scholar
Hughes MM, Yassai M, Sedy JR, Wehrly TD, Huang CY, Kanagawa O, Gorski J, Sleckman BP (2003) T cell receptor CDR3 loop length repertoire is determined primarily by features of the V(D)J recombination reaction. Eur J Immunol 33(6):1568–1575. https://doi.org/10.1002/eji.200323961
Article CAS PubMed Google Scholar
Kedl RM, Kappler JW, Marrack P (2003) Epitope dominance, competition and T cell affinity maturation. Current Opinion in Immunology 15(1):120–127 Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12495743
Article CAS PubMed Google Scholar
Kidera A, Konishi Y, Oka M, Ooi T, Scheraga HA (1985) Statistical analysis of the physical properties of the 20 naturally occurring amino acids. J Protein Chem 4(1):23–55. https://doi.org/10.1007/BF01025492
Article CAS Google Scholar
Kotturi MF, Scott I, Wolfe T, Peters B, Sidney J, Cheroutre H et al (2008) Naive precursor frequencies and MHC binding rather than the degree of epitope diversity shape CD8⁺ T cell immunodominance. J Immunol 181(3):2124–2133. https://doi.org/10.4049/jimmunol.181.3.2124
Article CAS PubMed Google Scholar
Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157(1):105–132. https://doi.org/10.1016/0022-2836(82)90515-0
Article CAS PubMed Google Scholar
La Gruta NL, Kedzierska K, Pang K, Webby R, Davenport M, Chen W et al (2006) A virus-specific CD8+ T cell immunodominance hierarchy determined by antigen dose and precursor frequencies. Proc Natl Acad Sci 103(4):994–999. https://doi.org/10.1073/pnas.0510429103
Article CAS PubMed PubMed Central Google Scholar
Lefranc M-P, Lefranc G (2001) The T cell receptor FactsBook. Elsevier
Lefranc M-P (2014) Immunoglobulins: 25 years of immunoinformatics and IMGT-ONTOLOGY. Biomolecules 4(4):1102–1139. https://doi.org/10.3390/biom4041102
Article PubMed PubMed Central Google Scholar
Lefranc M-P, Pommié C, Ruiz M, Giudicelli V, Foulquier E, Truong L et al (2003) IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Developmental and Comparative Immunology 27(1):55–77 Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12477501
Article CAS PubMed Google Scholar
Lundegaard C, Hoof I, Lund O, Nielsen M (2010) State of the art and challenges in sequence based T-cell epitope prediction. Immunome Research 6(Suppl 2):S3. https://doi.org/10.1186/1745-7580-6-S2-S3
Article CAS PubMed PubMed Central Google Scholar
Ma L, Yang L, Bin Shi B, He X, Peng A, Li Y et al (2016) Analyzing the CDR3 repertoire with respect to TCR–beta chain V-D-J and V-J rearrangements in peripheral T cells using HTS. Sci Rep 6(1):29544–29510. https://doi.org/10.1038/srep29544
Article CAS PubMed PubMed Central Google Scholar
Marsh SGE, Albert ED, Bodmer WF, Bontrop RE, Dupont B, Erlich HA et al (2010) Nomenclature for factors of the HLA system, 2010. Tissue Antigens 75(4):291–455. https://doi.org/10.1111/j.1399-0039.2010.01466.x
Article CAS PubMed PubMed Central Google Scholar
Miyazawa S, Jernigan RL (1996) Residue–residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 256(3):623–644. https://doi.org/10.1006/jmbi.1996.0114
Article CAS PubMed Google Scholar
Moss PA, Bell JI (1996) Comparative sequence analysis of the human T cell receptor TCRA and TCRB CDR3 regions. Hum Immunol 48(1–2):32–38. https://doi.org/10.1016/0198-8859(96)00084-5
Article CAS PubMed Google Scholar
Niemi HJ, Laakso S, Salminen JT, Arstila TP, Tuulasvaara A (2015) A normal T cell receptor beta CDR3 length distribution in patients with APECED. Cell Immunol 295(2):99–104. https://doi.org/10.1016/j.cellimm.2015.03.005
Article CAS PubMed Google Scholar
Osuna CE, Gonzalez AM, Chang HH, Hung AS, Ehlinger E, Anasti K, Alam SM, Letvin NL (2014) TCR affinity associated with functional differences between dominant and subdominant SIV epitope-specific CD8 + T cells in Mamu-A*01 + rhesus monkeys. PLoS Pathog 10(4):e1004069. https://doi.org/10.1371/journal.ppat.1004069
Article CAS PubMed PubMed Central Google Scholar
Pannetier C, Cochet M, Darche S, Casrouge A, Zöller M, Kourilsky P (1993) The sizes of the CDR3 hypervariable regions of the murine T-cell receptor beta chains vary as a function of the recombined germ-line segments. Proc Natl Acad Sci U S A 90(9):4319–4323. https://doi.org/10.1073/pnas.90.9.4319
Article CAS PubMed PubMed Central Google Scholar
Pennock ND, White JT, Cross EW, Cheney EE, Tamburini BA, Kedl RM (2013) T cell responses: naïve to memory and everything in between. Adv Physiol Educ 37(4):273–283. https://doi.org/10.1152/advan.00066.2013
Article PubMed PubMed Central Google Scholar
Qi Q, Liu Y, Cheng Y, Glanville J, Zhang D, Lee J-Y, Olshen RA, Weyand CM, Boyd SD, Goronzy JJ (2014) Diversity and clonal selection in the human T-cell repertoire. Proc Natl Acad Sci 111(36):13139–13144. https://doi.org/10.1073/pnas.1409155111
Article CAS PubMed PubMed Central Google Scholar
Rudolph MG, Wilson IA (2002) The specificity of TCR/pMHC interaction. Curr Opin Immunol 14(1):52–65. https://doi.org/10.1016/S0952-7915(01)00298-9
Article CAS PubMed Google Scholar
Sethna Z, Elhanati Y, Callan CG, Walczak AM, Mora T (2019) OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs. Bioinformatics 35(17):2974–2981. https://doi.org/10.1093/bioinformatics/btz035
Article PubMed PubMed Central Google Scholar
Shugay M, Bagaev DV, Zvyagin IV, Vroomans RM, Crawford JC, Dolton G, Komech EA, Sycheva AL, Koneva AE, Egorov ES, Eliseev AV, van Dyk E, Dash P, Attaf M, Rius C, Ladell K, McLaren J, Matthews KK, Clemens EB, Douek DC, Luciani F, van Baarle D, Kedzierska K, Kesmir C, Thomas PG, Price DA, Sewell AK, Chudakov DM (2018) VDJdb: a curated database of T-cell receptor sequences with known antigen specificity. Nucleic Acids Res 46(D1):D419–D427. https://doi.org/10.1093/nar/gkx760
Article CAS PubMed Google Scholar
Song I, Gil A, Mishra R, Ghersi D, Selin LK, Stern LJ (2017) Broad TCR repertoire and diverse structural solutions for recognition of an immunodominant CD8+ T cell epitope. Nat Struct Mol Biol 24(4):395–406. https://doi.org/10.1038/nsmb.3383
Article CAS PubMed PubMed Central Google Scholar
Stadinski BD, Shekhar K, Gómez-Touriño I, Jung J, Sasaki K, Sewell AK, Peakman M, Chakraborty AK, Huseby ES (2016) Hydrophobic CDR3 residues promote the development of self-reactive T cells. Nat Immunol 17(8):946–955. https://doi.org/10.1038/ni.3491
Article CAS PubMed PubMed Central Google Scholar
Tsuchiya Y, Namiuchi Y, Wako H, Tsurui H (2018) A study of CDR3 loop dynamics reveals distinct mechanisms of peptide recognition by T-cell receptors exhibiting different levels of cross-reactivity. Immunology 153(4):466–478. https://doi.org/10.1111/imm.12849
Article CAS PubMed Google Scholar
Venturi V, Price DA, Douek DC, Davenport MP (2008) The molecular basis for public T-cell responses? Nat Rev Immunol. https://doi.org/10.1038/nri2260
Article CAS PubMed Google Scholar
Wang C-Y, Fang Y-X, Chen G-H, Jia H-J, Zeng S, He X-B, Feng Y, Li SJ, Jin QW, Cheng WY, Jing Z-Z (2017) Analysis of the CDR3 length repertoire and the diversity of T cell receptor α and β chains in swine CD4+ and CD8+ T lymphocytes. Mol Med Rep 16(1):75–86. https://doi.org/10.3892/mmr.2017.6601
Article CAS PubMed PubMed Central Google Scholar
Yassai MB, Naumov YN, Naumova EN, Gorski J (2009) A clonotype nomenclature for T cell receptors. Immunogenetics 61(7):493–502. https://doi.org/10.1007/s00251-009-0383-x
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

We thank Peter de Greef for providing the naive T cell and Ewald van Dijk for sharing his program on motif-finding.

Author information

Authors and Affiliations

The Centre for Integrative Bioinformatics VU, Vrije Universiteit, Amsterdam, The Netherlands
Wannisa Ritmahan
Theoretical Biology, Utrecht University, Utrecht, The Netherlands
Wannisa Ritmahan, Can Kesmir & Renske M.A. Vroomans
Institute for Advanced Study, University of Amsterdam, Amsterdam, The Netherlands
Renske M.A. Vroomans

Authors

Wannisa Ritmahan
View author publications
You can also search for this author in PubMed Google Scholar
Can Kesmir
View author publications
You can also search for this author in PubMed Google Scholar
Renske M.A. Vroomans
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wannisa Ritmahan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on “Nomenclature, databases and bioinformatics in Immunogenetics”

Electronic supplementary material

ESM 1

(DOCX 410 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Ritmahan, W., Kesmir, C. & Vroomans, R.M. Revealing factors determining immunodominant responses against dominant epitopes. Immunogenetics 72, 109–118 (2020). https://doi.org/10.1007/s00251-019-01134-9

Download citation

Received: 17 September 2019
Accepted: 04 October 2019
Published: 06 December 2019
Issue Date: February 2020
DOI: https://doi.org/10.1007/s00251-019-01134-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Revealing factors determining immunodominant responses against dominant epitopes

Abstract

Similar content being viewed by others

Broad TCR repertoire and diverse structural solutions for recognition of an immunodominant CD8+ T cell epitope

Correlative CD4 and CD8 T-cell immunodominance in humans and mice: Implications for preclinical testing

Cell activation-based screening of natively paired human T cell receptor repertoires

Introduction

Materials and methods