The evolution of relapse of adult T cell acute lymphoblastic leukemia

Sentís, Inés; Gonzalez, Santiago; Genescà, Eulalia; García-Hernández, Violeta; Muiños, Ferran; Gonzalez, Celia; López-Arribillaga, Erika; Gonzalez, Jessica; Fernandez-Ibarrondo, Lierni; Mularoni, Loris; Espinosa, Lluís; Bellosillo, Beatriz; Ribera, Josep-Maria; Bigas, Anna; Gonzalez-Perez, Abel; Lopez-Bigas, Nuria

doi:10.1186/s13059-020-02192-z

Research
Open access
Published: 23 November 2020

The evolution of relapse of adult T cell acute lymphoblastic leukemia

Inés Sentís¹^na1,
Santiago Gonzalez^1,2^na1,
Eulalia Genescà³,
Violeta García-Hernández⁴,
Ferran Muiños¹,
Celia Gonzalez³,
Erika López-Arribillaga¹,
Jessica Gonzalez⁴,
Lierni Fernandez-Ibarrondo⁵,
Loris Mularoni^1,6,
Lluís Espinosa⁴,
Beatriz Bellosillo⁵,
Josep-Maria Ribera³,
Anna Bigas⁴,
Abel Gonzalez-Perez^1,7^na2 &
…
Nuria Lopez-Bigas ORCID: orcid.org/0000-0003-4925-8988^1,7,8^na2

Genome Biology volume 21, Article number: 284 (2020) Cite this article

6685 Accesses
12 Citations
78 Altmetric
Metrics details

Abstract

Background

Adult T cell acute lymphoblastic leukemia (T-ALL) is a rare disease that affects less than 10 individuals in one million. It has been less studied than its cognate pediatric malignancy, which is more prevalent. A higher percentage of the adult patients relapse, compared to children. It is thus essential to study the mechanisms of relapse of adult T-ALL cases.

Results

We profile whole-genome somatic mutations of 19 primary T-ALLs from adult patients and the corresponding relapse malignancies and analyze their evolution upon treatment in comparison with 238 pediatric and young adult ALL cases. We compare the mutational processes and driver mutations active in primary and relapse adult T-ALLs with those of pediatric patients. A precise estimation of clock-like mutations in leukemic cells shows that the emergence of the relapse clone occurs several months before the diagnosis of the primary T-ALL. Specifically, through the doubling time of the leukemic population, we find that in at least 14 out of the 19 patients, the population of relapse leukemia present at the moment of diagnosis comprises more than one but fewer than 10⁸ blasts. Using simulations, we show that in all patients the relapse appears to be driven by genetic mutations.

Conclusions

The early appearance of a population of leukemic cells with genetic mechanisms of resistance across adult T-ALL cases constitutes a challenge for treatment. Improving early detection of the malignancy is thus key to prevent its relapse.

Background

Acute lymphoblastic leukemia (ALL) affects 3 children in 100,000 in the UK [1]. In the past 5 decades, intense research on this disease has succeeded in reducing the mortality of ALL-affected children by 82% [2]. Recently, with the development of cancer genomics, researchers have unraveled the most frequent somatic genetic alterations underlying its development [3,4,5,6,7,8,9,10,11,12,13,14], and molecular subtypes, as well as their clinical relevance [15,16,17,18,19,20,21,22]. Genetic alterations that elicit some relapse events have also been uncovered, and the potential role of therapy in the development of such relapse cases has been explored [23,24,25,26,27,28,29,30,31].

ALL is less prevalent in adults (0.7 patients in 100,000 people [1]). Not only are there differences in incidence among age groups, but also relapses after treatment appear more frequently in adults (40–75% vs 15–20% among pediatric patients) [31]. Very few studies have been dedicated to understanding the genomic roots of the emergence of adult ALL, and in particular, of T cell ALL (T-ALL) [32,33,34,35,36]. There is a larger gap in the study of the evolution of this malignancy under therapy and its relapse after treatment. Therefore, important questions regarding the genomic evolution of adult T-ALL remain unanswered. It is not entirely clear, for example, whether the same mutational processes are involved in the onset of pediatric and adult T-ALL cases, and if the chemotherapeutic drugs employed in the treatment leave a mutational footprint in relapse cells, as it has been shown for pediatric cases [37]. Furthermore, while some genetic mechanisms of resistance to treatment have been identified in pediatric ALL [26, 27], it is not known whether these also contribute to resistance of the adult malignancy.

To explore the evolution of adult T-ALL under treatment and address these specific questions, we profiled the whole-genome somatic mutations of 19 T-ALLs from adult patients who relapsed after treatment (in-house cohort; Additional file 1: Table S1). Samples were taken at the time of diagnosis (primary) and at recurrence of the malignancy after treatment (relapse). We then analyzed the genomic evolution of these adult T-ALL cases in comparison with 238 pediatric and young adult ALL cases (71 with primary and relapse samples) available in the public domain (Table 1). Known or potential resistance mutations appear in 6 patients of the cohort. Nevertheless, our results show that in the 19 cases the relapse is driven by genetic mutations, and that resistant cells appear in the population of blasts several months before the diagnosis of the primary.

Table 1 Summary of ALL cohorts analyzed

Full size table

Results

The genomics of primary adult T-ALL

Previous studies on the genomic basis of pediatric ALL have identified somatic mutations across cohorts of patients suffering from this disease [5,6,7,8, 10, 12, 13, 28,29,30, 38,39,40,41]. Therefore, we first aimed to compare the landscape of somatic alterations observed across primary adult T-ALL with that across eight other cohorts of T- and B-ALL patients of varying age, ranging from infancy to young adulthood, which we analyzed with a unified mutation calling approach (Table 1; Additional file 1: Tables S1 and S2). Among cancer types, ALL presents a relatively low somatic mutation burden [42, 43]. Nevertheless, the burden of somatic point mutations of adult ALL cases tends to be higher than that of cases of most of the subtypes of the pediatric malignancy, as has been previously observed [44] (Fig. 1a).

Mutations in human somatic cells are contributed to by different molecular mechanisms involving the interaction of endogenous (for instance, spontaneous cytosine deamination or oxidative damage) and external DNA damaging agents (such as UV-light, tobacco carcinogens, or chemotherapies) with the DNA repair machinery [42, 45,46,47]. The study of these mutational processes in tumors reveals the lifetime exposures of patients to potential carcinogenic agents and consequently contributes to shedding light on the etiology of malignancies. Thus, we first asked whether the somatic mutations observed across nine cohorts of pediatric and adult ALL (Table 1) are contributed by similar or different mutational processes. No clear differences are observed between the mutational profiles of B-ALL and T-ALL (Fig. 1b, top). However, the mutational profiles of pediatric and adult malignancies exhibit discernible, albeit slight differences (Fig. 1b, bottom). The same mutational processes appear to be active across pediatric and adult T-ALL and in pediatric B-ALL (Fig. 1c; Additional file 2: Fig. S1). In particular, mutational signature 5 (SBS5), which in blood has been demonstrated to behave in a clock-like manner [48], and has been associated with the process of hematopoietic cell divisions [49, 50], appears as one of the main contributors of mutations in the evolution of both pediatric and adult ALL.

We next asked whether the driver alterations observed across primary adult T-ALL in the in-house cohort are different from those observed across pediatric B/T-ALL (see the “Methods” section; Fig. 1d; Additional file 2: Fig. S2; Additional file 1: Tables S3 and S4). Mutations in some known ALL driver genes, such as NOTCH1 and FBXW7 (the E3-ligase charged with its recognition for ubiquitination [51]), are overrepresented among both pediatric and adult T-ALL in comparison with B-ALLs (χ² p = 1.05 × 10⁻¹⁶ and χ² p = 8.37 × 10⁻⁹, respectively). Similar overrepresentation of mutations in T-ALLs was found in JAK3 (χ² p = 0.004). In contrast, RAS activating mutations do not appear to be differently represented in both ALL types (χ² p = 0.05 and χ² p = 0.634 for KRAS and NRAS).

Genomic alterations driving primary and relapse adult T-ALL

With the goal to study the evolution of adult T-ALL, the 19 patients in the in-house cohort were selected specifically because they relapsed several months after treatment (Fig. 2a; Additional file 2: Fig. S3; Additional file 1: Table S1). Seventeen of them received the same treatment protocol (ALL-HR-11 [NCT01540812]), while the remaining two were administered very similar protocols (LAL-07OLD and ALL-HR-2003 [NCT00853008]). To uncover the genomic similarities and differences between adult and pediatric T-ALL cases at relapse, we next compared the in-house cohort with 31 relapsed cases from the T-ALL Oshima and T-ALL SJ cohorts (Table 1; Additional file 1: Tables S3 and S4). A list of potential driver events across the 19 patients in the cohort is presented in Additional file 1: Tables S5 and S6.

Many NOTCH1 and FBXW7 mutations observed in the primary leukemias were also present in the relapse samples (Fig. 2b; Additional file 2: Fig. S4). Intriguingly, mutations affecting USP7, a known deubiquitinase of NOTCH1, were detected in 3 adult and 3 pediatric patients, raising the possibility of yet another form of alteration of the NOTCH pathway in leukemogenesis [52,53,54]. Overall, NOTCH1-affecting mutations in adults are distributed along the protein-coding sequence in a very similar manner as those observed in pediatric patients (Fig. 2c). Nine patients in the cohort present multiple mutations of NOTCH1 that affect different protein domains (mostly HD and PEST), in agreement with previous reports [55]. Interestingly, in 6 patients, different NOTCH1/FBXW7 mutations were detected in the primary and relapse samples (Fig. 2d). These constitute examples of convergent evolution of mutations affecting the NOTCH1 pathway, also observed in eight pediatric patients in the cohorts analyzed. This suggests that NOTCH1 mutations tend to appear late [56] and recurrently (i.e., in several cells) during T-ALL development.

DNMT3A-affecting mutations, known to drive acute myeloid leukemias (AML), were observed in three adult patients in the in-house cohort and none of the pediatric T-ALLs. In fact, these three patients are classified as Early T-Cell Precursor (ETP), a T-ALL subtype that presents myeloid markers [33]. Similarly, PAT5 and PAT9, patients with mutations of ROBO2—a gene associated with progression of myelodysplastic syndrome [57] to AML and recently reported as mutated in pediatric ALL [58]—present the ETP phenotype. Clonal mutations of PHF6 are overrepresented (χ² p = 0.001) in adult T-ALLs with respect to their pediatric counterparts, shared between primary and relapse samples. PHF6 is a zinc-finger transcription factor that suppresses ribosomal RNA (rRNA) transcription [32]. Loss-of-function mutations of this gene have been shown to decrease sensitivity to glucocorticoids [59], which are part of the standard first-line treatment of adult T-ALL patients. Interestingly, activating mutations of the NT5C2 gene, known to elicit resistance to mercaptopurine anti-ALL treatment in pediatric cases [26, 27], are also observed across 3 adult cases exposed to this drug (Fig. 2a), with PAT16 bearing two mutations of NT5C2 (R238G, R367Q, see Additional file 1: Table S5). In the relapse samples of two patients of the in-house cohort, we observed amplifications of ABCB1, an ATP-dependent membrane transporter known to mediate multidrug resistance in tumors [60, 61] (Additional file 2: Fig. S5). Finally, SMARCA4 mutations and deletions were also detected across adult (2) and pediatric T-ALLs, but almost exclusively in relapse malignancies, suggesting a potential role in resistance to treatment.

In summary, in 6 of the 19 adult patients of the in-house cohort, we were able to identify a candidate treatment-resistance mutation.

The evolution of relapse adult T-ALL measured through mutations

We next asked how much do the mutational processes active in primary T-ALLs also contribute to the overall burden of mutations of relapse adult T-ALLs. The incorporation of new mutational processes, like the exposure to chemotherapies used in their treatment, could leave a mutational footprint that may be detectable in the relapse clone, as recently demonstrated in metastases of different solid tumors, and in relapsed pediatric ALL cases [37, 46].

The deconstruction of mutational signatures (representing mutational processes active during a person’s life) of primary and relapse samples of each patient reveals very similar scenarios for primary-private, shared, and relapse-private mutations (Fig. 3a). Signature 5 (SBS5), which represents a mutational process associated with hematopoietic cell division [46], contributes the vast majority (~ 80%) of mutations in these three groups. We did not detect the mutational footprint of mercaptopurine or any other chemotherapy in the relapse samples (Additional file 2: Fig. S6). This does not preclude that chemotherapy-related mutations exist below the level of detection of the sequencing technology, for example if the evolutionary bottleneck caused by the treatment has not sufficiently reduced the T-ALL population.

Since signature 5 has been described as a clock-like process [48] and this type of mutations are the main contribution to the burden of clonal mutations of both primary and relapse T-ALLs, we used them to infer a molecular time of divergence between the primary and relapse populations (Fig. 3b, Additional file 2: Fig. S7). To this end, we counted the number of primary-private, shared, and relapse-private signature 5 clonal mutations (Fig. 3b). In all cases, the branch that corresponds to relapse-private mutations is longer than that representing primary-private mutations, because the relapse clone has continued accumulating mutations longer after its divergence from the primary (eliminated as a consequence of the treatment). As expected, fewer relapse-private mutations accumulate in the cases with shorter time elapsed between the diagnosis of the primary and the emergence of relapse.

Time of divergence of primary and relapse clones

The number of primary-private, shared, and relapse-private signature 5 clonal mutations can also be used to estimate the precise time of the divergence of the primary and relapse clonal populations. To that end, we first needed to understand the rate of accumulation of signature 5 mutations during T-ALL development. The DNA of normal hematopoietic cells has been shown to incorporate signature 5 mutations at a rate of roughly 12 per year (Fig. 4a; Additional file 2: Fig. S7 [49]). Regressing the number of signature 5 mutations across primary and relapse T-ALLs on the age of patients in the in-house cohort in comparison with healthy hematopoietic stem cells (HSCs) yields slightly higher mutation rates and an unanticipated high (~ 400) number of mutations at the start of life of hematopoietic cells (intercept of trendline in Fig. 4a). This deviation could be explained through an acceleration in the mutation rate that occurs upon malignization of hematopoietic cells [62].

To compute the moment of time before diagnosis when this acceleration started, as well as the value of the accelerated mutation rate, we assumed that the acceleration rate is the same for the primary and relapse malignancies of a patient. We then simulated a one-time increase of the mutation rate (constant rate model) during tumor evolution and alternatively a steady increase (linear rate model) in the mutation rate for successive cell generations (Additional file 2: Fig. S8). For each patient, we assayed several trendlines of accelerated mutation rate (i.e., starting at different timepoints before diagnosis; dotted lines in Fig. 4b) approximating the observed number of signature 5 clonal mutations in the primary and relapse T-ALL clones. We computed the likelihood of each of these trends of acceleration following their accuracy to fit the observed number of mutations in the primary and relapse malignancies (Fig. 4b and Additional file 2: Fig. S8). For each trendline of accelerated mutation rate, the age of the patient at which the divergence of the two clones occurred can be computed from the number of shared mutations. The difference between this age and the age at diagnosis then yields the time elapsed between this divergence and the diagnosis of the primary T-ALL.

Upon application of this approach to each patient in the in-house cohort, we obtained a number of estimates of the number of days elapsed between the divergence of both clones and the diagnosis of the primary T-ALL, each with varying likelihood (green circles, Fig. 4c). The estimates for each patient may be summarized as their weighted (by likelihood) averages (broken lines). The time estimated for each patient was subsequently refined using the distribution of all patients (see the “Methods” section). As a result, we obtained a robust prediction of the boundaries of the most likely time elapsed between the divergence of primary and relapse clones and the diagnosis of the primary malignancy. In the majority of cases shown in the figure (13 out of 15) less than a year passed between the emergence of the relapse clone and the diagnosis of the primary disease (Additional file 1: Table S7).

The evolution of relapse of adult T-ALLs

Both the primary and resistant populations of T blasts across the adult in-house T-ALL cohort are composed of a major clone and one or more subclones detectable through sequencing (see Additional file 3). In all the patients, including four that are refractory to treatment, the major clone in the primary and relapse leukemias differ, implying that in every case, the treatment obliterates the major clone in the primary malignancy.

To understand the effect of the therapy on the clonal architecture of adult T-ALLs, we first estimated the speed of growth of the population of T-ALL cells to determine the minimum size of the relapse population at the time of diagnosis. This growth speed may be characterized through the doubling time of the population (the time needed by a population of cells to duplicate its number). This can be computed from the number of blasts estimated by the pathologist at remission and relapse, and the amount of time elapsed between both events [37] (Additional file 2: Fig. S9a; see the “Methods” section). We computed a doubling time for the T-ALL leukemic population of 10.79 days (confidence intervals, 10.1–11.36), which is slightly longer than that recently estimated for pediatric B-ALL [37] (Additional file 2: Fig. S9b). We were then able to compute, with this doubling time, the minimum time necessary for the relapse population to achieve approximately 7 × 10¹¹ cells that corresponds to a full grown leukemia [37, 63]. This minimum time to expand from a single cell upon its divergence from the primary population informs us of the likelihood that the relapse clone has arisen before the diagnosis of the primary.

In three cases (PAT7, PAT11, PAT12), it is possible that the relapse clone appeared during treatment, given the estimated doubling time. In two more (PAT9 and PAT10), it is not completely clear whether there was enough time between the start of treatment and relapse to allow the emergence of a new clone. In all other cases, the relapse clone was most likely already present at the time of diagnosis and represented by more than one cell (Fig. 5a). Indeed, for fourteen patients in the cohort, the size of the relapse clone at the time of diagnosis of the primary malignancy probably comprises more than 100 out of the 7 × 10¹¹ leukemia cells. (Note that this calculation is independent from the time elapsed between divergence of the primary and relapse clones and the diagnosis computed previously.) PAT2, PAT4, PAT5, and PAT17, with more than 0.01% minimal residual disease during treatment, show estimates of the relapse clone at the time of diagnosis which are, as expected, above 1 in 10,000 blasts. We then asked whether the relapse clone could be detected in the primary sample of ALL cases by a method with a lower limit of detection than Next Generation Sequencing technologies. Thus, we aimed to detect two non-synonymous SMARCA4 mutations (G1162S and T786I) that are private of the relapse sample of two patients in the corresponding primary samples of these patients (PAT8 and PAT14). With a limit of detection of around one in 1000 cells, a digital PCR was unable to detect this mutation in the primary sample of either patient (Fig. 5a and Additional file 2: Fig. S10a,b). The fraction of cells of the relapse clone estimated to be in the primary sample of these two patients is below this limit of detection. These results thus provide further support to the estimation of the doubling time and the size of the relapse clone in the primary samples derived from it.

Although we were able to pinpoint known or putative resistance mutations in several cases, we asked whether other cases of relapse could be explained by a failure of the treatment to kill a subset of the leukemic cells independent of any genetic mechanism [28, 58]. To answer this question, we modeled the emergence of the relapse clone following both a resistant and a non-resistant (not driven by a genetic mutation) scenario (Fig. 5b). First, a population of tumor cells with driver and passenger mutations was simulated. Then, to model the first scenario, a group of cells sharing one passenger subclonal mutation (the resistance mutation) were selected as survivors of the treatment and were expanded again for 20, 40, or 60 generations (40 generations correspond roughly to the observed times elapsed between primary and relapse diagnoses for the cohort; Additional file 2: Fig. S11). To simulate the second scenario, a group of cells with the same size as in the first case (but selected randomly and sharing no particular subclonal mutation) was selected and expanded for the same number of generations. We then compared the change in clonal composition—change of cancer cell fraction (CCF) of mutations in primary and relapse—obtained for both simulated scenarios with the distribution of CCF in the primary samples of mutations fixed in the relapse samples for all patients, represented in Fig. 5c. For example, of all mutations fixed in the relapse ALL of PAT8 (dashed brown line), approximately 59% were present at CCF 0–0.1% in the primary. In other words, in the primary sample, they appeared below the limit of detection of the sequencing and thus correspond to the red star mutations in the toy diagrams in Fig. 5b. On the other hand, 30% of the PAT8 fixed mutations were detected in the primary ALL at CCF between 0.9 and 1, with the remaining mutations at intermediate CCF bins. All patients in the cohort yield similar bimodal distributions.

Only in the results of the simulation of the resistant scenario do we observe a distribution of CCF of the mutations in the primary sample that resembles that of the patients in the in-house cohort (Additional file 2: Fig. S10). By contrast, in the results of the simulations of the non-resistant scenario, no mutations undetectable in the primary leukemia (CCF in the 0–0.1 decile) become fixed in the relapse (Fig. 5d). This holds if the simulations are run between 20 and 60 generations, and even if a much higher (unrealistic) fitness is assigned to driver mutations. These results suggest that the non-resistant scenario of evolution under treatment is not feasible given the time elapsed between primary and relapse.

In summary, in 14 cases in the cohort, the relapse population is most likely already present before the start of the treatment. Moreover, all relapse cases fit the model of genetic resistance—due to one genetic event common to all cells in this relapse population—although we are only able to identify the responsible mutation in a few of them.

Discussion

Advancing our knowledge on how tumors respond to therapies and which of their features determine their relapse after treatment is key to improving clinical oncology practice. Here, we studied the genomic features and the clonal composition of nineteen adult T-ALL cases at diagnosis and after relapse to understand their evolution and identify commonalities that may predict their likelihood to respond to current therapeutic approaches.

Our results suggest that for most adult T-ALL patients, the population of leukemia cells that dominates the relapse is already present at the moment of diagnosis, that is before the start of the treatment, and comprises more than one but fewer than 10⁸ blasts. One evidence that supports this notion comes from the fact that, in most cases, the span of time between the diagnosis and the emergence of relapse is not enough (given the doubling time estimated from the cohort) to explain the repopulation of a full leukemic population starting from a single cell. This contrasts with the results reported recently for a pediatric cohort, in which some relapse cases could be explained by resistance mutations appearing during treatment [37]. This finding is relevant for the clinical practice, since early identification of such potential resistance populations in a patient’s leukemia may support making clinical decisions regarding their treatment.

We were not able to detect the mutational footprint of chemotherapies employed in the treatment of patients of this cohort, such as mercaptopurine, which has already been characterized in pediatric T-ALL cases [37]. This does not preclude that these chemotherapies indeed cause mutations in leukemic cells that progress in the relapse. Since upon treatment chemotherapy mutations will be private to each blast, if the relapse clone does not emerge from a complete clonal expansion after the start of the treatment, the variant allele frequency of these treatment mutations will not rise above the limit of detection of the sequencing. The detection in the relapse T-ALL population [37] of these treatment mutations would require that only one or few blasts survived the treatment, guaranteeing that sufficient numbers of cells in the relapse carried the same mutations to make them detectable through sequencing. The absence of treatment footprints in the relapse is therefore another evidence that the relapse population at the time of treatment already contains a large number of cells.

One intriguing result is the detection of multiple mutations affecting the NOTCH pathway in the same T-ALL case, which do not appear to be exceptions, but rather the rule. It is possible that mutations affecting different domains of NOTCH1 increase the fitness of leukemic cells more than a single mutation and provide an advantage for relapse. Further studies comparing the pattern of NOTCH1 mutations in relapsing and non-relapsing T-ALLs are needed to clarify this.

Conclusions

All results show that, in the T-ALL patients of this cohort, the relapse is driven by genetic mutations that appear in the population of blasts several months before diagnosis, giving rise to a resistant subclone of up to several million cells at the beginning of treatment. Upon treatment thus, this subclone comes to dominate the T-ALL population at relapse.

Methods

In-house cohort selection and samples collection

Samples from adults (≥ 18 years old) with T cell acute lymphoblastic leukemia were collected in the course of 15 years under therapy protocols (LAL-07OLD, ALL-HR-03, LAL-AR-2011) as part of the PETHEMA (Programa Español de Tratamientos en Hematología) trials (with the exception of patient 16). Patients have signed the corresponding consents of the protocols. Cohort clinical data is specified in Additional file 2: Fig. S3 and Additional file 1: Table S1. There are three collected samples per patient: one taken at diagnosis (primary), a second one when the percentage of lymphoblasts is reduced during treatment (remission), and a final sample when the leukemia reappears after some months (relapse).

Whole genome sequencing

The short-insert paired-end libraries for the whole genome sequencing were prepared with KAPA HyperPrep kit (Roche Kapa Biosystems) with some modifications. In short, in function of available material 0.1 to 1.0 microgram of genomic DNA was sheared on a Covaris™ LE220-Plus (Covaris). The fragmented DNA was further size-selected for the fragment size of 220–550 bp with Agencourt AMPure XP beads (Agencourt, Beckman Coulter). The size selected genomic DNA fragments were end-repaired, adenylated, and ligated to Illumina platform compatible adaptors with Unique Dual matched indexes or Unique Dual indexes with unique molecular identifiers (Integrated DNA Technologies). The libraries were quality controlled on an Agilent 2100 Bioanalyzer with the DNA 7500 assay for size and the concentration was estimated using quantitative PCR with the KAPA Library Quantification Kit Illumina® Platforms (Roche Kapa Biosystems). To obtain sufficient amount of libraries for sequencing, it was necessary for the low input libraries (0.1–0.2 μg) to amplify the ligation product with 5 PCR cycles using 2x KAPA-HiFi HS Ready Mix and 10X KAPA primer mix (Roche Kapa Biosystems).

The libraries were sequenced on HiSeq 4000 or NovaSeq 6000 (Illumina) with a paired-end read length of 2 × 151 bp. Image analysis, base calling, and quality scoring of the run were processed using the manufacturer’s software Real Time Analysis (HiSeq 4000 RTA 2.7.7 or NovaSeq 6000 RTA 3.3.3).

Analysis of ALL cohorts in the public domain

We downloaded public whole-genome and whole-exome sequencing data from EGA and dbGap. We included samples from St. Jude Children’s Research Hospital associated with EGAD00001001052 and EGAD00001001432 EGA accession codes. We have used only samples of which we could recover clinical information from the associated publications [5, 8, 10, 38, 39]. We downloaded the DNA sequencing data of Oshima et al., 2016 [30] from dbGap under the accession code phs001072.v1.p1. The information of the cohorts with the clinical information gathered for each sample is summarized in Additional file 1: Table S2.

For some of the samples, we could not find information regarding the sex of the patient. In those cases we inferred it from the normal sample BAM of each patient. For that, we applied the following reasoning: (1) we determined that the patient is a female if the average coverage of chromosome X is greater than the minimum average coverages of the autosomal chromosomes and (2) the mean coverage of chromosome Y is 10 times smaller than the average coverage of the autosomal chromosomes of the sample.

All the samples in Additional file 1: Table S2 have been analyzed with the same pipeline (for detailed information see the following section: “Alignment and variant calling”). However, in order to compare the T-ALL Adult cohort with other T-ALL cohorts with pre- and post-treatment samples, we added the mutations reported in the supplementary materials in Li et al. [37] only in Fig. 2a and b.

Alignment and variant calling

Alignment, SNV, small InDels

We performed the alignment and calling of mutations (SNVs and small InDels) using Sarek pipeline v2.2.1 [64]. This workflow performs the alignment from raw FASTQ applying the steps referred to as “best practices” according to GATK. We converted the downloaded BAMs from public repositories to FASTQ with biobambam v2.0.72 and used them as input for the pipeline. We used the Strelka caller implemented in Sarek to generate mutation calls. Only the T-ALL adult cohort was aligned with GEM-mapper v3.6 by the CNAG but the calls were done with Strelka. The mutation calls were performed using primary and relapse as tumor samples and the remission as “normal” sample. Variants have been annotated with VEP v.92 run locally with the canonical flag and using gnomAD r2.0.1 to get population frequencies of the potential polymorphisms.

CNV

We have used FACETS v0.5.6 [65] to call copy number changes in WGS and WES samples. Following FACETS documentation, we first created its input with snp-pileup which imputed common SNPs and made the reference and alternative read counts at nucleotide resolution. We have run snp-pileup with the recommended parameters except for the --min-read-counts that was set to 10,0. We run FACETS for WES as mentioned in the documentation but setting preProcSample function parameters to cval = 15, ndepth = 5, snp.nbhd = 500 and procSample function parameters to cval = 80, min.nhet = 20. Similarly, we run FACETS for the WGS data as preProcSample (snp.nbhd = 5000, ndepth = 5, cval = 75) and procSample (cval = 800, min.nhet = 25).

SV

We ran Delly v0.7.9 [66] to detect duplications, inversions, and translocations. First we ran the call function and then the filter function of Delly for each one of the alterations mentioned. The map-quality parameter of the call function was set to 20 and we also passed a file provided in the github of Delly with regions to exclude through the --exclude argument. The filter function was run with the following parameters: --filter somatic --minsize 0 (expect for duplications which was set to 100) --qual-tra 0.75 --altaf 0.1.

Filtering steps

SNVs and InDels

From the VCF output from Strelka, we retained the calls labeled as PASS and DP from the FILTER column. We recovered the shared mutations between primary and relapse that are not PASS or DP but are present in the original VCF. This was not possible for patients with only paired samples (primary and remission) in some cohorts. In addition, we checked for miss-called DNVs (dinucleotide variants) by inspecting consecutive SNV positions with Samtools v1.4.1 and changed the reference and alternative if needed. Once the variants were annotated with VEP, we took the variants in the canonical transcript. In case of more than one consequence type predicted for the same variant, we took the most damaging (more impact) one according to VEP. We also filtered out mutations with population frequency greater than 0.01 according to the gnomADg_AF column added. Finally, we discarded low coverage variants as the ones with a total depth of 5 reads. Further details regarding filters applied to called SNVs are provided in Additional file 3.

CNV

We discarded the variants that were called with low reliability. Those are the segments reported with NAs in the cellular fraction and minor allele copy number columns of FACETS output which, to our knowledge, indicate that the region does not have sufficient numbers of heterozygous SNPs to guide good estimates (Additional file 2: Fig. S5).

SV

We converted the VCFs into bedpe format with bcftobedpe function from svtools v0.4.0 and kept the variants with the flag PASS in the FILTER column. We manually check recurrent SV that have not been described before in the literature by performing BLAT of the breakend points (BND) and their flanking regions in the UCSC and discarded those that were Alu regions or mappable to many parts of the genome.

Purity and clonality estimations

We inferred the purity of the samples from the variant allele frequency (VAF) distribution of the mutations as follows. Since the overall ploidy of the samples was mostly around 2 (diploid), we computed density plots of the VAF multiplied by the CNV of each mutation as a rough proxy of the CCF and determined the purity as the maximum point. We recomputed the CCF with the inferred purity and fitted a beta binomial distribution (betabinom function from scipy v1.4.1 python package). For each mutation, we derived a probability from it and categorized them as clonal or subclonal according to a threshold of 0.01 (above or below it respectively). Exceptionally for PAT16, upon inspection of the CCF distributions in primary and relapse samples, we detected a more complex clonal structure and thus used a threshold of 0.05 for a clearer categorization of the clonality of the mutations.

Signatures analysis

Several runs of deconstructSigs v.1.8.0 [67] were carried out depending on the context of the analysis. Firstly, following the guidelines proposed by Maura et al. [50], we have included all hematological meaningful described signatures for the fitting of primary samples (see Additional file 2: Fig. S1). From those, we selected the signatures that we believed had a substantial activity in the primary leukemias in at least one patient of the cohort analyzed and re-run deconstructSigs with them (see Fig. 1c). Secondly, we re-fitted the T-ALL adult samples with only those signatures that presented activity (SBS1, SBS5, SBS18) to better estimate their contribution in Fig. 3a. Lastly, we have fitted known-treatment signatures for the primary and relapse samples to see whether there is any contribution of those in the mutational profile of the relapse. In this case, we have included Signature 32 (SBS32) which the proposed etiology in COSMIC [68] suggests prior treatment with azathioprine. The adult T-ALL patients have not been treated directly with this compound but it is known that azathioprine is metabolized to 6-mercaptopurine which is used in the maintenance phase of received therapy (see Additional file 2: Figs. S3 and S6). Apart from SBS32, we have also included two treatment signatures recently extracted in Li et al. [37] as SBSA_new and SBSB_new. They assigned the usage of thiopurines to SBSB_new signature so that is why we have decided to include it. There is not much said about SBSA_new but since pediatric and adult ALL patients receive similar treatment we decided to give it a try in the fitting analysis. In all cases, we set the signature cutoff parameter of deconstructSigs to 0.1.

Clustering of driver genes of ALL subtypes

The distances computed to build the dendrogram on Fig. 1d were based on Jensen-Shannon divergence measures between the distributions of the number of patients per mutated gene of each cohort. We only took into account genes with mutations in at least three patients.

Dimensionality reduction

We used a Uniform Manifold Approximation and Projection (UMAP) implemented in the python package umap-learn v0.3.10 to simplify the mutational profiles (96 dimensions that represent each trinucleotide channel) into two dimensions with the size of the local neighborhood (n_neighbors) to 20 and minimal distance (min_dist) of 0.2.

Identification of ALL driver variants

Driver gene discovery

We have run the IntOGen pipeline [69] for SNVs and small InDels (https://www.intogen.org/search) locally for each of the defined cohorts (see above). For each one of the outputs, we have proceeded as follows. First, we have discarded all genes in Tier 3 and 4 that are not in the Cancer Gene Census (CGC) [70]. Second, we have discarded all genes in all tiers that have been defined as potential artifacts (see this list of genes in https://bitbucket.org/intogen/intogen-plus/src/master/extra/data/artifacts.json). Third, we have manually inspected the remaining genes and defined a list of potential false positives (FP). From this list of suspicious genes, we have discarded those not present in the CancerMine. With the rest of the FP candidates that were present in the CancerMine, we have decided their level of credibility as driver genes of leukemia according to the publications reported. Apart from that, we have also manually searched in PubMed for any other missed relation by CancerMine of the gene and hematopoietic neoplasms (see Additional file 1: Table S3).

Literature lists of cancer genes of ALL

We have defined 3 lists of known driver genes in ALL:

Genes with SNVs/InDels mutations
Genes affected by CNV
Genes affected by SV that are known to drive ALL

The genes and their sources to build these lists are listed in Additional file 1: Tables S4.a,b,c respectively.

Annotation of alterations

For SNVs and InDels, we have defined as potential driver all the mutations with a predicted protein affecting consequence type (in the canonical transcript) according to VEP (transcript_ablation, splice_acceptor_variant, splice_donor_variant, stop_gained, frameshift_variant, stop_lost, start_lost, transcript_amplification, inframe_insertion, inframe_deletion, missense_variant, protein_altering_variant, splice_region_variant, incomplete_terminal_codon_variant, start_retained_variant, stop_retained_variant) in a cancer gene from the list defined as the combination of the results from the Driver Gene Discovery and the curated literature list of SNVs and InDels. Results from that are summarized in Fig. 1d, Additional file 2: Fig. S2, and Additional file 1: Table S5.

For CNV and SV, we have flagged the alterations we have found as “known driver” (contained in the curated literature lists respectively) or with “alteration in gene of interest” if it affects any cancer gene related to leukemia of all the lists. In the case of CNV affecting genes of interest, we consider as candidate drivers those oncogenes that are fully amplified and tumor suppressors affected by any deletion. Results are reported with the annotated “classic” Giemsa cytobands by mapping where the BND genomic coordinates fall within them (see Additional file 1: Table S6 a and b).

We have also annotated the genes affected grouping them by some meaningful information such as their protein family, biological process, or pathway (see Additional file 2: Figs. S2, S4 and Additional file 1: Table S4). We created those groups with information from the sources in Additional file 1: Table S4.

Estimations of divergence time

Considering the differences between the mutational burden of T-ALL samples compared with the expected number of mutations of healthy hematopoietic cells seems clear that some acceleration on the mutation rate has occurred (Fig. 4a). Additionally, the regression between age and signature 5 of healthy cells and T-ALL show close slope (12.21 ∓ 1.24 vs 20.61 ∓ 6.58, see Fig. 4a and Additional file 2: Fig. S7) but a much higher intercept (22.35 ∓ 45.53 vs 397.4 ∓ 251.81, see Fig. 4a and Additional file 2: Fig. S7). We hypothesize these similarities on slope and differences on intersect can be explained by a late-stage acceleration during tumorigenesis that affects in a similar way the different T-ALL samples.

Based on this hypothesis of tumorigenesis acceleration of signature 5, we have built 2 different models which represent the upper and lower boundary of the estimations: (I) the change of mutation rate is a one-time, discontinuous event, shared between primary and relapse, and (II) the change on the mutation rate grows linearly during all lifetime of the tumor. In both scenarios, the mutation rate can only increase and both primary and relapse clones are under the same mutational process. In terms of divergence time, the constant model is the most conservative showing the earliest times of divergence between clones, while the linear model is the one generating larger divergences times. The rest of the models based on N acceleration steps will generate estimates within the previous described.

We established 120 different timepoints t_n evenly spaced along the 10-year period immediately preceding diagnosis: we refer to them as “acceleration times,” since they are bound to represent the time-points when the mutation rate first deviates from neutral, clock-like behavior. For each acceleration time, we first computed a function assigning a plausible mutation rate for each time point, consistently with either the constant or linear model. To this end, we fitted the mutation curve to go through the average number of mutations of primary and relapse N(t*) at the middle timepoint t* between these two events. More specifically, the following conditions must hold:

$$ \mathrm{Constant}:N\left({t}^{\ast}\right)=N\left({t}_n\right)+\mu \cdotp \left({t}^{\ast }-{t}_n\right) $$

$$ \mathrm{Linear}:N\left({t}^{\ast}\right)=N\left({t}_n\right)\cdotp {\left(1+r\right)}^{t^{\ast }-{t}_n} $$

where the values of μ and r have to be determined, depending on the model used. Now we did 100 stochastic simulations of the mutation curve by randomly sampling 0 or 1 mutations from a beta binomial distribution with a 1-day granularity, only in cases the mutation rate per day exceeds one a smaller granularity has been used. Thus, mean parameter μ(t) may change with time (linear model) while correlation parameter ρ = 0.0002, estimated with the dispersion observed on healthy hematopoietic stem cells described on Osorio et al. [49], remains constant. Therefore, the number of mutations simulated at time t is defined recursively as:

$$ N\left({t}_m\right)\sim N\left({t}_{m-1}\right)+ BetaBinom\left(\mu \left({t}_m\right),\rho, 1\right) $$

where μ(t_m) is either μ (constant model) or log(1 + r) · N(t_m − 1) (linear model). As the 100 stochastic curves generated for each hypothesis (determined by the acceleration time and mutation rate model) cut the time levels at primary and relapse, they cast a distribution of the possible number of mutations about the observed that yields a likelihood that the hypothesis explains well the observed number of mutations at primary and relapse. Thus, each combination of acceleration time and mutation rate model has an associated prior likelihood. We calculated the Bayes posterior distribution using the combinations of parameters with a higher success (likelihood) on the cohort which is then used to select the most plausible models underlying the observation, then provide a plausible set of divergence times weighted by the likelihood. In order to avoid the deviation of the divergence time estimation due to a long tail of low likelihood simulations, only the more likely scenarios have been selected (10% percentile).

Doubling time and lymphoblast population estimates

The doubling time of the T cell lymphoblast population was estimated following a similar approach as in Li et al. [37]. We assumed that blast cell growth is consistent with a logistic model, i.e., the population fraction represented by the T-lymphoblast population as a function of time t fits a logistic function of the form:

$$ \sigma \left(t,a\right)={\left(1+{e}^{- at}\right)}^{-1} $$

where a is the parameter of the logistic model and t is assumed to be given in standard time units such that the T-lymphoblast subpopulation reaches 50% of the total population at time t = 0.

Assuming the parameter a is known, the doubling time is given by the following expression:

$$ {T}_D=\mathit{\log}(2)/a $$

Therefore, the doubling time estimate resorts to fitting a logistic model to our data, i.e., provide an estimate for the parameter a.

Our approach intends to provide an estimate of a that corrects for the likely inconsistencies between time annotations provided in the patients’ data. We make the general assumption that some error Δt_i has been introduced for each patient P_i when associating a standard time to the T-lymphoblast population measurements—mainly due to the difficulty to estimate the initial time for paired data points with a low initial T-lymphoblast population fraction. A standard goodness-of-fit criterion for logistic models is given by the cross-entropy loss:

$$ C\left(y,\hat{y}\right)=-\frac{1}{n}{\sum}_{i=1}^n{y}_i\mathit{\log}{\hat{y}}_i+\left(1-{y}_i\right)\mathit{\log}\left(1-{\hat{y}}_i\right) $$

where y and $ \hat{y} $ are the observed (resp. predicted) data samples.

Our approach intends to simultaneously estimate the errors Δt_i and the parameter a by minimizing the following cross-entropy loss:

$$ L\left(a,\varDelta {t}_1,\dots, \varDelta {t}_n\right)=-\left({\sum}_{i=1}^nC\left({y}_{i,0};{t}_{i,0};\varDelta {t}_i\right)+C\left({y}_{i,1};{t}_{i,1};\varDelta {t}_i\right)\right) $$

where C(y; t; Δt) = ylogσ(t − Δt, a) + (1 − y) log (1 − σ(t − Δt, a))

where for each patient P_i the values y_{i, 0} and y_{i, 1} are the initial (resp. final) population fractions and the values t_{i, 0} and t_{i, 1} are the initial (resp. final) times.

Minimization of the cross-entropy L was implemented in Python with the function “minimize” of the scipy.optimize module. For a more robust minimization, we ran it several times with different randomly generated initial values (see Additional file 2: Fig. S9).

Upon estimation of the doubling time T_D, we proceed to compute the number of cells N_d at the time of diagnosis as a function of the time Δt elapsed between diagnosis and relapse:

$$ {N}_{\mathrm{d}}={N}_{\mathrm{B}}\cdotp f\cdotp {2}^{-\varDelta t/{T}_{\mathrm{D}}} $$

where N_B is an estimate of the total number of bone marrow cells in adults (~ 7.5 × 10¹¹ cells according to [37, 63]) and f is the frequency of lymphoblasts of the biopsy.

Digital PCR analysis of SMARCA4 mutations

The dPCR analysis was performed on a QuantStudio 3D dPCR System using the manufacturer’s procedure and reagents (ThermoFisher Scientific). Data analysis and chip quality were assessed using the QuantStudio 3D Analysis Suite software online.

Simulations of relapse scenarios

In order to understand how likely our observations at primary and relapse can be obtained under a non-therapy selective scenario, we have performed several simulations using a Wright-Fisher model (https://github.com/gerstung-lab/clonex).

Firstly, we have established a set of parameters based on our observations of primary samples using a mutation rate of 10⁻⁸ and a total number of driver and passenger positions of 100 (0.01 fitness effect) and 150,000 respectively on a population of 10⁶ cells. As a result, after 5000 generations, the population has fixed a number of driver mutations ranging from 3 to 8 (mean 5.2) and 122 to 753 (mean 505.8) passengers.

Secondly, from the primary population we randomly removed between 9 × 10⁴ and 10⁶ cells to simulate a bottleneck effect. The resulting population has grown for 20, 40, and 60 generations which covers our estimations about the observed dataset (10% CI 10.83–37.89 generations).

Finally, we have compared the VAF distribution at primary of those variants with a VAF at relapse higher than 90%, considered as fixed mutations, between the observed and simulated non-resistant scenario.

Due to the lack of fixation of low VAF variants in our simulations, two additional scenarios were performed under the previously described strategy: (I) A non-resistant simulation increasing the fitness up to 0.1 (considered as high fitness, [71]) to allow for faster fixation rates and (II) a resistant scenario where the bottleneck consists of the selection of all cells sharing a low population frequency passenger mutation, defined as resistant mutation.

Availability of data and materials

The raw data of the genomic sequencing of the 45 samples (primary-remission-relapse) of the patients of the in-house cohort is deposited in the EGA repository (accession code EGAS00001004750 [72]). To facilitate reproducibility, the code of the analysis is available here: https://github.com/bbglab/evolution_TALL_adults under Apache Software License 2.0 (doi:https://doi.org/10.5281/zenodo.4120326 [73];). Raw sequencing data of public datasets produced by St. Jude Children’s Research Hospital-Washington University Pediatric Cancer Genome Project (see Table 1) was obtained from the EGA repository (accession codes EGAD00001001052 and EGAD00001001432; some BAMS corresponding to published projects somewhere else [5, 6, 8, 10, 14, 38]). Raw sequencing data of patients included in the study by Oshima et al. [30] (Table 1) was obtained from dbGap (phs001072.v1.p1). The somatic mutations identified in the patients included in the study by Li et al. [37] were obtained from the Supplementary Data of the original paper.

References

Acute lymphoblastic leukaemia (ALL) incidence statistics | Cancer Research UK [Internet]. [cited 2020 Mar 16]. Available from: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/leukaemia-all/incidence?_ga=2.138922035.1884636715.1584377747-1833693179.1584377747#heading-Four.
Acute lymphoblastic leukaemia (ALL) mortality statistics | Cancer Research UK [Internet]. [cited 2020 Mar 16]. Available from: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/leukaemia-all/mortality#heading-Two.
Bhojwani D, Pei D, Sandlund JT, Jeha S, Ribeiro RC, Rubnitz JE, et al. ETV6-RUNX1-positive childhood acute lymphoblastic leukemia: improved outcome with contemporary therapy. Leukemia. 2012;26:265–70 Nature Publishing Group.
Article CAS PubMed Google Scholar
Mullighan CG, Su X, Zhang J, Radtke I, Phillips LAA, Miller CB, et al. Deletion of IKZF1 and prognosis in acute lymphoblastic leukemia. 2009.
Google Scholar
Zhang J, Ding L, Holmfeldt L, Wu G, Heatley SL, Payne-Turner D, et al. The genetic basis of early T-cell precursor acute lymphoblastic leukaemia. Nature. 2012;481:157–63 Nature Research.
Article CAS PubMed PubMed Central Google Scholar
Roberts KG, Morin RD, Zhang J, Hirst M, Zhao Y, Su X, et al. Genetic alterations activating kinase and cytokine receptor signaling in high-risk acute lymphoblastic leukemia. Cancer Cell. 2012;22:153–66.
Article CAS PubMed PubMed Central Google Scholar
Lilljebjörn H, Rissler M, Lassen C, Heldrup J, Behrendtz M, Mitelman F, et al. Whole-exome sequencing of pediatric acute lymphoblastic leukemia. Leukemia. 2012;26:1602–7.
Article PubMed CAS Google Scholar
Holmfeldt L, Wei L, Diaz-Flores E, Walsh M, Zhang J, Ding L, et al. The genomic landscape of hypodiploid acute lymphoblastic leukemia. Nat Genet. 2013;45:242–52 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Shah S, Schrader KA, Waanders E, Timms AE, Vijai J, Miething C, et al. A recurrent germline PAX5 mutation confers susceptibility to pre-B cell acute lymphoblastic leukemia. Nat Genet. 2013;45:1226–31 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Roberts KG, Li Y, Payne-Turner D, Harvey RC, Yang Y-L, Pei D, et al. Targetable kinase-activating lesions in Ph-like acute lymphoblastic leukemia. N Engl J Med. 2014;371:1005–15.
Article PubMed PubMed Central CAS Google Scholar
Lindqvist CM, Nordlund J, Ekman D, Johansson A, Moghadam BT, Raine A, et al. The mutational landscape in pediatric acute lymphoblastic leukemia deciphered by whole genome sequencing. Hum Mutat. 2015;36:118–28.
Article CAS PubMed Google Scholar
Zhang J, McCastlain K, Yoshihara H, Xu B, Chang Y, Churchman et al. Deregulation of DUX4 and ERG in acute lymphoblastic leukemia. Nat Genet. 2016;48(12):1481–9.
Ma X, Liu Y, Liu Y, Alexandrov LB, Edmonson MN, Gawad C, et al. Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours. Nature. 2018; Available from: http://www.nature.com/doifinder/10.1038/nature25795. Nature Publishing Group.
Gu Z, Churchman ML, Roberts KG, Moore I, Zhou X, Nakitandwe J, et al. PAX5-driven subtypes of B-progenitor acute lymphoblastic leukemia. Nat Genet. 2019;51:296–307.
Article CAS PubMed PubMed Central Google Scholar
Mullighan CG, Downing JR. Global genomic characterization of acute lymphoblastic. Semin Hematol. 2009;46:3–15.
Article CAS PubMed PubMed Central Google Scholar
Inaba H, Greaves M, Mullighan CG. Acute lymphoblastic leukaemia. The Lancet. 2013;381:1943–55 Elsevier Ltd.
Article Google Scholar
Hunger SP, Mullighan CG. Redefining ALL classification : toward detecting high-risk ALL and implementing precision medicine. Blood. 2015;125:3977–88.
Article CAS PubMed PubMed Central Google Scholar
Pui CH, Pei D, Coustan-Smith E, Jeha S, Cheng C, Bowman WP, et al. Clinical utility of sequential minimal residual disease measurements in the context of risk-based therapy in childhood acute lymphoblastic leukaemia: a prospective study. Lancet Oncol. 2015;16:465–74.
Article PubMed PubMed Central Google Scholar
Belver L, Ferrando A. The genetics and mechanisms of T cell acute lymphoblastic leukaemia. Nat Rev Cancer. 2016;16:494–507 Nature Publishing Group.
Article CAS PubMed Google Scholar
Inaba H, Azzato EM, Mullighan CG. Integration of next-generation sequencing to treat acute lymphoblastic leukemia with targetable lesions: the St. Jude Children's Research Hospital approach. Front Pediatr. 2017;5:258.
Iacobucci I, Mullighan CG. Genetic basis of acute lymphoblastic leukemia. J Clin Oncol. 2017;35:975–83.
Article CAS PubMed PubMed Central Google Scholar
Genescà E, Morgades M, Montesinos P, Barba P, Gil C, Guàrdia R, et al. Unique clinico-biological, genetic and prognostic features of adult early T-cell precursor acute lymphoblastic leukemia. Haematologica. 2020;105(6):e294–7.
Mullighan CG, Phillips LA, Su X, Ma J, Miller CB, Shurtleff SA, et al. Genomic analysis of the clonal origins of relapsed acute lymphoblastic leukemia. Science. 2008;322:1377–80.
Article CAS PubMed PubMed Central Google Scholar
Yang J, Bhojwani D, Yang W. Genome-wide copy number profiling reveals molecular evolution from diagnosis to relapse in childhood acute lymphoblastic leukemia. ldots. 2008;112:4178–83.
CAS Google Scholar
Mullighan CG, Zhang J, Kasper LH, Lerach S, Payne-Turner D, Phillips LA, et al. CREBBP mutations in relapsed acute lymphoblastic leukaemia. Nature. 2011;471:235–9 NIH Public Access.
Article CAS PubMed PubMed Central Google Scholar
Meyer JA, Wang J, Hogan LE, Yang JJ, Dandekar S, Patel JP, et al. Relapse-specific mutations in NT5C2 in childhood acute lymphoblastic leukemia. Nat Genet. 2013;45:290–4 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Tzoneva G, Perez-Garcia A, Carpenter Z, Khiabanian H, Tosello V, Allegretta M, et al. Activating mutations in the NT5C2 nucleotidase gene drive chemotherapy resistance in relapsed ALL. Nat Med. 2013;19:368–71 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Kunz JB, Rausch T, Bandapalli OR, Eilers J, Pechanska P, Schuessele S, et al. Pediatric T-cell lymphoblastic leukemia evolves into relapse by clonal selection, acquisition of mutations and promoter hypomethylation. Haematologica. 2015;100:1442–50.
Article CAS PubMed PubMed Central Google Scholar
Ma X, Edmonson M, Yergeau D, Muzny DM, Hampton OA, Rusch M, et al. Rise and fall of subclones from diagnosis to relapse in pediatric B-acute lymphoblastic leukaemia. Nat Commun. 2015;6:1–12 Nature Publishing Group.
Google Scholar
Oshima K, Khiabanian H, da Silva-Almeida AC, Tzoneva G, Abate F, Ambesi-Impiombato A, et al. Mutational landscape, clonal evolution patterns, and role of RAS mutations in relapsed acute lymphoblastic leukemia. Proc Natl Acad Sci U S A. 2016;113:11306–11 National Academy of Sciences.
Article CAS PubMed PubMed Central Google Scholar
Dobson SM, García-Prat L, Vanner RJ, Wintersinger J, Waanders E, Gu Z, et al. Relapse-fated latent diagnosis subclones in acute B lineage leukemia are drug tolerant and possess distinct metabolic programs. Cancer Discov. 2020;10(4):568–87.
Van Vlierberghe P, Palomero T, Khiabanian H, Van der Meulen J, Castillo M, Van Roy N, et al. PHF6 mutations in T-cell acute lymphoblastic leukemia. Nat Genet. 2010;42:338–42 Nature Publishing Group.
Article PubMed PubMed Central CAS Google Scholar
Neumann M, Heesch S, Schlee C, Schwartz S, Gökbuget N, Hoelzer D, et al. Whole-exome sequencing in adult ETP-ALL reveals a high rate of DNMT3A mutations. Blood. 2013;121:4749–52.
Article CAS PubMed Google Scholar
De Keersmaecker K, Atak ZK, Li N, Vicente C, Patchett S, Girardi T, et al. Exome sequencing identifies mutation in CNOT3 and ribosomal genes RPL5 and RPL10 in T-cell acute lymphoblastic leukemia. Nat Genet. 2013;45:186–90 Nature Publishing Group.
Article PubMed CAS Google Scholar
Neumann M, Vosberg S, Schlee C, Heesch S, Schwartz S, Gökbuget N, et al. Mutational spectrum of adult T-ALL. Oncotarget. 2015;6:2754–66.
Article PubMed Google Scholar
Liu Y, Easton J, Shao Y, Maciaszek J, Wang Z, Wilkinson MR, et al. The genomic landscape of pediatric and young adult T-lineage acute lymphoblastic leukemia. Nat Genet. 2017;49:1211–8.
Article CAS PubMed PubMed Central Google Scholar
Li B, Brady SW, Ma X, Shen S, Zhang Y, Li Y, et al. Therapy-induced mutations drive the genomic landscape of relapsed acute lymphoblastic leukemia. Blood. 2020;135:41–55.
Article PubMed PubMed Central Google Scholar
Zhang J, Walsh MF, Wu G, Edmonson MN, Gruber TA, Easton J, et al. Germline mutations in predisposition genes in pediatric Cancer. N Engl J Med. 2015;373:2336–46.
Article CAS PubMed PubMed Central Google Scholar
Paulsson K, Lilljebjörn H, Biloglav A, Olsson L, Rissler M, Castor A, et al. The genomic landscape of high hyperdiploid childhood acute lymphoblastic leukemia. Nat Genet. 2015;47:672–7 Nature Publishing Group.
Article CAS PubMed Google Scholar
Spinella J-F, Cassart P, Richer C, Saillour V, Ouimet M, Langlois S, et al. Genomic characterization of pediatric T-cell acute lymphoblastic leukemia reveals novel recurrent driver mutations. Oncotarget. 2016;7:65485–503.
Article PubMed PubMed Central Google Scholar
Papaemmanuil E, Rapado I, Li Y, Potter NE, Wedge DC, Tubio J, et al. RAG-mediated recombination is the predominant driver of oncogenic rearrangement in ETV6-RUNX1 acute lymphoblastic leukemia. Nat Genet. 2014;46:116–25 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SAJR, Behjati S, Biankin AV, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–21 Nature Research.
Article CAS PubMed PubMed Central Google Scholar
Gröbner SN, Worst BC, Weischenfeldt J, Buchhalter I, Kleinheinz K, Rudneva VA, et al. The landscape of genomic alterations across childhood cancers. Nature. 2018;555:321–7.
Article PubMed CAS Google Scholar
Liu Y-F, Wang B-Y, Zhang W-N, Huang J-Y, Li B-S, Zhang M, et al. Genomic profiling of adult and pediatric B-cell acute lymphoblastic leukemia. EBioMedicine. 2016;8:173–83.
Article PubMed PubMed Central Google Scholar
Alexandrov LB, Kim J, Haradhvala NJ, Huang MN, Tian Ng AW, Wu Y, et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578:94–101.
Article CAS PubMed PubMed Central Google Scholar
Pich O, Muiños F, Lolkema MP, Steeghs N, Gonzalez-Perez A, Lopez-Bigas N. The mutational footprints of cancer therapies. Nat Genet. 2019;51:1732–40.
Article CAS PubMed PubMed Central Google Scholar
Gonzalez-Perez A, Sabarinathan R, Lopez-Bigas N. Local determinants of the mutational landscape of the human genome. Cell. 2019;177:101–14.
Article CAS PubMed Google Scholar
Alexandrov LB, Jones PH, Wedge DC, Sale JE, Campbell PJ, Nik-Zainal S, et al. Clock-like mutational processes in human somatic cells. Nat Genet. 2015;47:1402–7 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Osorio FG, Rosendahl Huber A, Oka R, Verheul M, Patel SH, Hasaart K, et al. Somatic mutations reveal lineage relationships and age-related mutagenesis in human hematopoiesis. Cell Rep. 2018;25:2308–2316.e4 Elsevier Company.
Article CAS PubMed PubMed Central Google Scholar
Maura F, Degasperi A, Nadeu F, Leongamornlert D, Davies H, Moore L, et al. A practical guide for mutational signature analysis in hematological malignancies. Nat Commun. 2019;10 Available from: https://doi.org/10.1038/s41467-019-11037-8. Springer US.
Mészáros B, Kumar M, Gibson TJ, Uyar B, Dosztányi Z. Degrons in cancer. Sci Signal. 2017;10:eaak9982.
Article PubMed CAS Google Scholar
Richter-Pechańska P, Kunz JB, Hof J, et al. Identification of a genetically defined ultra-high-risk group in relapsed pediatric T-lymphoblastic leukemia. Blood Cancer J. 2017;7(2):e523.
Shan H, Li X, Xiao X, Dai Y, Huang J, Song J, et al. USP7 deubiquitinates and stabilizes NOTCH1 in T-cell acute lymphoblastic leukemia. Signal Transduct Target Ther. 2018;3:29.
Article PubMed PubMed Central CAS Google Scholar
Jin Q, Ca M, Km A, Zhu Y, Bt G-D, Kk W, et al. USP7 cooperates with NOTCH1 to drive the oncogenic transcriptional program in T-cell leukemia. Clin Cancer Res. 2018;25:222–39.
Article PubMed PubMed Central Google Scholar
Saito Y, Koya J, Araki M, Kogure Y, Shingaki S, Tabata M, et al. Landscape and function of multiple mutations within individual oncogenes. Nature. 2020; [cited 2020 May 26]; Available from: http://www.nature.com/articles/s41586-020-2175-2.
Mansour MR, Duke V, Foroni L, Patel B, Allen CG, Ancliff PJ, et al. NOTCH1 mutations are secondary events in some patients with T-cell acute lymphoblastic leukemia. Clin Cancer Res. 2007;13:6964–9.
Article CAS PubMed Google Scholar
Xu F, Wu LY, Chang CK, et al. Whole-exome and targeted sequencing identify ROBO1 and ROBO2 mutations as progression-related drivers in myelodysplastic syndromes. Nat Commun. 2015;6:8806.
Waanders E, Gu Z, Dobson SM, Antić Ž, Crawford JC, Ma X, et al. Mutational landscape and patterns of clonal evolution in relapsed pediatric acute lymphoblastic leukemia. Blood Cancer Discov. 2020;1(1):96-111.
Xiang J, Wang G, Xia T, Chen Z. The depletion of PHF6 decreases the drug sensitivity of T-cell acute lymphoblastic leukemia to prednisolone. Biomed Pharmacother. 2019;109:2210–7 Elsevier.
Article CAS PubMed Google Scholar
Kosztyu P, Bukvova R, Dolezel P, Mlejnek P. Resistance to daunorubicin, imatinib, or nilotinib depends on expression levels of ABCB1 and ABCG2 in human leukemia cells. Chem Biol Interact. 2014;219:203–10 Elsevier Ireland Ltd.
Article CAS PubMed Google Scholar
Ankathil R. ABCB1 genetic variants in leukemias: current insights into treatment outcomes. Pharmacogenomics Pers Med. 2017;10:169–81 Dove Press.
Article CAS Google Scholar
Gerstung M, Jolly C, Leshchiner I, Dentro SC, Gonzalez S, Rosebrock D, et al. The evolutionary history of 2,658 cancers. Nature. 2020;578:122–8 Nature Publishing Group.
Article CAS PubMed PubMed Central Google Scholar
Bianconi E, Piovesan A, Facchin F, Beraudi A, Casadei R, Frabetti F, et al. An estimation of the number of cells in the human body. Ann Hum Biol. 2013;40:463–71.
Article PubMed Google Scholar
Garcia M, Juhos S, Larsson M, Olason PI, Martin M, Eisfeldt J, et al. Sarek: A portable workflow for whole-genome sequencing analysis of germline and somatic variants. F1000Research. 2020;9:63.
Article PubMed PubMed Central Google Scholar
Shen R, Seshan VE. FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing. Nucleic Acids Res. 2016;44(16):e131.
Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28:333–9.
Article CAS Google Scholar
Rosenthal R, McGranahan N, Herrero J, Taylor BS, Swanton C. deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome biol. Genome Biol; 2016;17:1–11.
COSMIC. https://cancer.sanger.ac.uk/cosmic/signatures/SBS/. Accessed June 2020.
Martínez-Jiménez F, Muiños F, Sentís I, Deu-Pons J, Reyes-Salazar I, Arnedo-Pac C, et al. A compendium of mutational cancer driver genes. Nat Rev Cancer. 2020;20(10):555–72.
Sondka Z, Bamford S, Cole CG, Ward SA, Dunham I, Forbes SA. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Nat Rev Cancer. 2018;18:696.
Article CAS PubMed PubMed Central Google Scholar
Watson CJ, Papula AL, Poon GYP, Wong WH, Young AL, Druley TE, et al. The evolutionary dynamics and fitness landscape of clonal hematopoiesis. Science. 2020;367:1449–54.
Article CAS PubMed Google Scholar
Sentís I , Gonzalez S , Genescà E, Garcia-Hernández V , Muiños F , Gonzalez C, Lopez-Arribillaga E, Gonzalez J, Fernandez-Ibarrondo L, Mularoni L , Espinosa L , Bellosillo B Ribera JM , Bigas A , Gonzalez-Perez A , Lopez-Bigas N. The evolution of adult T-cell acute lymphoblastic leukemia. European Genome-phenome Archive. https://ega-archive.org/search-results.php?query=EGAS00001004750 EGAS00001004750.
Sentís I , Gonzalez S , Genescà E, Garcia-Hernández V , Muiños F , Gonzalez C, Lopez-Arribillaga E, Gonzalez J, Fernandez-Ibarrondo L, Mularoni L , Espinosa L , Bellosillo B Ribera JM , Bigas A , Gonzalez-Perez A , Lopez-Bigas N. Code of the analysis performed in the T-ALL relapse evolution in adult patients project. 2020. https://github.com/bbglab/evolution_TALL_adults / https://doi.org/10.5281/zenodo.4120326.

Download references

Acknowledgements

We would like to acknowledge the contribution of Jordi Deu-Pons and Iker Reyes to the mutation calling and general technical support of the project. We also want to mention Francisco Martínez-Jimenez for his contribution to the analysis of drivers and Oriol Pich for his help on mutational signature analysis. We are grateful to the St. Jude Children’s Research Hospital-Washington University Pediatric Cancer Genome Project (PCGP) for permitted access to pediatric data. Also, we would like to thank the data from Columbia University Medical Center Institutional published in Oshima et al. [30] used in this study.

Review history

The review history is available as Additional file 4.

Peer review information

Yixin Yao was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.

Funding

The authors would like to thank the Asociación Española Contra el Cáncer (AECC) for financially supporting this project (GC16173697BIGA). N.L.-B. acknowledges funding from the European Research Council (consolidator grant 682398) and the ERDF/Spanish Ministry of Science, Innovation and Universities–Spanish State Research Agency/DamReMap Project (RTI2018-094095-B-I00). S. G work is supported by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No. 754510. I. S is supported by FPI fellowship from Spanish Ministry of Economy and Competitiveness (project reference SAF2015-66084-R). V.G-H. is supported by the AECC (project reference GC16173697BIGA-9). IRB Barcelona is a recipient of a Severo Ochoa Centre of Excellence Award from the Spanish Ministry of Economy and Competitiveness (MINECO; Government of Spain) and is supported by CERCA (Generalitat de Catalunya).

Author information

Inés Sentís and Santiago Gonzalez contributed equally to this work.
Abel Gonzalez-Perez and Nuria Lopez-Bigas are co-senior authors.

Authors and Affiliations

Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
Inés Sentís, Santiago Gonzalez, Ferran Muiños, Erika López-Arribillaga, Loris Mularoni, Abel Gonzalez-Perez & Nuria Lopez-Bigas
Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 10, 08028, Barcelona, Spain
Santiago Gonzalez
Hematology Departments, ICO-Hospital Germans Trias i Pujol, Josep Carreras Research Institute, Universitat Autònoma de Barcelona, Badalona, Spain
Eulalia Genescà, Celia Gonzalez & Josep-Maria Ribera
Program in Cancer Research, Institut Hospital del Mar d’Investigacions Mèdiques, CIBERONC, Barcelona, Spain
Violeta García-Hernández, Jessica Gonzalez, Lluís Espinosa & Anna Bigas
Pathology Department, CIBERONC, Hospital del Mar, IMIM, Barcelona, Spain
Lierni Fernandez-Ibarrondo & Beatriz Bellosillo
CMR[B] Center of Regenerative Medicine, Barcelona, Spain
Loris Mularoni
Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
Abel Gonzalez-Perez & Nuria Lopez-Bigas
Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Nuria Lopez-Bigas

Authors

Inés Sentís
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Eulalia Genescà
View author publications
You can also search for this author in PubMed Google Scholar
Violeta García-Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Ferran Muiños
View author publications
You can also search for this author in PubMed Google Scholar
Celia Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Erika López-Arribillaga
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Lierni Fernandez-Ibarrondo
View author publications
You can also search for this author in PubMed Google Scholar
Loris Mularoni
View author publications
You can also search for this author in PubMed Google Scholar
Lluís Espinosa
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Bellosillo
View author publications
You can also search for this author in PubMed Google Scholar
Josep-Maria Ribera
View author publications
You can also search for this author in PubMed Google Scholar
Anna Bigas
View author publications
You can also search for this author in PubMed Google Scholar
Abel Gonzalez-Perez
View author publications
You can also search for this author in PubMed Google Scholar
Nuria Lopez-Bigas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.B, JM.R., and N.L-B coordinated the project. I. S and S.G. carried out the analyses and prepared the figures. I.S. collected public sequencing data and re-analyzed them to call mutations systematically. I.S. also did mutation calling of the 19 ALL patient samples of the project and performed the analysis of driver and resistance mutations. S.G. conceived and carried out the analyses of mutation rate acceleration and the development of resistance models in different scenarios, presented in Figs. 4c and 5c and d. F.M. contributed in the design of the statistical model to compute the doubling time. I. S, S. G, A.G.-P. and N.L.-B. participated in the design of computational analyses and in the interpretation of the results. L. M. contributed to the mutation calling. JM.R. and E.G. collected the samples of the adult ALL patients and provided clinical information. I.S., E.G., E.L-A, Ll.E., A.G-P, A.B., JM.R., and N.L-B. participated in discussions of project design, patient data, and sample selection. V.G-H., L. F-I, and B. B contributed to digital PCR experiments and data analysis. J.G. and C.G. provided technical support to the project. A.G.-P. drafted the manuscript. I.S., S.G., A.G.-P., and N.L.-B. edited the manuscript. A.G.-P. and N.L.-B. supervised the analyses. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Josep-Maria Ribera, Anna Bigas or Nuria Lopez-Bigas.

Ethics declarations

Ethics approval and consent to participate

All patients were included in protocols (LAL-07OLD, ALL-HR-03, LAL-AR-2011) from the PETHEMA group, except PAT16. These protocols were approved by the Institutional Research Board (IRB) of the participating centers and patients provided informed consent before entering into the trials. The study was approved by the Comitè d’Ètica de la Investigació (Research Ethics Committee: PI-16-146) of the Hospital Germans Trias y Pujol (code approval AEC143). The study complies fully with the Helsinki declaration.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Additional tables. This file contains the supplementary tables referenced in the main text. Table S1 contains clinical information on the adult T-ALL cohort. Table S2 contains clinical information of the public pediatric cohorts. Table S3 contains the detected cancer genes by IntOGen. Table S4 contains the lists of ALL cancer genes of interest found in the literature separated in 3 subtables according to the type of alterations: SNVs and InDels (Table S4.a), CNV (Table S4.b), SV (Table S4.c). Table S5 contains the mutations (SNVs and InDels) that we consider as candidate drivers. Table S6 has the candidate driver CNVs (Table S6.a) and SVs (Table S6.b) of the cohorts analyzed. Table S7 has the time of divergence estimates between primary and relapse estimated as days pre-diagnosis of each patient.

Additional file 2.

Additional figures. This file presents all supplementary figures referenced in the main text.

Additional file 3.

Additional methods. Some of the filtering steps have been extended for clarification in this file.

Additional file 4.

Review history.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Sentís, I., Gonzalez, S., Genescà, E. et al. The evolution of relapse of adult T cell acute lymphoblastic leukemia. Genome Biol 21, 284 (2020). https://doi.org/10.1186/s13059-020-02192-z

Download citation

Received: 29 May 2020
Accepted: 03 November 2020
Published: 23 November 2020
DOI: https://doi.org/10.1186/s13059-020-02192-z

The evolution of relapse of adult T cell acute lymphoblastic leukemia

Abstract

Background

Results

Conclusions

Background

Results

The genomics of primary adult T-ALL

Genomic alterations driving primary and relapse adult T-ALL

The evolution of relapse adult T-ALL measured through mutations

Time of divergence of primary and relapse clones

The evolution of relapse of adult T-ALLs

Discussion

Conclusions

Methods

In-house cohort selection and samples collection

Whole genome sequencing

Analysis of ALL cohorts in the public domain

Alignment and variant calling

Alignment, SNV, small InDels

CNV

SV

Filtering steps

SNVs and InDels

CNV

SV

Purity and clonality estimations

Signatures analysis

Clustering of driver genes of ALL subtypes

Dimensionality reduction

Identification of ALL driver variants

Driver gene discovery

Literature lists of cancer genes of ALL

Annotation of alterations

Estimations of divergence time

Doubling time and lymphoblast population estimates

Digital PCR analysis of SMARCA4 mutations

Simulations of relapse scenarios

Availability of data and materials

References

Acknowledgements

Review history

Peer review information

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Competing interests

Additional information

Publisher’s Note

Supplementary information

Additional file 1.

Additional file 2.

Additional file 3.

Additional file 4.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Biology

Contact us