Research article
Exploring glucosinolates diversity in Brassicaceae: a genomic and chemical assessment for deciphering abiotic stress tolerance

https://doi.org/10.1016/j.plaphy.2020.02.032Get rights and content

Highlights

  • Brassicaceae genes involved in GLS biosynthesis were identified using a multi-database approach.

  • UPGMA and PCA separation between genes in GLS core structure and CYP450/MYB gene families.

  • Phylogenetics revealed a recent diversification of aliphatic genes and an earliest for indolic.

  • Distinct GLS chemo-profiles between Brassica crops and Diplotaxis species, wild relatives.

  • GLS-rich species as a new source of taxa with great agronomic potential for abiotic stress tolerance.

Abstract

Brassica is one of the most economically important genus of the Brassicaceae family, encompassing several key crops like Brassica napus (cabbage) and broccoli (Brassica oleraceae var. italica). This family is well known for their high content of characteristic secondary metabolites such as glucosinolates (GLS) compounds, recognize for their beneficial health properties and role in plants defense. In this work, we have looked through gene clusters involved in the biosynthesis of GLS, by combining genomic analysis with biochemical pathways and chemical diversity assessment. A total of 101 Brassicaceae genes involved in GLS biosynthesis were identified, using a multi-database approach. Through a UPGMA and PCA analysis on the 101 GLS genes recorded, revealed a separation between the genes mainly involved in GLS core structure synthesis and genes belonging to the CYP450s and MYBs gene families. After, a detailed phylogenetic analysis was conducted to better understand the disjunction of the aliphatic and indolic genes, by focusing on CYP79F1–F2 and CYP81F1–F4, respectively. Our results point to a recent diversification of the aliphatic CYP79F1 and F2 genes in Brassica crops, while for indolic genes an earliest diversification is observed for CYP81F1–F4 genes. Chemical diversity revealed that Brassica crops have distinct GLS chemo-profiles from other Brassicaceae genera; being highlighted the high contents of GLS found among the Diplotaxis species. Also, we have explored GLS-rich species as a new source of taxa with great agronomic potential, particularly in abiotic stress tolerance, namely Diplotaxis, the closest wild relatives of Brassica crops.

Introduction

The Brassicaceae is one of the world's most economically important plant family (Ishida et al., 2014). It includes important crop species such as Brassica oleracea (e.g., cauliflower, Brussels sprouts, cabbage, broccoli, and Kai Lan), Brassica rapa (e.g., pakchoi, choy sum, and Chinese cabbage), Nasturtium officinale (e.g., watercress), and Raphanus sativus (e.g., daikon radish and red cherry radish). Other species such as Diplotaxis tenuifolia and Eruca vesicaria, commonly referred as ‘rocket salads', have also attracted a considerable interest as culinary vegetables because of their strong flavor and content of putative health-promoting compounds (Verkerk et al., 2010). These species and their crop wild relatives (CWR – taxa closely related to crops) grown primarily in the Euro-Mediterranean region, which contains the highest proportion of agronomically important plants representing an important reservoir of genetic resources for crop improvement (Kell et al., 2008). CWR are likely to contain a great genetic diversity necessary to combat climate change because of the diversity of habitats in which they grow and the wide range of conditions they are adapted to (Ford-Lloyd et al., 2011).

Among the most important chemical compounds produced by Brassicaceae species are the Glucosinolates (GLS), which proved to have health promoting effects and importance in abiotic stress tolerance (Cartea and Velasco, 2007). They are constituted by a common structure comprising a β-D-thioglucose group, a sulfonated oxime moiety and a variable side-chain derived either from methionine, tryptophan, phenylalanine, or from other branched chain amino acids. GLS are found in 16 dicotyledonous plant families where, at least, 130 different structures have been identified so far (Fahey et al., 2001; Collett et al., 2014).

GLS are present at different concentrations throughout the plant organs. They can reach 1% of the dry weight in some tissues of Brassica (Fahey et al., 2001). Within a single species, up to 4 different GLSs dominate the GLS occurrence in the plant (Verkerk et al., 2008). The type, concentration and distribution of the GLS in the plants of Brassicaceae family vary according to a high number of factors, namely species (Bellostas et al., 2004), variety (Choi et al., 2014), plant organ (Brown et al., 2003; Bellostas et al., 2004) or plant age (Fahey et al., 1997; Brown et al., 2003) and developmental cycle. Moreover, environmental conditions such as season (Cartea and Velasco, 2007), biotic (Verkerk et al., 2008) or abiotic stress factors such as salinity or drought, are also known to play a role on the production and content of these compounds (Khan et al., 2011; Martínez-Ballesta et al., 2015).

Recent studies have revealed that GLS and their derivatives have beneficial effects on humans. They can help in suppressing tumor growth of various types of cancers namely: breast, brain, blood, bone, colon, gastric, liver, lung, oral, pancreatic and prostate (Zhang et al., 2003; Soundararajan and Kim, 2018). Significant reduction in plasma LDL-C levels has also been reported as being directly linked to consumption of GLS-rich broccoli (Armah et al., 2015). Some GLS derived products are reported to have antimicrobial effects and well documented health benefits (Cavaiuolo and Ferrante, 2014; Bischoff, 2016). Exclusive or excessive feeding of vegetables and/or seeds from the Brassica plants have been associated with toxic effects in livestock (VanEtten and Tookey, 1983; Tripathi and Mishra, 2007) and strategies have been explored to reduce GLS content in Brassica vegetables to increase their palatability for animal consumption (Verker et al., 2008).

The GLS biosynthetic pathway has been partially elucidated by studies on Arabidopsis (e.g. reviewed in Grubb and Abel, 2006; Halkier and Gershenzon, 2006). The GLS, synthesized from amino acids, are grouped in three subtypes according to their corresponding precursors: i) aliphatic GLS, derived from alanine, leucine, isoleucine, valine, and methionine; ii) indole GLS, derived from tryptophan; and iii) aromatic GLS, derived from phenylalanine and tyrosine (Fahey et al., 2001; Halkier and Gershenzon, 2006). Different authors have reported on aliphatic GLS accounting for 70–97% of the total GLS content in leaves of Brassica oleracea (Cartea and Velasco, 2007), leaves and stems of Brassica napus (Cleemput and Becker, 2011), leaves and seeds of Brassica juncea (Gupta et al., 2012), and sprouts and mature leaves of Brassica rapa (Wiesner et al., 2013). The formation of the GLS core structure involves the action of enzymes from different families, namely the CYP79 (Hansen et al., 2001; Chen et al., 2003), CYP83 (Bak and Feyereisen, 2001), UGT74 (Grubb et al., 2014), C–S-lyases (Mikkelsen et al., 2004) and of sulfotransferases (SOTs or STs) (Piotrowski et al., 2004). These enzymes are involved in the biosynthesis of basic GLS structures from elongated and non-elongated amino acids. The basic GLS structures are subjected to a range of secondary side chain modification and transformation pathways catalyzed by enzymes such as flavin monooxygenase (FMOOXs) (Hansen et al., 2007), GLS-AOPs (Mithen et al., 1995), GLS-OH (Hansen et al., 2008) and CYP81Fs (Pfalz et al., 2009, 2011) to generate different types of GLS structures, that are the last finalizing gene family involved in the indolic biosynthetic pathway (Clarke, 2010; Fahey et al., 2001).

The most important mechanism for the wide production of secondary metabolites as glucosinolates relies on whole-genome events, which occurred in Brassicaceae evolution history (Kliebenstein et al., 2001a,b; Kroymann, 2011). The availability of the whole-genome sequences gives an opportunity for using comparative genomics, which, in turn, can lead to a better understanding of the genome evolution in this family. Whole-genome sequences are available for more than 100 plant species (Tohge et al., 2014). The massive contribution, resulting from next-generation technologies, cannot be currently matched by metabolomics, especially if high-quality and species-optimized approaches are adopted (Fukushima et al., 2014). With the increasing number of whole-genome sequences and the freely available genomic resources, the opportunities for conducting an analysis based on comparative genomics is foreseen.

In this paper, we investigated gene clusters involved on the biosynthesis of GLS, by combining genome analysis with biochemical pathways and compound structure assessment. Considering the high diversity in GLS content in Brassicaceae species, we aim to: i) contribute to the global GLS gene inventory in Brassicaceae; ii) compare gene diversity within the three GLS sub-pathways; iii) assess a potential genetic basis for GLS divergence using 6 CYP genes (CYP79F1–F2 and CYP81F1–F4), known to be key genes of indolic and aliphatic GLS biosynthetic pathways, respectively; and iv) increase the knowledge on the chemical diversity of GLS compounds in major Brassica crops compared to the CWR of the genus Diplotaxis. By combining chemical data with genomic sequences, we expect to provide information of interest for promoting the use of the neglected Diplotaxis genus as a potential viable CWR of economically important Brassica crops.

Section snippets

GLS biosynthetic genes: compilation and gene ontology annotation

QuickGO (https://www.ebi.ac.uk/QuickGO/, Binns et al., 2009), AmiGO (http://amigo.geneontology.org/amigo, Carbon et al., 2008) and MetaCyc (https://metacyc.org/, Caspi et al., 2017) databases were used to filter genes involved in GLS biosynthetic process (GBP) by searching the specific GO term (GO:0019761). Sequences representing the complete set of GLS biosynthetic genes in Arabidopsis thaliana were acquired from The Arabidopsis Information Resource (TAIR, www.arabidopsis.org, accessed on July

Genomic information on GLS genes

The species diversity assessment carried out on GLS genes available at public databases enabled the identification of 101 Arabidopsis genes that were blasted using Blastn (NCBI) restricted to Brassicaceae. From the results obtained, 36 species contain information on orthologous genes belonging to the GLS metabolic pathway. As expected, the most represented species was Arabidopsis thaliana, which accounted for 32% of the total GLS available genes. Other species, in particular the major crop

GLS biosynthetic pathway: gene signature of aliphatic and indolic vias

In this study, we have performed a comprehensive assessment of the GLS biosynthetic pathways in Brassicaceae family. A reconstruction analysis of GLS pathway and a global assessment using genes described for Arabidopsis and for Brassica species were established, using a multi-database approach (i.e. TAIR, NCBI, Brassicadb, MetaCyc). From a total of 101 genes identified, about 78 previously identified genes in Arabidopsis were classified into the three sub-pathways of the GLS biosynthetic

Conclusions

Overall, we have analyzed gene clusters involved in the biosynthesis of GLS, by combining genome analysis with biochemical pathways and chemical diversity assessment. An integrated approach was performed by assessing a global GLS gene inventory in Brassicaceae and its diversity, analysing a potential genetic basis for GLS divergence using 6 CYP genes (CYP79F1–F2 and CYP81F1–F4), known to be key genes of indolic and aliphatic GLS biosynthetic pathways, linked to a chemical diversity evaluation

Authors contributions

Conceptualization, F.M. and M.M.R.; methodology, F.M., A.P.E., A.R.P., M.M.R.; Bioinformatic analysis, A.P.E., F.M and A.R.P.; Results analysis, A.P.E., F.M., A.R.P., M.M., M.M.R.; writing iginal draft preparation, A.P.E., F.M., A.R.P., M.M.R.; writing—review and editing, A.P.E., F.M., A.R.P., M.S.P., M.M.R. and M.M. All authors have approved the submitted version of this manuscript.

Funding

This research was funded by Fundação para a Ciência e Tecnologia (FCT) and Aga Khan Development Network (AKDN) under the project CVAgrobiodiversity/333111699. Fellowships SFRH/BPD/114664/2016, SFRH/BD/135362/2017 to FM and APE, respectively, and research units: UID/AGR/04129/2019 (LEAF) and UID/BIA/00329/2019 (cE3c) were funded by Portuguese National Funds through FCT, Portugal.

Declaration of competing interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors would like to acknowledge the supported provided by Fundação para a Ciência e Tecnologia (FCT) and Aga Khan Development Network (AKDN).

References (90)

  • T. Tohge et al.

    Genome-enabled plant metabolomics

    J. Chromatogr. B

    (2014)
  • Y. Tong et al.

    Topsoil drying combined with increased sulfur supply leads to enhanced aliphatic glucosinolates in Brassica juncea leaves and roots

    Food Chem.

    (2014)
  • M.K. Tripathi et al.

    Glucosinolates in animal nutrition: a review

    Anim. Feed Sci. Technol.

    (2007)
  • P.S. Variyar et al.

    Role of glucosinolates in plant stress tolerance

    Emerg. Technol. Manag. Crop Stress Toler.

    (2014)
  • H. Wang et al.

    Glucosinolate biosynthetic genes in Brassica rapa

    Gene

    (2011)
  • A.J. Windsor et al.

    Geographic and evolutionary diversification of glucosinolates among near relatives of Arabidopsis thaliana (Brassicaceae)

    Phytochemistry

    (2005)
  • U. Wittstock et al.

    Glucosinolate research in the Arabidopsis era

    Trends Plant Sci.

    (2002)
  • G. Yuan et al.

    Effect of salt stress on phenolic compounds, glucosinolates, myrosinase and antioxidant activity in radish sprouts

    Food Chem.

    (2010)
  • T. Arias et al.

    A fully resolved chloroplast phylogeny of the brassica crops and wild relatives (Brassicaceae: Brassiceae): novel clades and potential taxonomic implications

    Taxon

    (2012)
  • C.N. Armah et al.

    Diet rich in high glucoraphanin broccoli reduces plasma LDL cholesterol: evidence from randomised controlled trials

    Mol. Nutr. Food Res.

    (2015)
  • R. Augustine et al.

    Targeted silencing myb28transcription factor gene directs development of low glucosinolate lines in oilseed (Brassica juncea)

    Plant Biotechnol. Jo.

    (2013)
  • S. Bak et al.

    The involvement of two p450 enzymes, CYP83B1 and CYP83A1, in auxin homeostasis and glucosinolate biosynthesis

    Plant Physiol.

    (2001)
  • B. Barco et al.

    Evolution of glucosinolate diversity via whole-genome duplications, gene rearrangements, and substrate promiscuity

    Annu. Rev. Plant Biol.

    (2019)
  • J. Barillari et al.

    Direct antioxidant activity of purified glucoerucin, the dietary secondary metabolite contained in rocket (Eruca sativa Mill.) seeds and sprouts

    Food Chem.

    (2005)
  • N. Bellostas et al.

    Qualitative and quantitative evaluation of glucosinolates in cruciferous plants during their life cycles

    Agroindustria

    (2004)
  • T.Z. Berardini et al.

    The Arabidopsis information resource: making and mining the ‘gold standard’ annotated reference plant genome

    Genesis

    (2015)
  • E. van den Bergh et al.

    Flower power and the mustard bomb: comparative analysis of gene and genome duplications in glucosinolate biosynthetic pathway evolution in Cleomaceae and Brassicaceae

    Am. J. Bot.

    (2016)
  • S. Bhandari et al.

    Comparison of glucosinolate profiles in different tissues of nine Brassica crops

    Molecules

    (2015)
  • D. Binns et al.

    QuickGO: a web-based tool for gene ontology searching

    Bioinformatics

    (2009)
  • A. Bonasia et al.

    Nutritional, biophysical and physiological characteristics of wild rocket genotypes as affected by soilless cultivation system, salinity level of nutrient solution and growing period

    Front. Plant Sci.

    (2017)
  • S. Carbon et al.

    AmiGO: online access to ontology and annotation data

    Bioinformatics

    (2008)
  • M.E. Cartea et al.

    Glucosinolates in Brassica foods: bioavailability in food and significance for human health

    Phytochemistry Rev.

    (2007)
  • M.E. Cartea et al.

    Glucosinolate variation in leaves of Brassica rapa crops

    Plant Foods Hum. Nutr.

    (2012)
  • R. Caspi et al.

    The MetaCyc database of metabolic pathways and enzymes

    Nucleic Acids Res.

    (2017)
  • M. Cavaiuolo et al.

    Nitrates and glucosinolates as strong determinants of the nutritional quality in rocket leafy salads

    Nutrients

    (2014)
  • M. Cavaiuolo et al.

    Gene expression analysis of rocket salad under preharvest and postharvest stresses: a transcriptomic resource for Diplotaxis tenuifolia

    PloS One

    (2017)
  • S. Chen et al.

    CYP79F1 and CYP79F2 have distinct functions in the biosynthesis of aliphatic glucosinolates in Arabidopsis

    Plant J.

    (2003)
  • S. Choi et al.

    Metabolite profiles of glucosinolates in cabbage varieties (Brassica oleracea var. capitata) by season, color, and tissue position

    Hortic. Environ. Biotechnol.

    (2014)
  • D. Clarke

    Glucosinolates, structures and analysis in food

    Anal. Methods

    (2010)
  • S. Cleemput et al.

    Genetic variation in leaf and stem glucosinolates in resynthesized lines of winter rapeseed (Brassica napus L.)

    Genet. Resour. Crop Evol.

    (2011)
  • M. Collett et al.

    Could nitrile derivatives of turnip (Brassica rapa) glucosinolates be hepato- or cholangiotoxic in cattle?

    J. Agric. Food Chem.

    (2014)
  • J.W. Fahey et al.

    Broccoli sprouts: an exceptionally rich source of inducers of enzymes that protect against chemical carcinogens

    Proc. Natl. Acad. Sci. Unit. States Am.

    (1997)
  • B.V. Ford-Lloyd et al.

    Crop wild relatives—undervalued, underutilized and under threat?

    Bioscience

    (2011)
  • A. Fukushima et al.

    Metabolomic characterization of knockout mutants in Arabidopsis: development of a metabolite profiling database for knockout mutants in Arabidopsis

    Plant Physiol.

    (2014)
  • C. Gómez-Campo et al.

    The taxonomic and evolutionary significance of some juvenile characters in the Brassiceae

    Bot. J. Linn. Soc.

    (1974)
  • Cited by (0)

    1

    authors contributed equally to this work.

    View full text