Introduction

With a rapidly evolving incidence, breast cancer is currently the first female cancer in sub-Saharan Africa followed by cervical cancer.1 Recent reviews have reported that most breast cancers in Sub-Saharan Africa are triple negative, prognostic stage III tumours, with average age at diagnosis at late 40 s, resulting in a high-mortality rate.2,3 Overall, 5–10% of breast cancers are inherited and could be associated with ovarian cancers. The risk is linked to two high-penetrance susceptibility genes: BRCA1 (17q21) and BRCA2 (13q12). Both are tumour suppressor genes involved in double strand break DNA repair. Women who have inherited mutations in BRCA1 or BRCA2 are at higher risk of developing breast and/or ovarian cancers.4 Risk increased with the number of affected women within the family, early age at diagnosis and the degree of relationship with other affected women.5,6,7 The cumulative risk of breast cancer by age 80 years was estimated to 72% for BRCA1 carriers and 69% for BRCA2 carriers. For ovarian cancer, cumulative risk at age 80 years was estimated to 44% for BRCA1 carriers and 17% for BRCA2 carriers.5,6,7

Many studies in different populations have identified pathogenic variants that have been stored in BRCA Consortia databases (CIMBA http://cimba.ccge.medschl.cam.ac.uk/, UMD http://www.umd.be/BRCA1/, BIC http://research.nhgri.nih.gov/bic/, ENIGMA https://enigmaconsortium.org/, BRIDGE https://bridges-research.eu/). Some variants are at very high frequencies in specific ethnic groups suggesting their founder effect. In Ashkenazi Jewish women, BRCA1 c.66_67delAG (p.Glu23Valfs) and c.5266dupC (p.Gln1756Profs) founder mutations conferred a life time risk to breast/ovarian cancer, ten times higher compared with general population.8,9 In Africa few studies have reported specific founder mutations of BRCA1: c.5335delC (p.Gln1779Asnfs) identified in Egypt10; c.5309G>T (p.Gly1770Val) identified in five unrelated families from Morocco11; c.303T>G (p.Tyr101Ter) reported in Yoruban population12 from Nigeria and c.2641G>T (p.Glu881Ter) in Afrikaner population from South Africa.13,14 In addition to these founder mutations, other mutations with African origin have been described in African Americans in the US, in African diaspora and in Hispanics from Peru, Mexico and the Bahamas: c.815_824dupAGCCATGTGG (p.Thr276Alafs); c.1713_1717delAGAAT (p.Glu572Thrfs), and c.5173_5176delGAAA (p.Arg1726Lysfs).15,16,17,18,19 Among these mutations c.815_824dup10 is of particular interest. It has been reported as originated from West Africa during the slavery period.20 In the CIMBA database this mutation has been identified in 65 people from France, Spain and the US. They are mostly labelled as being of African or Hispanic descent. Haplotypes analyses have shown that the shared BRCA1 region flanking c.815_824dup10 is shorter than those flanking European founder mutations.20,21,22 Therefore c.815_824dup10 African mutation may probably be older than European mutations. In West Africa very few studies have screened for BRCA1 founder mutations.12,13 Here we report the highest occurrence of c.815_824dup10 of BRCA1 gene in Senegalese patients with inherited breast cancer and confirm its West African origin.

Results

Age at diagnosis and tumor characteristics

Mean age at breast cancer diagnosis was 39.5 years (range from 21 to 67 years). Overall, 88.8% of recruited patients were diagnosed before age 50. Of patients with information on tumor stage (94.1%), most were diagnosed at advance stage (stage II = 58.9% or stage III = 35.2%). Tumour hormone receptors and HER2 status showed that 42.8% had triple negative breast cancer followed by HER2 enriched tumors, 28.5 % (Table 1).

Table 1 Age at diagnosis and tumor characteristics of studied population.

Identification of a recurrent pathogenic variant of BRCA1 gene

Mutation screening identified a recurrent pathogenic variant at heterozygous state of the BRCA1 gene in 15 probands out of 27 recruited. This is a duplication of ten nucleotides (c.815_824dupAGCCATGTGG, p.Thr276Afs) located in exon 11 of BRCA1 according to the HGVS nomenclature (Fig. 1). This pathogenic variant leads to a frameshift and a spurious stop codon 14 amino acids further down. It was detected in six index cases of a first group of 15 index cases by mutation screening of all coding exons of BRCA1 gene, and later in nine additional index cases from a second group of 12 patients by PCR genotyping (Supplementary Fig. 1). The allelic frequency was then estimated at 27.7% in hereditary breast cancer cases. The pathogenic variant was also detected by genotyping in a control population of sporadic breast cancer cases and healthy controls free from any cancer (allelic frequency estimated at 5% and 0.55%, respectively), and in ten healthy relatives from selected studied families (Table 2, Fig. 2). The PCR genotyping method is now available for first routine screening of the recurrent pathogenic variant in women at risk in our laboratory (Fig. 3).

Fig. 1
figure 1

Chromatographic sequence of the BRCA1gene exon 11 surrounding the identified pathogenic variant c.815_824dup.

Table 2 c.815_824dup10 pathogenic variant status in the study population.
Fig. 2: Pedigrees of six families carrying the BRCA1 pathogenic variant c.815_824dup10 identified by Sanger sequencing.
figure 2

Blue color indicates individuals diagnosed with breast cancer. P: index case, E+: Individual with pathogenic variant, E−: Individual without pathogenic variant.

Fig. 3: 4% agarose gel electrophoresis of PCR products for BRCA1 pathogenic variant c.815_824dup10 genotyping.
figure 3

PM molecular weight marker, TN DNA negative control, NM non-carrier index case, AB-FY-FGF-FW-ML-SN-WM-SM-NAG-AT: index cases carrying the mutation (AB-FY-FGF-FW-ML-AT belong to the first group of 15 index cases recruited and SN-WM-SM-NAG belong to the second group of 12 recruited index cases). This gel derived from a single experiment.

Any other pathogenic variant of the BRCA1 gene was identified in the remaining nine index cases of the first group while one of them had a novel BRCA2 pathogenic variant.23

An ancient founder haplotype linked to the recurrent pathogenic variant

As the pathogenic variant is recurrent in our study population, we searched for its founding effect by haplotype analysis. We genotyped seven microsatellite markers flanking the BRCA1 gene and distributed in 2.15 Mb in ten index cases with the pathogenic variant, and 15 unrelated healthy controls. Haplotype analysis showed that specific alleles sizes 144, 156, and 173 bp of three microsatellite markers D17S855, D17S1323, and D17S1325, respectively segregated together in the index cases, and constituted a common haplotype of ~400 kb. This haplotype is not found in any of the healthy controls studied. This common haplotype suggested a founder effect of the pathogenic variant in the study population (Table 3).

Table 3 Haplotypes of ten probands with the BRCA1 c.815_824dup10 mutation and 15 healthy controls.

We then estimated the age of the pathogenic variant in number of generations, using the following formula G = logδ/log (1 − θ) as described in the methodology. The pathogenic variant is supposed to appear in Senegal 55.5 generations ago, ~1400 years.

Discussion

Breast cancer in sub-Saharan Africa has a clinical epidemiology characterized by an early age at diagnosis (<50 years), aggressive tumors with poor prognosis.2,24,25 This particular epidemiology has been observed in our study population. The average age of diagnosis was 39.5 years and is the lowest reported in sub-Saharan Africa: 45 years in Senegal and Nigeria,25 46 years in Mali,26 and 49 years in Ghana.27 In Canada, the US, and Australia there has been an average age at diagnosis of about 45 years for sporadic breast cancer and 39.9 years for inherited breast cancer.28 A study conducted in Senegal in 2017 has reported a mean age at diagnosis of 47.5 years in women with sporadic breast cancer.29 The average age observed in this study is then concordant with that reported in inherited breast cancer over the world.

Tumors characteristics showed a late stage diagnosis with aggressive tumors of bad prognosis mainly grade SBR II and III. This has also been reported in sporadic breast cancer in sub-Saharan Africa.30,31,32,33,34,35,36,37 This late stage diagnosis could be linked to the lack of awareness programs for breast cancer symptomatology and diagnosis, and the high cost of breast cancer therapy, most patients first resorted to traditional medicine.

Despite the need expressed by gynecologists and oncologists, molecular phenotyping of tumors is not available in most sub-Saharan African countries. Available data came from abroad at expensive costs inaccessible to most patients. For the few laboratories that have local facilities, the poor quality of biopsies, the inadequacy of fixation time, the lack of equipment and specialists, are pitfalls for achieving these analyzes.2 Our results showed that the majority of tumors were triple negative (42.8%) or Her2 enriched (28.5%), all considered as poor prognostic tumors. Added to this, is the young age at diagnosis with 40.7% of our population under 35 years of age. It has been reported that breast cancer diagnosed in women under 40 years with triple negative or Her2-enriched tumors, is of poor prognosis and high metastasis incidence.38,39,40,41 It therefore appears that breast cancer in young women could constitute a biologically different entity. Germline mutations of BRCA1 and BRCA2 genes, and a familial aggregation are frequently observed in this case.42,43 Therefore, a BRCA gene mutation will be suspected, when an index case has a young age at diagnosis, and/or a family history of breast or ovarian cancer and a poor prognosis tumor. Patients in our study were relatively young (mean age 39.5 years) and all had at least one affected relative, while 14.8% had associated ovarian cancer. These characteristics are in favor of BRCA genes inherited mutations and thus a genetic predisposition.

Hereditary breast cancer accounts for 5–10% of breast cancers in women.1 Molecular genetics have led to a better understanding of the genetic basis of predisposition to breast/ovarian cancer. Several genes have been involved with two major genes BRCA1 and BRCA2 and minor genes PALB2, P53, PTEN, PALB2, P53, PTEN, CDH1, RAD51, MLH1, MSH2, PMS2, and EPCAM. Pathogenic variants of these genes play an important role in the genetic predisposition to breast or ovarian cancer. The cumulative risk of breast cancer at age 80 is estimated at 72% for BRCA1 carriers and 69% for BRCA2 carriers, while for ovarian cancer it is estimated at 44% for BRCA1 carriers and 17% for BRCA2 carriers.44

Predisposition to inherited breast cancer has been conducted in European or American populations while in Africa particularly in sub-Saharan Africa, very few studies have been conducted. We report here for the first time in Senegal, a recurrent pathogenic variant of the BRCA1 gene, c.815_824dup10 involved in the predisposition to hereditary breast cancer. Allelic frequency of the mutation was estimated to 27.7%. This is the highest allelic frequency of this mutation reported in a population. Review from the literature has shown that the variant has been described for the first time in a breast cancer patient from Ivory Coast living in the US.45 Later it was also reported in patients from Mexico and the Bahamas,16 among African-Americans and Hispanics living in the US,46 in Peru,17 but also in some European populations. Among African Americans with hereditary breast cancer, it is the most common mutation with a frequency of 16%.46 According to CIMBA database, 65 people carrying the variant have been identified worldwide, mostly of African or Hispanic origin. Studies in West Africa, particularly in Nigeria13 and Burkina Faso12 have not report it in these populations.

When a mutation is identified at a high frequency within a population, exploring for its founding effect becomes important for cancer prevention. Several founder mutations have been reported in different populations. Variants c.66_67delAG and c.5266dupC of BRCA1 have been used for breast cancer prevention among Jewish women.13 In Africa, founder mutations have been reported in Yoruba population from Nigeria (c.303T>G;(p.Tyr101Ter)) in a series of four families,13 in Afrikaners from South Africa (c.2641G>T (p.Glu881Ter))14 in five families, and in Morocco (c.5309G>T; (p.Gly1770Val)) in five families.11 The founding effect of the variant c.815_824dup10 was reported first in five nonrelated families from US20 and the mutation was supposed to be of West African origin.

When we screened for its founder effect in Senegal, we identified a haplotype of about 400 kb containing the variant in hereditary breast cancer patients and not in any of the healthy controls. This Senegalese haplotype is shorter compared with the one identified in the US, which spanned 700 kb, as well as the one associated with the Jewish mutation c.66_67delAG.21,22 This suggested that the Senegalese haplotype would be older than the African-American haplotype which was estimated to be 200 years old.20 Age estimation of the Senegalese haplotype suggested that the variant c.815_824dup10 arose around 1400 years ago. Then we supposed that the variant appeared first in Senegal and was spread throughout the world by population migration, especially by slave trade. Senegal by its geographical position in West Africa was one of the major ports during the trans-Atlantic slave trade toward the European and the US continents.47 But it would be of interest to screen for this mutation in other West African countries. Studies conducted in Nigeria and Burkina Faso have not reported it.12,13

While the allelic frequency of c.815_824dup10 in our study is high (27.7%), it is necessary to recruit more index cases and more controls in order to estimate the exact allelic frequency of this variant in Senegalese population.

In this study we identified a founder pathogenic variant involved in predisposition to inherited breast cancer in Senegalese women. Screening of the variant in women at risk in Senegal and other West African countries (Mali, Gambia, Ivory Coast, Ghana, and Benin) will be of interest for breast cancer prevention strategies. It will also lead the ground for oncogenetic counselling in Senegal.

Methods

Patients

Female index cases with biopsy-proven breast cancer and family history of breast or ovarian cancer, followed up at the Joliot Curie Institute and the Senology department of Aristide le Dantec Hospital in Dakar were recruited. After free written informed consent all participants were interviewed to collect demographic data and medical history. Twenty-seven unrelated female index cases were included in this study. Pedigrees were drawn with Progeny software and 5 ml blood sample was collected from each participant. Characteristics of each index cases and pedigrees are available upon request. From the 27 index cases recruited a first group of 15 was screened for mutations in all exons of the BRCA1 and BRCA2 genes. After identification of a recurrent mutation in this first group, the twelve (12) consecutives index patients were screened for this recurrent mutation by PCR genotyping. For index cases with the BRCA1 pathogenic variant, healthy relatives were further recruited for genetic testing after written informed consent.

We have also recruited a control population including 90 healthy women without known cancer who came for routine check-up at the Laboratory of Biology of Le Dantec Hospital and 80 women with sporadic breast cancer and without family history from the Joliot Curie Institute. This study was approved by the ethics committee of Cheikh Anta DIOP University under Protocol 014/2014/CER/UCAD. All participants gave their informed written consent before participation in the study.

Molecular analysis

DNA extraction and Sanger sequencing

Genomic DNA was extracted from whole blood with a QiAmp® DNA blood purification kit (Qiagen). BRCA1 and BRCA2 exons were amplified by PCR with specific primers located in intron/exon boundaries. Twenty-eight fragments covering the 22 coding exons of BRCA1 and 32 fragments covering the 26 coding exons of BRCA2 gene were amplified.48 The large exons 10 and 11 of BRCA2 were amplified as two and nine fragments, respectively, while exon 11 of BRCA1 was amplified as seven fragments (Supplementary Table 1). PCRs were carried out with initial denaturation at 95 °C for 10 min followed by 40 cycles of 95 °C for 30 s, 55 °C for 30 s, and 72 °C for 30 s with a GeneAmp® PCR System 9700 (Applied Biosystems) as described previously.48

The PCR products were purified with a MinElute 96UF kit and sequenced using a Big Dye terminator V3.1 sequencing kit on a 3730 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). Both forward and reverse strands were sequenced. Obtained sequences were compared with BRCA1 GenBank reference sequence (NM_007294.3) with Alamut Software. This method was used for the first 15 index cases recruited.

PCR genotyping of c.815_824dup10 recurrent pathogenic variant of BRCA1

For the following 12 patients and healthy relatives recruited, the recurrent pathogenic variant identified in the first group was genotyped by PCR with primers 5′-TGCTTGTGAATTTTCTGAGACGG-3′ and 5′-TGAGCATGGCAGTTTCTGCT-3′ using standard PCR conditions, followed by a 4% agarose gel migration during 4 h at 100 V. Individuals with the pathogenic variant showed two fragments at 422 and 412 bp (Supplementary Fig. 2).

Haplotype analysis

We selected ten index cases with the pathogenic variant and 15 unrelated healthy controls for haplotype analysis. Seven microsatellite markers flanking the BRCA1 gene located in 2.30 Mb were genotyped by PCR (locus order: cen-D17S1793-D17S1320-D17S855-D17S1323-D17S1325-D17S951-D17S1183-tel) with fluorescent labelled primers (Supplementary Table 2). The PCR products were analysed in automated sequencer ABI Prim 310 using Genscan 3.1.2 software (Applied Biosystems). Allele sizes are given as size of the PCR amplicons containing the microsatellites. Genotyping was performed by Inqaba BiotecTM.

Age estimation of the founder mutation

The age of c.815_824dup10 pathogenic variant in generations (G) was calculated using the following equation: G = logδ/log (1 − θ). Linkage disequilibrium (δ; delta) between the mutation and each of the closest recombinant microsatellite markers D17S855 and D17S1325 was calculated as δ = (Pd − Pn)/(1 − Pn), with Pd being the frequency of the ancestral microsatellite marker allele among the chromosomes carrying the mutated BRCA1 and Pn being the frequency of that microsatellite allele on chromosomes not carrying the mutation. The symbol θ (Teta) represents the recombination fraction between a marker and the gene.49 The genetic distances were inferred from Ensembl database (https://www.ensembl.org/index.html).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this paper.