Multiple ancestral haplotypes harboring regulatory mutations cumulatively contribute to a QTL affecting chicken growth traits

Wang, Yuzhe; Cao, Xuemin; Luo, Chenglong; Sheng, Zheya; Zhang, Chunyuan; Bian, Cheng; Feng, Chungang; Li, Jinxiu; Gao, Fei; Zhao, Yiqiang; Jiang, Ziqin; Qu, Hao; Shu, Dingming; Carlborg, Örjan; Hu, Xiaoxiang; Li, Ning

doi:10.1038/s42003-020-01199-3

Download PDF

Article
Open access
Published: 28 August 2020

Multiple ancestral haplotypes harboring regulatory mutations cumulatively contribute to a QTL affecting chicken growth traits

Yuzhe Wang ORCID: orcid.org/0000-0003-3474-7957^1,2^na1,
Xuemin Cao¹^na1,
Chenglong Luo³,
Zheya Sheng^1,4,
Chunyuan Zhang^1,5,
Cheng Bian ORCID: orcid.org/0000-0002-3638-5550¹,
Chungang Feng¹,
Jinxiu Li¹,
Fei Gao^1,2,5,
Yiqiang Zhao^1,5,
Ziqin Jiang¹,
Hao Qu³,
Dingming Shu³,
Örjan Carlborg⁶,
Xiaoxiang Hu ORCID: orcid.org/0000-0001-7045-0283¹ &
…
Ning Li¹

Communications Biology volume 3, Article number: 472 (2020) Cite this article

3018 Accesses
15 Citations
1 Altmetric
Metrics details

Subjects

Abstract

In depth studies of quantitative trait loci (QTL) can provide insights to the genetic architectures of complex traits. A major effect QTL at the distal end of chicken chromosome 1 has been associated with growth traits in multiple populations. This locus was fine-mapped in a fifteen-generation chicken advanced intercross population including 1119 birds and explored in further detail using 222 sequenced genomes from 10 high/low body weight chicken stocks. We detected this QTL that, in total, contributed 14.4% of the genetic variance for growth. Further, nine mosaic precise intervals (Kb level) which contain ancestral regulatory variants were fine-mapped and we chose one of them to demonstrate the key regulatory role in the duodenum. This is the first study to break down the detail genetic architectures for the well-known QTL in chicken and provides a good example of the fine-mapping of various of quantitative traits in any species.

Genome-wide association study reveals the genetic determinism of growth traits in a Gushi-Anka F2 chicken population

Article Open access 28 September 2020

Yanhua Zhang, Yuzhe Wang, … Xiangtao Kang

Genome-wide identification of candidate copy number polymorphism genes associated with complex traits of Tibetan-sheep

Article Open access 12 October 2023

Dehong Tian, De Sun, … Kai Zhao

Mutant alleles differentially shape fitness and other complex traits in cattle

Article Open access 02 December 2021

Ruidong Xiang, Ed J. Breen, … Michael E. Goddard

Introduction

The molecular mechanisms of many monogenic traits have been identified in agricultural animals and plants, facilitated by their distinct genotype to phenotype associations^1,2,3. In contrast, complex quantitative traits tend to follow Fisher’s⁴ minor-polygene hypothesis, and are difficult to disentangle due to small allelic effects for each locus, complex patterns of linkage disequilibrium in associated regions, limited ability to screen for putative functional polymorphisms, the balance between artificial and natural selection for complex traits, and variable effects of loci across populations and environments^5,6,7. In human genetics, genetic variants contributing to complex traits such as human height are often distributed across the entire genome and an infinitesimal model seems to hold⁸. In domestic animals, although some prominent examples including Insulin-like growth factor 2 (IGF2)⁹ and Myostatin (MSTN)^10,11 prove that artificial selection has resulted in fixation of large effect mutations, the on-going grand challenge in the field is still to provide a better empirical understanding of the basis for polygenic traits.

A major-effect, quantitative trait locus (QTL) has been mapped in many chicken populations at the distal end of chromosome 1, where it has been confirmed to be significantly associated with several growth-related traits including body weight, abdominal fat, muscle weight, and feed conversion ratio^{12,13,14,15,16,17,18,19,20,21}. Many candidate genes reside in the associated region, including Retinoblastoma 1 (RB1), Forkhead box O1 (FOXO1)^20,21 and an insertion mutation of miR-16 have been suggested as possible causal polymorphism¹⁵. However, there is still no conclusive consensus among the detailed genetic architectural, biological mechanisms of this locus. In our previous study, an F₂ intercross between a slow growing meat type chicken, the Huiyang Bearded (HB, Table 1), and a fast-growing commercial broiler breed, High Quality chicken Line A03 (HQLA, Table 1) were constructed. We also mapped this QTL for growth-related traits in this region to the distal end of GGA1¹⁸.

Table 1 Summary of phenotype and sequencing information for the breeds included in the study.

Full size table

Here, we report the construction of an F₁₅ advanced intercross line (AIL) derived from the previously reported F₂ population and we use the large F₉ population for QTL mapping and birds from the F₁₅ generation for functional studies. The strongest association was detected to a mosaic haplotype that was further fine-mapped and functionally explored by evaluating the relationship between the haplotype detected in the HQLA-HB population and in two independently selected broiler breeds, six ancient low body weight breeds and two jungle fowl populations (Table 1)^22,23. Whole genome sequencing revealed multiple ancestral haplotypes harboring regulatory mutations cumulatively contributing to the major QTL affecting chicken growth traits. We further identified one instance where sequence variants within a regulatory element was identified as candidate causal mutations regulating the expression of the calcium binding protein 39 like (CAB39L) gene. This study reports the fine-mapping and functional dissection of a complex quantitative trait locus, contributing new insights to the genetic and biological mechanisms underlying growth in chicken.

Results

Genome-wide association studies replicate a major growth trait QTL on GGA1

Genome-wide association study (GWAS) analyses for several body weight and carcass traits (Supplementary Tables 1 and 2) were performed separately in the F₂ and F₉ generations from the HQLA × HB intercross population using in total 41,758 and 291,772 SNPs²⁴, respectively. In the F₂ generation (n = 493), a single, genome-wide 1% significant (p = 8.1 × 10^–7) association signal was detected for 11 of 15 evaluated traits at the distal end of GGA1 (165 Mb to 175 Mb; Supplementary Table 3; Supplementary Fig. 1). The most significant association was to a SNP (rs14917305) located at 169,795,686 bp (p = 1.6 × 10^–18) for the combined weight of the ventriculus and the proventriculus (SW; Table 2 and Supplementary Fig. 2). For body weight, the most significant associations were for weight at 10 weeks of age (BW10; GGaluGA055630 at 171,387,660 bp; p = 1.2 × 10^–10; Table 2 and Supplementary Fig. 1), and weight at 8 weeks of age (BW8; GGaluGA055630; p = 2.1 × 10^–9; Fig. 1a).

Table 2 Top SNPs for each trait in F₂ and F₉ generations.

Full size table

**Fig. 1: Fine mapping of a growth and body composition QTL on GGA1.**

In the F₉ generation (n = 595 after quality control from 602 samples), the strongest association was to BW8 (S1_168536487 at GGA1: 169,241,142 bp; p = 3.4 × 10^–16; Fig. 1b; Table 2 and Supplementary Table 4). The QTL peak was narrower in the F₉ than in the F₂ population due to the additional recombinants and higher density of SNP markers (41 K vs. 292 K). The top SNP (S1_168536487) was significantly associated with 10 traits, including weight at 2–14 weeks of age, growth rates (0–4 and 4–8 weeks of age) and intestinal length (IL) (Table 2 and Supplementary Table 4). To further define the associated haplotype around S1_168536487, SNPs were aggregated using r² = 0.3 with the top SNP as a criterion. This identified a target region of 3.1 Mb from 168.6 Mb to 171.7 Mb (Fig. 1c). In addition to the GGA1 QTL, a second QTL was discovered for BW8 in F₉ on GGA27 (3.60 Mb to 3.75 Mb; S27_3406188 at 3,620,306 bp; p = 7.6 × 10^–6), containing genes including Insulin-like growth factor 2 binding protein 1 (IGF2BP1) and Phosphoethanolamine/phosphocholine phosphatase (PHOSPHO1).

Identifying breed-level recombinants in the 3.1Mb region on GGA1

The GWAS P-values and the difference in allele-frequency between HQLA and HB at the significant SNPs (ΔAF(HQLA-HB) were highly correlated (r = 0.68, Supplementary Fig. 3). This correlation is consistent with the basic assumption in our analyses that, at many loci, alleles with significantly different effects on growth (growth-increasing Q in HQLA and growth-decreasing q in HB) were present in considerably different frequencies in these two phenotypically divergent populations. To trace haplotypes inherited from the founder breeds through the experimental cross, we first used founder-discriminatory markers (breed-level) to perform an identical-by-descent (IBD) analysis from F₀ to F₉. Only 2 of the ~800 GBS markers in the selected 3.1 Mb region (168.6 Mb–171.7 Mb; Fig. 1c) were informative of founder-breed origin (defined as having delta allele-frequency differences between the HQLA and HB, |ΔAF_(HQLA-HB)| > 0.95). Therefore, we sequenced the 31 F₀ founders of the AIL to 10× individual coverage to identify additional markers for tracing the recombination events between the HQLA and HB founder haplotypes in the associated GGA1 QTL region at high resolution. The 3.1 Mb peak QTL region was screened for polymorphic sites in the sequence data (Supplementary Table 5) and in total 46 SNP polymorphisms with |ΔAF_(HQLA-HB)| > 0.95 were identified. Next, 31 of these 46 SNPs were selected for genotyping using Fluidigm® in from the F₉ generation (n = 575 after quality control from 602 samples). An IBD analysis identified four breed-level recombination breakpoints in the QTL interval. In total, 503 F₉ individuals carried unrecombined progenitor chromosomes (HQLA/HQLA, HQLA/HB, HB/HB) in the 3.1 Mb region and significant differences in BW8 were detected between individuals carrying the three different founder-breed origin genotypes (Fig. 1d). The other 72 F₉ individuals carried one or two copies of the recombinant haplotypes (the different recombinant haplotypes found among these 72 individuals are shown in Fig. 1e). An association analysis to the IBD-status of the five blocks defined by the four recombination events found that only the first two proximal blocks were significantly associated with BW8 (p = 0.0007 and 0.011, respectively, Fig. 1e) and only the most proximal block was significantly associated with BW14 (p = 0.0009, Supplementary Fig. 4). For each block interval, we also computed the detailed phenotypic scales for the three genotypes and significant differences were observed among the three groups only in the first two proximal blocks (p < 0.05, Fig. 1f). The differences between Block2 and Block3 were due to 28 HQLA to HB conversions (Supplementary Fig. 5), resulting in a significantly lower BW8 of HQLA/HB and HB/HB individuals in Block1 and Block2 compared to Block3, Block4, and Block5 (HQLA/HQLA not included due to small sample size, Fig. 1f). Besides, this interval (Block1+Block2) overlapped with the peak QTL region in other mapping studies^12,15,21 and the most significant SNP in our genome-wide analysis was located in this region. In total, this segment tagged by GBS markers explained 14.4% of the genetic variance in BW8 in the F₉ generation of the AIL. Conservatively, we selected the ~1.2 Mb region covered by Block1 and Block2 (168.6–169.8 Mb) for further analysis.

Identifying a mosaic pattern of the 1.2 Mb QTL interval

In an attempt to utilize individual-level haplotype diversity in the founders to fine-map the region further, additional genotyping of markers segregating in the two intercrossed founder populations was performed across the region. Only 7 of the 31 genotyped founder-population informative SNPs (|ΔAF_(HQLA-HB)| > 0.95) were located in the 1.2 Mb candidate region. For this, an additional 76 SNPs with |ΔAF_(HQLA-HB)| > 0.75, including the peak SNP S1_168536487 in the previous F₉ GWAS, were selected in this interval (detail in “Methods” section). These were genotyped in the same F₉ cohort as before for use in additional association analysis and exploration of common, segregating haplotypes. Of these, 60 SNPs were selected as TagSNPs using a 1% genome-wide Bonferroni corrected significance threshold in the single SNP F₉ association analysis across these markers. Nine of these SNPs were more significant in this analysis than the earlier top-SNP S1_168536487 (Supplementary Data 1). When evaluating the distribution of the associated SNPs in the target region in more detail, it was noticed that the TagSNPs were not continuously distributed across the region. They were instead located in clusters separated by regions with SNPs showing low or no association to growth (Fig. 1g). It is considered unlikely that the non-continuous association signal observed was due to a recombination breakup of founder haplotypes in the AIL due to the limited number of generations of intercrossing (F₀ to F₉) and the recombination rate in chicken. Instead, it is considered more likely to have arisen from variable allele frequencies of the SNPs in the founders (Supplementary Fig. 6). No strong positive selection signal was found using standard measures of the genetic diversity (ΔAF/π/haplotype diversity (H)/Fst/XP-EHH; Supplementary Figs. 6, 7, 8, and 9). These results are consistent with the breeding history of the commercial HQLA stock (details in Materials section), that was formed by first crossing two divergent but outbred population (increasing the haplotype diversity) and then followed by strong directional selection for increased weight over 10 generations. This likely resulted in selection signatures due to selection of longer old haplotypes than creation of new via genetic hitchhiking across multiple individually contributing SNPs at candidate functional loci. This distinct mosaic pattern observed is therefore different from the genetic architecture expected around a strongly selected single causative mutation, making it difficult to exclude any significant SNPs as tags of contributing variants. Thus, we postulated the hypothesis that this 1.2 Mb region might contain multiple linked functional polymorphisms in the founders, which are to a large extent co-inherited through the pedigree causing the extended association signal.

Haplotype analysis suggests multiple contributing loci in this region

To identify the haplotypes contributing to the association signal, a multilocus backward-elimination analysis was performed across the 60 tag SNPs in the 1.2 Mb region^25,26. In addition, 18 markers outside the QTL with associations in the F₉-GWAS (p < 0.0001) were selected to control for genetic background effects. In total, this analysis identified 8 SNPs (backward-elimination SNPs, BESNPs, Supplementary Data 1) in the 1.2 Mb region with statistically independent associations to BW8 at a 5% False Discovery Rate (FDR) threshold. Next, the haplotypes tagged by these 8 BESNPs were estimated in the F₉ population and, in total, 100 haplotypes were detected of which 16 existed at a frequency > 0.01 (“Major haplotypes”). The remaining 84 were grouped (“Other haplotypes”; total frequency = 0.19).

The additive haplotype substitution effects on BW8 were estimated, and there was a gradient distribution of haplotype allele effects from decreasing BW8 by 59 g to increasing it by 65 g (Fig. 2). The founder haplotypes with HB and HQLA origin have effects at the opposite ends of the effect spectrum. Two non-recombined founder haplotypes, one from HB (AF_HB = 0.70) and one from HQLA (AF_HQLA = 0.04), had bigger weight decreasing/increasing effects, respectively, than the other founder line haplotypes. The recombinant haplotypes had intermediate substitution effects to those of the contributing founder haplotypes. Inheriting a partial founder-line haplotype segment is thus not sufficient to reach their full weight increasing/decreasing effects, respectively. The part of the 1.2 Mb segment transmitted to a recombinant haplotype appears to matter. Inheriting a “HB” segment in proximal part of the haplotype provides most of the “low-weight” effect, with only a smaller contribution by the “HB” segment in the distal part. In contrast, the “high-weight” effect appears to be more evenly contributed by the corresponding “HQLA” segments in the proximal and distal parts. The results from the association analyses therefore lead us to propose the hypothesis that this 1.2 Mb segment contain at least two loci contributing to the differences in weight between HB and HQLA, and that both founder lines are likely to segregate for at least two haplotypes across this region with different effects on weight.

**Fig. 2: Haplotype association analysis for 8-week body weight in the 1.2 Mb candidate region on GGA1.**

Haplotype-sharing discovered 9 sub-haplotypes as causal candidates

A haplotype-sharing analysis across multiple high-weight and low-weight chicken breeds was next used to refine the 1.2 Mb region further. We here assumed that one or more of the multiple favorable regions of the large-effect, multi-locus QTL haplotype identified in the cross between HB × HQLA (the “Q-haplotype”) would also have been selected in other high-weight breeds and that these breeds would therefore, at least in parts, share the Q-haplotype in the region. However, this analysis is also a potential source of false negative variant discrimination. The 253 re-sequenced individuals were divided into two groups (high and low body weight, Table 1, Supplementary Figs. 10 and 11). In the 1.2 Mb segment, nine sub-haplotypes were identified as having large frequency differences (ΔAF(Q) > 0.4) between the high and low-weight breeds (Fig. 3a). The low level of pairwise LD across these segments in the multi-breeds analysis (Supplementary Fig. 12), together with the difference in haplotype frequencies between high and low breeds, suggests that multiple haplotypes that are mosaics of these sub-segments have undergone positive selection. This is consistent with the results found in the association analysis that multiple sub-regions in the segment contribute to the differences between HB and HQLA. It is hard to discriminate the functional mutations from all SNPs inside each haplotype because they are almost complete linkage disequilibrium across all high-weight chicken. However, candidate intervals have been narrowed to a few Kb levels (lengths range from 2 Kb to 12 Kb, Fig. 3a) and causal mutations are expected to be located within the minimum shared haplotypes present in these breeds.

**Fig. 3: Deciphering the genetic architecture, gene expression and regulation mode using multi-population and multi-omics.**

Multiple genes are differentially expressed in different tissues

All nine sub-haplotypes are located in non-coding regions (introns or intergenic) which may influence gene expression at local or long distances and in tissue-specific manners. To examine the potential functional effect of the variants in these Q-haplotypes, the expression levels of the 23 genes located in the 1.2 Mb interval (Fig. 3b and Supplementary Fig. 13) were evaluated using 9 birds with high 7-week weight (BW7: 882.56 ± 18.66 g), and 10 with low 7-week weight (BW7: 717.60 ± 21.15 g) from the F₁₅ generation of the deep intercross line. Five digestive tract tissues (proventriculus, ventriculus, duodenum, jejunum and ileum) were selected for gene expression analysis detected by quantitative real-time PCR, since phenotypes relating to these tissues were associated to this region in the GWAS analyses (Supplementary Tables 3 and 4). Multiple genes were differentially expressed in different tissues (Fig. 3b and Supplementary Fig. 13) including ITM2B located in sub-region Q2 and CAB39L in sub-region Q4. Outside of these, MLNR and SPRYD7 were more highly expressed in the high body weight individuals in multiple tissues. These results suggest that polymorphisms in the selected Q haplotypes may contribute to chicken weight by altering gene expression in the same chromosomal segment, perhaps via a network regulation involving multiple target tissues and genes.

Identification of a regulatory mutation located in the Q4 haplotype

We use ATAC-seq to identify chromatin accessibility, which is a crucial component of genome regulation. Here we first focus on chicken duodenum as an example of in-depth research because the most significant associations of the evaluated carcass traits (dressed weight (DW), abdominal fat weight (AFW), eviscerated weight (EW) and intestinal length (IL)) in F₉ was IL and duodenum (the first part of small intestine) is known to have important roles in digestion, appetite regulation and growth^27,28,29. We conducted ATAC-seq to profile the accessible chromatin in duodenum samples from two chickens with high and low body weights (HBW1, HBW2, LBW1, LBW2) from the F₁₂ generation of the intercross. We obtained 86.9–131.7 million unique mapped reads and 7,303–29,724 peaks from each sample (Supplementary Table 6). We assessed the genomic distribution of duodenum ATAC-seq peaks and found a characteristic enrichment near gene transcriptional start sites and more intronic and intergenic non-coding sequences (Fig. 3c, Supplementary Table 7 and Supplementary Fig. 14). Notably, only one overlapping region, which is located upstream of the TSS of CAB39L and SETDB2 genes (within Q4 haplotype), was found compared with the haplotype-sharing results (Figs. 3a and 3c). Within the crucial component of genome regulation, two tag SNPs (SNP1 chr1:169,208,105 and SNP2 chr1:169,208,133) were significantly associated with body weight and other growth traits (GR/CW/IL) in the GWAS analysis. The QQ (CG) and qq (AA) haplotype sequences from the open chromatin region for the 2 SNPs (Chr1:169,207,831–169,208,840; in total 1,010 bp) were cloned in both orientations (SETDB2(F) and CAB39L(R)) into the pGL3-basic and pGL3-promoter luciferase reporter vectors. We transfected DF1 cells and measured luciferase activity after 48 h. Figure 4 shows the comparison of the promoter activity for these, compared to the promoter-less vector. All fragments were found to increase luciferase activity compared with empty vector: ~53/40-fold increase (qq/QQ) in SETDB2 direction (F) and ~97/218-fold (qq/QQ) in CAB39L direction (R). The luciferase activity was significantly higher for the QQ constructs than for the qq constructs in CAB39L direction (R) (p < 0.001; ~2.25-fold difference). For enhancer activity detection, only the QQ constructs showed significantly increased luciferase activity compared with enhancer less vector and the corresponding qq constructs (~1.4/1.2-fold increase in SETDB2 direction (F)/CAB39L direction (R)). This example shows the Q4 haplotype contains functional element affecting gene regulation which may affect growth.

**Fig. 4: Luciferase reporter assay for the open chromatin region in the Q4 haplotype region.**

Discussion

Deep intercross-populations between divergent lines is a powerful experimental design for identifying and fine-mapping chromosomal regions (QTL) contributing to complex traits via the accumulation of recombination events in each generation. A series of analyses were performed in the F₂ and F₉ generations of a cross between native Chinese (HB) and commercial broiler (HQLA) populations. The results strongly indicated that the associated region contained multiple segregation haplotypes in both founder populations harboring regulatory mutations cumulatively contributing to a major QTL affecting chicken growth traits.

Earlier works in other chicken populations involving this region on chromosome 1 have proposed several candidate genes and/or suggestive functional mutations in the region. A study of an F₂ population identified a 54-bp insertion on miR-16 that was significantly associated with increased body weight¹⁵. This mutation does, however, not segregate at significantly different allele frequencies in the HQLA and HB founders of this cross, and also not in a sample of 50 additional HQLA/HB samples and other 240 high/low body weight individuals collected for diagnostic tests (Supplementary Table 8). Therefore, we consider it a highly unlikely casual mutation for the differences in body weight between the founder lines of this study. The situation is similar for RB1, an earlier proposed candidate gene for BW and bone traits²¹ located at around 168.8 Mb on chromosome 1, where the maximum |ΔAF_(HQLA-HB)| < 0.4 in this study. Another association detected in this region is that of Yuan et al. reporting the top association to a SNP (rs13553102) located at upstream of miR-15a (in Block2 of this study) resulting in a significant decrease in feed conversion ratio¹². A relatively high |ΔAF_(HQLA-HB)| was found for this SNP (0.65), but for the other breeds in the low body weight chicken breed cluster the trend for this polymorphism was inconsistent (Supplementary Table 9). A reasonable explanation for these results is with slightly different sub-phenotypes of growth traits being the target of selection in different breeds, the preferred polymorphisms in the breeding might be different to further enhance this trend. The selection of the target phenotype will make the selection pressure and fitness for each functional site different, change the genetic architecture gradually, and further lead to different fine-mapping results. From another aspect, the presence of multiple, potentially causative, mutations in this region are coherent with our finding that multiple sub-haplotypes are likely to have been selected in this region across the commercial high-weight breeds for multiple traits including feed conversion ratio, growth rate, meat quality and so on.

In addition to different selection pressure on alleles affecting different sub-phenotypes of growth, non-additive genetic architectures and complex linkage patterns in the founder-populations used for different breeds are also likely important explanations for the inconsistent results observed across populations³⁰. One of the most in-depth explorations of growth in chicken have been performed in the Virginia body weight chicken lines. Starting with a single significant QTL in the F₂ population³¹, more than 20 loci have been shown to contribute to long-term responses in Virginia population, many of which being linked, epistatic and contributing minor effects from alleles present as standing variations in the base-population³². In this study, both the major QTL interval on GGA1 and the minor QTL region on GGA27 (Fig. 1b) overlapped with QTLs detected in the Virginia lines. But we have not considered the potential impact of high-order genetic interactions (epistasis) on the modeling of genetic associations between sequence-level variations^25,33.

The segregation of multiple haplotypes across the fine-mapped 1.2 Mb segment in the two founder-lines, and the differential expression of multiple genes in this segment between high-weight and low-weight birds, together reflect the complexity in the genetics of growth trait regulation even in a small fragment of the genome. Functional experiment of the open chromatin region located in Q4 haplotype highlighted two SNPs inside a regulatory element as candidate causal mutations and CAB39L as a functional candidate gene based on its differential expression in high-weight and low-weight birds. CAB39L is a scaffold protein that binds and stabilizes the LKB1 activation loop in a conformation required for phosphorylation of substrates³⁴. In particular, its presence enhances the regulatory effect of the pseudokinases, STE20-like kinase family, STRADα and STRADβ, on the activity of LKB1. The AMP-activated-protein-kinase (AMPK) is activated upon phosphorylation of Thr172 within the catalytic alpha-subunit by the LKB1–STRAD–CAB39L complex^35,36,37. AMPK plays a critical role in hormonal and nutrient-derived anorexigenic and orexigenic signals and in energy balance³⁸. Interestingly, selection for muscle growth in pigs resulted in a high frequency of a missense mutation in a muscle-specific isoform of the AMPK gamma chain³⁹. However, the sample size in the differential expression analysis (n = 19) is small for this quantitative trait and it can therefore not be expected that other than the largest effects are detected.

Genetic analysis of domestic animals provides a unique opportunity to discover the evolution of genomes under intense selection^40,41. In the Virginia chicken lines, the large selection responses for body-weight has been shown to be due to a highly polygenic genetic architecture, and where most contributing extended QTL were complex and fine-mapped to multiple linked loci^25,26,42. Earlier work has shown that most of the selection in that experimental population has been due to selection on standing genetic variation²⁶. Here, an evaluation of the region in the ancestral Red junglefowl and other breeds showed that the sub-haplotypes contributing to high weight in the commercial populations were present also in the RJF and several local breeds (Fig. 5a and Supplementary Data 2). According to the theory of genetic hitchhiking, selection on favorable alleles makes them reach high frequencies in a population faster than recombination can degrade the core haplotype around it⁴³. Given the recombination rate in chicken⁴⁴, it is unlikely that the mosaic pattern observed in the region explored here has been generated during the limited time—10 s or perhaps a few hundred generations—of intense breeding for meat producing chicken breeds. A more reasonable explanation is that multiple haplotypes containing different variants across these segments already existed in the population used as founders of the high-weight broiler breeds, to eventually produce the mosaic selection signature observed in this study. Similar patterns have also earlier been discovered in other species⁴⁵, and may play an important role in adaptation to diverse habitats for animals and plants^46,47,48. In Fig. 5b, we schematically illustrate our hypothesis for how the multiple standing haplotypes across the selected region first emerged, likely by combining standing variants from the red junglefowl during the 1000s of years of chicken domestication, before they were rapidly accumulated in the high-weight chicken breeds during intense artificial selection (Fig. 5b). In this mosaic model, different haplotypes harbor the standing genetic variants and each haplotype does not diversify the population extensively during normal conditions. These can later allow for first rapid and later long-term adaptations by releasing genetic selectable variation by generating new and favored haplotypes via recombination when the population is subjected to intense selection, similar to what has been observed for the Virginia chicken lines⁴⁷. A potentially valuable practical consequence of the results in this study emerges from the observation that all sub-segments in the Q-haplotype have not yet been fixed in all the evaluated broiler breeds (Fig. 5a). This suggests that selection on this major locus is still an on-going event, where different combinations of beneficial variants are selected in the breeds suggesting that there might still be considerable space for improvement by introgression.

**Fig. 5: Illustration of the proposed model of mosaic artificial haplotype selection for high-weight during chicken domestication.**

In conclusion, the association analyses inferred multiple haplotypes with different effects on weight in chicken, likely due to segregating polymorphisms at multiple, tightly linked regulatory mutations loci in the region. We proposed nine Kb-level candidate segments and selectively validated one of the regulatory roles played in the duodenum. The multiple shorter haplotype genetic architecture of this fine-mapped major QTL region made us propose a mosaic positive selection model consistent with this and earlier findings in chickens. This is an illustration of fine mapping of multiple minor genes in a major QTL. It also provides a new perspective on the genomic consequences of strong artificial selection in chicken, as well as other domestic animals, and new ideas for developing the next generation of genomic selection based breeding.

Methods

Experimental population and phenotyping

A large intercross pedigree was established from two divergent chicken lines, High Quality chicken Line A03 (HQLA), a broiler line bred by Guangdong Wiz Agricultural Science and Technology, Co. (Guangzhou, China), and Huiyang Bearded chicken (HB), a native Chinese breed. In brief, HQLA is a Chinese commercial broiler line founded by the commercial Anak broiler breed and a Chinese indigenous chicken line, followed by strong artificial selection over 10 generations, according to a weight-based selection index. HB is a breed characterized by slow growth and high meat quality. The body weight of HB chickens at 7 weeks of age were, on average, less than one-third of that of chickens from HQLA (Table 1), which is a stock that has been under selection for fast growth for more than 10 generations. Detailed feeding regimes, and F₀ to F₂ mating schemes, have been described earlier in Sheng et al.¹⁸. Later AIL generations (F₃ to F₉) were founded by birds from the F₂ population and bred using random mating. The number of individuals produced in the generations varied from 144 to 294 (Supplementary Table 1).

For the F₂ and F₉ generations, live body weights were measured in grams at hatching, 2, 4, 6, 8, 10, 12 weeks of age (BW0, BW2, BW4, BW6, BW8, BW10, BW12). Weights at 14 weeks of age (BW14) were measured in F₉ only. Growth rates were calculated as the weight gain during the periods 0–4, 4–8, and 8–12 weeks of age (GR0–4, GR4–8, GR8–12). In total, 499 F₂ and 602 F₉ individuals were slaughtered at week 13 (F₂) or 14 (F₉). After slaughter, dissections were performed to measure dressed weight (DW), Abdominal fat weight (AFW), eviscerated weight (EW), the combined weight of the ventriculus and the proventriculus (SW, F₂ only)¹⁸, all in grams, and intestine length (IL, F₉ only) in centimeters. In the F₉ individuals, Cholesterol (CHO, mM), Triglyceride (TG, mM) and lactate dehydrogenase (LDH, IU/L) were also measured. Descriptive statistics for these phenotypes were provided in¹⁸ for F₂ and in Supplementary Table 2 and Supplementary Data 3 for F₉.

Ethics approval

All animals used in this study were cared for, and experiments conducted using procedures, that complied with the requirements of the Animal Welfare Committee of Agrobiotechnology of China Agricultural University (approval SKLAB-2014-06-07).

Whole genome resequencing sample information

Comparative population genomics involved 253 additional chickens from a range of domestic chicken breeds and two wild junglefowl populations. Analyzed sequences included those from F₀ founders for the HQLA-HB AIL (HQLA n = 16 and HB n = 15), two commercial broiler lines (Cornish n = 29, Recessive white Rock n = 30), and four Chinese native breeds (Silkies n = 30, Chahua chicken n = 30, Tibetan chicken n = 30, Daweishan Mini chicken n = 31; Table 1). In addition, we made use of previously published sequences from the Xishuangbanna game fowl (XSD n = 8)²², Indonesian native breeds (SUM n = 15 including Sumatera n = 5 and Kedu Hitam n = 10)²³ and two jungle fowls (Red jungle fowls; RJF n = 5 from the study by ref. ⁴⁹, and RJFs n = 5 and Green jungle fowls; GJFs n = 9 from ref. ²³). These chickens were divided into two groups (high and low body weight), with the high-weight group containing the commercial broiler lines (HQLA, CB1, CB2) whose weights were much higher than those of other breeds (Table 1).

Blood sampling and DNA extraction

DNA was extracted from EDTA-anticoagulated blood from the wing vein using the phenol-chloroform method (F₂) or the Qiagen DNeasy Blood and Tissue Kit (F₀, F₉ and other breeds) according to the manufacturer’s instructions (Qiagen, Hilden, Germany). All animals used in this study were cared for, and experiments conducted using procedures, that complied with the requirements of the Animal Welfare Committee of Agrobiotechnology of China Agricultural University (approval SKLAB-2014-06-07).

SNP Genotyping in the F₂ and F₉ populations

In total, 493 individuals from the F₂ generation were genotyped using the Illumina Chicken 60 K SNP Beadchip (Illumina, San Diego, CA), containing in total 57,636 SNPs⁵⁰. SNPs on the sex chromosomes (Z/W), mitochondrial SNPs, and SNPs that could not be assigned to known chromosomes were excluded from the raw data. SNPs that failed to meet the following criteria were removed: individual call rate (>0.9), individual SNP call frequency (>0.9), and minor allele frequency (MAF > 0.05). In total, 493 F₂ samples and 41,758 autosomal SNPs were retained for further analysis.

In the F₉ generation, double-enzyme digestion genotyping by sequencing (ddGBS) was performed. An EcoRI- MseI library was prepared as in ref. ²⁴ and sequencing performed on an Illumina Nextseq500 sequencer. Qualified reads were aligned to the chicken reference genome Gallus gallus 5.0 (released 2015) and the TASSEL-4.0 GBS analysis pipeline was used to discover SNPs⁵¹. The SNP filtering options used in TASSEL are described in detail in ref. ²⁴. In total, 595 F₉ samples and 291,772 SNPs that were evenly distributed across the genome in F₉ were retained for further analyses.

Joint variant calling of new and downloaded data

Whole genome sequencing was performed for n = 211 new samples described above, and sequences from n = 42 other birds where downloaded from earlier studies, for a total of n = 253 processed samples. Approximately 100 ng of genomic DNA (for each of n = 211 samples) was fragmented to a mean size of about 300–400 bp using Covaris E210. The sample preparation workflow was complied with TruSeq Nano DNA Library Prep for NeoPrep Reference Guide (15049722 v01). The quality and concentration of the libraries were assessed with a Qubit2.0 Fluorometer (Thermo, MA, U.S.A.) and an Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA, U.S.A.). Paired-end libraries were sequenced on the HiSeq X platform with 2 × 151 cycles.

GATK best practices (https://software.broadinstitute.org/gatk/best-practices/) were employed to analyze the n = 253 chicken genomes. Prior mapping, adapter sequences were deleted and then the reads which contained more than 50% low quality bases, or more than 5% N bases, were removed. Qualified reads were aligned to the chicken reference genome (Gallus gallus 5.0) with BWA-MEM (version 0.7.10)⁵² using ‘-t 10 -M’ as parameters. Initial BAM files were further processed with reordering, sorting and duplicates marking utilizing the Picard (picard-tools-1.56) package followed by base quality recalibration using the BaseRecalibrator tools in the Genome Analysis Toolkit (GenomeAnalysisTK-3.6, GATK)⁵³. Raw variants were called for individual bases using HaplotypeCaller. This process resulted in 253 individual *.gvcf files which were subjected to joint variant calling using GenotypeGVCFs. The VariantFiltration command was employed to exclude potential false-positive variant calls with the parameter ‘–filterExpression “QD < 2.0 || FS > 60.0 || MQ < 40.0 || ReadPosRankSum < −8.0” for SNPs and “QD < 2.0 || FS > 60.0 || MQ < 40.0 || ReadPosRankSum < −8.0” for Indels’. Ultimately credible SNPs were identified after using strict filtering criteria with parameters: minor allele frequency <0.05 and call rate >90% in each population. The SNPEff program⁵⁴ was used, with the chicken reference genome sequence and GTF annotation files downloaded from Ensembl (version 5.0.89) to annotate variants. In GGA1, structural variations (SV) including deletions (DEL), insertions (INS), inversions (INV) and tandem duplications (DUP) were discovered using the Pindel (V0.2.4) software⁵⁵ and copy number variation (CNV) were detected using the CNVnator (V0.3) software⁵⁶.

Genome-wide association analyses

A mixed linear model (MLM) approach was used for the genome-wide association analyses as implemented in the GCTA package (v1.24)⁵⁷. The MLM model used is described in detail in ref. ⁵⁸. The statistical model during analyses of BW2–14 included the sex and batch as discrete covariates and birth weight as quantitative covariate. For DW and EW, sex and batch were included as discrete covariates and BW13/BW14 (F₂/F₉) as quantitative covariate. For all other traits, only the sex and batch were included as discrete covariates. A quantile-quantile (Q-Q) plot generated in R (v3.0.2) was used to assess the potential impact of population stratification on the genetic association studies (Supplementary Fig. 2). To account for multiple testing across the genome, a Bonferroni correction was applied correcting for the number estimated independent markers from a PCA analysis performed as follows. A subset of SNPs that were in approximate linkage equilibrium with each other was obtained by removing one in each pair of SNPs if the LD was greater than 0.4 using the PLINK v1.07 ‘--indep-pairwise’ command⁵⁹. The squared correlation coefficient (r²) between the genotypes was calculated using the vcftools ‘--geno-r2’ command⁶⁰. Consequently, for the F₂ generation, the genome-wide 1% significance threshold was determined as p-value < 8.12 × 10^–7 (0.01/12,310), and a suggestive association as 4.06 × 10^–6 (0.05/12,310). For the F₉ generation, the thresholds were 1.84 × 10^–7 (0.01/54,399) and 9.19 × 10^–7 (0.05/54,399), respectively. The genotype in major QTL on GGA1 (165–175 Mb) of F₉ generation was shown in Supplementary Data 4. The GWAS raw data to BW8 in F₂ and F₉ were shown in Supplementary Data 5 and 6.

Fine mapping the QTL

Stage I: The founder-line origin of the chromosomal segments inherited by the F₉ individuals were traced back to facilitate identity-by-descent (IBD) mapping of the segment assuming divergent QTL-allele fixation in the founders. The absolute allele frequency differences (ΔAF) between HQLA and HB were calculated for all markers across the segment using the SNPs identified using whole genome resequencing of the F₀ founders (see above). A subset of 46 segregating SNPs (|ΔAF_(HQLA-HB)| ≥ 0.95, i.e., almost completely fixed for alternative alleles in the two lines) from candidate QTL region were selected to discriminate between alleles inherited from HQLA and HB. These were then genotyped on the Fluidigm platform (SNPtype Assays for SNP Genotyping on the 96.96 Dynamic Array IFCs) in 602 F₉ individuals using the run thermal cycling protocol ‘SNPtype 96×96 v1’. SNPs and individuals were excluded due to bad calling quality (default parameters), individual call rate < 0.9 and SNPs call frequency < 0.9. In total, 31 SNPs and 575 F₉ individuals passed this filtering and missing genotypes were imputed and corresponding recombinant IBD blocks deduced using BEAGLE V4.0⁶¹. For each recombinant block, the distributions of BW8 and BW14 were evaluated in the 72 individuals carrying recombinant segments (in total 144 chromosomes). The significance of weight differences between chromosomal IBD segments of HQLA-origin and HB-origin were calculated using an unpaired T-test (Supplementary Data 7 and 8).

Stage II: The two blocks with the strongest associations to weight (blocks 1 and 2; 168.6–169.8 Mb) were subjected to a second round of targeted genotyping in the 602 available individuals from the F₉ generation. New SNPs were chosen based on the following three principles. First, to also tag segregating variants in the founders the allele-frequency selection criterion was relaxed to |ΔAF_(HQLA-HB)| ≥ 0.75 (mean|ΔAF| ± two times of standard deviation in this 1.2 Mb interval). When multiple SNPs met this criterion and were separated by less than 100 bp, all but one were removed. We selected 61 SNPs in this step (including 9 SNPs genotyped earlier using the GBS method). Second, if a detected SNP was predicted as a missense/splice mutation or was located in the 3′/5′ UTR region of a gene, a softer selection criteria was used as |ΔAF_(HQLA-HB)| ≥ 0.3. 18 SNPs were selected under this condition. Third, 10 SNPs that reached the genome-wide significance in the earlier F₂ Chip-GWAS, but were not part of the F₉ GBS set were also included. These 89 SNPs were genotyped using matrix-assisted laser desorption/ionization-time of flight technology (MALDI-TOF, Sequenom®). Thirteen SNPs were excluded after quality control due to bad calling, resulting in 76 SNPs being appended to the third GWAS in the same F₉ cohort.

Selective sweep scans

To investigate the signatures of selection in the three sequenced broilers and other breeds, five statistical tests were used including the frequency spectrum-based Tajima’s D, π and haplotype diversity (H) methods, the linkage disequilibrium-based XP-EHH method and the population differentiation-based Fst method. Three different genomic segments were analyzed: (1) GGA1:168.6–169.8 Mb, (2) GGA1: 165–175 Mb, and (3) the whole GGA1. For these three intervals, the Tajima’s D statistic was obtained in bins of 250 bp or 25 Kbp. Calculations of Fst between HQLA and HB were done on a per-site basis or in bins of 20 Kbp. The nucleotide diversity (π) was measured in sliding windows, with a window size of 500 bp and a step size of 250 bp or a window size of 50 Kbp and a step size of 25 Kbp. XP-EHH value normalization at each locus between HQLA and HB was estimated by selscan program⁶² and the genetic map for our population was 2.8 cM/Mb for GGA1¹⁸. Haplotype diversity (H) was calculated for each population as described by Nei and Tajima⁶³. The H statistic was calculated in bins with 2 Kb.

Haplotype blocks

Genome resequencing data was used to construct haplotypes. Eleven populations were divided into a high-weight group (high; 3 broiler populations) and a low-weight group (low; other 8 breeds except GJF). Haplotype block estimation was performed using PLINK with 2 Kb bins across the 1.2 Mb candidate region. The regions defining the Q-haplotype of HQLA was identified using a criterion of ΔAF_Q(High-Low) ≥ 0.4. If the haplotypes in two consecutive bins both reach this criterion, the windows will be merged to create a longer Q haplotype sub-segment.

Haplotype-based association analyses

In total, 100 haplotypes were detected of which 16 existed at a frequency > 0.01 (“Major haplotypes”) and the remaining 84 were grouped (“Other haplotypes”; total frequency = 0.19) (Supplementary Data 9). A haplotype-based association analysis was performed in the 1.2 Mb fine-mapped QTL region using model²⁵:

$$Y = X\beta + Zu + e$$

Here, Y is a column vector containing the BW8 of the F₉ individuals. X is the design matrix including the coding for the sex of the birds. Z is the design matrix for 17 columns containing the each haplo-genotype number (coded as 0,1,2) of each individual. β is a vector with the estimate of the fixed effect of sex. u is a column vector with the estimates for allele substitution effects for each haplo-genotype, and e is the normally distributed residual.

Examination of population structure

The population-level phylogeny was inferred using the Neighbor–Joining method⁶⁴. Evolutionary analyses were conducted in MEGA7⁶⁵ based on independent-pairwise SNPs and the tree is drawn by interactive tree of life (iTOL) online tool⁶⁶. All SNPs were pruned using the indep-pairwise option in PLINK, with a window size of 50 SNPs, a step of 5 SNPs, and r² threshold of 0.1.

RNA extraction and real-time PCR assay (Q-PCR)

Nine birds with high 7-week weight (BW7: 882.56 ± 18.66 g), and 10 with low 7-week weight (BW7: 717.60 ± 21.15 g) from the F₁₅ generation of the deep intercross line were used for the quantitative real-time PCR (Supplementary Data 10). Total RNA was extracted using HiPure Universl miRNA Kit (Magen, Beijing, China) followed the manufacturer’s instructions. RNA extractions were treated with RNase free DNase I (Magen, Beijing, China) to remove potentially contaminating DNA. RNA was reverse-transcribed into cDNA with PrimeScript™ RT Master Mix (Perfect Real Time) (Takara, Japan) using 1 μg total RNA. The expression of specific gene (except miRNA) was quantified by real-time PCR using Biomark HD System with 96.96 dynamic arrays (Fluidigm Corporation, CA, USA) according to the “Fast Gene Expression Analysis Using EvaGreen on the BioMark™ or BioMark HD System” in the user guide. miRNA was reverse-transcribed into first-strand cDNA with TransScript miRNA First-Strand cDNA Synthesis SuperMix Kit (TransGen Biotech, Beijing, China). The expression of miRNA was quantified by real-time PCR using a Roche LightCycler® 480 instrument with the Roche LightCycler® 480 SYBR Green I Master Mix (Roche Applied Science, Indianapolis City, U.S.A.). gga-U6 was the reference gene for normalization for miRNA and GAPDH (glyceraldehyde 3-phosphate dehydrogenase) was the reference gene for other target genes. For all genes relative expression levels were calculated using the 2^−ΔΔCt method. Primers used for Q-PCR are listed in Supplementary Table 10, the universal miRNA qPCR Primer was provided by TransScript miRNA First-Strand cDNA Synthesis SuperMix Kit (TransGen Biotech, Beijing, China).

Genotyping the miR-15a-16 mutation

PCR sequencing was performed to identify reported variations of miR-15a-16 in all F₀ individuals. PCR amplification primers were forward: tcctcagtaaatacccacata and reverse: gaactgcattaactacaaaatc according the reference¹⁵. For more confirmation, another 25 unrelated individuals of HQLA and 25 unrelated individuals of HB were used for additional diagnostic tests.

Luciferase reporter assay

Both allelic forms (QQ and qq) of the 1010 bp open chromatin fragment were synthesized and cloned into the pGL3-basic and pGL3-promoter luciferase reporter vectors (Promega Corporation). The sequence and orientation of the inserts were confirmed by sequencing. For cell culture, DF1 (a chicken fibroblast cell line) cells were cultured in 24-well plates with DMEM (Gibico, Carlsbad, CA, USA) supplemented with 10% FBS (Gibico, Carlsbad, CA, USA) in a 37 °C incubator with 5% CO₂. Using Lipofectamine 2000 (Invitrogen, Carlsbad, CA) according to the manufacturer’s recommendations, we transfected per well cell in a 24 well plate (~80%–90% confluency) with a mixture comprising 720 ng of the pGL3 firefly luciferase reporter construct, 90 ng of the pRL-TK Renilla luciferase construct (Promega Corporation) and 3 µl Lipofectamine 2000. The luciferase assay was performed 48 h after transfection using the Dual Luciferase Reporter Assay system (Promega Corporation) and an Infinite F200 Luminometer (Tecan, Switzerland). Ratios of firefly luminescence/Renilla luminescence were calculated. For each test construct, one expression value was obtained as the average of three technical replicates in each plate (Supplementary Data 11).

ATAC-Seq

Duodenum of high and low body weight chickens at 7 weeks (n = 4) were used for ATAC-Seq. All experiments were performed based on the method of Jason D. Buenrostro⁶⁷. Sequencing was performed on Illumina HiSeq platform. Clean reads were obtained from the raw reads by removing the adaptor sequences. The clean reads were then aligned using the bwa program. Peak calling was conducted using macs2 software with cutoff q value < 0.05.

Statistics and reproducibility

Details number of biological samples or replicates can be found in the figure or figure legends. Statistical testing was performed in R, GraphPad Prism 8 and Excel. Different standards are used to considered as statistically significant for GWAS, gene expression and ATAC peak calling (see the detail method of each section). All analyses are reproducible with access to genetic data (see “Data availability”).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The new sequence reads have been deposited in the SRA database (SRA accession: PRJNA547951, PRJNA647930). We declare that the data generated in this study are available within the article and its Supplementary Data files.

References

Merveille, A. C. et al. CCDC39 is required for assembly of inner dynein arms and the dynein regulatory complex and for normal ciliary motility in humans and dogs. Nat. Genet. 43, 72–U98 (2011).
Article CAS PubMed Google Scholar
Imsland, F. et al. The Rose-comb mutation in chickens constitutes a structural rearrangement causing both altered comb morphology and defective sperm motility. PLoS Genet. 8, e1002775 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wu, W. et al. A single-nucleotide polymorphism causes smaller grain size and loss of seed shattering during African rice domestication. Nat. Plants 3, 17064 (2017).
Article CAS PubMed Google Scholar
Fisher, R. A. The correlation between relatives on the supposition of Mendelian inheritance. Trans. R. Soc. Edinb. 52, 399–433 (1919).
Article Google Scholar
Schielzeth, H., Rios Villamil, A. & Burri, R. Success and failure in replication of genotype-phenotype associations: How does replication help in understanding the genetic basis of phenotypic variation in outbred populations? Mol. Ecol. Resour. 18, 739–754 (2018).
Article PubMed Google Scholar
Messer, P. W. & Petrov, D. A. Population genomics of rapid adaptation by soft selective sweeps. Trends Ecol. Evol. 28, 659–669 (2013).
Article PubMed Google Scholar
Wellenreuther, M. Balancing selection maintains cryptic colour morphs. Mol. Ecol. 26, 6185–6188 (2017).
Article PubMed Google Scholar
Lango Allen, H. et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467, 832–838 (2010).
Article CAS PubMed PubMed Central Google Scholar
Van Laere, A. S. et al. A regulatory mutation in IGF2 causes a major QTL effect on muscle growth in the pig. Nature 425, 832–836 (2003).
Article PubMed CAS Google Scholar
Grobet, L. et al. A deletion in the bovine myostatin gene causes the double-muscled phenotype in cattle. Nat. Genet. 17, 71–74 (1997).
Article CAS PubMed Google Scholar
Clop, A. et al. A mutation creating a potential illegitimate microRNA target site in the myostatin gene affects muscularity in sheep. Nat. Genet. 38, 813–818 (2006).
Article CAS PubMed Google Scholar
Yuan, J. et al. Genome-wide association study reveals putative role of gga-miR-15a in controlling feed conversion ratio in layer chickens. BMC Genomics 18, 699 (2017).
Article PubMed PubMed Central CAS Google Scholar
Besnier, F. et al. Fine mapping and replication of QTL in outbred chicken advanced intercross lines. Genet Sel. Evol. 43, 3 (2011).
Article PubMed PubMed Central Google Scholar
Brandt, M., Ahsan, M., Honaker, C. F., Siegel, P. B. & Carlborg, O. Imputation-based fine-mapping suggests that most QTL in an outbred chicken advanced intercross body weight line are due to multiple, linked loci. G3-Genes Genomes Genet. 7, 119–128 (2017).
CAS Google Scholar
Jia, X., Lin, H., Nie, Q., Zhang, X. & Lamont, S. J. A short insertion mutation disrupts genesis of miR-16 and causes increased body weight in domesticated chicken. Sci. Rep. 6, 36433 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. Fine-mapping quantitative trait loci for body weight and abdominal fat traits: effects of marker density and sample size. Poult. Sci. 87, 1314–1319 (2008).
Article CAS PubMed Google Scholar
Podisi, B. K., Knott, S. A., Burt, D. W. & Hocking, P. M. Comparative analysis of quantitative trait loci for body weight, growth rate and growth curve parameters from 3 to 72 weeks of age in female chickens of a broiler-layer cross. BMC Genet. 14, 22 (2013).
Article PubMed PubMed Central Google Scholar
Sheng, Z. et al. Genetic dissection of growth traits in a Chinese indigenous x commercial broiler chicken cross. BMC Genomics 14, 151 (2013).
Article PubMed PubMed Central Google Scholar
Uemoto, Y. et al. Genetic mapping of quantitative trait loci affecting growth and carcass traits in F-2 intercross chickens. Poult. Sci. 88, 477–482 (2009).
Article CAS PubMed Google Scholar
Xie, L. et al. Genome-wide association study identified a narrow chromosome 1 region associated with chicken growth traits. PLoS ONE 7, e30910 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H. et al. Fine-mapping of quantitative trait loci for body weight and bone traits and positional cloning of the RB1 gene in chicken. J. Anim. Breed. Genet. 128, 366–375 (2011).
Article CAS PubMed Google Scholar
Wang, M. S. et al. Genomic analyses reveal potential independent adaptation to high altitude in Tibetan chickens. Mol. Biol. Evol. 32, 1880–1889 (2015).
Article CAS PubMed Google Scholar
Ulfah, M. et al. Genetic features of red and green junglefowls and relationship with Indonesian native chickens Sumatera and Kedu Hitam. BMC Genomics 17, 320 (2016).
Article PubMed PubMed Central CAS Google Scholar
Wang, Y. et al. Optimized double-digest genotyping by sequencing (ddGBS) method with high-density SNP markers and high genotyping accuracy for chickens. PLoS ONE 12, e0179073 (2017).
Article PubMed PubMed Central CAS Google Scholar
Zan, Y. et al. Artificial selection response due to polygenic adaptation from a multilocus, multiallelic genetic architecture. Mol. Biol. Evol. 34, 2678–2689 (2017).
Article CAS PubMed Google Scholar
Sheng, Z. Y., Pettersson, M. E., Honaker, C. F., Siegel, P. B. & Carlborg, O. Standing genetic variation as a major contributor to adaptation in the Virginia chicken lines selection experiment. Genome Biol. 16, 1–12 (2015).
Castaneda, T. R., Tong, J., Datta, R., Culler, M. & Tschop, M. H. Ghrelin in the regulation of body weight and metabolism. Front. Neuroendocrinol. 31, 44–60 (2010).
Article CAS PubMed Google Scholar
Schulze, K. Imaging and modelling of digestion in the stomach and the duodenum. Neurogastroenterol. Motil. 18, 172–183 (2006).
Article CAS PubMed Google Scholar
Anderberg, R. H. et al. Glucagon-like peptide 1 and its analogs act in the dorsal raphe and modulate central serotonin to reduce appetite and body weight. Diabetes 66, 1062–1073 (2017).
Article CAS PubMed Google Scholar
Li, Y. R. & Keating, B. J. Trans-ethnic genome-wide association studies: advantages and challenges of mapping in diverse populations. Genome Med. 6, 91 (2014).
Jacobsson, L. et al. Many QTLs with minor additive effects are associated with a large difference in growth between two selection lines in chickens. Genet. Res. 86, 115–125 (2005).
Article CAS PubMed Google Scholar
Pettersson, M., Besnier, F., Siegel, P. B. & Carlborg, O. Replication and explorations of high-order epistasis using a large advanced intercross line pedigree. PLoS Genet. 7, e1002180 (2011).
Kroymann, J. & Mitchell-Olds, T. Epistasis and balanced polymorphism influencing complex trait variation. Nature 435, 95–98 (2005).
Article CAS PubMed Google Scholar
Pulito, C. et al. Metformin-induced ablation of microRNA 21-5p releases Sestrin-1 and CAB39L antitumoral activities. Cell Discov. 3, 17022 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hawley, S. A. et al. Complexes between the LKB1 tumor suppressor, STRAD alpha/beta and MO25 alpha/beta are upstream kinases in the AMP-activated protein kinase cascade. J. Biol. 2, 28 (2003).
Article PubMed PubMed Central Google Scholar
Carling, D., Sanders, M. J. & Woods, A. The regulation of AMP-activated protein kinase by upstream kinases. Int J. Obes. 32(Suppl 4), S55–S59 (2008).
Article CAS Google Scholar
Shaw, R. J. et al. The tumor suppressor LKB1 kinase directly activates AMP-activated kinase and regulates apoptosis in response to energy stress. Proc. Natl Acad. Sci. USA 101, 3329–3335 (2004).
Article CAS PubMed PubMed Central Google Scholar
Ferreri, A. J., Illerhaus, G., Zucca, E., Cavalli, F. & International Extranodal Lymphoma Study, G. Flows and flaws in primary central nervous system lymphoma. Nat. Rev. Clin. Oncol. 7, 10:1038/nrclinonc 2010 9-c2 (2010).
Milan, D. et al. A mutation in PRKAG3 associated with excess glycogen content in pig skeletal muscle. Science 288, 1248–1251 (2000).
Article CAS PubMed Google Scholar
Andersson, L. Molecular consequences of animal breeding. Curr. Opin. Genet. Dev. 23, 295–301 (2013).
Article CAS PubMed Google Scholar
Rubin, C. J. et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 464, 587–591 (2010).
Article CAS PubMed Google Scholar
Lillie, M. et al. Genomic signatures of 60 years of bidirectional selection for 8-week body weight in chickens. Poult. Sci. 97, 781–790 (2018).
Article CAS PubMed Google Scholar
Innan, H. & Kim, Y. Pattern of polymorphism after strong artificial selection in a domestication event. Proc. Natl Acad. Sci. USA 101, 10667–10672 (2004).
Article CAS PubMed PubMed Central Google Scholar
Groenen, M. A. et al. A high-density SNP-based linkage map of the chicken genome reveals sequence features correlated with recombination rate. Genome Res. 19, 510–519 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tengvall, K. et al. Genome-wide analysis in German shepherd dogs reveals association of a locus on CFA 27 with atopic dermatitis. PLoS Genet. 9, e1003475 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wu, Q. et al. Long-term balancing selection contributes to adaptation in Arabidopsis and its relatives. Genome Biol. 18, 217 (2017).
Article PubMed PubMed Central CAS Google Scholar
Caspermeyer, J. Eat more chicken: scientists hone in on genetics behind chicken weight adaptation. Mol. Biol. Evol. 34, 2730–2731 (2017).
PubMed Google Scholar
Murray, G. G. R. et al. Natural selection shaped the rise and fall of passenger pigeon genomic diversity. Science 358, 951–954 (2017).
Article CAS PubMed Google Scholar
Wang, M. S. et al. Positive selection rather than relaxation of functional constraint drives the evolution of vision during chicken domestication. Cell Res. 26, 556–573 (2016).
Article CAS PubMed PubMed Central Google Scholar
Groenen, M. A. et al. The development and characterization of a 60K SNP chip for chicken. BMC Genomics 12, 274 (2011).
Article PubMed PubMed Central Google Scholar
Glaubitz, J. C. et al. TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS ONE 9, e90346 (2014).
Article PubMed PubMed Central CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ye, K., Schulz, M. H., Long, Q., Apweiler, R. & Ning, Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25, 2865–2871 (2009).
Article CAS PubMed PubMed Central Google Scholar
Abyzov, A., Urban, A. E., Snyder, M. & Gerstein, M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46, 100–106 (2014).
Article PubMed PubMed Central CAS Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L. & Browning, S. R. Improving the accuracy and efficiency of identity-by-descent detection in population data. Genetics 194, 459–471 (2013).
Article PubMed PubMed Central Google Scholar
Szpiech, Z. A. & Hernandez, R. D. selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol. Biol. Evol. 31, 2824–2827 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nei, M. & Tajima, F. DNA polymorphism detectable by restriction endonucleases. Genetics 97, 145–163 (1981).
CAS PubMed PubMed Central Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 44, W242–W245 (2016).
Article CAS PubMed PubMed Central Google Scholar
Buenrostro, J. D., Wu, B., Chang, H. Y. & Greenleaf, W. J. ATAC-seq: a method for assaying chromatin accessibility genome-wide. Curr. Protoc. Mol. Biol. 109, 21 29 1–21 29 9 (2015).
Article Google Scholar

Download references

Acknowledgements

Jiangli Ren, Zhuolin Huang, and Ying Guo are acknowledged for their valuable contributions during DNA Libraries generation. We also thank Da Yang and Yanjun Zan for useful input on the statistical analysis and the manuscript. This study was financially supported by the National Natural Science Foundation of China (NSFC, 31672411, 31961133003 to X.H. and 31902143 to Y.W.), the 948 Program of the Ministry of Agriculture of China (2012-G1[4] to X.H.), the Earmarked Fund for Modern Agri-industry Technology Research System (Grant No. CARS-41 to D.S.).

Author information

These authors contributed equally:Yuzhe Wang, Xuemin Cao.

Authors and Affiliations

State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
Yuzhe Wang, Xuemin Cao, Zheya Sheng, Chunyuan Zhang, Cheng Bian, Chungang Feng, Jinxiu Li, Fei Gao, Yiqiang Zhao, Ziqin Jiang, Xiaoxiang Hu & Ning Li
College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
Yuzhe Wang & Fei Gao
State Key Laboratory of Livestock and Poultry Breeding, Guangdong Key Laboratory of Animal Breeding and Nutrition, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
Chenglong Luo, Hao Qu & Dingming Shu
Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
Zheya Sheng
Beijing Advanced Innovation Center for Food Nutrition and Human Health, China Agricultural University, Beijing, 100193, China
Chunyuan Zhang, Fei Gao & Yiqiang Zhao
Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, SE-751 23, Sweden
Örjan Carlborg

Authors

Yuzhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xuemin Cao
View author publications
You can also search for this author in PubMed Google Scholar
Chenglong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zheya Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Chunyuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Bian
View author publications
You can also search for this author in PubMed Google Scholar
Chungang Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jinxiu Li
View author publications
You can also search for this author in PubMed Google Scholar
Fei Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yiqiang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ziqin Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Hao Qu
View author publications
You can also search for this author in PubMed Google Scholar
Dingming Shu
View author publications
You can also search for this author in PubMed Google Scholar
Örjan Carlborg
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxiang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Ning Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.H., N.L., D.S., and H.Q. initiated the study and designed the project with Ö.C.; C.L. developed, planned and bred the HQLA-HB advanced intercross chicken lines; Y.W., X.C., C.L., C.F., J.L., F.G., and Z.J. designed, planned, bred, bled, phenotyped and extracted DNA; Y.W., X.C., and Z.S. performed the quality control of the genotype data; X.C. performed the gene expression analyses; Ö.C. and Y.Z. designed the statistical analyses; Y.W., C.Z., and C.B. performed the data analyses; Y.W. and Ö.C. summarized the results and wrote the manuscript.

Corresponding authors

Correspondence to Dingming Shu, Örjan Carlborg or Xiaoxiang Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Cao, X., Luo, C. et al. Multiple ancestral haplotypes harboring regulatory mutations cumulatively contribute to a QTL affecting chicken growth traits. Commun Biol 3, 472 (2020). https://doi.org/10.1038/s42003-020-01199-3

Download citation

Received: 23 December 2019
Accepted: 03 August 2020
Published: 28 August 2020
DOI: https://doi.org/10.1038/s42003-020-01199-3

This article is cited by

Phenotypic divergence between broiler and layer chicken lines is regulated at the molecular level during development
- Renata Erbert Contriciani
- Carla Vermeulen Carvalho Grade
- Lúcia Elvira Alvares
BMC Genomics (2024)
Whole-genome variants dataset of 209 local chickens from China
- Xiaodong Tan
- Jiawen Zhang
- Deqian Wang
Scientific Data (2024)
Serum metabolic profile and metabolome genome-wide association study in chicken
- Jing Tian
- Xiaoning Zhu
- Xiaoxiang Hu
Journal of Animal Science and Biotechnology (2023)
Genomic prediction based on selective linkage disequilibrium pruning of low-coverage whole-genome sequence variants in a pure Duroc population
- Di Zhu
- Yiqiang Zhao
- Xiaoxiang Hu
Genetics Selection Evolution (2023)
QTL Mapping and Genetic Map for the Ornamental Sunflower in China
- Jixia Liu
- Junjian Shan
- Ping Wang
Plant Molecular Biology Reporter (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.