A sheep pangenome reveals the spectrum of structural variations and their effects on tail phenotypes

  1. Yu Jiang1,14
  1. 1Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China;
  2. 2State Key Laboratory of Sheep Genetic Improvement and Healthy Production, Xinjiang Academy of Agricultural and Reclamation Sciences, Shihezi, Xinjiang 832000, China;
  3. 3Department of Animal Science, Fars Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education & Extension Organization (AREEO), Shiraz 7155863511, Iran;
  4. 4Department of Animal Science, Shahid Bahonar University of Kerman, Kerman 7616914111, Iran;
  5. 5Reproductive Biotechnology Research Center, Avicenna Research Institute, ACECR, Tehran 1983969412, Iran;
  6. 6State Key Laboratory of Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China;
  7. 7Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China;
  8. 8Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, 3052 Victoria, Australia;
  9. 9Sherbrooke Research and Development Centre, Agriculture and Agri-Food Canada, Sherbrooke, Quebec J1M 0C8, Canada;
  10. 10Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, United Kingdom;
  11. 11Animal Genomics and Improvement Laboratory, USDA-ARS, Beltsville, Maryland 20705, USA;
  12. 12Faculty of Veterinary Medicine, Utrecht University, Utrecht 3508 TD, The Netherlands;
  13. 13College of Coastal Agricultural Sciences, Guangdong Ocean University, Zhanjiang 524088, China;
  14. 14Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
  1. 15 These authors contributed equally to this work.

  • Corresponding authors: yu.jiang{at}nwafu.edu.cn, shangquangan{at}gdou.edu.cn
  • Abstract

    Structural variations (SVs) are a major contributor to genetic diversity and phenotypic variations, but their prevalence and functions in domestic animals are largely unexplored. Here we generated high-quality genome assemblies for 15 individuals from genetically diverse sheep breeds using Pacific Biosciences (PacBio) high-fidelity sequencing, discovering 130.3 Mb nonreference sequences, from which 588 genes were annotated. A total of 149,158 biallelic insertions/deletions, 6531 divergent alleles, and 14,707 multiallelic variations with precise breakpoints were discovered. The SV spectrum is characterized by an excess of derived insertions compared to deletions (94,422 vs. 33,571), suggesting recent active LINE expansions in sheep. Nearly half of the SVs display low to moderate linkage disequilibrium with surrounding single-nucleotide polymorphisms (SNPs) and most SVs cannot be tagged by SNP probes from the widely used ovine 50K SNP chip. We identified 865 population-stratified SVs including 122 SVs possibly derived in the domestication process among 690 individuals from sheep breeds worldwide. A novel 168-bp insertion in the 5′ untranslated region (5′ UTR) of HOXB13 is found at high frequency in long-tailed sheep. Further genome-wide association study and gene expression analyses suggest that this mutation is causative for the long-tail trait. In summary, we have developed a panel of high-quality de novo assemblies and present a catalog of structural variations in sheep. Our data capture abundant candidate functional variations that were previously unexplored and provide a fundamental resource for understanding trait biology in sheep.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.277372.122.

    • Freely available online through the Genome Research Open Access option.

    • Received September 30, 2022.
    • Accepted February 21, 2023.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server