当前位置: X-MOL 学术Science › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Quantitative analysis of population-scale family trees with millions of relatives
Science ( IF 56.9 ) Pub Date : 2018-03-01 , DOI: 10.1126/science.aam9309
Joanna Kaplanis 1, 2 , Assaf Gordon 1, 2 , Tal Shor 3, 4 , Omer Weissbrod 5 , Dan Geiger 4 , Mary Wahl 1, 2, 6 , Michael Gershovits 2 , Barak Markus 2 , Mona Sheikh 2 , Melissa Gymrek 1, 2, 7, 8, 9 , Gaurav Bhatia 10, 11 , Daniel G. MacArthur 7, 9, 10 , Alkes L. Price 10, 11, 12 , Yaniv Erlich 1, 2, 3, 13, 14
Affiliation  

Quantitative analysis of millions of relatives Human relationships, as documented by family trees, can elucidate the heritability of a host of medical and biological parameters. Kaplanis et al. collected 86 million publicly available profiles from a crowd-sourced genealogy website and used them to examine the genetic architecture of human longevity and migration patterns (see the Perspective by Lussier and Keinan). Various models of inheritance suggested that life span is predominantly attributable to additive genetic effects, with a smaller component from dominant genetic inheritance. The data also suggested that relatedness between individuals is less attributable to advances in human transportation than to cultural changes. Science, this issue p. 171; see also p. 153 Crowdsourced genealogy data are useful for testing genetic hypotheses. Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from publicly available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of human longevity and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other data sets with our resource.

中文翻译:

百万亲缘人口规模家谱的定量分析

对数百万亲属的定量分析家谱所记载的人际关系可以阐明许多医学和生物学参数的遗传性。卡普兰尼斯等人。从众包家谱网站收集了 8600 万份公开可用的资料,并用它们来检查人类长寿和迁徙模式的遗传结构(参见 Lussier 和 Keinan 的观点)。各种遗传模型表明,寿命主要归因于加性遗传效应,而显性遗传遗传的成分较小。数据还表明,个人之间的相关性与其说是文化变化,不如说是人类交通的进步。科学,这个问题 p。171; 另见第。153 众包族谱数据可用于测试遗传假设。家谱在遗传学、人类学和经济学等领域有着广泛的应用。然而,扩展家谱的收集是繁琐的,通常依赖于地理范围有限和复杂的数据使用限制的资源。我们从家谱爱好者共享的公开在线数据中收集了 8600 万份个人资料。经过大量的清理和验证,我们获得了人口规模的家谱,包括一个 1300 万个体的谱系。我们利用这些数据来划分人类长寿的遗传结构,并深入了解家庭的地理分布。我们还报告了一个简单的数字程序,用我们的资源覆盖其他数据集。大族谱的收集很繁琐,而且通常依赖于地理范围有限、数据使用限制复杂的资源。我们从家谱爱好者共享的公开在线数据中收集了 8600 万份个人资料。经过大量的清理和验证,我们获得了人口规模的家谱,包括一个 1300 万个体的谱系。我们利用这些数据来划分人类长寿的遗传结构,并深入了解家庭的地理分布。我们还报告了一个简单的数字程序,用我们的资源覆盖其他数据集。大族谱的收集很繁琐,而且通常依赖于地理范围有限、数据使用限制复杂的资源。我们从家谱爱好者共享的公开在线数据中收集了 8600 万份个人资料。经过大量的清理和验证,我们获得了人口规模的家谱,包括一个 1300 万个体的谱系。我们利用这些数据来划分人类长寿的遗传结构,并深入了解家庭的地理分布。我们还报告了一个简单的数字程序,用我们的资源覆盖其他数据集。我们从家谱爱好者共享的公开在线数据中收集了 8600 万份个人资料。经过大量的清理和验证,我们获得了人口规模的家谱,包括一个 1300 万个体的谱系。我们利用这些数据来划分人类长寿的遗传结构,并深入了解家庭的地理分布。我们还报告了一个简单的数字程序,用我们的资源覆盖其他数据集。我们从家谱爱好者共享的公开在线数据中收集了 8600 万份个人资料。经过大量的清理和验证,我们获得了人口规模的家谱,包括一个 1300 万个人的谱系。我们利用这些数据来划分人类长寿的遗传结构,并深入了解家庭的地理分布。我们还报告了一个简单的数字程序,用我们的资源覆盖其他数据集。
更新日期:2018-03-01
down
wechat
bug