当前位置: X-MOL 学术G3 Genes Genomes Genet. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Faster and More Accurate Algorithm for Calculating Population Genetics Statistics Requiring Sums of Stirling Numbers of the First Kind.
G3: Genes, Genomes, Genetics ( IF 2.1 ) Pub Date : 2020-10-27 , DOI: 10.1534/g3.120.401575
Swaine L Chen 1 , Nico M Temme 2
Affiliation  

Ewen’s sampling formula is a foundational theoretical result that connects probability and number theory with molecular genetics and molecular evolution; it was the analytical result required for testing the neutral theory of evolution, and has since been directly or indirectly utilized in a number of population genetics statistics. Ewen’s sampling formula, in turn, is deeply connected to Stirling numbers of the first kind. Here, we explore the cumulative distribution function of these Stirling numbers, which enables a single direct estimate of the sum, using representations in terms of the incomplete beta function. This estimator enables an improved method for calculating an asymptotic estimate for one useful statistic, Fu’s . By reducing the calculation from a sum of terms involving Stirling numbers to a single estimate, we simultaneously improve accuracy and dramatically increase speed.



中文翻译:

一种更快,更准确的算法,计算需要第一类斯特林数之和的种群遗传统计。

Ewen的采样公式是将概率和数论与分子遗传学和分子进化联系起来的基础理论结果。它是测试进化中立理论所需的分析结果,并且此后直接或间接地用于许多人口遗传学统计数据中。反过来,Ewen的采样公式与第一种斯特林数紧密相关。在这里,我们探讨了这些斯特林数的累积分布函数,该函数使用不完整的贝塔函数表示,可以对总和进行单个直接估计。该估计器实现了一种改进的方法,用于计算一个有用统计量Fu的渐近估计。通过将计算从涉及斯特林数的项的总和减少到单个估计,我们可以同时提高准确性并显着提高速度。

更新日期:2020-11-06
down
wechat
bug