Combining multiple genetic estimates ofN_e

Robin S. Waples

0 evaluations Published on Jun 27, 2025

This article on Sciety

Abstract

Researchers often use more than one genetic method to estimate contemporary effective population size (N_e), but few formally combine multiple estimates despite the potential benefits for increasing precision. Maximizing these benefits requires an optimal, inverse-variance weighting scheme. A precondition for combining estimates is that they must be estimating the same parameter, which can be appropriate either for estimates using the same method applied to different time periods, or for estimates using different methods applied to the same time period. Previous approaches focused on<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline1.gif"/></inline-formula>for weighting, but that is problematical because<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline2.gif"/></inline-formula>is highly skewed and can be infinitely large. A new approach is described here using weights that are inversely proportional to<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline3.gif"/></inline-formula>is the drift signal thatN_e-estimation methods respond to, and its distribution is close to normal even when<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline4.gif"/></inline-formula>assumes extreme values. Benefits are maximized under three general conditions: the estimators have approximately equal variances; they are uncorrelated or have weak positive correlations; individual estimates have low precision (i.e., if data are limited and/or trueN_eis large). Analytical and numerical results demonstrate that: (1) existing theory allows robust estimates of<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline5.gif"/></inline-formula>for the temporal and LD methods, which provide independent information aboutN_e– both of which facilitate optimally combining those methods; (2) estimates for the LD and sibship methods are essentially uncorrelated when data are limited but can be strongly positively correlated in genomics-scale datasets. General theory predicting<inline-formula><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="661823v1_inline6.gif"/></inline-formula>for the sibship method is lacking, but values for specific scenarios have been published.

Related articles are currently not available for this article.