- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Luikart, G.
- Articles by Cornuet, J.-M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Luikart, G.
- Articles by Cornuet, J.-M.
Estimating the Effective Number of Breeders From Heterozygote Excess in Progeny
Gordon Luikarta and Jean-Marie Cornuetba Laboratoire de Biologie des Populations d'Altitude, Université Joseph Fourier, CNRS, F-38041 Grenoble, Cedex 9, France
b Laboratoire de Modèlisation et de Biologie Evolutive, INRA/URLB, 34090 Montpellier Cedex, France
Corresponding author: Gordon Luikart, Laboratoire de Biologie des Populations d’Altitude, CNRS, UMR 5553, Université Joseph Fourier, F-38041 Grenoble, Cedex 9, France., gluikart{at}ujf-grenoble.fr (E-mail)
Communicating editor: G. B. GOLDING
| ABSTRACT |
|---|
The heterozygote-excess method is a recently published method for estimating the effective population size (Ne). It is based on the following principle: When the effective number of breeders (Neb) in a population is small, the allele frequencies will (by chance) be different in males and females, which causes an excess of heterozygotes in the progeny with respect to Hardy-Weinberg equilibrium expectations. We evaluate the accuracy and precision of the heterozygote-excess method using empirical and simulated data sets from polygamous, polygynous, and monogamous mating systems and by using realistic sample sizes of individuals (15120) and loci (530) with varying levels of polymorphism. The method gave nearly unbiased estimates of Neb under all three mating systems. However, the confidence intervals on the point estimates of Neb were sufficiently small (and hence the heterozygote-excess method useful) only in polygamous and polygynous populations that were produced by <10 effective breeders, unless samples included > ~60 individuals and 20 multiallelic loci.
THE effective population size (Ne) is an important parameter in evolutionary genetics and conservation biology because it influences the rate of inbreeding and loss of genetic variation. It also influences the efficiency of natural selection in maintaining beneficial alleles and purging deleterious ones. For example, when Ne is very small, genetic drift will often be too strong for natural selection to efficiently maintain or purge alleles. Unfortunately, Ne has proven very difficult to estimate in natural populations (![]()
![]()
The (current) effective population size can be estimated using genetic data and the four following statistical methods: the loss of heterozygosity method (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
![]()
Here, we extend the evaluation of ![]()
eb from finite samples of progeny, (ii) quantify the bias of
eb, (iii) determine the number of loci and individuals that must be sampled to achieve precise estimates of Neb, and (iv) test if monogamy generates an interfamily Wahlund effect (i.e., heterozygote deficiency) that counteracts the heterozygote excess generated by small Neb. To conduct these evaluations, we use data from simulations and natural populations. We focus on populations with a small Neb (4100) because a heterozygote excess is generated only when Neb is small.
| PRINCIPLE AND METHODS |
|---|
The principle of the heterozygote-excess method is as follows: When the number of breeders is small, the allele frequencies in males and females will be different due to binomial sampling error. This difference generates an excess of heterozygotes in the progeny relative to the proportion of heterozygotes expected under Hardy-Weinberg equilibrium (![]()
![]()
![]()

where n is the number of haploid genomes in the mothers or fathers and p and q are the frequencies of alleles at a locus, in the population from which the parents were drawn. Because the excess of heterozygotes expected in progeny increases as the number of parents decreases, ![]()
![]()
Now
eb can be estimated as follows:

The ratio Hexp/(Hobs - Hexp) is the reciprocal of ![]()
![]()

The following more exact equation was derived by ![]()
![]() |
(1) |
The above expression is for two alleles. For a multiallelic locus, one should average D over all k alleles as
![]() |
(2) |

Hobs[i] is the total proportion of heterozygotes having allele i, and Hexp[i] is simply 2pi(1 - pi) · 2N/(2N - 1), where i is the ith allele.
We note that when the sample size of individuals is finite, Hexp must be estimated using the following unbiased estimator of 2pq (![]()
eb will give a biased estimate Neb (especially when N is small). In nature, Neb can range from only two to near infinity and
eb can be negative (e.g., in the case of an overall deficit of heterozygotes). In our analyses, we considered Neb values as infinite if
eb was negative or >10,000 (an arbitrary but reasonably large value). If
eb is infinite, it simply means that the heterozygote-excess signal is obscured by the noise (i.e., sampling error) associated with small samples of loci or individuals.
The simulation model that we used to evaluate the heterozygote-excess method has three main steps. First, it assigns genotypes to the parental generation using random numbers and a predefined allele frequency distribution. We modeled loci with 2, 3, and 5 alleles and the following three allele frequency distributions: equal (e.g., H = 0.8, for 5 alleles), triangular (e.g., 0.33, 0.30, 0.20, 0.13, 0.07, and H = 0.74; see ![]()
, where Nm and Nf are the number of breeding males and females, respectively; ![]()
eb from a random sample of 15120 progeny using Equation 1 and Equation 2. These three steps were repeated 5002000 times per combination of the following parameters: Neb, sample size of individuals and loci, allele number, allele frequency distribution, and mating system.
| RESULTS AND DISCUSSION |
|---|
Bias:
Our simulations suggest that the heterozygote-excess method slightly overestimates the Neb when using finite samples and Nei's unbiased estimator of Hexp. When Neb was 4, 20, and 100, the harmonic mean estimates of Neb (from 500 simulations) were 4.1, 22.2, and 103.4, respectively, when sampling 30 individuals and 10 loci (five alleles/locus) with triangular allele frequency distributions and a polygamous breeding system (Table 1). When more individuals were sampled, the bias was slightly lower. Harmonic mean estimates of Neb were nearly identical for loci with two, three, and five alleles and with widely different allele frequency distributions (Table 1; Figure 1, horizontal bars inside box plots). The largest bias occurred under the polygynous mating system (e.g., =
eb 5.0 when true Neb
4; Figure 1). This bias was not surprising given the assumption of the heterozygote-excess method that there are equal numbers of male and female parents. Still, the bias was small enough not to substantially diminish the usefulness of the heterozygote-excess method.
|
|
The harmonic mean Neb estimates were similar for the monogamous and polygamous mating systems (
eb was 4.5 and 4.1, respectively, when Neb was 4.0; Figure 1). This suggests that there is little impact of an interfamily Wahlund effect on the accuracy of Neb estimates. When only approximately two to three large families exist (e.g., under monogamy with Neb equaling 46), sampling across families does not generate a large heterozygote deficiency (i.e., Wahlund effect). However, a Wahlund heterozygote deficiency is expected when many families exist (A. PUDOVKIN, personal communication). Such a deficiency would at least partially cancel the heterozygote excess caused by small Neb, and thereby cause biased (high) estimates of Neb. Although monogamy did not cause a large bias, it did substantially increase the variance among Neb estimates (see below and Figure 1).
Precision:
To quantify the precision of the Neb estimates, we used the Student's t-distribution to compute a 95% confidence interval for each
eb (as in ![]()
eb contained the true Neb in 9296% of the simulation estimates of Neb when using loci with five alleles (Table 1). As expected, approximately half of the confidence intervals that did not contain the true Neb were too low (L) and half were too high (H). This suggests that the Student's t-distribution is useful for computing confidence intervals, even though Neb is not exactly normally distributed. For loci with three alleles or for monogamous mating systems, the confidence intervals also contained the true Neb ~9296% of the time. However, when using loci with only two alleles, the confidence intervals were generally too narrow and contained the true Neb in only 8389% of the simulation estimates of Neb (Table 1). Thus, confidence intervals must be interpreted with caution or computed by alternative methods (e.g., bootstrap resampling) when using loci with only two alleles (e.g., many allozyme loci).
Under extreme polygyny (e.g., one male mating with 99 females), the confidence intervals were often too high. For example, when Neb was four, ~25% of the 500 simulated confidence intervals were slightly higher than the true Neb, and none were lower than Neb. Although the confidence intervals were often too high, they were also much narrower under polygyny than under monogamy or polygamy (Figure 1). This narrowness substantially increases the usefulness of the heterozygote-excess method under polygyny. Thus, under extreme polygamy, the heterozygote-excess method will be useful for detecting a small Neb but will be less useful for quantifying the exact size of Neb.
To determine the minimum number of loci and individuals that must be sampled to achieve a high probability of obtaining narrow confidence intervals, we plotted the distribution of the (upper and lower) 95% confidence limits obtained from 500 simulation estimates of Neb. When the true Neb is only 4, at least 10 loci (with five alleles) and 30 individuals must be sampled to achieve an 80% probability of obtaining an upper 95% confidence limit < ~20 (Figure 2B). In other words, the statistical power is 0.80 when testing the null hypothesis that the true Neb
20 (and when the true Neb is actually only 4). The power will be slightly higher when using a one-tailed test and the null hypothesis that true Neb
20 (the alternative hypothesis is Neb < 20).
|
These results show that the heterozygote-excess method is sufficiently powerful for detecting a small Neb when sampling reasonable numbers of individuals and loci with five alleles. Such results are important for conservation biology and the management of captive and natural populations. The precision of
eb is often increased more by analyzing a larger number of loci than by sampling more individuals. Doubling the number of loci from 10 to 20 (compare the first box plot in Figure 2B and Figure C) generally reduces confidence intervals more than doubling the number of individuals sampled from 15 to 30 (compare the first two box plots in Figure 2B). However, the benefit from doubling the number of loci depends on the number and frequency of alleles (Figure 1).
When true Neb is 10, we must sample >20 polymorphic loci and 60 individuals to have an 80% probability of obtaining confidence intervals that are <50 (and to have a 95% probability of obtaining confidence intervals <100; Figure 2E). When the true Neb is 100, >80% of the confidence intervals include infinity, even when sampling 120 individuals and 20 loci with five alleles (data not shown). Clearly, when Neb is >10, very large samples of loci and individuals are required to achieve a high probability of obtaining reasonably small confidence intervals. Thus, the main limitation of the usefulness of the heterozygote-excess method is its poor precision, i.e., its wide confidence intervals. The confidence intervals are generally too wide for the method to be useful when using diallelic loci, loci with mostly rare alleles, or when studying strictly monogamous species (Figure 1).
When applied to data from natural populations, the heterozygote-excess method often gave estimates of Neb equal to infinity. For example,
eb was infinity in 5 of 10 cohorts for which the total number of parents was known (or estimated) to be small (i.e., three to a few dozen). Further, only 2 of the 10 estimates gave 95% confidence intervals that did not include infinity as an upper limit (Table 2). This poor precision is not surprising in that only 59 polymorphic loci were analyzed, and only 1125 progeny were sampled. Additional empirical evaluations are needed, but it is extremely difficult to find large data sets containing individuals produced from a known number of parents.
|
One potential limitation of the method is the requirement for random, representative sampling. For example, if a sample contains only one or few families (due to sampling error) then we could obtain a very low Neb estimate, even though many families actually exist and Neb is large. Another obvious limitation is that the method will work only in species with separate sexes. The method will work for haplo-diploid species (e.g., Hymenopterans), but will require the derivation of equations different from those presented here.
Four approaches may help circumvent the problem of poor precision. First, one can compute 80% confidence intervals (in addition to 95% confidence intervals). This will reduce the likelihood that the upper confidence limit will include infinity and be uninformative. Second, one could explore alternative methods for computing confidence intervals (e.g., nonparametric methods such as bootstrap resampling of loci). Third, one could combine estimates of Neb from several generations or cohorts by computing the harmonic mean of
eb over the multiple generations or cohorts. This can reduce the probability of obtaining infinity for
eb because, when computing a harmonic mean, the low estimates carry far more weight than high ones (e.g., infinity). Finally, one can potentially combine estimates of Ne obtained from several independent Ne estimators by computing the harmonic mean of the Ne estimates. Other promising Ne estimators include those based on gametic disequilibrium (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank I. Till-Bottraud and two anonymous reviewers for helpful comments on earlier versions of this manuscript, M. Schwartz for sharing unpublished simulation data, and especially P. Spruell, F. W. Allendorf, A. Estoup, and M. Brown for providing data sets. Support was provided by the French "Bureau Ressources Génétiques," a postdoctoral fellowship (for G.L.) from National Science Foundation/North Atlantic Treaty Organization, and the Laboratoire de Biologie des Populations d'Altitude.
Manuscript received June 17, 1998; Accepted for publication November 20, 1998.
| LITERATURE CITED |
|---|
CROW, J. F., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Burgess Publishing, Minneapolis.
ESTOUP, A., F. ROUSSET, Y. MICHALAKIS, J.-M. CORNUET, and M. ADRIAMANGA et al., 1998 Comparative analysis of microsatellite and allozyme markers: a case study investigating microgeographic differentiation in brown trout (Salmo trutta). Mol. Ecol. 7:339-353[Medline].
FALCONER, D. S., 1989 Introduction to Quantitative Genetics, Ed. 3. Longman Scientific & Technical with John Wiley & Sons, New York.
FRANKHAM, R., 1995 Effective population size/adult population size ratios in wildlife: a review. Genet. Res. 66:95-106.
HARRIS, R. B. and F. W. ALLENDORF, 1989 Genetically effective population size of large mammals: an assessment of estimators. Conserv. Biol. 3:181-191.
HILL, W. G., 1981 Estimation of linkage disequilibrium in randomly mating populations. Heredity 33:229-239.
KRIMBAS, C. B. and S. TSASKAS, 1971 The genetic of Dacus oleae V. Changes of esterase polymorphism in a natural population following insecticide controlSelection or drift? Evolution 25:454-460.
LUIKART, G., 1997 Usefulness of molecular markers for detecting population bottlenecks and monitoring genetic change. Ph.D. Thesis, University of Montana, Missoula, MT.
LUIKART, G., J.-M. CORNUET, and F. W. ALLENDORF, 1998 Temporal changes in allele frequencies provide estimates of population bottleneck size. Conserv. Biol. 89:238-247.
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
PUDOVKIN, A. I., D. V. ZAYKIN, and D. HEDGECOCK, 1996 On the potential for estimating the effective number of breeders from heterozygote-excess in progeny. Genetics 144:383-387[Abstract].
RASMUSSEN, D. I., 1979 Sibling clusters and gene frequencies. Am. Nat. 113:948-951.
ROBERTSON, A., 1965 The interpretation of genotypic ratios in domestic animal populations. Anim. Prod. 7:319-324.
SELANDER, R. K., 1970 Behavior and genetic variation in natural populations. Am. Zool. 10:53-66[Medline].
WAPLES, R. S., 1989 A generalized approach for estimating effective population size from temporal changes in allele frequency. Genetics 121:379-391
WAPLES, R. S., 1991 Genetic methods for estimating the effective size of Cetacean populations, pp. 279300 in Genetic Ecology of Whales and Dolphins, Special Issue 13, edited by A. R. HOELZEL. International Whale Commission, London.
This article has been cited by other articles:
![]() |
O. L. Zhdanova and A. I. Pudovkin Nb_HetEx: A Program to Estimate the Effective Number of Breeders J. Hered., November 1, 2008; 99(6): 694 - 695. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Beaumont Estimation of Population Growth or Decline in Genetically Monitored Populations Genetics, July 1, 2003; 164(3): 1139 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wang and M. C. Whitlock Estimating Effective Population Size and Migration Rates From Genetic Samples Over Space and Time Genetics, January 1, 2003; 163(1): 429 - 446. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Berthier, M. A. Beaumont, J.-M. Cornuet, and G. Luikart Likelihood-Based Estimation of the Effective Population Size Using Temporal Changes in Allele Frequencies: A Genealogical Approach Genetics, February 1, 2002; 160(2): 741 - 751. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Vitalis and D. Couvet Estimation of Effective Population Size and Migration Rate From One- and Two-Locus Identity Measures Genetics, February 1, 2001; 157(2): 911 - 925. [Abstract] [Full Text] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Luikart, G.
- Articles by Cornuet, J.-M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Luikart, G.
- Articles by Cornuet, J.-M.



4).

