- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Andolfatto, P.
- Articles by Przeworski, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Andolfatto, P.
- Articles by Przeworski, M.
A Genome-Wide Departure From the Standard Neutral Model in Natural Populations of Drosophila
Peter Andolfatto1,a and Molly Przeworski1,ba Institute for Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
b Committee on Evolutionary Biology, University of Chicago, Chicago, Illinois 60637
Corresponding author: Molly Przeworski, Statistics Department, University of Oxford, 1 South Parks Rd., Oxford OX1 3TG, United Kingdom., molly{at}stats.ox.ac.uk (E-mail)
Communicating editor: A. G. CLARK
| ABSTRACT |
|---|
We analyze nucleotide polymorphism data for a large number of loci in areas of normal to high recombination in Drosophila melanogaster and D. simulans (24 and 16 loci, respectively). We find a genome-wide, systematic departure from the neutral expectation for a panmictic population at equilibrium in natural populations of both species. The distribution of sequence-based estimates of 2Nc across loci is inconsistent with the assumptions of the standard neutral theory, given the observed levels of nucleotide diversity and accepted values for recombination and mutation rates. Under these assumptions, most estimates of 2Nc are severalfold too low; in other words, both species exhibit greater intralocus linkage disequilibrium than expected. Variation in recombination or mutation rates is not sufficient to account for the excess of linkage disequilibrium. While an equilibrium island model does not seem to account for the data, more complicated forms of population structure may. A proper test of alternative demographic models will require loci to be sampled in a more consistent fashion.
THE standard assumptions of the neutral theory of molecular variation (![]()
= 4Nµ, can be estimated from sequence polymorphism data (where N is the effective population size of the species, c the rate of recombination per base pair per generation, and µ the rate of mutation per base pair per generation). The ratio
/
is then an estimate of the recombination rate between adjacent bases, scaled to the mutation rate per base pair. Estimates of these same parameters, c and µ, can be obtained from genetic and physical map data and from nucleotide divergence between closely related species, respectively. This allows for a direct comparison between the two methods of estimation (![]()
Several authors have pointed out that estimates of C/
for specific loci are different from those expected from independent estimates of c and µ (![]()
![]()
![]()
![]()
![]()
![]()
have been interpreted as reflecting strong linkage disequilibrium (e.g., ![]()
![]()
![]()
![]()
Excess linkage disequilibrium is often interpreted as reflecting the action of natural selection at the locus (e.g., ![]()
![]()
![]()
![]()
![]()
/
values are systematically too small to be consistent with the theoretical predictions of the standard neutral theory, given reasonable estimates of c and µ. In other words, levels of linkage disequilibrium (as measured by
) are almost always significantly higher than expected.
| METHODS |
|---|
We use polymorphism data sets for regions of normal to high recombination (>5 x 10-9 per base pair per generation) that have more than three segregating sites (24 genes in D. melanogaster and 16 genes in D. simulans). We include biallelic single nucleotide polymorphisms but not mutations that overlap deletions as they represent incomplete information. The sequences for most loci can be obtained from GenBank at http://www.ncbi.nlm.nih.gov/Entrez/. The data sets analyzed are available upon request to P. Andolfatto. Polymorphism data sets used in this study are the following: Acp26A (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
12 kb upstream of the Sod coding region (![]()
Recombination estimates:
For each locus, laboratory estimates of the regional rate of crossing over, c, are obtained as follows: for every chromosomal arm, polynomial curves were fitted to plots of cumulative genetic distance as a function of cumulative physical distance. The derivative of the polynomial at a given physical map position is taken to be the recombination rate (![]()
![]()
![]()
To obtain an estimate of C, we need estimates of c and N (we refer to this estimate of C as Cmap). Since males do not recombine in D. melanogaster and D. simulans (![]()
)4Nc = 2Nc for autosomal genes and (
)3Nc = 2Nc for X-linked genes. The recombination rate c is taken to be the crossing-over rate estimated from laboratory crosses (see DISCUSSION). To estimate N, we equate the observed
w at silent and noncoding sites of each locus with 4Nµ (or 3Nµ for X-linked loci). Under the standard neutral model,
w (![]()
. Our estimate of the mutation rate
is obtained from
, the estimated rate of divergence per year (which depends on
, the estimated time to the common ancestor of the melanogaster and obscura species groups), and
, the estimated number of generations per year. If
= 3 x 10-8|
= 30 million years (my) and
= 10, our estimate of µ is 3 x 10-9/bp/generation. (We discuss the validity of these assumptions in detail in the DISCUSSION.) Given these estimates for d, g, and T, the average
across loci is roughly 106 for D. melanogaster and 2 x 106 for D. simulans.
HUDSON's (1987) estimator of C (henceforth referred to as Chud) was calculated using a modification of a program kindly provided by J. Wakeley. Ideally, one would like to use all the information in the data. However, full-likelihood approaches (e.g., ![]()
![]()
![]()
10,000 is taken to be 10,000. Similarly, if there is more association between sites than expected under the standard neutral model with no recombination, Chud is set to zero. Note that Chud is much more dependent on model assumptions than is our estimate of c and can only be interpreted as an estimate of 4Nc under a very restricted set of models.
Coalescent simulations of the standard neutral model:
Coalescent simulations (![]()
Although we refer to our model as the "standard neutral model," standard coalescent simulations use the parameter
and treat S as a random variable. Here, we generate genealogies and then place the observed number of mutations (at both synonymous and nonsynonymous sites), S, on the tree (![]()
(![]()
Note that S refers to the number of segregating sites, not the number of mutations, so we are effectively ignoring multiple hits. This choice is conservative for our purposes, since multiple hits will tend to decrease linkage disequilibrium. There are nine data sets in D. melanogaster with visible multiple hits (at most three sites) and seven in D. simulans (Adh has six sites with more than two alleles; the six other data sets have fewer than three). Results are virtually unchanged if we consider S to be the number of inferred mutations instead of the number of segregating sites (results not shown).
For each locus, given the observed number of segregating sites and our estimate of C, we ask what proportion P of simulated runs have a value of Chud smaller than or equal to the observed value. That is, for locus i, Pi = Pr(simulated Chud
observed Chud | SNM, Cmap). If loci are independent, C = Cmap, and the standard neutral model is accurate, the distribution of P values across loci should be uniformly distributed between 0 and 1. We test for a departure from uniformity by using the fact that, for n data sets, -
ni=02 ln(Pi) should be
2 distributed with 2n d.f. (Fisher 1954, as cited in ![]()
Heterogeneity in selective constraints:
We ran coalescent simulations to test the effect of a nonuniform distribution of mutations on estimates of Chud. We model this spatial heterogeneity as variation in mutation rate. Coalescent simulations are run with the same parameters as for the panmictic, uniform mutation case. The sequence is divided into three parts, spanning one-fourth of the length of the sequence, one-half, and one-fourth, respectively (Fig 1). This case is meant to represent an exon flanked by two introns. Since on average one site out of four is silent in a coding region, the mutation rate in "introns" is taken to be fourfold higher than in the middle half. This model assumes that introns and silent sites have similar levels of constraint (cf. ![]()
![]()

where n is the number of segments, Li is the length of segment i, and Ti is the total branch length of the genealogy for segment i. To add stepwise variation in mutation rates, we weight these probabilities by the "relative mutation rate" for each interval (y-axis in Fig 1).
|
Population subdivision:
We also ran coalescent simulations for a symmetric island model. Since geographic subdivision increases the extent of linkage disequilibrium (![]()
![]()
![]()
![]()
If we assume a symmetric two-deme model, observed values of FST (![]()
![]()
![]()
To evaluate the fit of a symmetric island model, we use both Chud and B' (![]()
![]()
![]()
![]()
![]()
observed B' | SNM, C = Cmap). A low value of P(B') indicates high levels of linkage disequilibrium. Simulations for B' were run with the total number of (inferred) mutations (which in all cases equaled the sum of the number of segregating sites and the number of multiple hits). Sites with multiple hits were treated as multiple mutations with missing information. For each site, there are only three ways the missing information can be filled in, depending on which of the three alleles is ancestral. This leads to a range of values for B', from which the most conservative one is taken.
In some cases, we wish to demonstrate that there is too little linkage disequilibrium in the data; this corresponds to the other tail of the B' statistic or Pr(simulated B'
observed B'| SNM, C = Cmap). Note that this probability is not 1 - Pr(simulated B'
observed B') since B' is discrete. Using the number of inferred mutations is no longer conservative, so we rerun the simulations with the number of segregating sites to calculate this tail. We do not examine the other tail of Chud (i.e., large values) because, for small samples, Chud is expected to be much larger than the true mean (![]()
| RESULTS |
|---|
Chud/
w is consistently too low:
Fig 2 and Fig 3 present two estimates of the number of recombination events per mutation for each locus. The first (represented by squares) is based on direct laboratory measurements of the crossing-over rate. This estimate is c/2µ if the gene is autosomal and 2c/3µ if it is X-linked, where c is the recombination rate per base pair. Chud/
w (shown with circles) is estimated from sequence polymorphism data.
w is based on the number of (silent and noncoding) segregating sites, not on the number of mutations; i.e., multiple hits are ignored. (This is conservative for our purposes since it leads to a smaller value of
w than if the estimate were based on the number of mutations.) Chud is a measure of linkage disequilibrium (see METHODS). Since both C and
are scalar multiples of the effective population size under standard neutral assumptions, dividing Chud by
w should make the ratio independent of the effective population size under the null model. This is of use because we do not have an estimate of the effective population size that is independent of genetic diversity levels. Scaling Chud to
w could also be important if background selection is reducing the effective population size for some loci (![]()
w is almost always smaller than c/2µ (or 2c/3µ if X-linked).
|
|
One reason for using Chud is that the median is known to be above the true mean (![]()
(![]()
![]()
w will be >c/2µ (or 2c/3µ for X-linked loci). This was confirmed by simulation (results not shown). Since the loci are independent, we can use a signs test with probability one-half that Chud/
w is above c/2µ (or 2c/3µ for X-linked loci). Such a test is highly conservative, yet significant for both species (P = 0.0032 for D. melanogaster and P = 0.019 for D. simulans, one-tailed). Laboratory estimates of the crossing-over rate are not available for chromosome 2 of D. simulans. If the rates from D. melanogaster are used as surrogates, an additional four data sets can be considered (for 16 genes, P = 0.038, one-tailed); the true rates in D. simulans are probably higher (see ![]()
P values are not uniformly distributed:
Of interest is not only the direction of the discrepancy between Chud/
w and c/2µ but also the magnitude of the difference. To quantify this, we ran coalescent simulations under the assumption that C = Cmap. Tabulated in Table 1 and Table 2 for each locus are the proportion P of 10,000 simulated data sets with a Chud value smaller than or equal to the observed one. Recall that under the null model, the distribution of P values across loci should be uniform; this is exceedingly unlikely: P < 10-15 and P < 10-9 for D. melanogaster and D. simulans, respectively. We might expect an excess of high P values since many of our assumptions are conservative. Instead we observe too many low P values (Fig 4): for 13 of the 24 D. melanogaster loci, P(Chud) < 0.05. Similarly, in D. simulans, 7 out of 16 loci have P values below 0.05. We conclude that either Chud is too small, i.e., there is too much linkage disequilibrium given our assumed recombination rate, or
is too large, i.e., there is too much diversity given our assumed mutation rate.
|
|
|
| DISCUSSION |
|---|
Recombination rates:
Our results rely on the assumption that the laboratory rates of crossing over, which are interpolated from measurements over large distances, are not overestimates. The correlation between diversity levels and recombination rates (![]()
![]()
![]()
w is 18-fold smaller than predicted in the sample from D. melanogaster. Second, differences between laboratory and natural conditions such as age and temperature appear to have only minor effects on the rate of recombination (![]()
![]()
![]()
A concern in Drosophila melanogaster is the presence of high-frequency chromosomal inversions in natural populations (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Mutation rates:
A second assumption is that the neutral mutation rate per base pair per generation is
= 3 x 10-9. This assumption enters into the signs test directly and as a means to estimate N for our simulations (see METHODS). In what follows, we review what is known about the mutation rate in Drosophila and discuss the sensitivity of our results to this parameter.
Synonymous divergence estimates vary across loci from 0.37 x 10-8/bp/year to 2.98 x 10-8 (with an average of 1.56 x 10-8; ![]()
= 30 my for the time to the common ancestor of the melanogaster and obscura species groups (![]()
could be in error (e.g., ![]()
= 25 mya), the substitution rates estimated for D. melanogaster are consistent with several independent estimates from Hawaiian Drosophila (![]()
![]()
![]()
![]()
![]()
Comparing per-generation rates to per-year rates is complicated by the fact that we do not know the annual number of generations in the wild. In the laboratory, the generation time is
14.5 days (i.e., 25 generations a year) at 20° (![]()
16 generations a year) although it is generally shorter for temperatures between 20° and 30°. Both species considered here, which are now cosmopolitan, are thought to have spent most of their evolutionary history in warm climates (![]()
Variation in mutation rates:
If our mutation rates and laboratory estimates of recombination are roughly correct, then Chud is unexpectedly low; i.e., there is a genome-wide excess of linkage disequilibrium. Several departures from the standard neutral model assumptions could potentially generate an excess of linkage disequilibrium. In our simulations, mutations are placed uniformly along the sequence while in actual data sets, mutations are clustered, presumably because of heterogeneity in selective constraints. Thus, the average distance between segregating sites may be smaller in actual data than in simulated data, perhaps resulting in more linkage disequilibrium.
To test this possibility, we ran coalescent simulations with the same parameters as before (see METHODS) but with a simple model of variation in mutation rates (see Fig 1). The results for D. melanogaster are presented in Table 3. To verify that our model produces at least as much spatial heterogeneity as observed in the data, we tabulated the longest distance (Dmax) between any two consecutive polymorphisms in the actual data set (cf., ![]()
|
Variation in recombination rates:
Chud does not use information about the distance between segregating sites, only the extent of association among them. Small-scale variation in recombination rate should look similar to variation in mutation rates, with the history of sites within a locus more or less correlated than expected given the physical distance between them. In fact, for an infinite sites model, variation in recombination rates can be implemented in the same way as is done for variation in mutation rates (see METHODS). The simulated sequence can be thought of as a sequence in genetic rather than physical distance. Variation in recombination rates affects the translation of the genetic distance into physical distance by a factor similar to the one used to model variation in mutation rates. For example, if a particular interval has a low rate of recombination, then the number of mutations placed on it will be larger than expected given the (genetic) distance. So if recombination rates vary to the same extent as modeled for mutation rates, they should have only minor effects on Chud.
On a larger scale, if the Drosophila genome were a collection of few hotspots and many coldspots for recombination, rates averaged over larger distances could yield systematic overestimates of the mean rate. Since we observe few data sets where Chud is above the laboratory rate, recombination rates at these hotspots would have to be several orders of magnitude above those observed at most loci. Whether this is plausible is unclear. Recombination hotspots of this magnitude have been reported in fungi, humans, and mice (![]()
![]()
![]()
Departures from the demographic assumptions of the standard neutral model:
Departures from the demographic assumptions of a panmictic equilibrium can also generate an excess of linkage disequilibrium (and possibly decrease Chud). If there is population structure, for example, the effective population recombination rate should be decreased relative to panmixia since haplotypes in different subpopulations will not have a chance to recombine as often. In the data analyzed here, sampling schemes vary greatly across loci, with a third sampled entirely outside of Africa. If samples are partitioned by sampling location, population-specific estimates of Chud are sometimes higher and sometimes lower than are estimates based on total samples, so there is no clear effect of pooling different populations (results not shown). However, few samples include a large number of sequences from multiple populations, so this approach is not very informative.
Instead, we try to assess the fit of the data to simple demographic models. Several researchers have argued that populations sampled at present are the result of the recent admixture of previously subdivided populations (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
We evaluate the fit of a symmetric two-island model by considering changes in the distribution of P(Chud) and P(B') as the migration parameter 4Nm varies from 0.2 to 1. These values of 4Nm are lower than what is suggested by most FST values. For all parameter values we tried, sampling from both demes was a worse fit to the data than sampling entirely from one (results not shown). Thus, we present results only for the latter case. As expected, as the migration rates decrease, there is more linkage disequilibrium in the simulated data. As a result, low values of P(Chud) and P(B') become more likely. In Table 4, we report the probability U that the distributions of P(Chud) and P(B') are uniform (as they should be if the null model is correct). In both species, many Chud values are still too low for 4Nm > 1 (U = 0.006 for D. melanogaster and 0.004 for D. simulans). For 4Nm
1, there are too many high P(B') values, i.e., too little linkage disequilibrium in the data relative to the predictions of the model. Many of the assumptions made in this analysis are no longer conservative when considering the other tail (i.e., too little linkage disequilibrium). However, this analysis does suggest that a simple island model will only account for the low values of Chud if migration rates are lower than suggested by levels of population differentiation and that the model can explain some aspects of linkage disequilibrium (Chud) but not others (B'). When a summary of the frequency spectrum (![]()
|
It may not be surprising that an equilibrium island model is an inadequate demographic model, as the history of the species is likely to have been much more complex. In particular, it has been argued that non-African populations have experienced a population bottleneck, e.g., with the European colonization of the Americas (![]()
![]()
![]()
![]()
![]()
![]()
Interestingly, for loci in D. melanogaster, Chud values appear to be higher when estimated on the basis of African populations alone than when estimated from non-African populations (such a comparison is possible only for Acp26a, Adh, In(2L)t, and Vermilion). This is not true of the two suitable loci in D. simulans (G6pd and Vermilion). These findings might suggest that African D. melanogaster populations are closer to linkage equilibrium than are non-African ones. However, samples from Africa tend to be smaller than those from outside Africa, and we expect the median Chud value to be larger for smaller samples, even in the absence of population structure (![]()
Natural selection:
An alternative to a demographic explanation is that natural selection has acted on or near many of the loci. (These two alternatives are not mutually exclusive.) While a demographic departure from model assumptions may be a more parsimonious explanation for a systematic trend, it remains possible that natural selection is pervasive and commonly leads to increased linkage disequilibrium. In addition, it should be kept in mind that many of the loci analyzed here were collected because of prior evidence for selection.
One mode of selection thought to be prevalent in the Drosophila genome are "selective sweeps" in which a rare advantageous allele is rapidly fixed in the population (![]()
![]()
![]()
w above the median of loci (results not shown). The scenario just outlined assumes that a variant is selected while rare and swept to fixation in a panmictic population. A variation on this model is transient selection, where a variant is only swept to intermediate frequency (![]()
![]()
![]()
![]()
Epistatic interactions between nearby sites can also generate considerable linkage disequilibrium. In this type of model, recombinant haplotypes are selected against, reducing the effective frequency of crossing over. As an example, Adh protein production in D. melanogaster has been shown to be determined by epistatic interactions among multiple polymorphisms (![]()
![]()
In conclusion, unless it can be demonstrated that laboratory-based estimates of the crossing-over rate are systematic overestimates of the recombination rate, any theory for patterns of variability in the genome will have to accommodate a genome-wide excess of linkage disequilibrium (as measured by Chud). If this excess is best explained by a demographic departure from the standard neutral model, this has important implications for both parameter estimation (such as estimates of
) and for inferences from patterns of polymorphism at a particular locus. An alternative interpretation is that the mutation rates are much higher than indicated by levels of divergence at silent sites and noncoding DNA for any gene. This would point to a different but equally serious problem with the neutral theory.
| FOOTNOTES |
|---|
1 These authors contributed equally to this work. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank B. Charlesworth, M. Foote, M. Hamblin, R. Hudson, M. Kreitman, and J. Wall for helpful discussions and comments on an earlier version. This manuscript was significantly improved by comments from A. Clark, C. Langley, and an anonymous reviewer. R. Hudson, J. Wakeley, and J. Wall provided computer programs, S.-C. Tsaur provided unpublished Adh data, and J. Comeron and J. True provided genetic and physical map data.
Manuscript received July 24, 1999; Accepted for publication May 25, 2000.
| LITERATURE CITED |
|---|
AGUADÉ, M., N. MIYASHITA, and C. H. LANGLEY, 1992 Polymorphism and divergence in the Mst26A male accessory gland gene region in Drosophila. Genetics 132:755-770[Abstract].
ANDOLFATTO, P. and M. KREITMAN, 2000 Molecular variation at the In(2L)t proximal breakpoint site in natural populations of Drosophila melanogaster and D. simulans.. Genetics 154:1681-1691
ANDOLFATTO, P. and M. NORDBORG, 1998 The effect of gene conversion on intralocus associations. Genetics 148:1397-1399
ANDOLFATTO, P., J. D. WALL, and M. KREITMAN, 1999 Unusual haplotype structure at the proximal breakpoint of In(2L)t in a natural population of Drosophila melanogaster.. Genetics 153:1297-1311
ASHBURNER, M., 1989 Drosophila: A Laboratory Handbook. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
AYALA, F. J. and D. L. HARTL, 1993 Molecular drift of the bride of sevenless (boss) gene in Drosophila. Mol. Biol. Evol. 10:1030-1040[Abstract].
AYALA, F. J., B. S. CHANG, and D. L. HARTL, 1993 Molecular evolution of the Rh3 gene in Drosophila. Genetica 92:23-32[Medline].
BEGUN, D. J. and C. F. AQUADRO, 1992 Levels of naturally-occurring DNA polymorphism correlate with recombination rates in Drosophila melanogaster.. Nature 356:519-520[Medline].
BEGUN, D. J. and C. F. AQUADRO, 1993 African and North American populations of Drosophila melanogaster are very different at the DNA level. Nature 365:548-550[Medline].
BEGUN, D. J. and C. F. AQUADRO, 1994 Evolutionary inferences from DNA variation at the 6-Phosphogluconate Dehydrogenase locus in natural populations of Drosophila: selection and geographic differentiation. Genetics 136:155-171[Abstract].
BEGUN, D. J. and C. F. AQUADRO, 1995 Molecular variation at the vermilion locus in geographically diverse populations of Drosophila melanogaster and Drosophila simulans.. Genetics 140:1019-1032[Abstract].
BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140:783-796[Abstract].
BROOKS, L. D. and R. W. MARKS, 1986 The organization of genetic variation for recombination in Drosophila melanogaster.. Genetics 114:525-547
CHARLESWORTH, B., 1996 Background selection and patterns of genetic diversity in Drosophila melanogaster.. Genet. Res. 68:131-149[Medline].
CHARLESWORTH, D., B. CHARLESWORTH, and M. T. MORGAN, 1995 The pattern of neutral molecular variation under the background selection model. Genetics 141:1619-1632[Abstract].
CHOVNICK, A., 1973 Gene conversion and transfer of genetic information within the inverted region of inversion heterozygotes. Genetics 75:123-131
CICERA, S. and M. AGUADÉ, 1997 Evolutionary history of the sex-peptide (Acp70A) gene region in Drosophila melanogaster.. Genetics 147:189-197[Abstract].
CLARK, A. G. and L. WANG, 1997 Molecular population genetics of Drosophila immune system genes. Genetics 147:713-724[Abstract].
COMERON, J. M., M. KREITMAN, and M. AGUADÉ, 1999 Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila. Genetics 151:239-249
COOKE, P. H. and J. G. OAKESHOTT, 1989 Amino acid polymorphisms for Esterase 6 in Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 86:1426
DAVID, J. R. and P. CAPY, 1988 Genetic variation of Drosophila melanogaster natural populations. Trends Genet. 4:106-111[Medline].
DOONER, H. K. and I. M. MARTINEZ-FEREZ, 1997 Recombination occurs uniformly within the bronze gene, a meiotic recombination hotspot in the maize genome. Plant Cell 9:1633-1646[Abstract].
EANES, W. F., M. KIRCHNER, and J. YOON, 1993 Evidence for adaptive evolution of the G6pd gene in Drosophila melanogaster and D. simulans.. Proc. Natl. Acad. Sci. USA 90:7475-7479
EANES, W. F., M. KIRCHNER, J. YOON, C. BIERMANN, and I. N. WANG et al., 1996 Historical selection, amino acid polymorphism and lineage-specific divergence at the G6pd locus in Drosophila melanogaster and D. simulans.. Genetics 144:1027-1041[Abstract].
FLEISCHER, R. C., C. E. MCINTOSH, and C. L. TARR, 1998 Evolution on a volcanic conveyor belt: using phylogeographic reconstructions and K-Ar-based ages of the Hawaiian Islands to estimate molecular evolutionary rates. Mol. Ecol. 7:533-545[Medline].
GILLESPIE, J. H., 1997 Junk ain't what junk does: neutral alleles in a selected context. Gene 205:291-299[Medline].
GOSS, P. J. E. and R. C. LEWONTIN, 1996 Detecting heterogeneity of substitution along DNA and protein sequences. Genetics 143:589-602[Abstract].
GRIFFITHS, R. C. and P. MARJORAM, 1996 Ancestral inference from samples of DNA sequences with recombination. J. Comp. Biol. 3:479-502.
HALE, L. R. and R. S. SINGH, 1987 Mitochondrial DNA variation and genetic structure in populations of Drosophila melanogaster.. Mol. Biol. Evol. 4:622-637[Abstract].
HAMBLIN, M. T. and C. F. AQUADRO, 1996 High nucleotide sequence variation in a region of low recombination in Drosophila simulans is consistent with background selection. Mol. Biol. Evol. 13:1133-1140[Abstract].
HAMBLIN, M. T. and C. F. AQUADRO, 1997 Contrasting patterns of nucleotide sequence variation at the Glucose Dehydrogenase (Gld) locus in different populations of Drosophila melanogaster.. Genetics 145:1053-1062[Abstract].
HARADA, K., S. I. KUSAKABE, T. YAMAZAKI, and T. MUKAI, 1993 Spontaneous mutation rates in null and band-morph mutations of enzyme loci in Drosophila melanogaster.. Jap. J. Genet. 68:605-616.
HASSON, E. and W. F. EANES, 1996 Contrasting histories of three gene regions associated with In(3L)Payne of Drosophila melanogaster.. Genetics 144:1565-1575[Abstract].
HASSON, E., I. N. WANG, L. W. ZENG, M. KREITMAN, and W. EANES, 1998 Nucleotide variation in the Triose Phosphate Isomerase (Tpi) locus of Drosophila melanogaster and D. simulans.. Mol. Biol. Evol. 15:756-769[Abstract].
HEY, J. and R. M. KLIMAN, 1993 Population genetics and phylogenetics of DNA sequence variation at multiple loci within the Drosophila melanogaster species complex. Mol. Biol. Evol. 10:804-822[Abstract].
HEY, J. and J. WAKELEY, 1997 A coalescent estimator of the recombination rate. Genetics 145:833-846[Abstract].
HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50:245-250[Medline].
HUDSON, R. R., 1990 Gene genealogies and the coalescent process, pp. 144 in Oxford Surveys in Evolutionary Biology, Vol. 7, edited by D. FUTUYMA and J. ANTONOVICS. Oxford University Press, Oxford.
HUDSON, R. R., 1993 The how and why of generating gene genealogies, pp. 2326 in Mechanisms of Molecular Evolution, edited by N. TAKAHATA and A. G. CLARK. Japan Sci. Soc., Tokyo.
HUDSON, R. R., D. D. BOOS, and N. L. KAPLAN, 1992a A statistical test for detecting geographic subdivision. Mol. Biol. Evol. 9:138-151[Abstract].
HUDSON, R. R., M. SLATKIN, and W. P. MADDISON, 1992b Estimating levels of gene flow from DNA sequence data. Genetics 132:583-589[Abstract].
HUDSON, R. R., K. BAILEY, D. SKARECKY, J. KWIATOWSKI, and F. J. AYALA, 1994 Evidence for positive selection in the Superoxide Dismutase (Sod) region of Drosophila melanogaster.. Genetics 136:1329-1340[Abstract].
HUDSON, R. R., A. G. SAEZ, and F. J. AYALA, 1997 DNA variation at the Sod locus of Drosophila melanogaster: an unfolding story of natural selection. Proc. Natl. Acad. Sci. USA 94:7725-7729
INOMATA, N., H. SHIBATA, E. OKUYAMA, and T. YAMAZAKI, 1995 Evolutionary relationships and sequence variation of alpha-Amylase variants encoded by duplicated genes in the Amy locus of Drosophila melanogaster.. Genetics 141:237-244[Abstract].
IRVIN, S. D., K. A. WETTERSTRAND, C. M. HUTTER, and C. F. AQUADRO, 1998 Genetic variation and differentiation at microsatellite loci in Drosophila simulans: evidence for founder effects in New World populations. Genetics 150:777-790
JEFFREYS, A. J., R. BARBER, P. BOIS, J. BUARD, and Y. E. DUBROVA et al., 1999 Human minisatellites, repeat DNA instability and meiotic recombination. Electrophoresis 20:1665-1675[Medline].
KAMBYSELLIS, M. P., E. M. CRADDOCK, F. PIANO, M. PARISI, and J. COHEN, 1995 Pattern of ecological shifts in the diversification of Hawaiian Drosophila inferred from a molecular phylogeny. Curr. Biol. 5:1129-1139[Medline].
KAROTAM, J., A. C. DELVES, and J. G. OAKESHOTT, 1993 Conservation and change in structural and 5' flanking sequences of esterase 6 in sibling Drosophila species. Genetica 88:11-28[Medline].
KEIGHTLEY, P. D. and A. EYRE-WALKER, 1999 Terumi Mukai and the riddle of deleterious mutation rates. Genetics 153:515-523
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, United Kingdom.



