- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Comeron, J. M.
- Articles by Kreitman, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Comeron, J. M.
- Articles by Kreitman, M.
The Correlation Between Intron Length and Recombination in Drosophila: Dynamic Equilibrium Between Mutational and Selective Forces
Josep M. Comerona and Martin Kreitmanaa Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637
Corresponding author: Josep M. Comeron, Department of Ecology and Evolution, University of Chicago, 1101 E. 57th St., Chicago, IL 60637., jcomeron{at}midway.uchicago.edu (E-mail)
| ABSTRACT |
|---|
Intron length is negatively correlated with recombination in both Drosophila melanogaster and humans. This correlation is not likely to be the result of mutational processes alone: evolutionary analysis of intron length polymorphism in D. melanogaster reveals equivalent ratios of deletion to insertion in regions of high and low recombination. The polymorphism data do reveal, however, an excess of deletions relative to insertions (i.e., a deletion bias), with an overall deletion-to-insertion events ratio of 1.35. We propose two types of selection favoring longer intron lengths. First, the natural mutational bias toward deletion must be opposed by strong selection in very short introns to maintain the minimum intron length needed for the intron splicing reaction. Second, selection will favor insertions in introns that increase recombination between mutations under the influence of selection in adjacent exons. Mutations that increase recombination, even slightly, will be selectively favored because they reduce interference among selected mutations. Interference selection acting on intron length mutations must be very weak, as indicated by frequency spectrum analysis of Drosophila intron length polymorphism, making the equilibrium for intron length sensitive to changes in the recombinational environment and population size. One consequence of this sensitivity is that the advantage of longer introns is expected to decrease inversely with the rate of recombination, thus leading to a negative correlation between intron length and recombination rate. Also in accord with this model, intron length differs between closely related Drosophila species, with the longest variant present more often in D. melanogaster than in D. simulans. We suggest that the study of the proposed dynamic model, taking into account interference among selected sites, might shed light on many aspects of the comparative biology of genome sizes including the C value paradox.
NUCLEAR spliceosomal introns are nucleotide sequences that are transcribed but spliced out of the precursor mRNA and they generally do not encode any other polypeptide (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Might there be a universal selective benefit that could explain the evolutionary persistence of introns? The recent discovery of a correlation between intron length and recombination rate in Drosophila melanogaster (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Assuming the mutational deletion bias to be true, the presence of long introns can only mean that selection must favor (and preserve) their lengths in some instances, in opposition to the natural mutational bias. The discovery of the negative correlation between intron length and recombination rate further indicates that the strength or efficacy of this selection must be in some way recombination rate dependent. There is no evidence for strong selective constraints in long intron sequences, a possible explanation for the persistence of their long lengths. In fact, in Drosophila, short introns (defined as <8090 bp; ![]()
![]()
![]()
![]()
![]()
![]()
D. melanogaster's genome exhibits a wide range of recombination rates and the level of silent nucleotide polymorphism is known to vary across the genome and to be positively correlated with local recombination rates (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Models of strong selection and linkage have been invoked to explain the correlation between silent polymorphism and recombination, but an effect can also be produced with very weak selection acting on many sites, both for complete linkage (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Theoretical studies of ![]()
![]()
![]()
In this article, we investigate whether the preferred fixation of small insertions over deletions in introns as enhancers of recombination can explain the existence of longer introns in regions of low recombination in D. melanogaster. We also investigate whether relative mutation rates for insertions and deletions are the same in genomic regions of high and low recombination. Recombination rate heterogeneity in this species allows us to explore predictions of selection and interference influencing intron length. Comparison of D. melanogaster with both D. simulans and humans allows us to further test our predictions by taking advantage of their very different effective population sizes (Ne).
| MATERIALS AND METHODS |
|---|
D. melanogaster sequences:
We obtained all completely sequenced nuclear coding regions in D. melanogaster from ![]()
To avoid the inclusion in our dataset of introns whose lengths were influenced by the insertion of transposable elements (TE), we screened introns >200 bp for the putative presence of TEs. We conducted BLAST2 (![]()
Polymorphism analyses:
Insertion and deletion polymorphic events were studied in 31 genomic regions of D. melanogaster. The regions analyzed, the sample sizes, the cytological position, and estimated recombination rates (see below) are given in Table 1. Nucleotide sequences were available for most of the analyzed regions; information from high-resolution restriction fragment length polymorphisms (4-cutter technique; ![]()
![]()
|
Recombination rates:
Following ![]()
![]()
![]()
![]()
Estimation of selection coefficients on synonymous mutations based on the synonymous codon usage in D. melanogaster:
The scaled selection coefficient (
= 4Nes) on synonymous mutations was estimated for the 620 genes studied in D. melanogaster. The analysis focuses on the two-fold degenerate amino acids because in this case a symmetry argument allowed us to assign identical selection coefficients (but of opposite sign) to mutations to preferred (+s) or to unpreferred (-s) codons. The estimation of selection coefficients for the different synonymous codons for three-, four-, and sixfold degenerate amino acids depends on the selective model associated with the different codons (i.e., the different relative contribution that each codon makes to fitness), which has not been yet established. Following ![]()
on synonymous mutations was estimated from the frequency (P2) of the advantageous codon (preferred codon; ![]()
![]()
![]()
![]()
Confidence intervals for the number of replacement substitutions per site (Ka) between D. melanogaster and D. simulans:
The average Ka estimate between D. melanogaster and D. simulans for regions of low and high recombination is Ka = 0.0164 (five genes) and Ka = 0.0095 (eight genes), respectively (![]()
![]()
Computer simulations to study intron length under a mutation-selection-drift model:
We studied a model of selection in which mutations that increment the length of the transcripts are weakly deleterious with semidominant effects. The model imposes a minimum intron length (Lmin), with mutations that decrease the length below Lmin being strongly selected against. We generated and studied the equilibrium for a population of N = 500 diploid individuals and Nµ = 0.001, where µ is the indel mutation rate per base pair. Each generation was obtained by randomly choosing N individuals of the previous generation, with a probability (Pw) proportional to their relative fitness, according to the equations wi,k = 1 - s(Li,k - Lmin), where s is the absolute deleterious selection coefficient per base pair, and Li,k is the length of the kth chromosome (k = 1, 2) of the ith individual, and

where
i is the average fitness of the two chromosomes of the ith individual. The 2N genomes (lengths in this case) were then randomly paired to generate the new generation of N diploid individuals. The analyses of simulated data began after a minimum of 250N generations to ensure equilibrium. This was confirmed by visual inspection of each trajectory between 500N and 2500N generations. Deleterious selection coefficients (s) per base pair included
(4Ns) of 0, 0.04, 0.08, and 0.2, unless when Li,k < Lmin, in which case wi,k = 0. In all simulations, Lmin = 60 bp and Li,k << 1/s. All statistical analyses were carried out using STATISTICA FOR WINDOWS 5.1 (1997).
| RESULTS |
|---|
Intron length distribution in D. melanogaster:
![]()
![]()
|
Longer introns in regions of low recombination in D. melanogaster:
If intron length is a result of neutral mutational processes alone, then intron length should be independent of recombination rate. On the other hand, if indel mutations in Drosophila introns are weakly selected, either as modifiers of recombination or transcription time, then intron length and recombination rates could be correlated. Recombination rates in D. melanogaster correlate negatively with individual intron length (Spearman's rank correlation, R = -0.1822, P < 1 x 10-6; n = 1345), average intron length (R = -0.2048, P = 1.3 x 10-5; n = 447), and total intron length (R = -0.2163, P = 4 x 10-6, n = 447; see Fig 2A). A Kruskall-Wallis ANOVA test of data divided into three recombination rate groups and using the same three measures of intron length also reveals that introns in regions of low recombination are significantly longer than in regions of high recombination (H = 40.28, H = 21.12, and H = 20.82, respectively; P < 1 x 10-5 for all cases; see Fig 2B).
|
Relationship between recombination in D. melanogaster and both the length of the coding region and intron presence:
The total lengths of coding regions and the number of exons are both weakly negatively correlated with recombination rates (R = -0.1047, P = 0.0091, n = 620; and R = -0.085, P = 0.034, n = 620, respectively). Since intron length is negatively correlated with recombination rate, we also detect a positive and significant correlation between total coding region length and total intron length (R = -0.3833 and R = -0.4766 for all genes and for genes with introns, respectively; P < 10-6 in both cases). Similarly, average intron length correlates positively with total coding region length (R = 0.2589, P < 10-6; n = 447) as well as with average exon length (R = 0.1682, P = 0.00036; n = 447). These correlations suggest at least two possibilities: (1) a recombination-sensitive mutational mechanism that affects both the introns and exons of a gene or (2) weak selection for smaller total gene length, including introns and exons, which is sensitive to the recombinational environment of a gene. Further consideration is given to these possibilities in the following section and in the DISCUSSION.
Similar mutational bias in regions of high and low recombination:
One possible explanation for a negative correlation between intron length and recombination rates is a change in the deletion bias, i.e., the deletion/insertion ratio, with recombination rate. To investigate this possibility we analyzed available data on polymorphic insertion and deletion events in D. melanogaster in introns and noncoding regions, with the aim of capturing the mutational deletion bias. Because the number of segregating sites (involving length polymorphisms) is expected to be less affected by selection than their frequency in the population (![]()
With a total of 256 polymorphic indel events, we identified a significant excess of DEs compared to IEs (147 DEs vs. 109 IEs; G = 5.66, P = 0.017), with an overall polymorphic deletion bias (PDB) of 1.35. As shown in Table 1 there is no indication that the ratio of IEs to DEs differs between regions of high and low recombination (G = 0.289, P = 0.591). Neither intergenic regions nor introns show signs of a DE/IE bias that changes in relation to recombination rate (intergenic regions, G = 0.020, P = 0.888; introns only, G = 0.006, P = 0.938). The DE/IE ratios are similar for intergenic regions and introns (G = 2.479, P = 0.115) and the ratios do not change when the data are further subdivided between low and high recombination regions (G = 0.718, P = 0.397; and G = 1.568, P = 0.210, respectively). We conclude, therefore, that the negative correlation between intron length (or coding region length) and recombination is not caused by a mutational deletion/insertion bias or by a highly biased repair mechanism (see DISCUSSION) associated with recombination rates.
Size distribution of polymorphic indels:
Fig 3 shows the frequency distribution of indel sizes obtained from the polymorphism analyses in noncoding regions. Analyzing all indels shorter than 1000 bp, most length polymorphisms are 12 bp long (43.0 and 43.7% for insertion and deletions, respectively) and the great majority are within the range 110 bp (84.0 and 77.0% for insertion and deletions, respectively). The equivalent analyses only for indels located in introns and intergenic regions also show that the great majority of them are within the range 110 bp (89.2 and 75.4% for insertions and deletions, respectively, in introns, and 75.6 and 78.8% for insertions and deletions, respectively, in intergenic regions).
|
Even though long indels (longer than 100 bp) are a small fraction of all length changes, they nevertheless affect the estimation of average lengths. When only indels shorter than 100 bp are taken into account, the average length for insertions and deletions is 5.13 (±0.65 SE) and 7.01 (±0.76) bp, respectively, for all regions, and 5.04 (±0.84) and 7.52 (±1.20) bp for introns. The analyses of all indels shorter than 1000 bp, however, yield average insertion and deletion lengths of 40.3 (±15.1) and 16.3 (±4.7) bp, respectively, and 10.3 (±5.3) and 19.6 (±8.8) for insertions and deletions, respectively, in introns. Moreover, the only three polymorphic indels longer than 1000 bp detected in introns are insertions in genes located in regions of high recombination. This last observation, together with the previously indicated DE/IE ratio, confirms that there are no identifiable differences in the mutational deletion bias between regions of high and low recombination that could generate longer introns in regions of low recombination.
Shorter introns in D. simulans than in D. melanogaster:
Consistently larger effective population sizes (Ne) have been reported for D. simulans than for D. melanogaster, on the basis of analyses of nucleotide polymorphism levels (![]()
![]()
![]()
![]()
![]()
![]()
![]()
We compared a total of 211 fixed length differences between homologous sequences of the two species (Table 2). To reduce the probability of classifying polymorphic indels in one species as fixed differences, we used only those regions where multiple sequences are available in D. melanogaster and, when possible, in D. simulans (see Table 1). There is a significant excess of cases where the D. melanogaster noncoding sequence is longer than the homologous region in D. simulans (130 vs. 81, P = 0.0009; two-tailed sign test). This trend is highly significant for length differences in introns (66 vs. 27, P = 0.00007) but not in flanking regions (55 vs. 50, P = 0.70).
|
We also investigated the distribution of fixed indel differences along the two lineages leading to the common ancestor of these two species by comparing the homologous sequences in D. yakuba. Both the lineage and the direction of the indel mutation could be assigned without ambiguity for only 14 fixed indel mutations in introns (located in the genes Adh, Ci, per, and Pgi). The D. melanogaster lineage shows three IE and three DE, while the D. simulans lineage shows two IE and six DE. While suggestive of a possible difference, these numbers are too small to conclude that the observed trend is real.
| DISCUSSION |
|---|
The deletion to insertion mutational bias in Drosophila:
Results from our analysis of 256 length polymorphisms suggest an overall deletion to insertion bias of 1.35; i.e., an excess of deletions compared to insertions (P = 0.017). A 95% confidence interval (two-tailed G-test for goodness of fit) for deletion bias compatible with the estimated PDB is 1.021.78. In mammals, the average deletion bias obtained from analyzing 156 processed pseudogenes is 2.74 (![]()
![]()
This deletion bias in D. melanogaster is in the same direction although smaller (G = 178.1; P < 1 x 10-6) than the value of 8.7 estimated from analysis of the retroposon Helena in both D. melanogaster and D. virilis groups (![]()
![]()
![]()
![]()
![]()
![]()
50% of deletions longer than 10 bp in Helena sequences compared to 23% of deletions longer than 10 bp in our data). The lack of correspondence between data sets and approaches indicates the need for additional studies on the mutational deletion bias, the putative differences between heterochromatic and euchromatic (i.e., actively transcribed) regions, and the correlation with recombination rates. Comparison of polymorphism data and fixed differences, we propose, will be a valuable tool.
Potential effect of selection on D. melanogaster indel polymorphism:
We have investigated and can reject the possibility that our estimate of the ratio of DEs:IEs based on polymorphism data is strongly biased (i.e., reduced) by the action of selection under a strong deletion bias. Strong selection (
>> 1;
= 4Nes) acting on small noncoding indels can be eliminated from consideration because, if it were the predominant form of selection, each species would be essentially at mutation-selection balance for indels, and each species would be monomorphic, which is not the case. Weak selection (
1) is also unlikely under a strong deletion bias because it would inescapably produce a difference in the PDB (see Fig 4) in regions of high and low recombination. With a 10- to 20-fold difference in the effective population sizes of high and low recombination regions of the D. melanogaster genome (![]()
![]()
|
Intron presence and recombination rates in D. melanogaster:
We report here a weak, but significant, negative correlation between recombination rates and the number of exons (R = -0.085, P = 0.034; n = 620). This observation is quite unexpected because there has been an extensive gene rearrangement within chromosomal arms in Drosophila evolution, meaning that genes located in regions of low recombination in D. melanogaster have been in the present recombinational environment for a relatively short period of time. Therefore, the observation of a tendency for genes located in regions of low recombination to exhibit a greater number of introns may be an indication that the tempo of intron gain/loss might be faster than usually believed.
Longer coding regions in the region of low recombination in D. melanogaster:
We also noted a trend toward longer coding regions in regions of low recombination than in regions of high recombination in D. melanogaster (R = 0.1047, P = 0.0091; n = 620), a relationship previously undetected (![]()
![]()
![]()
Natural selection acting on minimum intron length:
It is generally accepted that there are structural constraints limiting minimum intron length and that natural selection must govern this and other requirements for proper splicing (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Natural selection acting on average intron length:
Two lines of evidence suggest that natural selection is acting on intron length. First, as indicated in RESULTS, a mutational bias alone is unlikely to be causing the observed correlation between intron and recombination rate. Polymorphism data based both on the ratio of indel events and on the total size differences in introns comparing regions with high and low rates of recombination rule out this possibility. Thus there is a disparity between polymorphic data and the observed average intron length in D. melanogaster in relationship with recombination rates. Second, there are a higher number of fixed insertion differences in introns in D. melanogaster than in D. simulans that are unlikely to be caused by distinct mutational mechanisms between these two closely related species.
Among selective explanations for the general negative relationship between intron length and recombination, only weak selection acting on indels in introns would allow the evolution of different intron lengths without evident differences at the polymorphism level. Restriction fragment length polymorphism and direct sequencing studies show that small length polymorphisms are common in introns, indicating that many of them are not subject to strong selection. We propose, therefore, that the observed difference in intron length in regions of high and low recombination is molded by weak selection, and specifically that intron length in D. melanogaster is the result of a mutation-selection-drift balance.
Weak selection has considerably less influence on the presence of polymorphisms and their frequency in populations than it does on the probability of fixation. Fig 5A and Fig B, shows the predicted effect of weak selection on the deletion bias with respect to both polymorphism (PDB) and fixation probabilities [fixed deletion bias (FDB)] under the infinitely many sites model with free recombination and semidominance, and assuming a MDB
1.50. As the figure shows, selection coefficients small enough to be hardly detected in polymorphism data on the basis of small sample sizes can lead to conspicuous differences in intron length evolution (fixed differences) due to small differences in the expected segregating frequency. In accord with this prediction, and as commented above, there is no significant difference between the PDB observed in the region of high and low recombination. Also, we do not detect a significant difference in the average frequency of the 256 polymorphic insertions and deletions we analyzed (0.295 and 0.183, respectively; P = 0.32, applying Mann-Whitney U-test). We do find, however, significantly elevated frequency of insertions in regions of high recombination (0.3696 vs. 0.1732, respectively; Mann-Whitney U-test, P = 0.025). A similar trend is not detected in regions of low recombination (0.178 and 0.197 for insertions and deletions, respectively). Using the 14 genomic regions with at least two IEs and at least two DEs, we also calculated Tajima's D statistic (![]()
![]()
|
In yeast, the repair of mispaired loops in heteroduplex DNA shows distinct biases, efficiencies, and repair pathways, for short and long indels, for nicked and continuous DNA, as well as for different genomic regions (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Models of weak selection:
What kind of selection might be operating to generate a correlation between intron length and recombination rate? Two nonmutually exclusive selective hypotheses are proposed and below we discuss their possible roles in shaping intron length in Drosophila: (i) selection favoring shorter transcripts to reduce transcriptional time and energy, and (ii) the selective advantage of long introns as reducers of interference among selected mutations.
Transcriptional efficiency:
Biologically, longer transcripts must be inherently more costly to synthesize than short transcripts, owing to the increment in both transcription time and energetic costs of DNA and RNA synthesis. Assume for the purpose of this discussion that the absolute selective cost of a small insertion (or conversely the selective advantage of a small deletion) is small, i.e.,
1. Then, given the 10- to 20-fold difference in the effective population size between high and low recombination regions of the D. melanogaster genome, it follows that the efficacy of selection acting on these indels will differ. Genes located in regions with higher recombination rates in D. melanogaster would be expected, under this scenario, to have shorter introns; these same introns will also be expected to be longer than their homologues in D. simulans, a species with a somewhat larger effective population size.
There are problems, however, with this hypothesis. Imagine that selection is nearly neutral in regions of low recombination (
LR
1). Then, for regions of high recombination selection will be relatively strong,
HR > 10, and this will generate an obvious difference in the PDB, one that is not detected between regions (see above). As an alternative, imagine that
LR << 1 and
HR = 1. In this case, evolution of insertions in regions of low recombination will be determined by the mutational process (and biases) and not by selection. Since there is a significant deletion bias detected at the polymorphism level, mutational processes would push intron length toward uniformly short lengths, again something that is not observed in the data.
We used a forward computer simulation to explore a more complex mutation-selection-drift scenario (see MATERIALS AND METHODS for details; Fig 6). In the presence of a mutational deletion bias, even a relatively weak one, and no selection, average intron length will evolve to be close to the minimum permissible length. Our simulations show that only when the deletion bias is weak and the occurrence of long insertions is frequent, is it possible for average intron length to be far enough from the minimum length to be compatible with the observed average size of introns in low recombination regions. Under these conditions, it could be possible for selection to act in regions of high recombination, where the effective population size is larger, to produce smaller average intron lengths. This particular scenario is unlikely, however, for the simple reason that sufficiently long insertions must occur at a vastly greater rate than is indicated from population genetic and evolutionary data to explain the difference between the average intron length in regions of high and low recombination (a difference of
400 bp). Thus, we do not believe that the energetic cost hypothesis and a mutation-selection-drift balance is a tenable unique explanation for the observed correlation between intron length and recombination rate in D. melanogaster.
|
An additional indication that selection favoring shorter introns to increase transcriptional efficiency is unlikely to be the general cause of longer introns in regions of low recombination comes from the study of human introns. Isochore GC richness is known to be correlated positively with recombination rates (![]()
![]()
![]()
![]()
![]()
![]()
1) are expected to behave as effectively neutral in humans (
<< 1). Therefore, the observed negative relationship between intron length and recombination in both Drosophila and humans is hardly explained only by weak selection acting on transcriptional costs in both species, unless selection coefficients against longer transcripts in humans were much higher than in Drosophila for unknown reasons. Efficacy of selection arguments alone cannot explain the large differences in intron length observed across recombination rates.
Introns as modifiers of recombination in Drosophila:
Given a mutational bias toward DNA loss, even if it is moderate, why do most eukaryotic genes have introns, and why, indeed, are some of them very long? The possible insertion of repetitive elements as a cause of long introns does not seem to have a significant role in D. melanogaster based on database search (see MATERIALS AND METHODS; also see ![]()
As an explanation for intron persistence and long lengths, we propose that there might be situations in which a longer intron length is selectively advantageous. ![]()
![]()
).
Longer introns reduce the Hill-Robertson effect compared to the same intron with a deletion and will also increase the hitchhiking effect between the favorable mutations and the longest variant. One selective advantage of introns, then, is to increase the recombination rate between selected mutations in different exons. This selective advantage of longer introns is expected to be greatest when the recombination rate is low and to diminish with increasing recombination (![]()
![]()
We further speculate that interference among many relatively weakly selected mutations will be the major cause of selection to reduce interference. This class of mutations is both abundant as polymorphisms and segregates at high frequencies in populations. Moreover, interference is maximized when selection coefficients are of equivalent magnitude (![]()
In Drosophila, synonymous mutations are the canonical example of weakly selected mutations, commonly observed as polymorphisms in coding regions (![]()
![]()
![]()
![]()
![]()
for synonymous mutations in D. melanogaster is 1.46 (95% confidence intervals of 0.262.9 and a mode of 1.31; see MATERIALS AND METHODS for details). In addition, we have previously shown that selection for codon bias is even detectable in regions of very low recombination (![]()
![]()
Codon bias is not the only form of weak selection that can promote selection for interference reduction. In Drosophila, both amino acid replacement changes (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
A scenario can be envisaged in which a mutational bias favoring deletions and a selection bias favoring long introns in regions of low recombination will establish a dynamic equilibrium that will change according to the recombinational environment within a given genome and/or gene. The fact that intron length and recombination are negatively correlated in both Drosophila and humans suggests that selection acting on intron length might be a general characteristic in eukaryotic genomes. The outcome of this complex equilibrium with mutational biases toward shorter introns, weak selection (which would be species specific), and strong selection controlling endpoints for proper splicing, might explain different modal intron lengths for different organisms. For instance, comparison of model organisms (Fig 7) shows modal lengths of 47, 58, 86, and 130 bp for Caenorhabditis elegans, D. melanogaster, Arabidopsis thaliana, and Homo sapiens, respectively. Small differences in the deletion bias alone will not explain these differences.
|
![]()
![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank C. Bergman for alerting us to ways in which DNA repair mechanisms may influence in/del bias. We also thank P. Andolfatto, C. Bergman, A. Llopart, M. Long, T. Nagylaki, S. Orzack, E. Stahl, K. Thornton, and C. Toomajian for helpful discussions and J. Hey and three anonymous reviewers for their constructive comments. J.M.C. is supported by a Postdoctoral Fellowship from Ministerio de Educación y Ciencia, Spain. This research was supported by National Institutes of Health grant GM-39355 to M.K., and a Postdoctoral Fellowship from Ministerio de Educación y Ciencia, Spain, to J.M.C. J.M.C. dedicates this work to Maria Teresa Graupera Vila.
Manuscript received August 2, 1999; Accepted for publication July 14, 2000.
| LITERATURE CITED |
|---|
AGUADÉ, M., and C. H. LANGLEY, 1994 Polymorphism and divergence in regions of low recombination in Drosophila, pp. 6776 in Non-Neutral Evolution: Theories and Molecular Data, edited by G. B. GOLDING. Chapman & Hall, New York.
AKASHI, H., 1995 Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076[Abstract].
AKASHI, H., 1996 Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster.. Genetics 144:1297-1307[Abstract].
AKASHI, H., 1999 Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. Genetics 151:221-238






