- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Vieira, J.
- Articles by Charlesworth, B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Vieira, J.
- Articles by Charlesworth, B.
Evidence for Selection at the fused Locus of Drosophila virilis
Jorge Vieiraa and Brian Charlesworthaa Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
Corresponding author: Jorge Vieira, Institute of Cell, Animal and Population Biology, University of Edinburgh, Ashworth Laboratories, King's Bldgs., W. Mains Rd., Edinburgh EH9 3JT, United Kingdom., j.vieira{at}ed.ac.uk (E-mail)
Communicating editor: A. G. CLARK
| ABSTRACT |
|---|
The genomic DNA sequence of a 2.4-kb region of the X-linked developmental gene fused was determined in 15 Drosophila virilis strains. One common replacement polymorphism is observed, where a negatively charged aspartic amino acid is replaced by the noncharged amino acid alanine. This replacement variant is located within the serine/threonine kinase domain of the fused gene and is present in ~50% of the sequences in our sample. Significant linkage disequilibrium is detected around this replacement site, although the fused gene is located in a region of the D. virilis X chromosome that seems to experience normal levels of recombination. In a 600-bp region around the replacement site, all eight alanine sequences are identical; of the six aspartic acid sequences, three are also identical. The occurrence of little or no variation within the aspartic acid and alanine haplotypes, coupled with the presence of several differences between them, is very unlikely under the usual equilibrium neutral model. Our results suggest that the fused alanine haplotypes have recently increased in frequency in the D. virilis population.
DATA on the level and patterns of within-population nucleotide polymorphisms can give information about the action of natural selection on DNA and protein sequences. This action is revealed as deviations from the patterns expected if the variation is selectively neutral (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To obtain further evidence for selection acting on this gene, we have here analyzed a 2.4-kb region of fu in 15 D. virilis strains and one D. lummei strain; the region includes most of the coding region of fu, the four introns of this gene, and a small part of the 5' flanking region. The level and distribution of variability along the gene suggest that one of the alleles has recently increased in frequency within the D. virilis population.
| MATERIALS AND METHODS |
|---|
The D. virilis strains used in this work are all the group A strains listed in ![]()
![]()
![]()
![]()
A 2440-bp DNA fragment, which includes a small region of the 5' noncoding flanking region of fu, most of its coding region, and the four introns of this gene (Fig 1), was amplified using the primers FUF and FU4IR (Table 1) from a single male from each strain. Genomic DNA extraction was performed as described in ![]()
|
|
The numbers of synonymous, nonsynonynous, intron, and 5' flanking region differences between pairs of sequences were calculated using the DnaSP software (![]()
![]()
| RESULTS |
|---|
Levels of nucleotide variability:
The nucleotide polymorphisms found in our sample of 15 D. virilis alleles are shown in Fig 2, and the estimated level of DNA polymorphism is summarized in Table 2. There are 36 segregating sites within the sequenced region of the D. virilis fu gene, of which only 5 are replacement sites (3 of which are singletons). The replacement site at position 132 is located in the N-terminal serine/threonine kinase domain of fu and is present in ~50% of the sequences. In this case, a negatively charged aspartic amino acid is replaced by the noncharged amino acid alanine. The alanine haplotypes are not confined to any particular geographic region. The 13 alanine haplotypes so far analyzed are from different localities in Japan, Russia, Georgia, Caucasus, England, California, and Argentina, while the 18 aspartic acid haplotypes are from different localities in Japan, China, Georgia, Malta, England, Holland, California, Mexico, and Chile (![]()
![]()
|
|
The replacement site at position 2099 (serine/threonine) is the second most frequent replacement polymorphism in our sample, and it is located in the C-terminal domain of fu that may differentially regulate the fu catalytic domain (![]()
![]()
Not surprisingly, the level of nucleotide site diversity,
(![]()
values are similar to the average level of intron variation for a sample of six D. virilis X-linked genes (1.36%; ![]()
values from autosomal genes). A similar level of nucleotide variation at synonymous and intron positions is not the general pattern for D. virilis. The average level of variation at synonymous sites has been estimated to be approximately half of the level of variation at intron sites (![]()
is larger than our estimate of
(Watterson's estimator for 4Neµ, in which µ is the neutral mutation rate and Ne is the effective population size; ![]()
![]()
An inverse measure of codon bias, the effective number of codons (ENC; ![]()
![]()
![]()
Recombination parameters and linkage disequilibrium:
The fu gene is located in a region of the X chromosome that seems to experience normal levels of recombination (![]()
![]()
(where
is 0.013) are shown in Table 3. Hudson's estimator of C (![]()
![]()
|
In our data the nonneutral causes of the observed significant linkage disequilibrium between several pairs of sites (see below), together with an excess of intermediate frequency polymorphisms at intron sites (although not statistically significant), could in principle have an impact on the estimates of C. In general, as long as
, any nonneutral process that increases the variance of the number of base pair differences between DNA sequences will decrease the value of Hudson's estimator of C; similarly, any process that reduces incongruency will decrease the value of Hey and Wakeley's estimator. Using computer simulations of the coalescent process with recombination, under the usual neutral scenario (![]()
![]()
![]()
![]()
Although recombination was detected in our sample of 15 sequences, significant linkage disequilibrium was detected between several pairs of sites (Fig 3), especially in the regions 132231 and 20792124 [significant by Fisher's exact tests at P < 0.05, without Bonferroni correction for multiple tests; with the sequential Bonferroni correction for multiple tests (![]()
|
Evidence for selection:
Balancing selection acting on or near a region can give rise to an apparent pattern of locally reduced recombination, because theory (![]()
![]()
![]()
![]()
![]()
![]()
|
The replacement site at position 132 is located in the N-terminal serine/threonine kinase domain of fu and is present in ~50% of the sequences. In this case, a negatively charged aspartic amino acid is replaced by the noncharged amino acid alanine. Sequences with the aspartic residue have twice as much variability at synonymous and intron sites and three times as many segregating sites as sequences with alanine (Table 4). Furthermore, of the 25 polymorphisms that are not singletons, 8 are shared between the aspartic acid and alanine haplotypes, 14 are polymorphic only within the aspartic acid sequences, and 3 are polymorphic only within the alanine sequences. Therefore, it is possible that the alanine sequences may have risen in frequency only recently and acquired most of their variability through recombination with the aspartic acid sequences rather than through mutation. If the alanine haplotypes rose in frequency only recently, then the average level of divergence (0.0055) between the aspartic acid and alanine sequences should be similar to the average level of polymorphism for the oldest haplotypes, in this case the aspartic acid haplotypes, as is observed (the value is 0.0056; Table 5).
|
|
Because the aspartic acid/alanine replacement is common, it is possible in principle to evaluate if selection is maintaining this replacement polymorphism. However, the power of tests for detecting selection in regions of normal recombination is low when test statistics that assume no recombination are used (![]()
![]()
![]()
![]()
![]()
The strongest deviations from the expected neutral pattern should be detected near the putative site under selection. If only the first 600 bp of sequence are analyzed (this is where the common replacement variant is located), the aspartic acid sequences and the alanine sequences do not share any of the five polymorphic sites found in this region. Furthermore, all eight alanine sequences are identical, but of the six aspartic acid sequences, only three are identical. The occurrence of little or no variation within groups of sequences, coupled with the presence of several differences between them, is very unlikely under the usual equilibrium neutral model, even if no recombination is assumed [P = 0 using the ![]()
![]()
Divergence between D. virilis and D. lummei:
We have also determined the DNA sequence of the fu gene of one D. lummei strain (a close relative of D. virilis; ![]()
![]()
![]()
![]()
![]()
Furthermore, by comparing the level of divergence between D. lummei and D. virilis to the level of polymorphism in D. virilis, it is possible to determine whether the polymorphism peaks observed in the latter species could be attributed to regions of unusually low constraint. These regions seem not to be diverging more than surrounding regions that are less polymorphic and therefore these regions do not seem to represent regions of unusually low constraint (Fig 4). However, we have failed to show that there is significant heterogeneity in the distribution of polymorphic sites relative to fixed differences between D. lummei and D. virilis using the test statistics described in ![]()
![]()
| DISCUSSION |
|---|
Overall, the statistical tests presented above suggest that the lack of variability within the haplotypes with the codon for alanine (GCC) at nucleotides 131133 relative to those with the codon for aspartic acid (GAT) is inconsistent with an equilibrium neutral model and suggest a history of selection at the fu locus. There are three main such possibilities that can be imagined. The first is that balancing selection has been maintaining the amino acid polymorphism at this position (or at a closely linked site) for much longer than the standard neutral coalescence time; this is expected to produce a window of enhanced variability in the neighborhood of the target of selection, as observed here (![]()
![]()
![]()
![]()
The fit of the data to these models was investigated by simulations of a structured coalescent model, in which there are two alleles (A1 and A2) that are assumed to be the target of selection. The aspartic acid to alanine substitution is equated with the mutation from A1 to A2. Allele A2 originated at time T in the past, measured in units of 2Ne generations, where Ne is the effective size of the population. If T is >> 1, the two haplotypes are old; if T << 1, A2 originated recently. The population frequencies of A1 and A2 at the time of sampling are p and q; if q is small, A2 must have been maintained at a low frequency since its time of origination and has only recently increased to its current value of ~0.5.
Variation at a set of neutral sites linked with recombination frequency r to the selected site was modeled by tracing the ancestry of an initial set of genes with n1 copies that were A1 in state and n2 copies that were A2. No recombination within the set of neutral sites was permitted; this is a conservative assumption in view of the nature of the hypotheses being tested (see below) and greatly simplifies the calculations. Up to time T, the rate per unit coalescence time at which a neutral site that is currently associated with allele A1 is derived from A2 is R12 = qR, where R = 2Ner; the complementary rate for a neutral site associated with A2 is R21 = pR (![]()
![]()
Using the standard assumption that the possible events involved follow competing exponential distributions with the above rate constants (![]()
![]()
Given the fact that this approach assumes no recombination between the neutral sites in question, it is only applicable to a group of closely linked sites. We have therefore applied it to the block of five segregating sites that are closest to the sites 132133, which are involved in the aspartic acid/alanine substitution. This block spans sites 198231 and shows no evidence for recombination in the set of 15 sampled fu alleles. From Fig 2, it is evident that the vector of the numbers of segregating sites in categories (i) to (iv), respectively, is (5, 0, 0, 0). An overall measure of the goodness of fit of the simulations to the data is thus given by the frequency of replicates in which all five segregating sites are found only among the A1 alleles.
Fig 5 shows the results of the simulations. A gives the measure of the goodness of fit for the case when A1 and A2 have been maintained at equal frequencies (q = 0.5), as a function of R for various values of T. Since we have no a priori evidence as to whether the aspartic acid or alanine variants are ancestral, the probability values generated by the simulations have been multiplied by two. It is evident that only the cases with a relatively recent origin of the alanine haplotype (T = 0.1 or 0.01) have probabilities >>1%. It is not possible to distinguish clearly between these two T values, although T = 0.01 consistently generates higher probabilities, nor to assign a value to R with much confidence, although the extreme values in Table 3 are both consistent with the data. The hypothesis that the two alleles have been maintained by selection at intermediate frequencies for a period that is as long or longer than the coalescence time is decisively rejected, however.
|
We have recently shown that the fu gene is duplicated in three closely related species of the virilis group (D. americana, D. texana, and D. novamexicana) but, by direct sequencing, there is no evidence for the presence of this duplication in any other species of the virilis group (J. VIEIRA, unpublished results). Of the nine D. virilis polymorphic sites observed in the first 600 bp, including the aspartic acid/alanine replacement polymorphism, seven have been observed either as fixed differences between the two duplicated fu genes or as polymorphisms in a small sample of D. americana and D. texana sequences (J. VIEIRA, unpublished results). Therefore, most of the polymorphisms observed in D. virilis in this region are ancient and must predate the divergence between D. americana/D. texana/D. novamexicana and D. virilis. This observation is clearly incompatible with a relatively recent origin of the D. virilis alanine haplotype as required by the above model.
B displays the results of simulations where allele A2 had a much lower frequency than A1 up to the time of sampling, consistent with its lower level of variability, assuming T = 10. Since prior information is being used, it is legitimate to use the one-tailed probabilities generated by the simulations in this case. Overall, a value of q of 0.001 or 0.01 is consistent with the data at the 5% probability level for all R values considered; q = 0.1 is consistent with the data only for R in the neighborhood of one. Clearly, the alanine haplotype must have been kept at low frequency in the lineage leading to D. virilis. Because neither the aspartic acid nor the alanine haplotypes are confined to any particular geographic region, this implies that the alanine mutation has recently increased in frequency throughout the D. virilis populations. Consistent with this view is the observation that there is no polymorphism in this region in a worldwide D. virilis sample of 13 alanine haplotypes (![]()
Because it seems that the two fu gene copies of D. americana and D. texana may be distinguished by the presence or absence of an aspartic acid/alanine at the same position where the common D. virilis replacement polymorphism is found, although the sample size is small (J. VIEIRA, unpublished results), it is possible that there may be an advantage in having both fu haplotypes. We therefore speculate that this may not be a selective sweep caught in midstream, but rather a balanced polymorphism that has experienced a shift in frequency. Our data fit the general observation that old balanced polymorphisms with intermediate allele frequencies seem to be rare in Drosophila. In the past few years, the pattern and level of intraspecific variation at the nucleotide level has been analyzed in detail for seven D. melanogaster loci with common allozyme polymorphisms (Adh, 6Pgd, G6pd, Gpdh, Sod, Est-6, and Tpi; ![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank D. Filatov for providing us with his computer program for performing the statistical tests with recombination, M. Przeworski for performing the recombination simulations in relation to Table 3, and Table 3D. Charlesworth and C. P. Vieira for helpful comments on the work. J.V. is supported by the Fundação para a Ciencia e Tecnologia (PRAXIS XXI/BPD/14120/97). B.C. is supported by the Royal Society.
Manuscript received August 12, 1999; Accepted for publication April 7, 2000.
| LITERATURE CITED |
|---|
ASHBURNER, M., 1989 Drosophila: A Laboratory Handbook. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
BLANCHET-TOURNIER, M. F., H. TRICOIRE, D. BUSSON, and C. LAMOUR-ISNARD, 1995 The segment-polarity gene fused is highly conserved in Drosophila. Gene 161:157-162[Medline].
BAUER, V. L. and C. F. AQUADRO, 1997 Rates of DNA sequence evolution are not sex-biased in Drosophila melanogaster and D. simulans.. Mol. Biol. Evol. 14:1252-1257[Abstract].
FILATOV, D. A. and D. CHARLESWORTH, 1999 DNA polymorphism, haplotype structure and balancing selection in the Leavenworthia PgiC locus. Genetics 153:1423-1434
HANKS, S. K., A. M. QUINN, and T. HUNTER, 1988 The protein kinase family: conserved features and deduced phylogeny of the catalytic domains. Science 241:42-52
HASSON, E., I. N. WANG, L. W. ZENG, M. KREITMAN, and W. F. EANES, 1998 Nucleotide variation in the triosephosphate isomerase (Tpi) locus of Drosophila melanogaster and Drosophila simulans.. Mol. Biol. Evol. 15:756-769[Abstract].
HEY, J. and J. WAKELEY, 1997 A coalescent estimator of the population recombination rate. Genetics 145:833-846[Abstract].
HSU, T. C., 1952 Chromosomal variation and evolution in the virilis group of Drosophila. Univ. Texas Publ. 5204:35-72.
HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50:245-250[Medline].
HUDSON, R. R., 1990 Gene genealogies and the coalescent process, pp. 144 in Oxford Surveys in Evolutionary Biology, Vol. 7, edited by D. FUTUYMA and J. ANTONOVICS. Oxford University Press, Oxford.
HUDSON, R. R. and N. L. KAPLAN, 1985 Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164
HUDSON, R. R. and N. L. KAPLAN, 1988 The coalescent process in models with selection and recombination. Genetics 120:831-840
HUDSON, R. R., K. BAILEY, D. SKARECKY, J. KWIATOWSKI, and F. J. AYALA, 1994 Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster.. Genetics 136:1329-1340[Abstract].
INGHAM, P. W., 1993 Localized hedgehog activity controls spatial limits of wingless transcription in the Drosophila embryo. Nature 366:560-562[Medline].
KAPLAN, N. L., T. DARDEN, and R. R. HUDSON, 1988 The coalescent process in models with selection. Genetics 120:819-829
KELLY, J. K., 1997 A test of neutrality based on interlocus associations. Genetics 146:1197-1206[Abstract].
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, United Kingdom.
KIMURA, M. and T. OHTA, 1972 On the stochastic model for estimation of mutational distance between homologous proteins. J. Mol. Evol. 2:87-90[Medline].
KREITMAN, M. and H. AKASHI, 1995 Molecular evidence for natural selection. Annu. Rev. Ecol. Syst. 26:403-422.
MCDONALD, J. H., 1996 Detecting non-neutral heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence. Mol. Biol. Evol. 13:253-260[Abstract].
MCDONALD, J. H., 1998 Improved tests for heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence. Mol. Biol. Evol. 15:377-384[Abstract].
MCDONALD, J. H. and M. KREITMAN, 1991 Adaptive protein evolution at the Adh locus in Drosophila. Nature 351:652-654[Medline].
MCVEAN, G. A. T. and J. VIEIRA, 1999 The evolution of codon preferences in Drosophila: a maximum likelihood approach to parameter estimation and hypothesis testing. J. Mol. Evol. 49:63-75[Medline].
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
NORDBORG, M., 1997 Structured coalescent processes on different time scales. Genetics 146:1501-1514[Abstract].
NURMINSKY, D. I., E. N. MORIYAMA, E. R. LOZOVSKAYA, and D. L. HARTL, 1996 Molecular phylogeny and genome evolution in the Drosophila virilis species group: duplications of the alcohol dehydrogenase gene. Mol. Biol. Evol. 13:132-149[Abstract].
OHTA, T., 1992 The nearly neutral theory of molecular evolution. Annu. Rev. Ecol. Syst. 23:263-286.
PREAT, T., P. THÉROND, C. LAMOUR-ISNARD, B. LIMBOURG-BOUCHON, and H. TRICOIRE et al., 1990 A putative serine/threonine protein kinase encoded by the segment-polarity fused gene of Drosophila. Nature 347:87-89[Medline].
RICE, W. R., 1989 Analyzing tables of statistical tests. Evolution 43:223-225.
ROZAS, J. and R. ROZAS, 1997 DnaSP version 2.0: a novel software package for extensive molecular population genetics analysis. Comput. Appl. Biosci. 13:307-311.
STROBECK, C., 1983 Expected linkage disequilibrium for a neutral locus linked to a chromosomal arrangement. Genetics 103:545-555
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595
TAJIMA, F., 1993 Measurement of DNA polymorphism, pp. 3760 in Mechanisms of Molecular Evolution, edited by N. TAKAHATA and A. G. CLARK. Sinauer Associates, Sunderland, MA.
THÉROND, P., G. ALVES, B. LIMBOURG-BOUCHON, H. TRICOIRE, and E. GUILLEMET et al., 1996 Functional domains of fused, a serine-threonine kinase required for signaling in Drosophila. Genetics 142:1181-1198[Abstract].
TOMINAGA, H. and S. NARISE, 1995 Sequence evolution of the Gpdh gene in the Drosophila virilis species group. Genetica 96:293-302[Medline].
VIEIRA, J. and B. CHARLESWORTH, 1999 X chromosome DNA variation in Drosophila virilis.. Proc. R. Soc. Lond. B Biol. Sci. 266:1905-1912[Medline].
WAKELEY, J., 1996 The variance of pairwise differences in two populations with migration. Theor. Popul. Biol. 49:39-57[Medline].
WALL, J. D., 1999 Recombination and the power of statistical tests of neutrality. Genet. Res. 74:65-79.
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-275[Medline].
WRIGHT, F., 1990 The `effective number of codons' used in a gene. Gene 87:23-29[Medline].
This article has been cited by other articles:
![]() |
J. Vieira Two Divergent Species of the Virilis Group, Drosophila littoralis and Drosophila virilis, Share a Replacement Polymorphism at the fused Locus Mol. Biol. Evol., April 1, 2002; 19(4): 579 - 581. [Full Text] [PDF] |
||||
![]() |
C. Schlotterer A Microsatellite-Based Multilocus Screen for the Identification of Local Selective Sweeps Genetics, February 1, 2002; 160(2): 753 - 763. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. R. Hudson Two-Locus Sampling Distributions and Their Application Genetics, December 1, 2001; 159(4): 1805 - 1817. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Vieira, B. F. McAllister, and B. Charlesworth Evidence for Selection at the fused1 Locus of Drosophila americana Genetics, May 1, 2001; 158(1): 279 - 290. [Abstract] [Full Text] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Vieira, J.
- Articles by Charlesworth, B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Vieira, J.
- Articles by Charlesworth, B.





