Genetics, Vol. 164, 1471-1480, August 2003, Copyright © 2003

Natural Selection Drives Drosophila Immune System Evolution

Todd A. Schlenkea and David J. Beguna
a Section of Evolution and Ecology, Division of Biological Sciences, University of California, Davis, California 95616

Corresponding author: Todd A. Schlenke, Storer Hall, University of California, Davis, CA 95616., taschlenke{at}ucdavis.edu (E-mail)

Communicating editor: W. STEPHAN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Evidence from disparate sources suggests that natural selection may often play a role in the evolution of host immune system proteins. However, there have been few attempts to make general population genetic inferences on the basis of analysis of several immune-system-related genes from a single species. Here we present DNA polymorphism and divergence data from 34 genes thought to function in the innate immune system of Drosophila simulans and compare these data to those from 28 nonimmunity genes sequenced from the same lines. Several statistics, including average KA/KS ratio, average silent heterozygosity, and average haplotype diversity, significantly differ between the immunity and nonimmunity genes, suggesting an important role for directional selection in immune system protein evolution. In contrast to data from mammalian immunoglobulins and other proteins, we find no strong evidence for the selective maintenance of protein diversity in Drosophila immune system proteins. This may be a consequence of Drosophila's generalized innate immune response.


INVESTIGATING the relationship between functional properties of genes and patterns of evolution is a major goal of evolutionary genetics. For example, genes functioning in host-pathogen interactions may be targets of directional or balancing selection more often than genes from other functional categories. Evidence of these unusual evolutionary forces may be manifest in the distribution of DNA sequence variation in host immunity genes within and between species (e.g., TANAKA and NEI 1989 Down; HUGHES et al. 1990 Down; BISHOP et al. 2000 Down; STAHL and BISHOP 2000 Down). Population genetic data from immunity proteins may also clarify the mechanisms of host response to pathogens over evolutionary time and the underlying population dynamics of pathogen virulence.

Drosophila has an innate immune system, which mediates a rapid, generalized response to invading pathogens. Innate immunity is conserved from insects to vertebrates (for reviews see KIMBRELL and BEUTLER 2001 Down; SILVERMAN and MANIATIS 2001 Down), although vertebrates have added the acquired immune response, which allows more specificity and "memory" of previous infections. Drosophila immunity has two main components: the cellular response and the humoral (systemic) response. The cellular response relies on hemocytes, which possess cell-surface pattern recognition receptors and which engulf or encapsulate foreign cells. The humoral response is mounted in the fat body, the immunity organ of Drosophila. The presence of pathogen antigens in the hemolymph triggers a signaling cascade across fat body cell membranes. Transduction of the signal culminates in the translocation of transcription factors into the nucleus, where they elevate transcription of immunity peptides. These peptides are released from the fat body cells into the hemolymph where they lyse invading microbial cells.

Advances in the genetic and biochemical description of Drosophila immunity have allowed us to pursue a large-scale molecular population genetic investigation of immune system genes. Here we present polymorphism and divergence data from 34 genes thought to be involved in recognition and signaling in the cellular and humoral immune responses of Drosophila simulans. Previously published data from 28 nonimmunity genes sequenced from the same D. simulans lines serve as a basis for comparison with the immunity data, such that the genomic effects of demographic history may be distinguished from the gene-specific effects of positive selection. Our goal is to investigate the relative importance of directional selection, balancing selection, and genetic drift in determining the evolution of host immune system proteins.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

All D. simulans sequence data are from a set of highly inbred lines made from field-caught inseminated females collected in the Wolfskill Orchard, Winters, California, in summer 1995 (BEGUN and WHITLEY 2000A Down). Sampling from a single population avoids complications in population genetic inference that can arise when genes are sampled from disparate populations that may have different demographic and selective histories. D. melanogaster sequence data were taken from GenBank. For a subset of the genes, a D. yakuba sequence was used as the outgroup to the D. melanogaster/D. simulans species pair. PCR products were directly sequenced on an Applied Biosystems (Foster City, CA) 377 automated sequencer. Information on primer sequences used for both PCR and sequencing reactions for all genes are available upon request from T.A.S. The analysis section in the DnaSP program (ROZAS and ROZAS 1999 Down) was used for most sequence analyses.

The 34 immunity genes surveyed are located throughout the D. simulans genome; an average of 7.85 alleles with an average of 1384 bp were sequenced per gene. Theoretical results suggest that sample sizes in this range allow for reasonable estimation of population heterozygosity, at least under the neutral model (TAJIMA 1983 Down; KLIMAN and HEY 1993 Down). A subset of the genes surveyed here are linked to the immune system not by direct experimentation but by virtue of homology to known immunity proteins. The known or inferred function of each immunity gene is provided in supplementary material A at http://www.genetics.org/supplemental. Data from 28 nonimmunity genes located on both chromosome 3R and the X chromosome were also included in this study (BEGUN and WHITLEY 2000B Down; BEGUN 2001 Down). Thirteen of the nonimmunity genes are located on chromosome 3R; the average sample size is 7.31 D. simulans alleles with an average of 1199 bp sequenced per allele per gene. For the X chromosome, 15 nonimmunity genes are analyzed with an average of 7.13 alleles and 1182 bp sequenced per gene. These nonimmunity genes were originally chosen on the basis of their cytological locations, not because of a priori hypotheses regarding the action of selection (BEGUN and WHITLEY 2000B Down). Thus, patterns of variation and divergence in a set of genes belonging to a coherent functional category, the immune system, can be compared to this functionally diverse, "random" set of genes. The cytological location, the sample size, the number of sites sequenced (coding and noncoding), and the presence/absence of a D. yakuba allele for all genes analyzed are listed in Table 1. Immunity gene sequences (excluding Relish; BEGUN and WHITLEY 2000A Down) have been deposited in GenBank under accession nos. AF544231, AF544232, AF544233, AF544234, AF544235, AF544236, AF544237, AF544238, AF544239, AY349684, AY349685, AY349686, AY349687, AY349688, AY349689, AY349690, AY349691, AY349692, AY349693, AY349694, AY349695, AY349696, AY349697, AY349698, AY349699, AY349705, AY349706, AY349707, AY349708, AY349709, AY349710, AY349711, AY349712, AY349713, AY349714, AY349715, AY349716, AY349717, AY349718, AY349719, AY349720, AY349721, AY349722, AY349723, AY349724, AY349725, AY349726, AY349727, AY349728, AY349729, AY349730, AY349731, AY349732, AY349733, AY349734, AY349735, AY349736, AY349745, AY349746, AY349747, AY349748, AY349749, AY349750, AY349751, AY349752, AY349761, AY349932, AY352227, AY352228, AY352229, AY352230, AY352231, AY352232, AY352233, AY352234, AY352235, AY352236, AY352237, AY352238, AY352239, AY352240, AY352241, AY352242, AY352243, AY352244, AY352245, AY352246, AY352247, AY352248, AY352249, AY352250, AY352251, AY352252, AY352253, AY352254, AY352255, AY352256, AY352257, AY352258, AY352259, AY352260, AY352261, AY352262, AY352263, AY352264, AY352265, AY354407, AY354408, AY354409, AY354410, AY354411, AY354412, AY354413, AY354414, AY354415, AY354416, AY354417, AY354418, AY354419, AY354420, AY354421, AY354422, AY354423, AY354424, AY354425, AY354426, AY354427, AY354428, AY354429, AY354430, AY354431, AY354432, AY354433, AY354434, AY354435, AY354436, AY354437, AY354438, AY354439, AY354440, AY354441, AY354442, AY354443, AY354444, AY354445, AY354446, AY354447, AY354448, AY354449, AY354450, AY354451, AY354452, AY354453, AY354454.


 
View this table:
In this window
In a new window

 
Table 1. Summary of the data collected

For some analyses, fixed differences between D. simulans and D. melanogaster were polarized using parsimony. For example, if D. melanogaster and D. yakuba share a base at a particular site but all D. simulans alleles have a different base, then a mutation is inferred to have arisen and fixed along the D. simulans lineage. Mutations were assigned to the D. simulans lineage only if the D. melanogaster and D. yakuba alleles were identical. Polymorphisms within D. simulans were polarized in a similar manner (e.g., AKASHI 1996 Down). Silent site mutations were classified as preferred or unpreferred through the use of the outgroup method as described by AKASHI 1996 Down.

Silent and replacement divergence levels do not significantly differ between autosomal and sex-linked genes in D. simulans and D. melanogaster (BAUER and AQUADRO 1997 Down; BEGUN and WHITLEY 2000B Down; BEGUN 2002 Down). BEGUN and WHITLEY 2000B Down and BEGUN 2001 Down, however, found that silent heterozygosity levels and the polymorphism frequency distribution do significantly differ across chromosome arms. Thus, for comparisons of heterozygosity and the polymorphism frequency distribution between immunity and nonimmunity genes, nonimmunity chromosome 3R and X-linked data will be treated separately. In these cases, the most relevant comparison is immunity genes vs. autosomal nonimmunity genes because 30 of the 34 sampled immunity genes are autosomal. For all other analyses, the nonimmunity chromosome 3R and the X-linked data are combined. Polymorphism and divergence data for individual genes are provided in supplementary material B at http://www.genetics.org/supplemental.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Tests for recurrent directional selection
KA/KS ratio: The KA/KS ratio compares the number of replacement (amino acid altering) substitutions per site and silent (synonymous) substitutions per site among different DNA sequences (LI et al. 1985 Down). This ratio will tend to be higher for genes experiencing recurrent fixation of beneficial replacement mutations. The average KA/KS value between D. simulans and D. melanogaster is significantly greater for immunity genes than for nonimmunity genes (Mann-Whitney U-test, P = 0.049, Table 2). Given that silent divergence is not significantly different between these groups (Mann-Whitney U-test, P = 0.34, Table 2), the higher ratio is attributable primarily to elevated replacement divergence (Mann-Whitney U-test, P = 0.02, Table 2). This finding is consistent with that of a similar study comparing rates of mammalian immunity-gene evolution to rates of evolution of other classes of genes (MURPHY 1993 Down). KA/KS estimates along the D. simulans and D. melanogaster lineages may be made independently using D. yakuba as an outgroup. Average KA/KS along the D. simulans lineage is significantly greater for immunity than for nonimmunity genes (Mann-Whitney U-test, P = 0.022, Table 3). For D. melanogaster, the difference is nearly significant (Mann-Whitney U-test, P = 0.053, Table 3).


 
View this table:
In this window
In a new window

 
Table 2. Comparison of replacement divergence and silent divergence


 
View this table:
In this window
In a new window

 
Table 3. KA/KS values along the D. simulans and D. melanogaster lineages

McDonald-Kreitman tests: Although the elevated KA/KS ratio in immunity genes is consistent with directional selection on immunity proteins, it is also consistent with a higher neutral mutation rate at replacement sites in immunity genes. Joint consideration of polymorphic and fixed mutations can provide a means of distinguishing these alternatives. Under the neutral model of molecular evolution, the ratio of replacement to silent fixations between species should equal the ratio of replacement to silent polymorphisms within a species, regardless of the level of functional constraint on a gene (KIMURA 1983 Down; MCDONALD and KREITMAN 1991 Down). However, because of the short sojourn time of replacement mutations under directional selection, such mutations make a relatively smaller contribution to standing protein polymorphism than to protein divergence, thereby elevating the replacement fixation to silent fixation ratio (relative to the replacement-polymorphism-to-silent-polymorphism ratio). The McDonald-Kreitman test (MCDONALD and KREITMAN 1991 Down) uses a 2 x 2 contingency table to test for differences in these ratios.

Table 4 shows the total numbers of replacement and silent polymorphisms and replacement and silent fixed differences for immunity and nonimmunity genes. Fixed differences between D. simulans and D. melanogaster were included only if they could be polarized to the D. simulans lineage (i.e., only genes in which a D. yakuba allele has been sequenced were analyzed: 33 immunity genes and 15 nonimmunity genes). Both immunity and nonimmunity genes as groups show highly significant deviations from the neutral expectation in these 2 x 2 contingency tables ({chi}2 = 62.19, P << 10-5; {chi}2 = 18.03, P < 10-4, respectively, Table 4). In principle, such deviations from neutrality may be caused by any of the cells in the 2 x 2 table. Results of the type observed in our data, however, are usually interpreted as evidence for adaptive protein evolution, i.e., an excess of replacement fixations. We address the possibility of selection on silent sites in a later section.


 
View this table:
In this window
In a new window

 
Table 4. McDonald-Kreitman test results with and without outlier genes

Because individual genes are not equally weighted in our McDonald-Kreitman tests, ratios of polymorphic-to-fixed mutations pooled across genes may be skewed by data from a small number of genes. Such a pattern could obscure the fact that most genes are evolving neutrally. This appears to be the case for the nonimmunity genes, as the departure from neutrality in the pooled McDonald-Kreitman test is attributable to one gene, mei-218, which has 33 of the 50 nonimmunity replacement fixations (although mei-218 data alone are not significantly different from neutral expectation). However, the significant McDonald-Kreitman test for immunity genes cannot be explained by data from a small number of outliers. Seven immunity genes (dorsal, Dredd, imd, Relish, Spn43Ac, Tehao, and Toll) individually deviate from neutrality in the McDonald-Kreitman test (P < 0.05). Data from the remaining 26 immunity genes still significantly deviate from the neutral expectation in a direction consistent with excess replacement fixations (Table 4).

Selection on silent sites is thought to have led to the high amounts of codon bias typically observed in Drosophila genes. Although immunity genes have slightly lower levels of codon bias than nonimmunity genes, average values for three different measures of codon bias do not significantly differ between immunity and nonimmunity genes (Table 5), suggesting that the intensity of selection for codon bias is similar in these gene groups. AKASHI 1996 Down, AKASHI 1999 Down hypothesized that selection on silent sites could contribute to the pattern typically attributed to adaptive protein evolution in McDonald-Kreitman tests. This model posits that many silent mutations in Drosophila are borderline deleterious (unpreferred) and thus may make a larger contribution to polymorphism than to divergence (SHARP and LI 1986 Down; BULMER 1988 Down; AKASHI 1995 Down). Table 6 compares the ratios of preferred to unpreferred silent polymorphisms with the ratios of preferred to unpreferred silent fixations. As expected (AKASHI 1996 Down, AKASHI 1999 Down; TAKANO 1998 Down; BEGUN 2001 Down), nonimmunity genes show a significant deviation from the neutral expectation in the direction of "too many" unpreferred polymorphisms ({chi}2 = 6.17, P = 0.01). The immunity genes do not show a significant deviation ({chi}2 = 3.59, P = 0.06), although these data also show a trend toward an excess of unpreferred polymorphisms.


 
View this table:
In this window
In a new window

 
Table 5. Comparison of average codon bias between gene groups


 
View this table:
In this window
In a new window

 
Table 6. Silent site evolution and modified McDonald-Kreitman test

One strategy for minimizing the potential complications of weakly deleterious silent mutations on interpretation of the McDonald-Kreitman test is to use only preferred silent mutations. Under the mutation-selection-drift model, preferred codons are beneficial. Thus, if the ratio of replacement to preferred silent fixations is significantly greater than the ratio of replacement to preferred silent polymorphisms, one might have more confidence in the inference that replacement sites are under directional selection. Comparison of replacement and preferred silent mutations in immunity genes reveals a highly significant deviation in the direction of an excess of replacement fixations ({chi}2 = 8.21, P < 10-2). However, replacement and preferred silent mutations from nonimmunity genes show no deviation ({chi}2 = 0.56, P = 0.45). This result provides additional support for the notion that directional selection plays a greater role in immunity protein evolution than in nonimmunity protein evolution. This result also suggests that selection on silent sites may have inflated McDonald-Kreitman estimates of the effects of positive selection observed in previous studies (FAY et al. 2002 Down; SMITH and EYRE-WALKER 2002 Down) utilizing pooled nonimmunity Drosophila data.

Tests for recent selection
Polymorphism levels: Several types of selection may lead to reductions of linked heterozygosity (MAYNARD-SMITH and HAIGH 1974 Down; KAPLAN et al. 1989 Down; GILLESPIE 1997 Down, GILLESPIE 1999 Down). Thus, if immunity genes experience directional selection more often than nonimmunity genes, immunity genes should have lower nucleotide diversity (relative to divergence) than nonimmunity genes. The ratio of silent polymorphism to divergence is significantly lower for immunity genes (0.19) than for autosomal nonimmunity genes (0.32, Mann-Whitney U-test, P = 0.03, Table 7). Although BEGUN and WHITLEY 2000B Down previously reported that X-linked genes have reduced heterozygosity compared to autosomal genes in D. simulans, heterozygosity in the subset of their X-linked and autosomal data analyzed here are not significantly different (Mann-Whitney U-test, P = 0.08). Silent heterozygosity-to-divergence ratios for immunity and X-linked genes are not different (Mann-Whitney U-test, P = 0.79). Reduced silent heterozygosity in immunity genes supports the hypothesis that linked selection has a greater effect on these genes than on a random sample of autosomal genes in D. simulans. Although greater effects of linked purifying selection (i.e., background selection; CHARLESWORTH et al. 1993 Down) on immunity proteins could also contribute to this pattern, this seems unlikely given the higher rate of protein divergence in immunity genes compared to nonimmunity genes.


 
View this table:
In this window
In a new window

 
Table 7. Average silent heterozygosity-to-divergence ratios

Polymorphic-site allele frequencies: In addition to reducing linked heterozygosity, directional selection may skew the frequency of derived alleles at linked polymorphic sites toward an excess of high- and low-frequency alleles (BRAVERMAN et al. 1995 Down; FAY and WU 2000 Down; PRZEWORSKI 2002 Down). This skew may occur if a selected allele does not fix or if recombination dissociates alleles at linked polymorphic sites from the selected allele before fixation. Table 8 compares the number of derived polymorphisms in low-frequency (<=0.2), high-frequency (>=0.8), and intermediate-frequency classes among immunity and nonimmunity X-linked and autosomal genes. Given that between six and eight D. simulans alleles were sequenced for each gene, the low-frequency class corresponds to singletons, while the high-frequency class includes only the highest possible frequency mutations in the sample. The ratio of low-to-intermediate-to-high-frequency silent polymorphisms is significantly heterogeneous for the three pairwise comparisons among the three categories of genes (Table 8), with the immunity genes showing the highest proportion of both high- and low-frequency polymorphisms. This pattern is even more extreme when the proportions of low- and high-frequency polymorphisms are calculated on a gene-by-gene basis and then averaged, giving equal weight to every gene (data not shown). Nonimmunity X-linked genes have by far the smallest proportion of low-frequency mutants (see also Table 3 of BEGUN 2001 Down).


 
View this table:
In this window
In a new window

 
Table 8. Numbers of derived silent polymorphisms at low, intermediate, and high frequency

Polymorphic-site allele distributions: Several statistics are available for determining whether the distributions of alleles at polymorphic sites among sampled chromosomes conform to the neutral equilibrium expectation. We focus on one such statistic, haplotype diversity (Hd), given by equation 8.4 of NEI 1987 Down, but we replace 2n by n: Hd = n(1 - {sum}x2)/(n - 1), where x is the frequency of each haplotype and n is the number of alleles sampled. Recent directional selection events that rapidly elevate the frequency of a favored haplotype tend to decrease overall haplotype diversity at a locus. Immunity genes have the lowest average haplotype diversity, and both immunity genes and X-linked nonimmunity genes have significantly lower haplotype diversities than the autosomal nonimmunity genes (Mann-Whitney U-test, P < 10-2, P < 10-2, respectively; Table 9). Low haplotype diversity in immunity genes cannot be explained by a lack of segregating sites, since average S (including noncoding mutations) for immunity genes (31.1) is greater than that for nonimmunity autosomal genes (29.3). There is no evidence for different average haplotype diversities between immunity and nonimmunity X-linked genes, although nonimmunity X-linked genes have significantly lower average S (14.4, Mann-Whitney U-test, P < 10-2).


 
View this table:
In this window
In a new window

 
Table 9. Average per gene haplotype diversity (Hd)

Balancing selection
If balancing selection (e.g., overdominance, negative frequency-dependent selection) were common in immunity proteins, one might predict that compared to nonimmunity genes, immunity genes would show elevated replacement heterozygosity. Table 10 shows that the average replacement heterozygosity (relative to divergence) for immunity genes is not significantly greater than that observed for autosomal nonimmunity genes (Mann-Whitney U-test, P = 0.21). In fact, the ratio for immunity genes (0.12) is substantially less than that for autosomal nonimmunity genes (0.31). However, this difference is due primarily to the "outlier" replacement heterozygosity value for the nonimmunity gene Hsc70 (1.89). If data from Hsc70 are omitted, replacement heterozygosity for immunity and autosomal nonimmunity genes is very similar (0.12 and 0.16, respectively). This analysis would seem to provide no evidence for excess protein polymorphism in immunity genes. A complication in interpreting this result, however, is the evidence for reduced silent heterozygosity in immunity genes (Table 7). If reduced silent heterozygosity in immunity genes results from linked selection, then we would also expect to observe reduced replacement heterozygosity in immunity genes. Furthermore, our analyses of polymorphism and divergence provide evidence that directional selection elevates rates of amino acid divergence in immunity genes. Overall, then, we might expect replacement heterozygosity-to-divergence ratios for immunity genes to be significantly lower than those for nonimmunity genes. Thus, one interpretation of the fact that replacement heterozygosity is not clearly reduced in immunity genes is that some level of diversity-enhancing selection acts on amino acid variants in immune system genes.


 
View this table:
In this window
In a new window

 
Table 10. Average replacement heterozygosity-to-divergence ratios


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Molecular population genetic analyses of large numbers of genes allow investigation of the relative importance of various evolutionary forces acting on proteins in different functional classes. Comparing genes from different functional classes also allows one to move beyond the assumptions and testing of the neutral equilibrium model. This is important because deviations from neutral model expectations can often be explained by either selection or demographic phenomena (e.g., population bottlenecks, population expansion, population subdivision). Comparisons of different classes of genes within a single population sample can help distinguish between these alternatives because effects of selection tend to be gene specific while effects of demography tend to be genome wide.

The D. simulans immune system DNA sequence data presented here are mainly from proteins thought to be involved in recognition of pathogens (in the cellular and/or humoral immune responses) and in upstream humoral response-signaling pathways. The distribution of DNA sequence variation within and between species suggests that directional selection plays a more important role in immunity gene evolution than in nonimmunity genes. Immunity genes have a significantly higher average KA/KS ratio (Table 2), a greater proportion of genes contributing to significant McDonald-Kreitman test results (Table 4), a significantly lower average standardized silent heterozygosity (Table 7), a greater proportion of both low- and high-frequency silent polymorphisms (Table 8), and significantly lower average haplotype diversity (Table 9) than nonimmunity genes sampled from the same set of inbred lines from a single California population.

Some population genetic differences between immunity and nonimmunity genes might be explained by differences in the levels of recombination experienced by these gene groups, as opposed to higher levels of positive selection in the immunity genes. For example, reduced silent heterozygosity in immunity genes could potentially be explained by lower levels of recombination in immunity genes (BEGUN and AQUADRO 1992 Down; AQUADRO et al. 1994 Down). Genetic data from D. simulans, although limited, suggest that there is little heterogeneity in recombination rates across the genome (STURTEVANT 1929 Down; OHNISHI and VOELKER 1979 Down; TRUE et al. 1996 Down). Although reduced recombination tends to be associated with proximity to centromeres and telomeres, the centromere effect of reduced crossing over encompasses a smaller physical region in D. simulans than in D. melanogaster. These genetic data are consistent with population genetic data showing that genes located within only five cytological divisions from centromeres, or within two cytological divisions from telomeres, have "normal" levels of polymorphism (HASSON et al. 1998 Down; BEGUN and WHITLEY 2000B Down; BEGUN 2002 Down).

Of the 34 immunity genes considered here, 12 (crq, Dif, dorsal, Dredd, IKK{gamma}, {kappa}B-Ras, Lectin-galC1, Mvl, ref(2)P, Spn43Ac, Tak1, and tube) are located within five cytological divisions of a centromere or within two cytological divisions of a telomere. The same is true of only 2 of the 28 nonimmunity genes (bnb, Pgd). However, neither the average standardized silent heterozygosity nor the average haplotype diversity of these 12 immunity genes is significantly different from that of the remaining 22 immunity genes (Mann-Whitney U-test, P = 0.59, P = 0.23 respectively; Table 11). Although the 22 immunity genes from regions of normal recombination no longer have levels of silent heterozygosity significantly lower than those of the autosomal nonimmunity genes (Mann-Whitney U-test, P = 0.06, Table 11), the similarity in average standardized silent heterozygosity levels between the immunity gene groups (0.19 vs. 0.18, respectively) suggests that this loss of significance is more reflective of a loss of power due to the removal of the 12 immunity genes. The 22 immunity genes in regions of normal recombination still have significantly lower average haplotype diversity than the autosomal nonimmunity genes (Mann-Whitney U-test, P < 10-2, Table 11). Overall, there is little evidence that a difference in recombination rates between sampled immunity and nonimmunity genes contributes significantly to their different population genetic patterns.


 
View this table:
In this window
In a new window

 
Table 11. Potential effects of physical location on silent heterozygosity and haplotype diversity

The fact that a subset of humoral response immunity genes surveyed here also functions in early development (BELVIN and ANDERSON 1996 Down) could compromise our inference that the unusual population genetic data from "immunity" genes are attributable to their role in immunity. However, patterns of variation in immunity genes that also have developmental roles (18w, cactus, dorsal, pelle, spz, Toll, and tube) are indistinguishable from patterns observed in other immunity genes (supplementary material B at http://www.genetics.org/supplemental). Removing these genes does not affect any of our inferences regarding the population genetics of immunity genes vs. nonimmunity genes (results not shown).

The evidence for positive selection in immunity protein evolution spans genes from both the cellular and the humoral immune responses. A growing body of literature reveals that pathogen genomes encode a wide array of immunomodulatory molecules specifically designed to interfere with host proteins involved in recognition and attack of pathogens and in immunity-signaling pathways (SPRIGGS 1996 Down; HUECK 1998 Down). For example, pathogenic bacteria utilize type III secretion systems to inject proteins into the cytoplasm of host cells that specifically disrupt immunity signal transduction (e.g., SCHESSER et al. 1998 Down). The importance of directional selection for D. simulans immune system evolution supports the notion that pathogen proteins interact with and exert strong selection pressures on D. simulans immune system proteins from multiple pathways and functional groups. Our data are consistent with models of coevolution such as an arms race (DAWKINS and KREBS 1979 Down), in which a specialist pathogen constantly evolves to escape its host's own evolving defenses, or with a turnover of pathogen species to which the host must repeatedly evolve resistance. Distinguishing between these models will require much additional data regarding the number, frequencies, and infection characteristics of the microbial pathogens that infect D. simulans in nature.

Given the effects of directional selection on D. simulans immune system genes, it seems reasonable to propose that immunity genes of other host species are also strongly influenced by pathogen-mediated directional selection. Previous studies of antimicrobial peptides (the most downstream steps in the humoral response pathways) from the sister species, D. melanogaster, provided no evidence for adaptive protein divergence (CLARK and WANG 1997 Down; DATE et al. 1998 Down; RAMOS-ONSINS and AGUADE 1998 Down). However, without comparable data from D. simulans antimicrobial peptides we cannot be confident that this difference in evolutionary histories results from a lineage effect. Instead, the simplest pathogen virulence strategy may be to interfere with upstream signaling and recognition proteins rather than with the smaller and more numerous downstream antimicrobial peptides.

To better compare and contrast D. simulans and D. melanogaster immunity protein evolution, we collected D. melanogaster polymorphism data for five of the immunity genes that were sampled in D. simulans (Relish, Spn43Ac, spz, Sr-CI, Toll—GenBank accession nos. AY349649, AY349650, AY349651, AY349652, AY349653, AY349654, AY349655, AY349656, AY349657, AY349658, AY349659, AY349660, AY349661, AY349662, AY349663, AY349664, AY349665, AY349666, AY349667, AY349668, AY349669, AY349670, AY349671, AY349672, AY349673, AY349674, AY349675, AY349700, AY349701, AY349702, AY349703, AY349704 with the exception of Relish; BEGUN and WHITLEY 2000A Down). The McDonald-Kreitman test from these five genes along the D. simulans lineage is highly significant ({chi}2 = 43.79, P < 10-4, Table 12), while the same test on D. melanogaster data is not significant ({chi}2 = 1.24, P = 0.27). Thus, neither the signaling and recognition proteins nor the antimicrobial peptides of D. melanogaster show evidence of adaptive protein divergence. Although additional data from D. melanogaster immunity genes will be required to make strong statements regarding the comparative population genetics of the immune system in these two species, one possibility is that D. simulans immunity genes have recently experienced an unusually intense bout of selection that D. melanogaster immunity genes have not. Alternatively, population genetic characteristics unique to D. melanogaster, such as a recent elevation of the silent-site substitution rate (AKASHI 1995 Down, AKASHI 1996 Down) or an excess of replacement polymorphism (ANDOLFATTO 2001 Down), could obscure the signal of adaptive protein evolution that one might otherwise observe along the D. melanogaster lineage using the McDonald-Kreitman test.


 
View this table:
In this window
In a new window

 
Table 12. Comparison of replacement and silent variation in homologous immunity genes in D. simulans and D. melanogaster

In contrast to genes involved in the mammalian acquired immune response (HUGHES and NEI 1988 Down, HUGHES and NEI 1989 Down; TANAKA and NEI 1989 Down; HUGHES et al. 1990 Down), we found little evidence for an important role of balancing selection in the population genetics of host-pathogen interactions in D. simulans. However, the Drosophila immune system relies on generalized pathogen-associated molecular patterns (such as lipopolysaccharide of gram-negative bacterial cell walls) to recognize microbes and apparently has little specificity for distinguishing members of large classes of pathogen species (LEMAITRE et al. 1997 Down). Therefore, lack of evidence for selective maintenance of protein variation in D. simulans immunity genes may not be surprising. It will be interesting to investigate whether mammalian innate immunity proteins that interact directly with pathogens evolve in a manner more akin to the specialized mammalian acquired immune system genes or in a manner more akin to the generalized Drosophila innate immune system genes.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AF544231, AF544232, AF544233, AF544234, AF544235, AF544236, AF544237, AF544238, AF544239, AY349649, AY349650, AY349651, AY349652, AY349653, AY349654, AY349655, AY349656, AY349657, AY349658, AY349659, AY349660, AY349661, AY349662, AY349663, AY349664, AY349665, AY349666, AY349667, AY349668, AY349669, AY349670, AY349671, AY349672, AY349673, AY349674, AY349675, AY349684, AY349685, AY349686, AY349687, AY349688, AY349689, AY349690, AY349691, AY349692, AY349693, AY349694, AY349695, AY349696, AY349697, AY349698, AY349699, AY349700, AY349701, AY349702, AY349703, AY349704, AY349705, AY349706, AY349707, AY349708, AY349709, AY349710, AY349711, AY349712, AY349713, AY349714, AY349715, AY349716, AY349717, AY349718, AY349719, AY349720, AY349721, AY349722, AY349723, AY349724, AY349725, AY349726, AY349727, AY349728, AY349729, AY349730, AY349731, AY349732, AY349733, AY349734, AY349735, AY349736, AY349745, AY349746, AY349747, AY349748, AY349749, AY349750, AY349751, AY349752, AY349761, AY349932, AY352227, AY352228, AY352229, AY352230, AY352231, AY352232, AY352233, AY352234, AY352235, AY352236, AY352237, AY352238, AY352239, AY352240, AY352241, AY352242, AY352243, AY352244, AY352245, AY352246, AY352247, AY352248, AY352249, AY352250, AY352251, AY352252, AY352253, AY352254, AY352255, AY352256, AY352257, AY352258, AY352259, AY352260, AY352261, AY352262, AY352263, AY352264, AY352265, AY354407, AY354408, AY354409, AY354410, AY354411, AY354412, AY354413, AY354414, AY354415, AY354416, AY354417, AY354418, AY354419, AY354420, AY354421, AY354422, AY354423, AY354424, AY354425, AY354426, AY354427, AY354428, AY354429, AY354430, AY354431, AY354432, AY354433, AY354434, AY354435, AY354436, AY354437, AY354438, AY354439, AY354440, AY354441, AY354442, AY354443, AY354444, AY354445, AY354446, AY354447, AY354448, AY354449, AY354450, AY354451, AY354452, AY354453, AY354454. Back


*  ACKNOWLEDGMENTS

Two anonymous reviewers provided useful comments. This work was supported by National Institutes of Health grant GM-55298, the National Science Foundation, and the Sloan Foundation. T.A.S. is cofounder of the Institute of Drosophila Immunomics, Ithaca, NY.

Manuscript received January 15, 2003; Accepted for publication April 18, 2003.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AKASHI, H., 1995  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076.[Abstract]

AKASHI, H., 1996  Molecular evolution between Drosophila melanogaster and D. simulans, reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster.. Genetics 144:1297-1307.[Abstract]

AKASHI, H., 1999  Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. Genetics 151:221-238.[Abstract/Free Full Text]

ANDOLFATTO, P., 2001  Contrasting patterns of X-linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans.. Mol. Biol. Evol. 18:279-290.[Abstract/Free Full Text]

AQUADRO, C. F., D. J. BEGUN and E. C. KINDAHL, 1994 Selection, recombination, and DNA polymorphism in Drosophila, pp. 46–56 in Non-Neutral Evolution: Theories and Molecular Data, edited by B. GOLDING. Chapman & Hall, New York.

BAUER, V. L. and C. F. AQUADRO, 1997  Rates of DNA sequence evolution are not sex-biased in Drosophila melanogaster and D. simulans. Mol. Biol. Evol. 14:1252-1257.[Abstract]

BEGUN, D. J., 2001  The frequency distribution of nucleotide variation in Drosophila simulans.. Mol. Biol. Evol. 18:1343-1352.[Abstract/Free Full Text]

BEGUN, D. J., 2002  Protein variation in Drosophila simulans, and comparison of genes from centromeric versus noncentromeric regions of chromosome 3. Mol. Biol. Evol. 19:201-203.[Free Full Text]

BEGUN, D. J. and C. F. AQUADRO, 1992  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster.. Nature 356:519-520.[Medline]

BEGUN, D. J. and P. WHITLEY, 2000a  Adaptive evolution of Relish, a Drosophila NF-{kappa}B/I{kappa}B protein. Genetics 154:1231-1238.[Abstract/Free Full Text]

BEGUN, D. J. and P. WHITLEY, 2000b  Reduced X-linked nucleotide polymorphism in Drosophila simulans.. Proc. Natl. Acad. Sci. USA 97:5960-5965.[Abstract/Free Full Text]

BELVIN, M. P. and K. V. ANDERSON, 1996  A conserved signaling pathway: the Drosophila toll-dorsal pathway. Annu. Rev. Cell Dev. Biol. 12:393-416.[Medline]

BISHOP, J. G., A. M. DEAN, and T. MITCHELL-OLDS, 2000  Rapid evolution in plant chitinases: molecular targets of selection in plant-pathogen coevolution. Proc. Natl. Acad. Sci. USA 97:5322-5327.[Abstract/Free Full Text]

BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995  The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140:783-796.[Abstract]

BULMER, M., 1988  Are codon usage patterns in unicellular organisms determined by selection-mutation balance? J. Evol. Biol. 1:15-26.

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303.[Abstract]

CLARK, A. G. and L. WANG, 1997  Molecular population genetics of Drosophila immune system genes. Genetics 147:713-724.[Abstract]

DATE, A., Y. SATTA, N. TAKAHATA, and S. I. CHIGUSA, 1998  Evolutionary history and mechanism of the Drosophila cecropin gene family. Immunogenetics 47:417-429.[Medline]

DAWKINS, R. and J. R. KREBS, 1979  Arms races between and within species. Proc. R. Soc. Lond. B Biol. Sci. 205:489-511.[Medline]

FAY, J. C. and C.-I WU, 2000  Hitchhiking under positive Darwinian selection. Genetics 155:1405-1413.[Abstract/Free Full Text]

FAY, J. C., G. J. WYCKOFF, and C.-I WU, 2002  Testing the neutral theory of molecular evolution with genomic data from Drosophila.. Nature 415:1024-1026.[Medline]

GILLESPIE, J. H., 1997  Junk ain't what junk does: neutral alleles in a selected context. Gene 205:291-299.[Medline]

GILLESPIE, J. H., 1999  The role of population size in molecular evolution. Theor. Popul. Biol. 55:145-156.[Medline]

HASSON, E., I. N. WANG, L. W. ZENG, M. KREITMAN, and W. F. EANES, 1998  Nucleotide variation in the triosephosphate isomerase (Tpi) locus of Drosophila melanogaster and Drosophila simulans.. Mol. Biol. Evol. 15:756-769.[Abstract]

HUECK, C. J., 1998  Type III protein secretion systems in bacterial pathogens of animals and plants. Microbiol. Mol. Biol. Rev. 62:379-433.[Abstract/Free Full Text]

HUGHES, A. L. and M. NEI, 1988  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature 335:167-170.[Medline]

HUGHES, A. L. and M. NEI, 1989  Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection. Proc. Natl. Acad. Sci. USA 86:958-962.[Abstract/Free Full Text]

HUGHES, A. L., T. OTA, and M. NEI, 1990  Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I major-histocompatibility-complex molecules. Mol. Biol. Evol. 7:515-524.[Abstract]

KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989  The "hitch-hiking" effect revisited. Genetics 123:887-899.[Abstract/Free Full Text]

KIMBRELL, D. A. and B. BEUTLER, 2001  The evolution and genetics of innate immunity. Nat. Rev. Genet. 2:256-267.[Medline]

KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.

KLIMAN, R. M. and J. HEY, 1993  DNA sequence variation at the period locus within and among species of the Drosophila melanogaster complex. Genetics 133:375-387.[Abstract]

LEMAITRE, B., J.-M. REICHHART, and J. A. HOFFMANN, 1997  Drosophila host defense: differential induction of antimicrobial peptide genes after infection by various classes of microorganisms. Proc. Natl. Acad. Sc