Genetics, Vol. 150, 823-833, October 1998, Copyright © 1998

Molecular Evolution of an Imprinted Gene: Repeatability of Patterns of Evolution Within the Mammalian Insulin-Like Growth Factor Type II Receptor

Nick G. C. Smitha and Laurence D. Hursta
a Centre for Mathematical Biology, School of Biology and Biochemistry, University of Bath, Bath BA2 7AY, United Kingdom

Corresponding author: Nick G. C. Smith, Centre for Mathematical Biology, School of Biology and Biochemistry, University of Bath, Claverton Down, Bath BA2 7AY, UK., n.smith{at}bath.ac.uk (E-mail).

Communicating editor: G. B. GOLDING


*  ABSTRACT
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The repeatability of patterns of variation in Ka/Ks and Ks is expected if such patterns are the result of deterministic forces. We have contrasted the molecular evolution of the mammalian insulin-like growth factor type II receptor (Igf2r) in the mouse-rat comparison with that in the human-cow comparison. In so doing, we investigate explanations for both the evolution of genomic imprinting and for Ks variation (and hence putatively for mutation rate evolution). Previous analysis of Igf2r, in the mouse-rat comparison, found Ka/Ks patterns that were suggested to be contrary to those expected under the conflict theory of imprinting. We find that Ka/Ks variation is repeatable and hence confirm these patterns. However, we also find that the molecular evolution of Igf2r signal sequences suggests that positive selection, and hence conflict, may be affecting this region. The variation in Ks across Igf2r is also repeatable. To the best of our knowledge this is the first demonstration of such repeatability. We consider three explanations for the variation in Ks across the gene: (1) that it is the result of mutational biases, (2) that it is the result of selection on the mutation rate, and (3) that it is the product of selection on codon usage. Explanations 2 and 3 predict a KaKs correlation, which is not found. Explanation 3 also predicts a negative correlation between codon bias and Ks, which is also not found. However, in support of explanation 1 we do find that in rodents the rate of silent C {Rightarrow} T mutations at CpG sites does covary with Ks, suggesting that methylation-induced mutational patterns can explain some of the variation in Ks. We find evidence to suggest that this CpG effect is due to both variation in CpG density, and to variation in the frequency with which CpGs mutate. Interestingly, however, a GC4 analysis shows no covariance with Ks, suggesting that to eliminate methyl-associated effects CpG rates themselves must be analyzed. These results suggest that, in contrast to previous studies of intragenic variation, Ks patterns are not simply caused by the same forces responsible for Ka/Ks correlations.


IN mammals there exists much variation in synonymous substitution rates (Ks) and the ratio of nonsynonymous to synonymous substitution rates (Ka/Ks), both within (ALVAREZ-VALIN et al. 1998 Down) and between (WOLFE and SHARP 1993 Down) genes. There are two alternative explanations for such variation: that it is simply a consequence of chance (e.g., some genes, or parts of them, might just happen to mutate faster than others) or that it is due to deterministic effects. Possible deterministic forces include selection and repeatable mutational biases. Analysis of the repeatability of patterns of molecular evolution in independent comparisons is one means to distinguish between stochastic and deterministic explanations (MOUCHIROUD et al. 1995 Down). Here we ask about the repeatability of patterns of intragene variation in Ka/Ks and Ks. To the best of our knowledge this is the first example where repeatability of intragene variation in both of these parameters has been examined. The gene that we examine, the insulin-like growth factor type II receptor (Igf2r), is of interest as it is one of a small class of mammalian genes that are imprinted. It has also recently been implicated in the determination of IQ (CHORNEY et al. 1998 Down).

Genomically imprinted genes are those for which expression is dependent upon the sex of the parent from which they are derived (EFSTRATIADIS 1994 Down). Welldescribed examples include murine insulin-like growth factor II (Igf2), a growth factor expressed from the paternally derived genome in fetuses, and the murine insulin-like growth factor type II receptor (Igf2r) expressed from the maternally derived genome. In eutherians and marsupials, but not in birds (ZHOU et al. 1995 Down), the protein product of this gene binds to Igf2 and takes it to the lysosome for digestion. The gene is also known as the cation-independent mannose-6-phosphate (M6P) receptor as it is also known to bind M6P (KORNFELD 1992 Down). The latent complex of transforming growth factor beta (TGFß) also binds to the Igf2r protein via the M6P binding domains (DENNIS and RIFKIN 1991 Down). Additionally the Igf2r protein has the ability to bind retinoic acid (KANG et al. 1998 Down). The binding activities of Igf2 and M6P are known to occur at different sites within the Igf2r protein. In rats Igf2r is known to be imprinted in the liver (MILLS et al. 1998 Down) and in humans IGF2R appears to be imprinted in some but not all individuals (OGAWA et al. 1993 Down; XU et al. 1993 Down). We are unaware of data pertinent to the imprinting status of the gene in cow. However, from our knowledge that the mouse, rat, and human IGF2R genes are now imprinted, we can be sure that some of the changes revealed by human-cow and mouse-rat comparisons of Igf2r will have occurred in an imprinted gene.

Aside from allowing us to ask whether patterns of Ka/Ks and of Ks within genes are repeatable and hence deterministically based, the patterns of molecular evolution in this gene are of interest for two reasons. First, the pattern of variation in Ks across the gene may shed light on the evolution of mutation rates or of codon usage patterns. Second, the variation in Ka/Ks is potentially informative as to the evolution of imprinting. One theory for the evolution of imprinting would find discriminating support from evidence for antagonistic coevolution between imprinted genes (see MCVEAN and HURST 1997B Down; TRIVERS and BURT 1998 Down; but see also HAIG 1997 Down).


*  Ks variation, mutation rate evolution, and imprinted genes
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

We consider four hypotheses to explain variation in Ks across Igf2r.

Hypothesis 1:
Ks variation may be caused by stochastic forces, in which case Ks will not be repeatable. The remaining three hypotheses suppose that Ks variation is the result of deterministic forces that would cause Ks repeatability.

Hypothesis 2:
If silent sites are considered neutral then Ks is an unbiased estimator of the mutation rate (KIMURA 1983 Down), and so Ks variation may be the result of deterministic mutational processes, with mutational biases dependent on perhaps sequence composition or regional effects. The two final hypotheses, (3) and (4), both suggest that Ks variation is molded by selection, but differ with respect to the character that selection is acting upon.

Hypothesis 3:
Decreasing mutation rates is expensive both in time and energy, while the benefits of a lower mutation rate depend on the functional importance and mutational sensitivity of the region concerned. Such reasoning has been used to explain the constant per genome mutation rate across many unicellular organisms (DRAKE 1991 Down). If this trade-off theory of mutation rate evolution is correct, and if selection is strong enough, one might imagine that selection would favor imprinted genes to have low mutation rates. This prediction derives from imprinted genes having haploid expression and hence recessive mutations in imprinted genes being unmasked. Hence it may be optimal for an organism to invest more into reducing the mutation rate of imprinted genes below the mutation rate of comparable nonimprinted ones. It should, however, be noted that selection on the modifier is likely to be very weak.

From the same logic, one might expect that the most important parts of an imprinted gene should be provided with especially low mutation rates. Thus if silent sites in the mammalian Igf2r gene are neutral, then the variation in Ks (and hence mutation rate) across the gene may be the result of selection acting to optimize mutation rates. We term this explanation of Ks variation the "selected mutation rate" hypothesis.

Hypothesis 4:
ALVAREZ-VALIN et al. 1998 Down have found intragenic correlations between synonymous and nonsynonymous substitution rates in mammalian genes. They conclude that these correlations might well be due to common selective constraints, such as selection for translational accuracy, between synonymous and nonsynonymous sites. This argument suggests that Ks in mammals is not the mutation rate (as often assumed), but instead reflects the local strength of selection. We term this explanation of Ks variation the "common constraints" hypothesis.

Just as for the selected mutation rate argument, the common constraints hypothesis would predict that imprinted genes, due to their haploid expression, would be under greater selective pressures, and thus should have lower Ks values than nonimprinted genes. The empirical data on this point are equivocal. Previous analyses have found Igf2 to have one of the lowest Ks values in the rodent genome (MCVEAN and HURST 1997B Down). Imprinted genes as a class have a Ks (Ks = 19.87, N = 7) between that of autosomes (Ks = 22.9, N = 238) and X-linked genes (Ks = 14.63, N = 33; MCVEAN and HURST 1997A Down, MCVEAN and HURST 1997B Down). However, on nonparametric analysis the imprinted genes are found to be not significantly different from autosomal genes. Given sample size limitations, however, the issue cannot be considered fully resolved.


*  Ka/Ks variation and the evolution of genomic imprinting
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Numerous theories have been proposed for the evolution of genomic imprinting (for a review see HURST 1997 Down). The "conflict" hypothesis (MOORE and HAIG 1991 Down) proposes that imprinting is an intraindividual manifestation of classic parent-offspring conflict (TRIVERS 1974 Down). According to this model, paternally derived genes are under selection to extract resources from mothers whereas maternally expressed genes are under selection to oppose this (for models see MOCHIZUKI et al. 1996 Down; SPENCER et al. 1998 Down). This hypothesis makes predictions about the growth effects of paternally and maternally expressed genes and it is presently unclear whether these predictions are borne out (HURST and MCVEAN 1997 Down).

Given that maternal and paternal genes have antagonistic interests, the conflict theory is unique in predicting that an arms race may develop between maternally and paternally derived genes for the control of fetal growth demands. Such antagonistic coevolution may be mediated through changes in the structure of the proteins concerned. Were one to find evidence for rapid evolution of imprinted genes (i.e., a high Ka/Ks ratio) this, it has been argued, could then reasonably be taken as evidence favoring the conflict hypothesis (MCVEAN and HURST 1997B Down; see also TRIVERS and BURT 1998 Down). Unfortunately this is not a falsifying prediction as the finding of slow evolution need not be considered evidence against the hypothesis (HAIG 1997 Down; MCVEAN and HURST 1997B Down) for a variety of reasons.

Comparable maternal-fetal conflict is a good candidate explanation for the rapid evolution of numerous genes, such as the placental lactogens (WALLIS 1993 Down), prolactin (WALLIS 1981 Down), the homeobox gene Pem (SUTTON and WILKINSON 1997 Down), growth hormones (WALLIS 1994 Down), the pregnancy-associated glycoproteins (XIE et al. 1997 Down), and numerous others (see MCVEAN and HURST 1997B Down). Antagonistic coevolution is similarly the most likely explanation for the rapid changes seen in antigenic components of parasites and antigen recognition components of immune systems (for example see HUGHES et al. 1994 Down), and may be invoked to explain rapid changes in a gene putatively involved in sexual conflict (TSAUR and WU 1997 Down).

Perhaps surprisingly then, an analysis of the molecular evolution of seven imprinted genes revealed them to be no faster evolving at the protein level (controlling for Ks, and hence possibly for mutation rate, variation) than ordinary receptors and significantly slower evolving than immune system genes (MCVEAN and HURST 1997B Down). In the same study, an analysis of molecular evolution within Igf2r, compared between mouse and rat, showed that where Igf2r binds to Igf2 a low Ka/Ks ratio is found [indicating, contrary to initial expectations (MCVEAN and HURST 1997B Down), stabilizing rather than disruptive selection]. This analysis also shows two high peaks in Ka/Ks that were not commented on. One of these was at the signal sequence and one at the position at which Igf2r binds M6P.

Neither of these peaks in the pattern of protein evolution rates are expected. Signal sequences direct protein transport within the cell, and possess the conserved structure of a hydrophobic core flanked by a polar basic region and a hydrophilic region (KENDREW 1994 Down). Since artificial signal sequence constructs lacking such features fail to perform their normal functions (IZARD et al. 1996 Down), one would expect strong amino acid conservation at signal sequences and hence low Ka/Ks values. Igf2r assists the transport of acid hydrolases to the lysosomes by binding the enzymes at their phosphomannosyl residues using the M6P binding site (KORNFELD 1992 Down). Because the phosphomannosyl residues are high-affinity ligands there should be high sequence specificity at the M6P binding site, and hence strong amino acid conservation and low Ka/Ks values.

Here we ask whether the peaks and troughs in the Ka/Ks pattern across the gene revealed in the mouse-rat comparison are also observed in the human-cow comparison or whether they are statistical artifacts and hence not worth further investigation. That the mouse-rat patterns might be artifacts is possible since the sliding window used was small and hence the expected error in estimates of Ka/Ks per window is high.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Extraction and analysis of genes:
NCBI Entrez (http://www.ncbi.nlm.nih.gov/Entrez/) was used to search for complete coding mammalian Igf2r sequences. Four were obtained: human, cow, mouse, and rat (accession numbers J03528, J03527, U04710, and U59809, respectively). The orthologies of these sequences were confirmed using BLASTN (ALTSCHUL et al. 1990 Down) and the HOVERGEN database (DURET et al. 1994 Down).

Sequence manipulation and alignment was performed using programs in the Wisconsin package (GENETICS COMPUTER GROUP 1994). All four mammalian Igf2r protein-coding DNA sequences were aligned using the default DNA alignment parameters of the program PILEUP. The program GAPFRAME was used to move gaps to codon boundaries, and the resultant alignment was very similar to the protein alignment obtained using the default protein alignment parameters of PILEUP. All four gapped sequences were then cut into 24 orthologous nonoverlapping sections each of 300 bp, or 74 orthologous overlapping sections (300 bp every 100 bp). Synonymous and nonsynonymous nucleotide substitution rates (Ks and Ka) were estimated for all mouse-rat and human-cow orthologous pairs using the program KESTIM (COMERON 1995 Down) with KIMURA's (1980) two-parameter model for multiple hits correction. The mouse-rat and human-cow species pairs, which share no evolutionary history, were chosen to ensure independence and thus a valid test of repeatability.

A nonparametric statistical test of repeatability was used to avoid assumptions about possible distributions of rates of molecular evolution. For both comparisons each window was ranked across the gene. Then the two patterns of ranks across the gene were compared using a rank correlation test. Thus a statistic was obtained (Prank) to describe the probability of the ranking patterns being so similar through chance alone.

Analysis of similarity:
For all of the nonsynonymous substitutions in the two comparisons, we characterized the extent to which these changes were conservative. This provides an indication as to whether it is likely that the replacements seen might be owing to weak stabilizing selection (in which case conservative changes are likely) or due to positive selection (in which case nonconservative changes are more likely; cf. TUCKER and LUNDRIGAN 1995 Down). To characterize changes as being conservative or not we employed PAM matrices. Such matrices use data on the difference between the expected number of replacements of one amino acid with another (from knowledge of relative positions in the genetic code), and data on the actual rate at which such substitutions occur. The difference in the two rates is thought to indicate some measure of physicochemical similarity between the two amino acids under consideration such that similar amino acids are more likely to be interchangeable.

Different PAM matrices are appropriate for genes of different levels of divergence. We obtained appropriate matrices from the Amino Acid Index Database (http://www.cbi.pku.edu.cn/srs5bin/cgi-bin/wgetz?-fun+Pagelibinfo+-info+AAINDEX/). As the mouse-rat protein comparison shows 93% identity, we employed the BENS940101 PAM6-8 matrix (BENNER et al. 1994 Down). The human-cow protein comparison shows 81% identity, so we used the BENS940102 PAM22-29 matrix (BENNER et al. 1994 Down). For each segment in the gene each amino acid difference was assigned the relevant value from the appropriate PAM matrix. The mean value per amino acid change per segment could then be calculated. We can then compare the regions of interest with the evolution in the rest of the gene.

Signal sequence evolution:
To ask whether the signal sequence of Igf2r has an unusual mode of evolution we compared its evolution with that of comparable signal sequences in other genes. We used NCBI Entrez (http://www.ncbi.nlm.nih.gov/Entrez/) and the HOVERGEN database (DURET et al. 1994 Down) to find nine other genes for which signal sequence information was available for mouse and rat orthologues. Alignments were prepared using the Wisconsin package (GENETICS COMPUTER GROUP 1994). DNA alignments were prepared for both signal sequences and entire protein-coding sequences using the default settings of PILEUP. The program GAPFRAME was used to move all gaps to codon boundaries. Nucleotide substitution rates were estimated with the Moriyama package (MORIYAMA and POWELL 1997 Down), using the synonymous/nonsynonymous method of LI 1993 Down and the multiple substitution correction method of TAMURA and NEI 1993 Down.

The genes (accession numbers in brackets, mouse then rat) were as follows: interleukin 4 receptor (M29854, X69903), low density lipoprotein receptor (X64414, X13722), tumor necrosis factor receptor 1 (M60468, M63122), calreticulin (X14926, X53363), cholesterol esterase (U33169, X16054), glutamate dehydrogenase (X57024, X14223), procathepsin E (X97399, D38104), immunoglobulin light chain V-{lambda}-1 (J00590, M17092), and oxytocin-neurophysin I (M88355, K01701).


*  RESULTS
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Repeatability of Ka/Ks:
With the 24 nonoverlapping sections, Ka/Ks patterns across Igf2r are significantly repeatable (Prank < 0.01, see Figure 1). It appears that the large-scale fluctuations in Ka/Ks rather than the small-scale fluctuations are responsible for the observed repeatability (our unpublished results). Ka/Ks is an indicator of both the intensity and type of selection. Stronger stabilizing selection reduces Ka/Ks, while both weaker stabilizing selection and stronger positive selection increase Ka/Ks. A Ka/Ks ratio significantly above one is the strict requirement for evidence of positive selection, but Ka/Ks ratios less than one do not preclude positive selection because different sites within the same segment might be under different selection pressures. The repeatability of patterns of selection across Igf2r means that selection is able to respond differently to various small subdomains across the Igf2r protein. This result is consistent with the conservation of the various functions of the different parts of the Igf2r molecule across the cow, human, mouse, and rat species.



View larger version (31K):
In this window
In a new window
Download PPT slide
 
Figure 1. Ka/Ks patterns are repeatable across Igf2r. The nonsynonymous over synonymous substitution rates (Ka/Ks) across the Igf2r gene are shown for both the mouse-rat and human-cow comparisons. The data are for the 74 overlapping windows. Note the repeated peaks at the signal sequence and the first M6P binding site, and the repeated troughs at the Igf2 binding site.

The significant repeatability of Ka/Ks across the Igf2r gene means that the unexpected Ka/Ks patterns are not artifacts, but are representative of Igf2r's molecular evolution. We therefore need to explain the low rate of protein evolution at the Igf2 binding site, and the high rates of protein evolution both at the signal sequence and at one of the M6P binding sites.

Similarity of amino acid substitutions: To examine the selection pressures acting on the fast-evolving signal sequence and M6P binding site, we compared the amino acids that had changed in each nonoverlapping window to ask how conservative the changes were, using data from the appropriate PAM matrices to ascribe similarity.

In the human-cow comparison the changes in the signal sequence were on average exceptionally nonconservative (see Figure 2). Indeed, none of the windows shows a lower level of similarity and the average level of similarity in the gene as a whole is significantly higher (P = 0.0017, two-tailed Mann-Whitney U-test). This suggests that positive selection might be acting on this sequence. In the mouse-rat comparison the picture is not so clear. The signal sequence does show a low similarity but three other windows have lower values. Compared with the rest of the gene the signal sequence does have a lower value (P = 0.0387, two-tailed Mann-Whitney U-test).



View larger version (10K):
In this window
In a new window
Download PPT slide
 
Figure 2. Mean PAM distances per amino acid change for different regions of Igf2r. Data are presented for both the human-cow (h-c) and mouse-rat (m-r) comparisons. The human-cow and mouse-rat data were obtained using the PAM22-29 and PAM6-8 matrices, respectively (see MATERIALS AND METHODS for references). Signal, the window containing the signal sequence; M6P, the two windows containing the rapidly evolving mannose-6-phosphate binding site; and Rest, all other parts of the gene. Bars show SE.

At the M6P binding site there appears to be nothing unusual about the amino acids being replaced in both mouse-rat and human-cow, suggesting that this region is under weak stabilizing selection (two-tailed Mann-Whitney U-tests between the M6P site and the rest of the gene give P = 0.6248 and P = 0.6829 for the human-cow and mouse-rat comparisons, respectively).

The signal sequence of Igf2r is unusually fast evolving: One would expect signal sequences to be generally slow evolving (see Introduction), but is this actually true; i.e., Is the signal sequence of Igf2r unusually fast evolving? When signal sequence Ka/Ks is plotted against the entire coding sequence Ka/Ks for the nine genes and for Igf2r it becomes clear that Igf2r's signal sequence is evolving unusually fast (see Figure 3). The only gene with a comparable signal sequence rate is immunoglobulin light chain V-{lambda}-1. However, the rate of signal sequence evolution in this gene is comparable to the rate of evolution in the rest of the gene. Igf2r is hence unusual in that the signal sequence shows an unusually high ratio of Ka/Ks given the rate of evolution of the rest of the sequence. Compared with a large sample of genes (unpublished data) the Ka/Ks ratio of the majority of Igf2r is not unusual. The value in the signal sequence is high in these terms. Hence the discrepancy between the signal sequence and the rest of the gene is better understood as unusually fast evolution of the signal sequence rather than unusually slow evolution of the majority of the gene.



View larger version (10K):
In this window
In a new window
Download PPT slide
 
Figure 3. Ka/Ks of signal sequences vs. Ka/Ks of entire genes. A comparison is shown of the mouse-rat Ka/Ks values of the signal sequence and of the entire protein coding sequence for ten genes (full gene names and accession numbers in MATERIALS AND METHODS). Both Igf2r and the immunoglobin gene show higher signal sequence Ka/Ks values than the other genes.

Signal sequence hydrophobicity: Is the rapid evolution of the Igf2r signal sequence at the expense of its functionality? We examined the hydrophobicity plots of the four species' signal sequences using the Staden package (STADEN 1996 Down). The conserved signal sequence structure (see Introduction) predicts a central hydrophobic peak. The signal sequences of human, rat, and mouse fulfilled this prediction but the signal sequence of cow showed no such feature (Figure 4). This observation could explain the high divergence in the human-cow comparison, but offers no clues as to why both the human-cow and mouse-rat signal sequence divergences are so high.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 4. Hydrophobicity plots across Igf2r for four species. Sliding window plots of hydrophobicity values [as produced by STADEN's (1996) package] across the signal sequences of the Igf2r gene in mouse, rat, cow, and human are shown. The signal sequence of cow is the only one that does not show the prototypical pattern of a central peak. The numbers give the lengths of the different signal sequences in amino acids (showing that the plots are not aligned).

Repeatability of Ks:
With the 24 nonoverlapping sections, Ks variation is significantly repeatable (Prank = 0.05; see Figure 5). With the rank of size change between neighboring overlapping sections as a potentially more powerful test, Ks patterns remained significantly repeatable (Prank < 0.05). The repeatability of Ks across the Igf2r gene thus implies that the variation in Ks within the gene is being shaped by deterministic forces. We have performed several tests to attempt to distinguish between the potential deterministic forces described above (see Introduction). First, however, we ask whether the repeatability might be an artifact.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 5. Ks patterns are repeatable across Igf2r. The mouse-rat and human-cow synonymous substitution rates (Ks) across the IGF2R gene are shown in the upper plot. The data are of the 74 overlapping windows. The actual human-cow values are used, but the mouse-rat values are scaled up so that the means of both sets of Ks values are the same, to allow easier comparison. The lower plot shows the correlation between the mouse-rat and human-cow orthologous overlapping window Ks values.

Ks repeatability is probably not a methodological artifact: Unless multiple substitution correction methods take account of variation in sequence composition, nucleotide substitution rate estimates can be biased (PESOLE et al. 1995 Down). Furthermore, the use of different substitution rate estimation methods can lead to different conclusions using the same dataset (SMITH and HURST 1998 Down). Could this effect be responsible for our observation of Ks repeatability, since KIMURA's (1980) two parameter model does not take account of variation in sequence composition? Under this hypothesis our finding of Ks repeatability would be an artifact caused by compositional repeatability leading to repeatable bias in Ks estimates. To test this idea, we reanalyzed the section alignments with Moriyama's package (MORIYAMA and POWELL 1997 Down) using the synonymous/nonsynonymous method of LI 1993 Down and the multiple substitution correction method of TAMURA and NEI 1993 Down(that uses all four nucleotide frequencies, as well as a transversion substitution rate and two transition substitution rates). A significant Ks repeatability was observed (Prank < 0.05), and thus the repeatability of Ks is probably not a methodological artifact.

Ks repeatability is not due to the majority of Igf2r consisting of repeated units: The large extracellular domain of IGF2R consists almost entirely of 15 conserved repeat units, between which there is roughly 20% amino acid identity in the human gene (MORGAN et al. 1987 Down). If compositional features affect Ks values, one might expect to see each repeat giving similar Ks patterns. Using the Wisconsin package (GENETICS COMPUTER GROUP 1994), alignments of the repeats were prepared using the same alignment gaps as for the preparation of overlapping and nonoverlapping sections. For each repeat, and for both species pairs, three alignments were prepared: one of the 5' three-fifths of the repeat (section 1), one of the middle three-fifths (section 2), and one of the 3' three-fifths (section 3). Overlapping sections were used because of the small size of the repeats (~150 bp). Ks was then estimated for each alignment using Moriyama's package (MORIYAMA and POWELL 1997 Down).

It was then determined for each repeat in both species comparisons which of the three sections in each repeat (first, second, or third) had the highest Ks. The null hypothesis that there is no link between repeats and mutation rates predicts that in any given repeat all three thirds of the sequence are equally likely to have the fastest rate of evolution. The alternative hypothesis that the repeat structure does influence Ks would suppose that repeatably one of the three sections will have the highest Ks. Thus for both species comparisons the null expectation was that of the 15 repeats, five sections should have the first third having the highest Ks, five should have the second with highest Ks, etc. The human-cow comparison gave (6, 5, 4) while the mouse-rat comparison gave (7, 3, 5). Neither result was significantly different from the null expectation (the chi-squared test gave P > 0.5 for both comparisons). Thus the null hypothesis of no link between repeats and Ks patterns cannot be rejected, and so we conclude that the repeatability of Ks patterns is probably not due to the repeatability of molecular evolution in repeats. We note, however, that this is a relatively weak test.

Having established that the repeatability in Ks is probably not an artifact, we now turn to the question of why this repeatability is found. We consider three hypotheses (see Introduction for details): either (1) the variation is due to differences in composition along the gene allied with some form of mutational bias, or (2) the variation is due to selection favoring important sites to have low mutation rates (selected mutation rate hypothesis), or (3) Ks variation is the result of varying selection on codon usage (common constraints hypothesis). Hypotheses 1 and 2 assume that silent site mutations are neutral. Hypotheses 2 and 3 propose that the repeatability is selection-driven rather than mutation-driven. We shall start with these.

Adaptive explanations for Ks repeatability:
Here we evaluate the extent to which adaptive explanations for Ks variability and repeatability can explain the data. The common constraints argument predicts that codon bias should be negatively correlated with Ks [because both depend on the strength of selection, for Drosophila and bacterial data see LI 1997 Down], and that there should be a correlation between Ka and Ks (because both are influenced by similar selective pressures). The selected mutation rate argument predicts that regions of the gene under strong stabilizing selection (low Ka) should have low mutation rates (low Ks), and conversely regions under weak stabilizing selection or even positive selection (high Ka) should have high mutation rates (high Ks). For both adaptive hypotheses, Ks repeatability follows from selection patterns being conserved across all four species. Thus the selected mutation rate argument agrees with the common constraints argument in predicting Ks repeatability and a correlation between Ka and Ks, but differs in predicting no negative correlation between codon bias and Ks.

Evidence from codon bias data: To differentiate between the selected mutation rate and common constraints hypotheses, we looked for a relationship between codon bias and Ks within the Igf2r gene. We used the Moriyama package (MORIYAMA and POWELL 1997 Down) to determine effective numbers of codons (ENC; WRIGHT 1990 Down) for the sections. For both species comparisons the ENC data of the two species were averaged. There was no correlation between mouse-rat ENC and Ks (Prank {Gt} 0.1), but the human-cow comparison showed a negative correlation on the edge of significance (Prank = 0.05). A negative correlation between ENC and Ks is equivalent to a positive correlation between codon bias and Ks (the lower the effective number of codons the stronger the codon bias), and so both species comparisons rejected the prediction of the common constraints hypothesis. The codon bias data show us that silent sites do not appear to be affected by selection, and thus Ks can be taken as an indicator of mutation rate.

No correlation between Ks and Ka: Both selective explanations of Ks repeatability predict a positive correlation between Ks and Ka. However, neither the mouse-rat nor the human-cow comparison give a significant positive correlation (Prank > 0.5 for both; see Figure 6). This result provides evidence against both adaptive explanations of Ks repeatability. We note additionally, that presence of a significant positive correlation between Ka and Ks could not have been taken as conclusive evidence in favor of the selected mutation rate argument because there are several other possible explanations for such a correlation (see LI 1997 Down).




View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 6. No intragenic correlation between Ka and Ks. The Ka value of each nonoverlapping section is plotted against the corresponding Ks value for both the mouse-rat and the human-cow comparisons. The linear regression lines for both comparisons indicate no significant positive correlation between Ka and Ks.

No correlation between Ks and functional importance: We separated the gene into sections according to predicted functional importance, and then examined whether regions likely a priori to be important (such as binding sites) showed a systematically lower Ks than the rest of the gene. Both the common constraints and selected mutation rate hypotheses would predict lower Ks values for more important sections of the gene because in such regions selection is likely to be more powerful. However, both the mouse-rat and human-cow comparisons gave a higher mean Ka/Ks for the extracellular ligand-binding regions than the extracellular regions not associated with ligand binding (0.27 vs. 0.17 and 0.20 vs. 0.11, respectively). This result contradicts the prediction of greater functional importance of binding sites (with stabilizing selection, the stronger the selection the lower the Ka/Ks values), and thus renders this test inapplicable. However, the observation that the Ks at the Igf2 binding site is not low despite the Ka/Ks being unusually low (see Figure 1 and Figure 5) provides further, albeit circumstantial, evidence against both the common constraints and selected mutation rate hypotheses.

Mutational explanations for Ks repeatability:
Here we consider two possible compositional correlates to Ks variation. First we ask whether GC composition might explain the patterns. Second we ask more particularly whether methylation at CpG sites might explain some of the variance.

GC composition: If local base composition influences mutation rates (and hence Ks), then the repeatability of Ks between highly related (and hence compositionally similar) sequences follows. For example, a correlation between GC content and Ks has been proposed (WOLFE et al. 1989 Down). However, we found no significant correlation between fourfold site GC content and Ks for either the mouse-rat (Prank > 0.1) or human-cow (Prank > 0.1) comparisons. WOLFE et al. 1989 Down is now discredited due to (1) a small sample size effect (WOLFE and SHARP 1993 Down) and (2) a biased estimating protocol (BERNARDI et al. 1997 Down). Correction for both finds no correlation between chromosomal site (defined by GC content) and mutation rate given by Ks (BERNARDI et al. 1997 Down), as we have also found (SMITH and HURST 1998 Down). The finding that the repeats within the Igf2r gene do not show similar Ks plots (see above) also suggests that composition might not explain the patterns of Ks variation.

The influence of methylation: Even though local GC content seems not to influence local Ks, more complicated compositional characters may well influence local Ks (for example, see MORTON et al. 1997 Down), and thus the above result does not falsify the compositional hypothesis. One compositional feature that might well influence Ks patterns, and be responsible for the Ks repeatability under the compositional hypothesis, is the methylation pattern. Methylation is known to affect mutation rates with a methylated cytosine in a CpG pair 10 to 20 times more mutable than an unmodified cytosine (KENDREW 1994 Down).

To test this possibility we examined the frequency of CpG {Rightarrow} TpG mutations. For each segment we found the number of times a CpG was found in one of the two sequences and TpG was found in the other, where the C existed at a third site. This protocol gives all possible synonymous CpG {Rightarrow} TpG changes (we assume all mutational changes to be in this direction). We find that the variation in Ks between segments is correlated to the frequency of such mutations (see Figure 7). The mouse-rat comparison shows a significant correlation (Prank = 0.02), while the human-cow comparison shows a tendency in the same direction (0.1 > Prank > 0.05). We also find that this class of mutations accounts for a reasonable proportion of all mutations (15% in the human-cow comparison, and 21% in the mouse-rat comparison). We conclude that methyl-induced mutations can account for some of the repeatability that we observe.




View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 7. Silent CpG {Rightarrow} TpG changes correlate with Ks. For both comparisons of mouse-rat and human-cow, and for each nonoverlapping window, Ks is plotted against the number of silent CpG {Rightarrow} TpG changes. The correlations show that such methylation-induced mutations provide a possible explanation for repeatable Ks patterns.

Two hypotheses might be considered to account for this pattern. At one extreme, all segments might have an equal number of CpG sites that might change, but these segments differ in the density of methylation. Alternatively, the segments might have the same density of methylation per CpG, but differ in the total number of CpG sites.

We find that variation in total number of mutable CpGs partially explains the pattern in Ks. We calculated the number of times CpG was found in both sequences (i.e., the CpG/CpG frequency) again with the C at the third site. Adding the CpG/CpG figure to the CpG/TpG figure we arrive at a figure for the total number of mutable or mutated silent CpG sites. This total correlates well with Ks (Prank < 0.05 for the mouse-rat comparison, though Prank > 0.1 for the human-cow comparison), indicating that the variance in Ks explained by methylation is to some extent a result of differences between segments in CpG content.

Conversely, we can ask whether segments also differ in their propensity to methylate CpG sites. The methylation of HpaII sites of the mouse E15 embryo Igf2r gene has been shown to vary from 50% to 90% throughout the locus (STOGER et al. 1993 Down), which at least suggests that there might be enough variation in methylation patterns to account for the highly variable mutation rate patterns (Figure 5). Asking whether segments differ significantly in the proportion of CpGs that become TpGs we find heterogeneity between segments (chi-squared test for heterogeneity gives P < 0.05 for the mouse-rat comparison, and 0.5 > P > 0.1 for the human-cow comparison). This supports the view that the observed variation in CpG > TpG changes (and hence Ks) is at least partially due to differences in the methylation density of potentially methylated sites.

This finding contrasts with human polymorphism data. The M6P/IGF2R Information Core (http://www.radonc.duke.edu/~jirtle/homepage.html) provides a human mutation database for IGF2R. Six substitutional mutations resulting in disease symptoms are described, with no C to T mutations. However, this small dataset suffers from an ascertainment bias, in that not all substitutional mutations are equally likely to cause disease symptoms.


*  DISCUSSION
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The mammalian Igf2r gene is multifunctional, and plays an important role in glycoprotein transport via M6P binding (KORNFELD 1992 Down) and growth and development via IGF2 and RA binding (KANG et al. 1998 Down), and also tumor suppression (DESOUZA et al. 1997 Down). In keeping with this functional complexity mammalian Igf2r displays complex patterns of molecular evolution. The significant repeatability of both Ka/Ks and Ks means that these patterns are driven by deterministic forces, and we have tested a number of hypotheses concerning the nature of these forces.

Ka/Ks variation and the conflict theory of imprinting:
A simple prediction of the conflict theory of imprinting is that the Igf2 binding site should exhibit a high rate of molecular evolution driven by positive selection (MCVEAN and HURST 1997B Down; TRIVERS and BURT 1998 Down). Instead we find evidence from Ka/Ks patterns for strong stabilizing selection at the Igf2 binding site in both the mouse-rat and human-cow comparisons. This finding cannot then be dismissed as a statistical artifact. Fast evolution at this site might have provided good evidence supporting the conflict hypothesis (MCVEAN and HURST 1997B Down; TRIVERS and BURT 1998 Down). We have looked for positive selection in other parts of the gene, and have found evidence from PAM matrix data that the signal sequence might be under positive selection. We have shown that signal sequence evolution in Igf2r is exceptional when compared to other examples of signal sequence evolution. This is the first evidence for any form of molecular evolution (sensu stricto) in an imprinted gene that is different from the form of molecular evolution in most nonimmune system/nonplacental genes [note, however, that imprinted genes have a few other unusual properties, such as intronic dimensions (HURST et al. 1996 Down), sex specific recombination rates (PALDI et al. 1995 Down), and repeat structures (NEUMANN et al. 1995 Down)].

The evolution of Igf2r's signal sequence might be a result of antagonistic coevolution, in which case we can speculate that the signal sequence might be coevolving with the gene product of an imprinted gene (or genes) expressed from the paternally derived genome. The antagonism might concern the location of Igf2r. The putative paternally derived product could, in theory, attempt to remove the protein to a cellular location where its effects on Igf2 levels are minimal. It would be interesting to see the effect of site-directed mutagenesis of the signal sequence on the cellular location of Igf2r (no mutations in the IGF2R signal sequence causing human disease are listed at the M6P/IGF2R Information Core at http://www.radonc.duke.edu/~jirtle/homepage.html). Alternatively, the unusual evolution at this site might be a property of Igf2r independent of imprinting. It would then be informative to analyze the molecular evolution of Igf2r in birds for example, assuming that avian Igf2r is not imprinted (as yet, chicken is the only bird for which the Igf2r gene has been sequenced).

Both this study and a previous analysis of the Ka/Ks values of imprinted genes (MCVEAN and HURST 1997B Down) assume that the conflict theory of imprinting predicts rapid protein evolution at the sites of functional antagonism. This assumption is in keeping with the results of studies of classic parent-offspring conflict (see Introduction), though such fast rates of evolution might be due to the acquisition of new roles. However, HAIG 1997 Down has suggested an alternative prediction, that the mutual dependence of the roles of Igf2r and Igf2 leads to an ESS rather than a continuing arms race. Haig suggests that the a priori expectation is unclear. This failure to provide a testable prediction of the rates of evolution at sites of functional antagonism might be considered a weakness of the current models of the conflict theory of imprinting.

Ks variation and the evolution of mutation rates:
The result of Ks repeatability in mammalian Igf2r, along with evidence from codon bias data that the silent sites appear to be neutral, extends our understanding of mutation rate variation: to quote CASANE et al. 1997 Down "it is still not clear whether the mutation pattern is constant over the entire genome of an organism or is variable among regions of the genome." Large scale variation in mammalian mutational patterns has been previously demonstrated on two levels: between different chromosomes [Ks used as a measure of mutation rate in MCVEAN and HURST 1997A Down] and between pseudogenes inserted into different genomic regions (CASANE et al. 1997 Down). The repeatability of Ks patterns within Igf2r provides strong evidence that there is deterministic variation in mutation rates at a fine scale, between different regions within the same gene.

We have tested various hypotheses to explain the Ks repeatability. Since the Ks repeatability does not appear to be a methodological artifact, the remaining hypotheses can be divided into two groups, mutational and adaptive explanations. The selected mutation rate argument holds that local mutation rates, given by Ks, are the results of conflicting selection pressures governing mutation rates (see Introduction). Alternatively, the common constraints argument proposes that silent sites may be under selection, and so Ks is not the mutation rate but instead depends on selective constraints (see Introduction). The lack of a negative correlation between codon bias and Ks provides evidence against the common constraints hypothesis, and the lack of a correlation between Ks and Ka provides evidence against both adaptive explanations. The comparison of local mutation rates and local functional importance provided further, but circumstantial, evidence against both adaptive hypotheses.

Repeatable composition of repeats does not cause the repeatable Ks patterns. We did, however, find that silent C {Rightarrow} T mutations at CpG sites do covary with Ks, suggesting that the variation in Ks is somehow connected with methylation-induced mutational patterns. The variation in silent C {Rightarrow} T mutations at CpG sites appears to be the result of variation both in the number of mutable sites, and in the density of methylation.

It has previously been reported that, at least for GC-rich genes, the variation in Ks does covary with the variation in Ka (ALVAREZ-VALIN et al. 1998 Down). This was interpreted as evidence for selection differentially affecting codon usage across the gene. Were this the only force affecting Ks variation, then one would not have predicted Ks repeatability in a gene, such as Igf2r, in which Ka and Ks do not covary. We hence conclude that there must be some other forces determining variation in Ks. Variation in silent CpG mutation appears in our case to be an important variable and it is noteworthy that analysis of GC content alone did not detect this.


*  ACKNOWLEDGMENTS

We thank Etsuko Moriyama and Josep Comeron for providing us with their programs for evolutionary analysis, Gil McVean for suggestions concerning methodology, David Haig for discussions on imprinting, Denise Barlow for advice on methylation data, and two anonymous referees for suggesting many improvements.

Manuscript received March 2, 1998; Accepted for publication June 30, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*Ks variation, mutation rate...
*Ka/Ks variation and the...
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ALTSCHUL, S. F., W. GISH, W. MILLER, E. W. MYERS, and D. J. LIPMAN, 1990  Basic local alignment search tool. J. Mol. Biol. 215:403-410[Medline].

ALVAREZ-VALIN, F., K. JABBARI, and G. BERNARDI, 1998  Synonymous and nonsynonymous substitutions in mammalian genes: intragenic correlations. J. Mol. Evol. 46:37-44[Medline].

BENNER, S. A., M. A. COHEN, and G. H. GONNET, 1994  Amino-acid substitution during functionally constrained divergent evolution of protein sequences. Prot. Eng. 7:1323-1332[Abstract/Free Full Text].

BERNARDI, G., D. MOUCHIROUD and C. GAUTIER, 1997 Isochores and synonymous substitutions in mammalian genes, pp. 137–168 in DNA and Protein Sequence Analysis, edited by M. J. BISHOP and C. J. RAWLINGS. Oxford University Press, Oxford.

CASANE, D., S. BOISSINOT, B. H. J. CHANG, L. C. SHIMMIN, and W. H. LI, 1997  Mutation pattern variation among regions of the primate genome. J. Mol. Evol. 45:216-226[Medline].

CHORNEY, M. J., K. CHORNEY, N. SEESE, M. J. OWEN, and J. DANIELS et al., 1998  A quantitative trait locus associated with cognitive ability in children. Psychol. Sci. 9:159-166.

COMERON, J. M., 1995  A method for estimating the numbers of synonymous and nonsynonymous substitutions per site. J. Mol. Evol. 41:1152-1159[Medline].

DENNIS, P. A. and D. B. RIFKIN, 1991  Cellular activation of latent transforming growth factor beta requires binding to the cation-independent mannose 6-phosphate/insulin-like growth factor type II receptor. Proc. Natl. Acad. Sci. USA 88:580-584[Abstract/Free Full Text].

DESOUZA, A. T., T. YAMADA, J. J. MILLS, and R. L. JIRTLE, 1997  Imprinted genes in liver carcinogenesis. FASEB J. 11:60-67[Abstract].

DRAKE, J. W., 1991  A constant rate of spontaneous mutation in DNA-based microbes. Proc. Natl. Acad. Sci. USA 88:7160-7164[Abstract/Free Full Text].

DURET, L., D. MOUCHIROUD, and M. GOUY, 1994  HOVERGEN—a database of homologous vertebrate genes. Nucleic Acids Res. 22:2360-2365[Abstract/Free Full Text].

EFSTRATIADIS, A., 1994  Parental imprinting of autosomal mammalian genes. Curr. Opin. Genet. Dev. 4:265-280[Medline].

GENETICS COMPUTER GROUP, 1994 Program Manual for the Wisconsin Package, Version 8, September 1994, Genetics Computer Group, 575 Science Drive, Madison, WI.

HAIG, D., 1997  Parental antagonism, relatedness asymmetries, and genomic imprinting. Proc. R. Soc. Lond. B 264:1657-1662[Medline].

HUGHES, A. L., M. K. HUGHES, C. Y. HOWELL, and M. NEI, 1994  Natural-selection at the class-II major histocompatibility complex loci of mammals. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 346:359-366[Medline].

HURST, L. D., 1997 Evolutionary theories of genomic imprinting, pp. 211–237 in Frontiers in Molecular Biology: Genomic Imprinting in Mammals, edited by W. REIK and A. SURANI. Oxford University Press, Oxford.

HURST, L. D. and G. T. MCVEAN, 1997  Growth effects of uniparental disomies and the conflict theory of genomic imprinting. Trends Genet. 13:436-443[Medline].

HURST, L. D., G. T. MCVEAN, and T. MOORE, 1996  Imprinted genes have few and small introns. Nat. Genet. 12:234-237[Medline].

IZARD, J. W., S. L. RUSCH, and D. A. KENDALL, 1996  The amino-terminal charge and core region hydrophobicity interdependently contribute to the function of signal sequences. J. Biol. Chem. 271:21579-21582[Abstract/Free Full Text].

KANG, J. K., Y. LI, and A. LEAF, 1998  Mannose-6-phosphate/insulin-like growth factor-II receptor is a receptor for retinoic acid. Proc. Natl. Acad. Sci. USA 95:13671-13676.

KENDREW, J. (Editor), 1994 The Encyclopedia of Molecular Biology. Blackwell, Oxford.

KIMURA, M., 1980  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111-120[Medline].

KIMURA, M., 1983 The Neutral Theory of Evolution. Cambridge University Press, Cambridge, UK.

KORNFELD, S., 1992  Structure and function of the mannose 6-phosphate insulin-like growth factor-II receptors. Annu. Rev. Biochem. 61:307-330[Medline].

LI, W. H., 1993  Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36:96-99[Medline].

LI, W. H., 1997 Molecular Evolution. Sinauer Associates, Sunderland, MA.

MCVEAN, G. T. and L. D. HURST, 1997a  Evidence for a selectively favourable reduction in the mutation rate of the X chromosome. Nature 386:388-392[Medline].

MCVEAN, G. T. and L. D. HURST, 1997b  Molecular evolution of imprinted genes: no evidence for antagonistic coevolution. Proc. R. Soc. Lond. Ser. B Biol. Sci. 264:739-746[Medline].

MILLS, J. J., J. G. FALLS, A. T. DESOUZA, and R. C. JIRTLE, 1998  Imprinted MGP/Igf2 receptor is mutated in rat liver tumours. Oncogene 16:2797-2802[Medline].

MOCHIZUKI, A., Y. TAKEDA, and Y. IWASA, 1996  The evolution of genomic imprinting. Genetics 144:1283-1295[Abstract].

MOORE, T. and D. HAIG, 1991  Genomic imprinting in mammalian development: a parental tug-of-war. Trends Genet. 7:45-49[Medline].

MORGAN, D. O., J. C. EDMAN, D. N. STANDRING, V. A. FRIED, and M. C. SMITH et al., 1987  Insulin-like growth factor-ii receptor as a multifunctional binding-protein. Nature 329:301-307[Medline].

MORIYAMA, E. N. and J. R. POWELL, 1997  Synonymous substitution rates in Drosophila: Mitochondrial versus nuclear genes. J. Mol. Evol. 45:378-391[Medline].

MORTON, B. R., V. M. OBERHOLZER, and M. T. CLEGG, 1997  The influence of specific neighboring bases on substitution bias in noncoding regions of the plant chloroplast genome. J. Mol. Evol. 45:227-231[Medline].

MOUCHIROUD, D., C. GAUTIER, and G. BERNARDI, 1995  Frequencies of synonymous substitutions in mammals are gene-specific and correlated with frequencies of nonsynonymous substitutions. J. Mol. Evol. 40:107-113[Medline].

NEUMANN, B., P. KUBICKA, and D. P. BARLOW, 1995  Characteristics of imprinted genes. Nat. Genet. 9:12-13[Medline].

OGAWA, O., D. M. BECROFT, I. M. MORISON, M. R. ECCLES, and J. E. SKEEN et al., 1993  Human insulin-like growth-factor type-I and type-II receptors are not imprinted. Hum. Mol. Genet. 2:2163-2165[Abstract/Free Full Text].

PALDI, A., G. GYAPAY, and J. JAMI, 1995  Imprinted chromosomal regions of the human genome display sex-specific meiotic recombination frequencies. Curr. Biol. 5:1030-1035[Medline].

PESOLE, G., G. DELLISANTI, G. PREPARATA, and C. SACCONE, 1995  The importance of base composition in the correct assessment of genetic-distance. J. Mol. Evol. 41:1124-1127.

SMITH, N. G. C. and L. D. HURST, 1998  Sensitivity of patterns of molecular evolution to alterations in methodology. J. Mol. Evol. in press.

SPENCER, H. G., M. W. FELDMAN, and A. G. CLARK, 1998  Genetic conflicts, multiple paternity and the evolution of genomic imprinting. Genetics 148:893-904[Abstract/Free Full Text].