- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Eyre-Walker, A.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Eyre-Walker, A.
Evidence of Selection on Silent Site Base Composition in Mammals: Potential Implications for the Evolution of Isochores and Junk DNA
Adam Eyre-Walkeraa Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton, BN1 9QG, United Kingdom
Corresponding author: Adam Eyre-Walker
| ABSTRACT |
|---|
It has been suggested that mutation bias is the major determinant of base composition bias at synonymous, intron, and flanking DNA sites in mammals. Here I test this hypothesis using population genetic data from the major histocompatibility genes of several mammalian species. The results of two tests are inconsistent with the mutation hypothesis in coding, noncoding, CpG-island, and non-CpG-island DNA, but are consistent with selection or biased gene conversion. It is argued that biased gene conversion is unlikely to affect silent site base composition in mammals. The results therefore suggest that selection is acting upon silent site G + C content. This may have broad implications, since silent site base composition reflects large-scale variation in G + C content along mammalian chromosomes. The results therefore suggest that selection may be acting upon the base composition of isochores and large sections of junk DNA.
BASE composition varies along mammalian chromosomes over scales of hundreds of kilobases to megabases, with G + C contents fluctuating from ~35% up to ~55% (![]()
![]()
![]()
![]()
The reason for the compositional variation along mammalian chromosomes remains unknown. It has been variously suggested that it is due to mutation bias (![]()
![]()
![]()
![]()
![]()
![]()
Here I consider whether mutation bias is responsible for variation in the G + C content of synonymous sites, and intron and flanking DNA sequences, using two population genetic tests. The first of these is a derivative of a test suggested by ![]()
![]()
![]()
![]()
![]()
![]()
AT substitutions must equal the number of AT
GC substitutions (by definition), and hence there must be equal numbers of GC
AT and AT
GC mutations (![]()
AT and AT
GC mutations as AT and GC mutations, respectively. The frequency distributions of AT and GC mutations segregating in a population are also expected to be the same under an infinite-sites model, since all neutral mutations have the same (average) frequency distribution (![]()
![]()
There are therefore two independent tests of whether the composition of a sequence is determined by mutation bias: a test of whether the number of GC mutations equals the number of AT mutations and a test of whether the frequency distributions of AT and GC mutations are the same. I have applied these two tests to polymorphisms segregating at synonymous, intron, and flanking DNA sites, in the MHC genes of several mammals. I follow ![]()
MHC genes were used because they are highly polymorphic and have been extensively sequenced in mammals. However, MHC genes pose a number of potential problems for an analysis of this sort. First, the alleles are not a random sample; generally alleles have been sequenced because they have a unique amino acid sequence; and second, there is balancing selection upon some of the amino acid variants (![]()
![]()
![]()
![]()
![]()
![]()
| MATERIALS AND METHODS |
|---|
Data:
MHC genes for which multiple alleles have been sequenced within a mammalian species were extracted from alignments held on the EBI server (ftp://ftp.ebi.ac.uk/pub/databases/hla/) in the case of the human HLA-A/B/C, DPB1, DQA1, DQB1, and DRB1 genes, or from GenBank. Care was taken to ensure that sequences were genuinely allelic, and not alleles from different paralogous loci. The HLA-CW*0301 allele was removed from the EBI alignment because it is incorrect according to the associated "readme" file. The transmembrane encoding exon of PERB11.1 was excluded since there is a frameshift in some sequences (![]()
Stationarity tests:
The two population genetic tests detailed above are only valid when the base composition of the sites under consideration is stationary (i.e., not changing systematically). To test for stationary base composition, I used the test suggested by ![]()
![]()
![]()
Infinite sites:
Both population genetic tests assume that each polymorphism we observe arose from a single mutation. To investigate whether the data are consistent with the infinite-sites model, two statistics were calculated for each coding, exon, intron, and flanking DNA sequence: the average pairwise silent site divergence, otherwise known as the nucleotide diversity (
), and the maximum silent site divergence between alleles. The maximum silent site divergence was calculated to ensure there were not large differences between allelic classes. In an ideal population
=
(![]()
< 0.1 the model conforms fairly closely to the infinite-sites model. Silent site divergences were calculated as the proportion of silent sites that had a silent difference. The analysis was restricted to those codons with no nonsynonymous variation segregating in the sample and to codons for which there were at least 10 sequences.
Polymorphism analysis:
If a sequence is subject solely to mutation bias then under the infinite-sites model the number of GC mutations equals the number of AT mutations, and their frequency distributions are the same. The direction of the mutation that gave rise to each polymorphism was inferred from the frequency of the alleles segregating at each site; the more common nucleotide was inferred to be the ancestral nucleotide. This method is unbiased under the null hypothesis and was chosen because parsimony is unreliable when base compositions are biased (![]()
![]()
![]()
Whether the number of AT mutations segregating was the same as the number of GC mutations in a gene was tested using a two-tailed binomial test. The frequency distributions of AT and GC mutations were compared for each gene using a two-tailed Mann-Whitney test. The difference in the average frequency of AT and GC mutations was also calculated for each gene. Note the average frequency in each case was calculated using only those sites at which a polymorphism was segregating. A two-tailed single sample t-test was performed on these frequency differences to test whether GC mutations are on average segregating at higher or lower frequencies than AT mutations across genes.
Shared polymorphisms:
The three human class I genes and two mouse class I genes are fairly closely related, so they were checked for shared polymorphisms. HLA-A, -B, and -C coding sequences share 8 out of 95 scored synonymous AT and GC polymorphisms; HLA-A, -B, and -C intron 1 sequences share 2 out of 30 polymorphisms; H-2K and H-2D share 4 out of 50 polymorphisms; and H-2D and H-2K 3' flanking sequences share 3 out of 36 polymorphisms. The results from these genes are therefore largely independent.
CpG islands:
CpG islands were identified using two techniques. First, the position of CpG and GpC dinucleotides was plotted along the genomic sequence or, when the genomic sequence was not available, along the mRNA sequence. CpG islands are evident on this basis as regions in which the frequency of CpG is similar to the frequency of GpC, whereas outside the island, the frequency of CpG is considerably lower [see examples in ![]()
![]()
| RESULTS |
|---|
Before we can test whether the base composition of silent sites is solely due to mutation bias we need to ensure that the base composition of the sequences is stationary and that the infinite-sites model holds. If both of these conditions are met, and the G + C content of silent sites is due to mutation bias alone, then we expect the number of silent AT mutations segregating in a population to be equal to the number of GC mutations and their frequency distributions to be same.
Stationary base composition:
If the base composition of a sequence is changing in a systematic fashion, it is possible to explain any pattern of polymorphism in terms of changes in the pattern of mutation (![]()
![]()
![]()
gene is significantly higher in G + C content than the human DMA gene.
|
Infinite-sites model:
Under the infinite-sites model each mutation occurs at a unique site, so each polymorphism is the product of a single mutation; i.e., there have been no multiple hits. To investigate whether the infinite-sites model is reasonable for silent site polymorphisms in MHC genes, the average and the maximum pairwise silent site divergences between alleles were calculated for each sequence (Table 2). The values are low; the maximum silent site divergence between any pair of alleles is 0.15, and most of the nucleotide diversities are below 0.05. The average and maximum pairwise divergences for individual exons are also low and fairly consistent along each gene; e.g., for HLA-A the nucleotide diversities for the eight exons are 0.039, 0.031, 0.016, 0.033, 0.024, 0.027, 0.003, and 0.000; for the three DQB1 exons for which we have data, they are 0.049, 0.022, and 0.045. In an ideal population this level of nucleotide diversity would be consistent with the infinite-sites model; i.e., when
= 0.05,
4Neu. The data therefore suggest that the infinite-sites assumption is reasonable, a conjecture supported by the observation that out of 514 polymorphic sites that are able to have more than two nucleotides segregating, only 15 sites actually have more than two.
|
Polymorphism analysis:
Since the sequences appear to be stationary in base composition and conform to the infinite-sites model, we expect the number of AT mutations segregating to equal the number of GC mutations, if mutation is responsible for the bias in silent site G + C content. For coding sequences, however, the number of synonymous AT mutations exceeds the number of GC mutations with only one exception in 15 MHC genes that show a difference (P < 0.001; Table 3). The difference is significant for many genes individually, for all three groups of organisms considered (Table 3), and overall (Table 4).
|
|
The frequency distributions of synonymous GC and AT mutations are also expected to be the same if mutation bias is responsible for synonymous codon bias. For 11 out of 12 genes for which there are both AT and GC mutations segregating, the average frequency of GC mutations is greater than the average frequency of AT mutations (P < 0.01). The average frequency of GC mutations is also significantly greater than the average frequency of AT mutations across human genes and over the whole data set (Table 4). The average difference in the frequency of GC and AT mutations is also nearly significant for the mouse genes (P < 0.08).
The patterns are similar for noncoding sequences. In all but two sequences the number of AT mutations is greater than the number of GC mutations (P < 0.05). The number of AT mutations is significantly greater than the number of GC mutations for a number of genes individually, in both humans and mice, and over the whole data set (Table 4). However, the average frequency of GC mutations is only significantly greater than the average frequency of AT mutations for the mouse sequences. This may be due to the fact that the frequency distribution test is weaker than the numbers of polymorphisms test.
Sequencing errors:
These patterns of polymorphism are not due to sequencing errors. If we repeat the analyses removing all singletons (i.e., sites at which one of the alleles is present as a single copy), the results remain qualitatively unchanged (Table 4). There is a large excess of AT mutations segregating in both coding and noncoding sequences, and for synonymous mutations the average frequency of GC mutations is significantly greater than the average frequency of AT mutations.
CpG islands:
Many of the MHC genes contain a CpG island (see MATERIALS AND METHODS for a list). These are short (~1 kb), G + C biased sequences, rich in the dinucleotide CpG, which have been implicated in the control of gene expression (![]()
![]()
The CpG-island DNA may give more significant results because on average CpG-island DNA is more G + C rich than non-CpG-island DNA (0.856 vs. 0.625 for coding sequences; 0.768 vs. 0.500 for noncoding). Under the alternative hypotheses of selection and biased gene conversion, the difference between the number of AT and GC mutations, and the difference between their average frequencies, are expected to be greater in sequences of high G + C content (see below).
| DISCUSSION |
|---|
The pattern of silent site polymorphism in mammalian MHC sequences is inconsistent with the action of mutation bias: we expect equal numbers of AT and GC mutations segregating in the population, but we observe significantly more AT mutations than GC mutations; furthermore, we expect GC and AT mutations to be segregating at similar frequencies, but silent GC mutations are segregating at significantly higher frequencies than AT mutations.
Stationarity:
If the sequences are not stationary, however, then it is always possible to explain the pattern of polymorphism in terms of changes in the pattern of mutation (![]()
![]()
![]()
![]()
Three observations suggest that the pattern of silent site polymorphism is not due to a change in the pattern of mutation. First, there is no evidence that the sequences are undergoing systematic changes in composition (Table 1). Second, there would have to have been at least two independent changes in the mutation pattern to explain the synonymous site data. Since none of the species used in this study share polymorphisms, we would need one recent change in the mutation pattern to explain the significant synonymous site frequency distribution result in humans. We would then need another change to explain the significant excess of AT mutations in the other organisms. If we note that the frequency distribution result is also approaching a significant level in mice, we might need as many as three independent changes in the mutation pattern, in the same direction, to explain the data. This seems improbable. Third, the change in the mutation pattern would have to be extreme. We can estimate the change in the mutation pattern that would explain the polymorphism data and express it as the change in the G + C content that would eventually result [see Equation 9 in ![]()
Infinite sites:
If the infinite-sites assumption of the two tests is violated we expect an excess of AT mutations segregating, with GC mutations segregating at higher frequencies on average, in G + C biased sequences. However, two lines of evidence suggest that the infinite-sites assumption is reasonable. First, only 3% of the polymorphic sites have more than two nucleotides segregating (excluding twofold degenerate sites); the number of multiple hits must therefore be low. Second, the silent site nucleotide diversities are low; for most genes the nucleotide diversity is <0.05, and this is also true for synonymous variation in most individual exon sequences. This level of nucleotide diversity would be consistent with the infinite-sites assumption in an ideal population since 4Neu/(1 + 8Neu)
4Neu when
< 0.1.
It is possible to calculate the proportion of mutations that would be classified as AT in an ideal population if mutation bias is the cause of compositional bias (see Appendix 1). The proportion of mutations classified as AT increases with
and, to some extent, with sample size (Figure 1). For all but one gene, sheep DQA2, the nucleotide diversity is below 0.05, and the average across genes is 0.023; with
= 0.05 we expect at most 56% of the mutations to be classified as AT in sequences that are 80% G + C. In contrast, 80% (±4%) of the polymorphisms in sequences (individual exons, introns, and flanking sequences) with silent site G + C contents between 75 and 85% are AT in the MHC sequences examined. To explain these values
would have to be ~0.2, which is ~10 times greater than the average nucleotide diversity observed.
|
It is ultimately difficult to conclusively prove that the results are not due to departures from an infinite-sites model. However, the available evidence suggests the infinite-sites assumption is reasonable. It therefore seems unlikely that the patterns of silent site polymorphism in MHC genes are due to mutation biases.
Alternative explanations:
The pattern of silent site polymorphism is, however, consistent with either selection or biased gene conversion (BGC). Under both of these hypotheses the pattern of polymorphism is expected to be biased in one direction, with the less common type of mutation segregating at higher frequencies on average. This can be demonstrated formally for directional selection and biased gene conversion, but it is also evident from the following intuitive argument, which also applies to stabilizing selection. Let us assume for simplicity that the rate at which a G or C site mutates to an A or T is equal to the rate of the reverse process (i.e., there is no mutation bias). Consider a sequence in which selection or biased gene conversion has elevated the G + C content from 50 to 80%; it is evident that 80% of the new mutations in each generation must be GC
AT changes, and 20% AT
GC changes (ignoring A
T and G
C mutations). Furthermore, if we assume the sequence is stationary in composition, the number of GC
AT substitutions must equal the number of AT
GC substitutions; therefore the fixation probability of each GC mutation must be higher than each AT mutation, since GC mutations are rarer; hence we expect GC mutations to segregate at higher frequencies. The results from the MHC genes are therefore consistent with selection or BGC acting in most sequences to elevate the G + C content above that expected under mutation alone.
Gene conversion:
Gene conversion is thought to have played a major role in generating haplotype diversity in MHC genes (![]()
![]()
1 to affect base composition (![]()
![]()
Two observations suggest silent site G + C content of the MHC genes is not affected by biased gene conversion. First, the process of recombination has been extensively studied in the mouse MHC, but BGC has almost never been observed either in recombinant or nonrecombinant chromosomes (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Selection:
Directional and stabilizing selection are both consistent with the population genetic results reported here. Furthermore, stabilizing selection can explain why third position G + C content is greater than intron G + C content. If there is an optimal G + C content for a sequence, third position G + C content would be expected to compensate for the constraint upon the first two codon positions imposed by protein function and, hence, be greater than intron G + C content (![]()
![]()
![]()
![]()
CpG islands:
CpG islands have high G + C content. This is perhaps not surprising given that CpG dinucleotides occur at very low frequencies outside CpG islands. However, the difference in G + C content between island and nonisland DNA appears to be greater than one would expect from the suppression of CpGs in nonisland DNA. For example, the G + C content of synonymous sites in our sample is 86% in the island, compared to 63% in the nonisland DNA; it is 77 vs. 50% in the noncoding sequences. This difference appears too great to be attributed to the removal of one dinucleotide. The results presented here suggest that there might be selection to increase the G + C content of the CpG-island DNA. This may not be surprising given that CpG islands have been implicated in the control of gene expression (![]()
![]()
Isochores:
Third codon position and intron G + C contents are highly correlated to the G + C content of a large block (>100 kb) of DNA, or isochore, in which a gene resides (![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
I am very grateful to Monty Slatkin who suggested using MHC genes in this analysis and to Hiroshi Akashi, Nick Barton, Adrian Bird, Brian Charlesworth, Brandon Gaut, Shirin Khambata, Richard Kliman, Monty Slatkin, John Maynard Smith, Colm O'hUigin, Joel Peck, three anonymous referees, and Jody Hey for many helpful discussions and comments on this manuscript. The author is supported by the Royal Society.
Manuscript received May 8, 1998; Accepted for publication February 22, 1999.
| APPENDIX 1 |
|---|
Let us consider a two-allele system in which the mutation rate from GC to AT is u and the mutation rate from AT to GC is v. In a diploid the equilibrium distribution of the gene frequency of GC mutations, x, is
![]() |
(A1) |

where Ne is the effective population size (![]()
![]() |
(A2) |
(![]()
![]() |
(A3) |
The expected nucleotide diversity is
![]() |
(A4) |
The factor n/(n - 1) corrects for bias caused by sampling error.
| LITERATURE CITED |
|---|
AISSANI, B., G. D'ONOFRIO, D. MOUCHIROUD, K. GARDINER, and C. GAUTIER et al., 1991 The compositional properties of human genes. J. Mol. Evol. 32:493-503[Medline].
AKASHI, H., 1995 Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076[Abstract].
AKASHI, H. and S. W. SCHAEFFER, 1997 Natural selection and the frequency distributions of "silent" DNA polymorphism in Drosophila. Genetics 146:295-307[Abstract].
BALLARD, J. W. and M. KREITMAN, 1994 Unraveling selection in the mitochondrial genome of Drosophila. Genetics 138:757-772[Abstract].
BERNARDI, G., 1995 The human genome: organization and evolutionary history. Annu. Rev. Genet. 29:445-476[Medline].
BERNARDI, G. and G. BERNARDI, 1986 Compositional constraints and genome evolution. J. Mol. Evol. 24:1-11[Medline].
BERNARDI, G., B. OLOFSSON, J. FILIPSKI, M. ZERIAL, and J. SALINAS et al., 1985 The mosaic genome of warm blooded vertebrates. Science 228:953-958
BIRD, A. P., 1987 CpG islands as gene markers in the vertebrate nucleus. Trends Genet. 3:342-347.
CLAY, O., S. CACCIO, Z. ZOUBAK, D. MOUCHIROUD, and G. BERNARDI, 1996 Human coding and noncoding DNA: compositional correlations. Mol. Phylogenet. Evol. 5:2-12[Medline].
COLLINS, T. M., P. H. WIMBERGER, and G. J. P. NAYLOR, 1994 Compositional bias, character state bias and character state reconstruction using parsimony. Syst. Biol. 43:482-496.
CROSS, S. H. and A. P. BIRD, 1995 CpG islands and genes. Curr. Opin. Genet. Dev. 5:309-314[Medline].
CROW, J., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York.
CULLEN, M., J. NOBLE, H. ERLICH, K. THORPE, and S. BECK et al., 1997 Characterization of recombination in the HLA class II region. Am. J. Hum. Genet. 60:397-407[Medline].
EYRE-WALKER, A., 1992 Studies of synonymous codon evolution in mammals. Ph.D. Thesis. University of Edinburgh, Edinburgh, Scotland.
EYRE-WALKER, A., 1993 Recombination and mammalian genome evolution. Proc. R. Soc. Lond. Ser. B 252:237-243[Medline].
EYRE-WALKER, A., 1994 DNA mismatch repair and synonymous codon evolution in mammals. Mol. Biol. Evol. 11:88-98[Abstract].
EYRE-WALKER, A., 1997 Differentiating selection and mutation bias. Genetics 147:1983-1987[Medline].
EYRE-WALKER, A., 1998 Problems with parsimony in sequences of biased base composition. J. Mol. Evol. 47:686-690[Medline].
FILIPSKI, J., 1987 Correlation between molecular clock ticking, codon usage,
delity of DNA repair, chromosome banding and chromatin compactness in germline cells. FEBS Lett. 217:184-186[Medline].
GARDINER-GARDEN, M. and M. FROMMER, 1987 CpG islands in vertebrate genomes. J. Mol. Biol. 196:261-282[Medline].
GAUDIERI, S., C. LEELAYUWAT, D. C. TOWNEND, J. MULLBERG, and D. COSMAN et al., 1997 Allelic and interlocus comparison of the PERB11 multigene family in the MHC. Immunogenetics 45:209-216[Medline].
HOLMQUIST, G. P., 1992 Chromosome bands, their chromatin flavors and their functional features. Am. J. Hum. Genet. 51:17-37[Medline].
HUGHES, A. L. and M. YEAGER, 1997 Comparative evolutionary rates of introns and exons in murine rodents. J. Mol. Evol. 45:125-130[Medline].
KHAMBATA, S., 1996 Analysis of the meiotic recombinational hotspot associated with the Ea gene of the mouse major histocompatibility complex. Ph.D. Thesis. Rutgers University, New Brunswick, NJ.
KHAMBATA, S., J. MODY, A. MODZELEWSKI, D. HEINE, and H. C. PASSMORE, 1996 Ea recombinational hot spot in the mouse major histocompatibility complex maps to the fourth intron of the Ea gene. Genome Res. 6:195-201
KOBORI, J. A., E. STRAUSS, K. MINARD, and L. HOOD, 1986 Molecular analysis of the hotspot of recombination in the murine major histocompatibility. Science 234:173-179
LEWIS, J. and A. P. BIRD, 1991 DNA methylation and chromatin structure. FEBS Lett. 205:155-159.
LI, W.-H., 1997 Molecular Evolution. Sinauer Associates, Sunderland, MA.
LI, W.-H. and L. A. STADLER, 1991 Low nucleotide diversity in man. Genetics 129:513-523[Abstract].
MACDONALD, J. H. and M. KREITMAN, 1991 Adaptive evolution at the Adh locus in Drosophila.. Nature 351:652-654[Medline].
PADGETT, K. A., D. C. SHREFFLER, and B. K. SATA, 1991 Molecular mapping of murine I region recombinants. J. Immunol. 147:2764-2770
PARHAM, P. and T. OHTA, 1996 Population biology of antigen presentation by MHC class I molecules. Science 272:67-74[Abstract].
PERNA, N. T. and T. D. KOCHER, 1995 Unequal base frequencies and the estimation of substitution rates. Mol. Biol. Evol. 12:359-361.
SANT'ANGELO, D. B., W. P. LAFUSE, and H. C. PASSMORE, 1992 Evidence that nucleotide sequence identity is a requirement for meiotic crossing over within the mouse Eb recombinational hot spot. Genomics 13:1334-1336[Medline].
SAWYER, S., D. E. DYKHUIZEN, and D. L. HARTL, 1987 Confidence interval for the number of selectively neutral amino acid polymorphisms. Proc. Natl. Acad. Sci. USA 84:6225-6228
SHIROISHI, T., T. SAGAI, N. HANZAWA, H. GOTOH, and K. MORIWAKI, 1991 Genetic control of sex-dependent meiotic recombination in the major histocompatibility complex of the mouse. EMBO J. 10:681-686[Medline].
SUOEKA, N., 1988 Directional mutation pressure and neutral molecular evolution. Proc. Natl. Acad. Sci. USA 85:2653-2657
SZOSTAK, J. W., T. L. ORR-WEAVER, R. J. ROTHSTEIN, and F. W. STAHL, 1983 The double-strand-break model for recombination. Cell 33:25-35[Medline].
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595
UEMATSU, Y., H. KIEFER, R. SCHULZE, K. FISCHER-LINDAHL, and M. STEINMETZ, 1986 Molecular characterization of a meiotic recombinational hotspot enhancing homologous equal crossing-over. EMBO J. 5:2123-2129[Medline].
WOLFE, K. H., P. M. SHARP, and W.-H. LI, 1989 Mutation rates differ among regions of the mammalian genome. Nature 337:283-285[Medline].
This article has been cited by other articles:
![]() |
F. Squartini and P. F. Arndt Quantifying the Stationarity and Time Reversibility of the Nucleotide Substitution Process Mol. Biol. Evol., December 1, 2008; 25(12): 2525 - 2535. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. R. Dreszer, G. D. Wall, D. Haussler, and K. S. Pollard Biased clustered substitutions in the human genome: The footprints of male-driven biased gene conversion Genome Res., October 1, 2007; 17(10): 1420 - 1430. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Hernandez, S. H. Williamson, L. Zhu, and C. D. Bustamante Context-Dependent Mutation Rates May Cause Spurious Signatures of a Fixation Bias Favoring Higher GC-Content in Humans Mol. Biol. Evol., October 1, 2007; 24(10): 2196 - 2202. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bernardi The neoselectionist theory of genome evolution PNAS, May 15, 2007; 104(20): 8385 - 8390. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Schmegner, J. Hoegel, W. Vogel, and G. Assum The Rate, Not the Spectrum, of Base Pair Substitutions Changes at a GC-Content Transition in the Human NF1 Gene Region: Implications for the Evolution of the Mammalian Genome Structure Genetics, January 1, 2007; 175(1): 421 - 428. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Comeron Weak selection and recent mutational changes influence polymorphic synonymous mutations in humans PNAS, May 2, 2006; 103(18): 6940 - 6945. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Shabalina, A. Y. Ogurtsov, and N. A. Spiridonov A periodic pattern of mRNA secondary structure created by the genetic code. Nucleic Acids Res., January 1, 2006; 34(8): 2428 - 2437. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Galtier, E. Bazin, and N. Bierne GC-Biased Segregation of Noncoding Polymorphisms in Drosophila Genetics, January 1, 2006; 172(1): 221 - 228. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Balakirev, V. R. Chechetkin, V. V. Lobzin, and F. J. Ayala Entropy and GC Content in the {beta}-esterase Gene Cluster of the Drosophila melanogaster Subgroup Mol. Biol. Evol., October 1, 2005; 22(10): 2063 - 2072. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Gaffney and P. D. Keightley The scale of mutational variation in the murid genome Genome Res., August 1, 2005; 15(8): 1086 - 1094. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Vinogradov Noncoding DNA, isochores and gene expression: nucleosome formation potential Nucleic Acids Res., January 26, 2005; 33(2): 559 - 563. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bazin, L. Duret, S. Penel, and N. Galtier Polymorphix: a sequence polymorphism database Nucleic Acids Res., January 1, 2005; 33(suppl_1): D481 - D484. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Kern and D. J. Begun Patterns of Polymorphism and Divergence from Noncoding Sequences of Drosophila melanogaster and D. simulans: Evidence for Nonequilibrium Processes Mol. Biol. Evol., January 1, 2005; 22(1): 51 - 62. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Moury Differential Selection of Genes of Cucumber Mosaic Virus Subgroups Mol. Biol. Evol., August 1, 2004; 21(8): 1602 - 1611. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Comeron Selective and Mutational Patterns Associated With Gene Expression in Humans: Influences on Synonymous Composition and Intron Presence Genetics, July 1, 2004; 167(3): 1293 - 1304. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Kudla, A. Helwak, and L. Lipinski Gene Conversion and GC-Content Evolution in Mammalian Hsp70 Mol. Biol. Evol., July 1, 2004; 21(7): 1438 - 1444. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Lercher, J.-V. Chamary, and L. D. Hurst Genomic Regionali |









