- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Duret, L.
- Articles by Galtier, N.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Duret, L.
- Articles by Galtier, N.
Vanishing GC-Rich Isochores in Mammalian Genomes
Laurent Dureta, Marie Semona, Gwenaël Piganeaub, Dominique Mouchirouda, and Nicolas Galtierca Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558 Université Claude Bernard Lyon 1, 69622 Villeurbanne Cedex, France,
b Centre for the Study of Evolution, School of Biological Sciences, Falmer, Brighton BN1 9QG, United Kingdom
c Laboratoire Génome, Populations, Interactions, UMR CNRS 5000 Université Montpellier 2, 34095 Montpellier Cedex 5, France
Corresponding author: Laurent Duret, UMR CNRS 5558 Université Claude Bernard Lyon 1, 16 rue Raphaël Dubois, 69622 Villeurbanne Cedex, France., duret{at}biomserv.univ-lyon1.fr (E-mail)
Communicating editor: P. D. KEIGHTLEY
| ABSTRACT |
|---|
To understand the origin and evolution of isochoresthe peculiar spatial distribution of GC content within mammalian genomeswe analyzed the synonymous substitution pattern in coding sequences from closely related species in different mammalian orders. In primate and cetartiodactyls, GC-rich genes are undergoing a large excess of GC
AT substitutions over AT
GC substitutions: GC-rich isochores are slowly disappearing from the genome of these two mammalian orders. In rodents, our analyses suggest both a decrease in GC content of GC-rich isochores and an increase in GC-poor isochores, but more data will be necessary to assess the significance of this pattern. These observations question the conclusions of previous works that assumed that base composition was at equilibrium. Analysis of allele frequency in human polymorphism data, however, confirmed that in the GC-rich parts of the genome, GC alleles have a higher probability of fixation than AT alleles. This fixation bias appears not strong enough to overcome the large excess of GC
AT mutations. Thus, whatever the evolutionary force (neutral or selective) at the origin of GC-rich isochores, this force is no longer effective in mammals. We propose a model based on the biased gene conversion hypothesis that accounts for the origin of GC-rich isochores in the ancestral amniote genome and for their decline in present-day mammals.
THE mammalian genome is heterogeneous with respect to base composition. In human, the GC content of large genomic DNA fragments (>100 kb) ranges from 35 to 65% (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Bernardi and colleagues proposed a model of mammalian genomes consisting in a mosaic of compositionally homogeneous regionsthe so-called isochores (see ![]()
![]()
![]()
![]()
![]()
Comparison of base composition in different vertebrate species indicates that the ancestral genome of tetrapodes was probably relatively homogeneous and AT rich, and that the acquisition of GC-rich isochores occurred in the amniote lineage, after the split with amphibians, but before the divergence of mammals, birds, and reptiles, i.e.,
310350 million years ago (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The major selectionist hypothesis, namely adaptation to homeothermy, was dismissed after isochores were discovered in several cold-blooded species (![]()
![]()
An important progress toward the solution of this controversial issue has been brought by the analysis of polymorphism data. Indeed, the VMB model can be distinguished from the other two (selection or BGC) by studying the process of allele fixation. Under the VMB model, the probability of fixation is expected to be the same for alleles resulting from an AT
GC mutation (i.e., from A or T to G or C) as for alleles resulting from a GC
AT mutation. Hence, the pattern of GC
AT substitution is expected to be identical to the pattern of mutation. Conversely, the BGC and the selectionist models predict a fixation bias in favor of GC alleles (i.e., GC alleles should have a higher probability of fixation than AT alleles). Analyses of human and mouse polymorphism data provided evidence for such a fixation bias, leading to the conclusion that GC-rich isochores result from BGC or selection, but not from mutation bias (![]()
![]()
![]()
![]()
![]()
![]()
The GC content of a genomic fragment is at equilibrium when the numbers of GC
AT and AT
GC substitutions occurring in this fragment are equal. Interestingly, the analysis of the complete sequence of the human genome has shown that some sequences are not at equilibrium (![]()
AT substitutions are significantly more frequent than AT
GC substitutions. This excess of GC
AT substitutions in transposons is observed both in GC-rich and GC-poor isochores. This suggests that the GC content of the human genome might be decreasing (i.e., not stationary), questioning the conclusions mentioned above. It should be noted, however, that the pattern of substitution in transposable elements might be different from that of nonrepetitive DNA, because repeated sequences can be subject to specific mutation patterns (![]()
The aim of this article was therefore to directly determine whether the GC content of nonrepetitive sequences is or is not at equilibrium in mammalian genomes. For this purpose, we analyzed orthologous gene sequences in closely related species from three mammalian orders (primates, rodents, cetartiodactyls). We show that in these mammals, the GC content of genes located in GC-rich isochores has been decreasing. These results indicate that GC-rich isochores are slowly disappearing from mammalian genomes. The predictions of the three models for the origin of isochores are reevaluated in this nonequilibrium situation and compared to human sequence polymorphism data. The issue of the origin and evolution of isochores is discussed in the context of a nonstationary process.
| MATERIALS AND METHODS |
|---|
We selected orthologous genes from closely related species in three mammalian orders, namely primates, Rodentia, and Cetartiodactyla. In each order, a triplet of species of the form [[ingroup1, ingroup2], outgroup] was defined. In primates the ingroups were human and another Hominidae (chimpanzee, gorilla, or orangutan), the outgroup any Catarrhini (external but as close as possible to the two ingroups, generally Papio). In Rodentia, the triplet was generally [[rat, mouse], hamster], but any triplet of species matching the [[Rattus, Mus], non-Murinae Muridae] pattern was considered acceptable. In Cetartiodactyla all comparisons involved [[Caprinae, Bovinae], Suidae], excepting one gene for which the outgroup was a Cervidae. The coding sequences of every gene available in a triplet of species as defined above were collected using the Hovergen database (release 40, May 2000; ![]()
![]()
|
The GC content expected at equilibrium at synonymous codon position (GC3eq) was estimated using only the third position of fourfold degenerate codons. GC3eq was computed by the ratio u/(u + v), where u is the rate of AT
GC substitutions and v the rate of GC
AT substitutions. We measured u (and v) at the third position of fourfold degenerate codons by dividing the number of AT
GC substitutions (GC
AT substitutions) observed in the two ingroups by the number of AT (GC) in the ancestral sequence inferred by the maximum parsimony method (ambiguous bases were ignored).
| SYNONYMOUS SUBSTITUTION PATTERNS IN MAMMALIAN GENES |
|---|
Excess of GC
AT synonymous substitutions in GC-rich genes:
To determine whether the GC content of mammalian genes is at equilibrium, we analyzed orthologous coding sequences from different taxa: rodents (194 genes), cetartiodactyls (79 genes), and primates (55 genes; Table 1). The synonymous substitution process was approached using the parsimony criterion. A substitution from nucleotide X to nucleotide Y was inferred when both the outgroup and one ingroup shared state X, but the other ingroup showed state Y. We analyzed all informative substitutions: 7058 substitutions in rodents, 817 substitutions in cetartiodactyls, and 197 substitutions in primates. Genes were classified in three groups depending on their GC3 (<57%, 5775%, >75%; ![]()
AT over AT
GC changes is found for GC-rich genes in the three orders. It has been shown previously that the GC content of GC-rich genes had been decreasing in the rodent lineage (![]()
|
In primates and cetartiodactyls, the present GC content is very far from the value expected at equilibrium given the observed substitution pattern (Table 2). The bias is less strong in GC-median genes, and virtually no bias is found in GC-poor genes.
Is it possible that the excess of GC
AT substitution is due to recent translocations of GC-rich genes into a GC-poor genomic context? To test this hypothesis, we examined the synonymous substitution pattern in primates according to the GC content of the surrounding genomic region. For this purpose, we retrieved for each human gene the largest overlapping genomic sequence available in GenBank (163 kb in average). The 55 genes were split into three groups, according to the GC content of their genomic context. As shown in Table 3, the results of our analysis remain unchanged (compare with Table 2): there is a large excess of GC
AT substitutions, especially for the genes located in regions of high GC content.
|
Methodological artifact?
Is it possible that this result is due to a methodological artifact? The pattern [[ingroup1 = X, ingroup2 = Y], outgroup = X] is interpreted by the parsimony method as a single X
Y substitution in the ingroup2 lineage. Parsimony can fail in case of multiple substitutions (e.g., the above pattern can occur through two independent Y
X substitutions in the ingroup1 and outgroup lineages). The method remains unbiased as long as the base composition is balanced (i.e., X% = Y%): failures are equiprobable for X
Y and Y
X actual changes. When base composition is unbalanced, however, the maximum parsimony method method tends to overestimate the proportion of substitutions from common bases to rare bases (![]()
X substitution is higher than the rate of X
Y substitution), the method will overestimate the proportion of X
Y changes. The bias can be strong even for moderate levels of divergence between sequences (![]()
![]()
![]()
To assess the extent of the bias caused by the maximum-parsimony method in this analysis, we made use of a restricted data set of 20 primate genes for which an additional species was available. This fourth species was used to measure the rate of error of the maximum-parsimony method. We selected quartets of the form [[[ingroup1, ingroup2], ingroup3], outgroup] (generally [[[human, chimpanzee], gorilla or orangutan], a non-Hominidae Catarrhini]). We examined all the substitutions previously inferred from the three-species analysis, and we counted as ambiguous all the cases where the nucleotide in the ingroup3 was different from the outgroup (i.e., patterns different from [[[X, Y], X], X]). Among the 78 substitutions analyzed, 9 (12%) were ambiguously oriented (i.e., counted as X
Y whereas they might correspond to Y
X substitutions). The error rate was similar for GC
AT (13%, 7/55) and for AT
GC (9%, 2/23) substitutions. In GC-rich genes (i.e., GC3 > 75%), there are four times as many GC
AT as AT
GC substitutions (12 vs. 3, a ratio very close to what was measured in the full three-species data set, see Table 2), and this ratio remains unchanged when erroneously orientated substitutions (13%, 2/15) are removed. In other words, the bias in the parsimony method is weak for this data set (presumably because evolutionary distances are very short) and cannot account for the fourfold excess of GC
AT substitutions over AT
GC substitutions observed in GC-rich genes from primates.
This test could not be performed in rodents and cetartiodactyls because of lack of data. We therefore analytically assessed the amount of bias of the maximum-parsimony method using EYRE-WALKER's (1998) equations. These equations give the expected number of GC
AT and AT
GC substitutions maximum parsimony would infer, given base composition and distances between analyzed sequences. Under the hypothesis that base composition is at equilibrium, the ratios of GC
AT substitutions over AT
GC substitutions that we would have expected to measure in rodent, cetartiodactyl, and primate GC-rich genes are, respectively, 1.9, 1.4, and 1.2 (compared with 1.7, 3.2, and 3.7 observed in the data set, see Table 2). Hence, the hypothesis of equilibrium is clearly rejected in cetartiodactyls and primates (in agreement with the previous test), but not in rodents. This does not mean that base composition is at equilibrium in rodents, but simply that we cannot totally exclude that the observed excess of GC
AT substitutions is due to a methodological artifact of the maximum-parsimony method (see DISCUSSION).
Hypermutability of CpG dinucleotides:
It is well known that in mammals, CpG dinucleotides are hotspots of C
T mutations because of the deamination of methylated cytosines (![]()
![]()
AT substitutions, we removed from our analyses all GC
AT substitutions that occurred within a CpG dinucleotide in the reconstructed ancestral sequence. The substitutions at CpG dinucleotides correspond to 14% (rodents), 28% (primates), and 31% (cetartiodactyls) of GC
AT substitutions in GC-rich genes. But in all taxa, the number of GC
AT substitutions remained higher than the number of AT
GC substitutions after CpG's were removed (Table 2). Hence, the decrease in GC content of GC-rich genes is not due solely to the hypermutability of CpG dinucleotides.
| FIXATION BIAS IN FAVOR OF GC ALLELES |
|---|
An asymmetric pattern of GC/AT polymorphisms in human has been reported by ![]()
AT than AT
GC polymorphic sites was found. In these analyses, it was assumed that the GC content was at equilibrium (i.e., that the number of GC
AT substitutions was equal to the number of AT
GC substitutions). Hence, the difference between the mutation pattern (revealed by polymorphism data) and the substitution pattern (inferred from the hypothesis of stationarity) was interpreted as the result of a biased fixation process, either selection or BGC, favoring GC alleles (![]()
![]()
We reanalyzed the SNP data set (![]()
![]()
AT or AT
GC), we focused on the distribution of the frequencies at which these alleles occur in the sample. We chose this approach because it is not affected by the fact that the GC content is not at equilibrium: the fixation dynamics of a mutation, given that it has occurred, is independent of the stationary status of base composition. Hence, in the absence of any fixation bias, similar distributions of allele frequency are expected for GC
AT or AT
GC polymorphisms, whatever the mutation process. Conversely, if GC alleles had a higher probability of fixation than AT alleles (because of selection or BGC), the allele frequency distribution of AT
GC polymorphisms should be shifted to the right (advantageous mutations segregate at higher frequencies, on average).
Polymorphisms were oriented using the chimpanzee as an outgroup. Noncoding and synonymous polymorphisms were pooled. Nonsynonymous polymorphisms were excluded from this analysis as they undergo selection at the protein level. The total number of SNPs analyzed was 410. ![]()
AT and AT
GC polymorphisms in GC-rich (GC3 > 75%, 169 SNPs), GC-median (96 SNPs), and GC-poor (GC3 < 57%, 145 SNPs) genes are shown (Fig 1). A visual inspection suggests that AT
GC mutations segregate at higher frequencies than GC
AT mutations (i.e., that GC alleles have a higher probability of fixation than AT alleles), especially in GC-rich genes.
|
A likelihood-ratio test was performed to assess the significance of this apparent discrepancy between the two observed distributions. Two multinomial models were fitted to the allele frequency data. The joint multinomial model (null hypothesis) assumes that the allelic frequencies of AT
GC and GC
AT polymorphic sites are drawn from a unique multinomial distribution (three parameters). The full multinomial model allows one free parameter for every frequency class of AT
GC and GC
AT polymorphisms (six parameters): distinct distributions are assumed for the two categories of SNPs. The likelihood was calculated under the two models according to the multinomial formula. Optimal parameter values are obvious: the expected proportions of each class of allele frequency are taken as the observed ones, separating (full multinomial model) or pooling (joint multinomial model) the distributions of AT
GC and GC
AT polymorphisms. Twice the difference in log-likelihood between the two models follows a
2 distribution (3 d.f.) under the null hypothesis of a common distribution. Results are displayed in Table 4. No difference between the two distributions was detected in GC-poor genes, but AT
GC polymorphisms appeared to segregate at significantly higher allele frequencies, on average, than GC
AT polymorphisms in both GC-median and GC-rich genes.
|
A population genetics model was also fitted to the allele frequency data set, following ![]()
![]()
![]()
This peculiar feature of GC-rich genes is a bit surprising. It can hardly be explained by a departure from demographic equilibrium or panmixy since the population history is shared by every part of the genome. We speculate that this uncommon distribution might result from a possible variation of fixation bias, w, among GC-rich regions of the genomethe above analysis assumes that a unique w applies to all the SNPs pooled in any one category. The "GC-rich gene" category might include a fraction of SNPs located in regions of the genome undergoing an exceptionally high fixation bias (see the "GC-factory" hypothesis below), the remaining SNPs of this category being located in "regular" GC-rich regions. This would explain the occurrence of many high-frequency AT
GC polymorphisms (contributed by the putative w hotspots), leading to a departure from distributions expected under the "single w" model.
The above analyses treat SNPs as independent data. This is not the case actually: the 410 analyzed SNPs are located in the vicinity of 106 genes, so that some of them are presumably genetically linked, and their allele frequency is correlated. Please note that this should not result in a bias toward a higher frequency of some kind (e.g., GC) of variantslinkage occurs independently of the AT/GC status of mutants. Linkage should essentially reduce the signal; i.e., the sampling variance will be higher than that of a sample of 410 independent SNPs. Our detection of a significant fixation bias, therefore, appears conservative in this respect.
Overall, these results confirm, on a larger gene data set, a previous analysis of allele frequency distribution in human and mouse MHC genes (![]()
AT mutation rate. This scenario, however, seems unlikely because the putative change in mutation pattern must have occurred very recently (<
4N generations ago).
| DISCUSSION |
|---|
Erosion of GC-rich isochores:
An analysis of substitution pattern in defective DNA transposons revealed an excess of GC
AT substitutions over AT
GC substitutions, suggesting that the human genome was not at equilibrium (![]()
We did not analyze noncoding sequences because there are presently not enough species for which such data are available. However, numerous works have shown that in mammals there is a strong correlation between the GC content at synonymous codon positions and the GC content of the genomic region in which a gene is located (e.g., AÏSSANI et al. 1991; ![]()
![]()
|
It was shown previously that the GC content of rodent genomes was less heterogeneous than that of primates (i.e., GC-rich genes are less GC rich in rodents than in primates, and conversely for GC-poor genes; ![]()
![]()
![]()
![]()
Our results show that the decrease of GC-rich isochores, previously reported in rodents, is actually occurring in primate and cetartiodactyl genomes. As far as rodents are concerned, we observed both a decrease in GC content in GC-rich isochores and an increase in GC-poor isochores (Table 2), suggesting that the murid shift might be an ongoing process. However, evolutionary distances between the rodent species we analyzed are too high for the maximum-parsimony method to be trusted. Sequence data involving more closely related species would be required for confirming the observed trend in rodents. Overall, the reports of a long-term (rodents, ![]()
|
The second major conclusion of our study, based on the analysis of allele frequency distribution, is that a directional evolutionary force is biasing the fixation process of GC/AT polymorphism toward GC in GC-rich, but not in GC-poor regions of the genome. However, this force is not strong enough to maintain GC-rich isochores, since GC
AT substitutions are more frequent than AT
GC substitutions. In other words, in GC-rich genes, GC alleles have a higher probability of fixation than AT alleles, but this bias is not sufficient to overcome the large excess of GC
AT mutations.
Fixation bias favoring GC alleles: biased gene conversion:
The fixation bias favoring GC alleles is likely to be a result of BGC, not selection. Indeed, different observations support the BGC model: experiments in mammalian cells have shown that the repair of DNA mismatches is biased in favor of GC; genes that frequently undergo conversions are GC rich, and there is an overall correlation between GC content and recombination rate (reviewed in ![]()
![]()
![]()
Origin and evolution of GC-rich isochores:
The origin and evolution of isochores becomes even more mysterious in this nonequilibrium context. As mentioned in the Introduction, the acquisition of GC-rich isochores occurred in the amniote lineage,
310350 million years ago. However, the substitution process measured from the comparison of closely related species indicates that GC-rich isochores are disappearing from mammalian genomes. The fact that similar patterns are found in three different mammalian orders suggests that this erosion may have started >80 million years ago (see Fig 4).
|
How to explain the rapid acquisition of GC-rich isochores in the amniote ancestral genome and their slow decay in mammals? A part of the answer probably lies in the asymmetry between the processes of GC increase and GC decrease. The data reported in this study are consistent with the existence of an AT-biased mutation process (u < v, where u is the AT
GC mutation rate and v the GC
AT mutation rate) vs. a GC-biased fixation process (w). In regions of the genome/periods of time of negligible fixation bias (4Nw << 1), the GC content therefore approaches the mutational equilibrium through mutation/drift, at neutral rate
v - (1 -
)u, where
is the current GC content. When the fixation bias is strong (4Nw >> 1), however, advantaged GC mutations fix with a probability 2w (vs. 1/2N in absence of fixation bias), so that the GC content will increase at a rate close to 4(1 -
)uNw, possibly much higher than the neutral rate. Therefore, short and/or localized episodes of GC increase can significantly contribute to the total amount of G and C, even if they concern a small fraction of the genome/time scale. In the light of these notions, we now propose several scenarios about the origin of GC-rich isochores, accounting for the newly reported nonequilibrium situation.
Model 1: the GC-factory hypothesis:
This scenario invokes a spatial, but not a temporal heterogeneity of BGC coefficient. Imagine that a small fraction of the genome is undergoing a strong fixation bias toward GC (GC factory), while the major part of the genome is evolving neutrally (i.e., without selection or BGC). Now assume that these two fractions communicate in some way (translocations or duplications). Sequences entering a GC factory would undergo a rapid increase of GC content. Such GC-rich sequences would then be released in the major component and form GC-rich isochores. Now when randomly sampling in the genome one would essentially sample genes from the major, neutrally evolving component and observe a global decrease of GC content. An example of such a process is provided by the Fxy gene in mouse. This gene used to undergo a rapid increase of GC content after it was translocated in the pseudoautosomal region (![]()
GC polymorphisms assigned to the GC-rich gene category (see above).
In this hypothesis, the genome would be at a mutation-selection-migration equilibrium, where migration refers to movements of genes between regions of the genome. Although appealing, this scenario appears incompatible with the observed conservation of isochores among mammalian orders. The GC3's of orthologous genes sampled in primates, Cetartiodactyla, Carnivora, and Lagomorpha are highly correlated (![]()
Model 2: a unique origin of GC-rich isochores:
The hypothetical scenario that we propose for the origin of GC-rich isochores is the following. In the ancestral amniote, a change occurred in a DNA repair system, resulting in the preferential repair of AT:GC mismatches into GC and hence to a biased gene conversion favoring the fixation of GC alleles. As suggested by ![]()
![]()
Why did BGC cease to be effective? As mentioned previously, BGC significantly affects the fixation of GC alleles only if the product 4Nw is greater than one (where N is the effective population size and w the BGC coefficient). We propose two possible explanations for the decline of GC-rich isochores.
First, the evolution of GC content might be linked to variations in effective population size. For a given distribution of w among regions of the genome, the fraction of the genome undergoing significant fixation bias (and eventually becoming GC rich) increases with N, as illustrated by Fig 5. GC-rich isochores might therefore have appeared in an ancestral species larger than today's population size. After a subsequent decrease of population size, most of the genome would be under too low a fixation bias (4Nw) for GC-rich isochores to be maintained.
|
The second possible explanation is that the evolution of GC-content might be linked to variations in recombination (and, therefore, BGC) rate. Such variations could have occurred as a result of chromosome rearrangements. The GC content of autosomal chromosomes is negatively correlated to chromosome size in human (R2 = 0.32, P = 0.006; data from ![]()
![]()
![]()
![]()
![]()
![]()
These mechanisms are possibly simplistic and not mutually exclusive. We leave them as an open hypothesis for future work on this issue. To conclude, it should be stressed that whatever the evolutionary force (neutral or selective) was at the origin of GC-rich isochores, this force is no longer effective to maintain them in mammalian genomes.
| ACKNOWLEDGMENTS |
|---|
We thank Nick Smith and Adam Eyre-Walker for their helpful comments. This work was supported by the Centre National de la Recherche Scientifique.
Manuscript received March 12, 2002; Accepted for publication September 9, 2002.
| LITERATURE CITED |
|---|
AISSANI, B., G. D'ONOFRIO, D. MOUCHIROUD, K. GARDINER, and C. GAUTIER et al., 1991 The compositional properties of human genes. J. Mol. Evol. 32:493-503.[Medline]
AKASHI, K. and S. W. SCHAEFFER, 1997 Natural selection and the frequency distribution of "silent" DNA polymorphisms in Drosophila. Genetics 146:295-307.[Abstract]
ANDREOZZI, L., C. FEDERICO, S. MOTTA, S. SACCONE, and A. L. SAZANOVA et al., 2001 Compositional mapping of chicken chromosomes and identification of the gene-richest regions. Chromosome Res. 9:521-532.[Medline]
BERNARDI, G., 2000 Isochores and the evolutionary genomics of vertebrates. Gene 241:3-17.[Medline]
BERNARDI, G. and G. BERNARDI, 1990 Compositional transitions in the nuclear genomes of cold-blooded vertebrates. J. Mol. Evol. 31:282-293.[Medline]
BERNARDI, G., B. OLOFSSON, J. FILIPSKI, M. ZERIAL, and J. SALINAS et al., 1985 The mosaic genome of warm-blooded vertebrates. Science 228:953-958.
BERNARDI, G., S. HUGHES, and D. MOUCHIROUD, 1997 The major compositional transitions in the vertebrate genome. J. Mol. Evol. 44:S44-S51.
BIELAWSKI, J. P., K. A. DUNN, and Z. YANG, 2000 Rates of nucleotide substitution and mammalian nuclear gene evolution: approximate and maximum-likelihood methods lead to different conclusions. Genetics 156:1299-1308.
BIRD, A. P., 1980 DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 8:1499-1504.
CARGILL, M., D. ALTSHULER, J. IRELAND, P. SKLAR, and K. ARDLIE et al., 1999 Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet. 22:231-238.[Medline]
CLAY, O., S. CACCIO, S. ZOUBAK, D. MOUCHIROUD, and G. BERNARDI, 1996 Human coding and noncoding DNA: compositional correlations. Mol. Phylogenet. Evol. 5:2-12.[Medline]
COULONDRE, C., J. H. MILLER, P. J. FARABAUGH, and W. GILBERT, 1978 Molecular basis of base substitution hotspots in Escherichia coli. Nature 274:775-780.[Medline]
D'ONOFRIO, G., D. MOUCHIROUD, B. AISSANI, C. GAUTIER, and G. BERNARDI, 1991 Correlation between the compositional properties of human genes, codon usage, and amino-acid composition of proteins. J. Mol. Evol. 32:504-510.[Medline]
DURET, L., D. MOUCHIROUD, and M. GOUY, 1994 HOVERGEN: a database of homologous vertebrate genes. Nucleic Acids Res. 22:2360-2365.
DURET, L., D. MOUCHIROUD, and C. GAUTIER, 1995 Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores. J. Mol. Evol. 40:308-317.[Medline]
EYRE-WALKER, A., 1993 Recombination and mammalian genome evolution. Proc. R. Soc. Lond. Ser. B Biol. Sci. 252:237-243.[Medline]
EYRE-WALKER, A., 1998 Problems with parsimony in sequences of biased base composition. J. Mol. Evol. 47:686-690.[Medline]
EYRE-WALKER, A., 1999 Evidence of selection on silent site base composition in mammals: potential implications for the evolution of isochores and junk DNA. Genetics 152:675-683.
EYRE-WALKER, A. and L. D. HURST, 2001 The evolution of isochores. Nat. Rev. Genet. 2:549-555.[Medline]
FRANCINO, M. P. and H. OCHMAN, 1999 Isochores result from mutation not selection. Nature 400:30-31.[Medline]
FRYXELL, K. J. and E. ZUCKERKANDL, 2000 Cytosine deamination plays a primary role in the evolution of mammalian isochores. Mol. Biol. Evol. 17:1371-1383.
FULLERTON, S. M., A. BERNARDO CARVALHO, and A. G. CLARK, 2001 Local rates of recombination are positively correlated with GC content in the human genome. Mol. Biol. Evol. 18:1139-1142.
GALTIER, N. and P. BOURSOT, 2000 A new method for locating changes in a tree reveals distinct nucleotide polymorphism vs. divergence patterns in mouse mitochondrial control region. J. Mol. Evol. 50:224-231.[Medline]
GALTIER, N. and M. GOUY, 1998 Inferring pattern and process: maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis. Mol. Biol. Evol. 15:871-879.[Abstract]
GALTIER, N. and D. MOUCHIROUD, 1998 Isochore evolution in mammals: a human-like ancestral structure. Genetics 150:1577-1584.
GALTIER, N., G. PIGANEAU, D. MOUCHIROUD, and L. DURET, 2001 GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. Genetics 159:907-911.
HARTL, L. D., E. N. MORIYAMA, and S. A. SAWYER, 1994 Selection intensity for codon bias. Genetics 138:227-234.[Abstract]
HOLMQUIST, G. P., 1992 Chromosome bands, their chromatin flavors, and their functional features. Am. J. Hum. Genet. 51:17-37.[Medline]
HUGHES, S., D. ZELUS, and D. MOUCHIROUD, 1999 Warm-blooded isochore structure in Nile crocodile and turtle. Mol. Biol. Evol. 16:1521-1527.[Abstract]
JABBARI, K. and G. BERNARDI, 1998 CpG doublets, CpG islands and Alu repeats in long human DNA sequences from different isochore families. Gene 224:123-128.[Medline]
KABACK, D. B., 1996 Chromosome-size dependent control of meiotic recombination in humans. Nat. Genet. 13:20-21.[Medline]
KRICKER, M. C., J. W. DRAKE, and M. RADMAN, 1992 Duplication-targeted DNA methylation and mutagenesis in the evolution of eukaryotic chromosomes. Proc. Natl. Acad. Sci. USA 89:1075-1079.
LANDER, E. S., L. M. LINTON, B. BIRREN, C. NUSBAUM, and M. C. ZODY et al., 2001 Initial sequencing and analysis of the human genome. Nature 409:860-921.[Medline]
LI, W. H., 1993 Unbiased estimation of the rates of synonymous and non-synonymous substitutions. J. Mol. Evol. 36:96-99.[Medline]
MCQUEEN, H. A., G. SIRIACO, and A. P. BIRD, 1998 Chicken micro-chromosomes are hyperacetylated, early replicating, and gene rich. Genome Res. 8:621-630.
MOUCHIROUD, D. and G. BERNARDI, 1993 Compositional properties of coding sequences and mammalian phylogeny. J. Mol. Evol. 37:109-116.[Medline]
MOUCHIROUD, D. and C. GAUTIER, 1988 High codon-usage changes in mammalian genes. Mol. Biol. Evol. 5:192-194.[Medline]
MOUCHIROUD, D., G. D'ONOFRIO, B. AISSANI, G. MACAYA, and C. GAUTIER et al., 1991 The distribution of genes in the human genome. Gene 100:181-187.[Medline]
NAGYLAKI, T., 1983 Evolution of a finite population under gene conversion. Proc. Natl. Acad. Sci. USA 80:6278-6281.
NEKRUTENKO, A. and W. H. LI, 2000 Assessment of compositional heterogeneity within and between eukaryotic genomes. Genome. Res. 10:1986-1995.
PERNA, N. T. and T. D. KOCHER, 1995 Unequal base frequencies and the estimation of substitution rate. Mol. Biol. Evol. 12:359-361.
PERRY, J. and A. ASHWORTH, 1999 Evolutionary rate of a gene affected by chromosomal position. Curr. Biol. 9:987-989.[Medline]
PIGANEAU, G., D. MOUCHIROUD, L. DURET, and C. GAUTIER, 2002 Expected relationship between the silent substitution rate and the GC content: implications for the evolution of isochores. J. Mol. Evol. 54:129-133.[Medline]
ROBINSON, M., C. GAUTIER, and D. MOUCHIROUD, 1997 Evolution of isochores in rodents. Mol. Biol. Evol. 14:823-828.[Abstract]
SAWYER, S. A. and D. L. HARTL, 1992 Population genetics of polymorphism and divergence. Genetics 132:1161-1176.[Abstract]
SHETTY, S., D. K. GRIFFIN, and J. A. GRAVES, 1999 Comparative painting reveals strong chromosome homology over 80 million years of bird evolution. Chromosome Res. 7:289-295.[Medline]
SMIT, A. F., 1999 Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr. Opin. Genet. Dev. 9:657-663.[Medline]
SMITH, N. G. and A. EYRE-WALKER, 2001 Synonymous codon bias is not caused by mutation bias in G+C-rich genes in humans. Mol. Biol. Evol. 18:982-996.
VARLET, I., M. RADMAN, and P. BROOKS, 1990 DNA mismatch repair in Xenopus egg extracts: repair efficiency and DNA repair synthesis for all single base-pair mismatches. Proc. Natl. Acad. Sci. USA 87:7883-7887.
VENTER, J. C., M. D. ADAMS, E. W. MYERS, P. W. LI, and R. J. MURAL et al., 2001 The sequence of the human genome. Science 291:1304-1351.
WOLFE, K. H., P. M. SHARP, and W. H. LI, 1989 Mutation rates differ among regions of the mammalian genome. Nature 337:283-285.[Medline]
This article has been cited by other articles:
![]() |
J. L. Chojnowski and E. L. Braun Turtle isochore structure is intermediate between amphibians and other amniotes Integr. Comp. Biol., October 1, 2008; 48(4): 454 - 462. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bullaughey, M. Przeworski, and G. Coop No effect of recombination on the efficacy of natural selection in primates Genome Res., April 1, 2008; 18(4): 544 - 554. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Karro, M. Peifer, R. C. Hardison, M. Kollmann, and H. H. von G |







