Genetics, Vol. 162, 1825-1835, December 2002, Copyright © 2002

Accelerated Protein Evolution and Origins of Human-Specific Features: FOXP2 as an Example

Jianzhi Zhanga, David M. Webba, and Ondrej Podlahaa
a Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109

Corresponding author: Jianzhi Zhang, University of Michigan, 3003 Nat. Sci. Bldg., 830 N. University Ave., Ann Arbor, MI 48109., jianzhi{at}umich.edu (E-mail)

Communicating editor: S. YOKOYAMA


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS AND DISCUSSION
*APPENDIX
*LITERATURE CITED

Genes responsible for human-specific phenotypes may have been under altered selective pressures in human evolution and thus exhibit changes in substitution rate and pattern at the protein sequence level. Using comparative analysis of human, chimpanzee, and mouse protein sequences, we identified two genes (PRM2 and FOXP2) with significantly enhanced evolutionary rates in the hominid lineage. PRM2 is a histone-like protein essential to spermatogenesis and was previously reported to be a likely target of sexual selection in humans and chimpanzees. FOXP2 is a transcription factor involved in speech and language development. Human FOXP2 experienced a >60-fold increase in substitution rate and incorporated two fixed amino acid changes in a broadly defined transcription suppression domain. A survey of a diverse group of placental mammals reveals the uniqueness of the human FOXP2 sequence and a population genetic analysis indicates possible adaptive selection behind the accelerated evolution. Taken together, our results suggest an important role that FOXP2 may have played in the origin of human speech and demonstrate a strategy for identifying candidate genes underlying the emergences of human-specific features.


IN spite of the relative young age of our species, we have many distinct morphological, physiological, and behavioral features that are not found in apes, most notably, bipedalism, a large brain, susceptibility to AIDS, speech, and higher-order cognitive function (BOYD and SILK 2000 Down; MCCONKEY et al. 2000 Down; VARKI 2000 Down; GAGNEUX and VARKI 2001 Down). Understanding how and why these and other features unique to humans evolved is a key to disclosing the mystery of human origins and is of substantial medical importance (GIBBONS 1998 Down; MCCONKEY et al. 2000 Down; VARKI 2000 Down). Fortunately, most of the genetic bases of these features lie somewhere in the ~3 billion nucleotides of our genome, a huge, albeit limited, pool in which to look for answers. GAGNEUX and VARKI 2000 Down recently reviewed genetic differences between humans and great apes. Although many genetic changes that have occurred in the human lineage have been found, including chromosomal fusion, gene duplication, gene deletion/inactivation, nucleotide substitution, and change in gene expression, very few, if any, of these changes have been linked to specific phenotypes important to the origin and well being of our species (GIBBONS 1998 Down; GAGNEUX and VARKI 2001 Down). With the availability of the human draft genome sequence, accumulation of ape DNA sequences, and rapid advances in molecular technology, calls have been made for systematic searches for genes that make us human (GIBBONS 1998 Down; MCCONKEY et al. 2000 Down).

We tackle this problem by comparing the rate of protein sequence evolution in the human lineage (since the human-chimpanzee split) with that in nonhuman mammals. This comparison is useful because phenotype-affecting genetic modifications can be subject to positive Darwinian selection, under which the rate of amino acid substitution can be greatly enhanced (NEI and KUMAR 2000 Down). A change in substitution rate may also result when the function of a protein shifts so that the selective pressure is either enhanced or relaxed (NEI and KUMAR 2000 Down). In the following, we report identification of two genes with significant rate enhancements in the hominid lineage and discuss their relevance to the origins of human-specific features.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS AND DISCUSSION
*APPENDIX
*LITERATURE CITED

Database search:
In our design of the rate comparison, orthologous protein sequences from humans (Homo sapiens), chimpanzees (Pan troglodytes), and, as an outgroup, mice (Mus musculus) are used (Fig 1A). Use of mice rather than primates for the outgroup makes the estimate of the substitution rate less subjective to sampling errors because a long-term average is obtained. Also many more genes have been sequenced and functionally characterized for the mouse than for any other nonhuman mammal. It has been suggested that the average amino acid substitution rate is higher in rodents than in primates (GU and LI 1992 Down; but see EASTEAL et al. 1995 Down). This will likely make our detection of accelerated human protein evolution more conservative. Here we focus on orthologous genes because a change in substitution rate after gene duplication (LYNCH and CONERY 2000 Down) would complicate our analysis. Ideally, no gene duplication should be allowed in any branches of the tree of human, chimpanzee, and mouse (Fig 1A). However, duplications occurring in branches 5 and 2 have virtually no effects on our results, as we are largely concerned with branches 1, 3, and 4. Duplications in branch 4, or the rodent-specific duplications, have only small effects because a basal substitution rate in mammals can still be estimated relatively accurately. All annotated gene sequences in GenBank were screened to find cases satisfying the above criteria. Specifically, all annotated chimpanzee gene sequences were retrieved from the GenBank. The translated protein sequences were BLASTed against the GenBank database to find the closest human and mouse sequences. Various sources of information and analyses, including previous evolutionary analyses of the genes (CHEN and LI 2001 Down), functional data, UniGene search, human/mouse homology maps, and phylogenetic analysis, were used to determine that the sequences are orthologous and that no gene duplications have occurred in branches 1 and 3 of the tree in Fig 1A. Nevertheless, it is possible that some cases may still have undetected duplications in branch 1 or 3 or may include paralogous genes, due to incomplete genome sequences of human and mouse and limited genetic information on the chimpanzee. This did not have serious effects on our results because we were interested mainly in the very few cases showing significant rate changes; additional experiments and analyses could be conducted after initial identification of candidate genes.



View larger version (30K):
In this window
In a new window
Download PPT slide
 
Figure 1. In search of accelerated protein evolution in the human lineage. (A) A gene tree of orthologous human, chimpanzee, and mouse proteins. Branches are named by the numbers. Only cases with no gene duplications in branches 1 and 3 are considered in this work. (B) Frequency distribution of {lambda}, the acceleration index for human proteins. Two of the 120 analyzed proteins have no substitutions in any branches and are not included in the distribution.

Obtaining new chimpanzee sequences:
In addition to the sequences retrieved from GenBank, we sequenced the coding regions of five chimpanzee genes for which the orthologous human and mouse sequences were available in GenBank. The five genes are BRCA2, CATSPER, FOXP2, RNASE4, and RNH. PCR primers were designed following the known human sequences and the chimpanzee genes were amplified by PCR and sequenced in both directions using automated DNA sequencer.

Rate analysis:
The obtained protein sequences were aligned using Clustal X (THOMPSON et al. 1997 Down) and gaps were removed before rate analysis. Aligned proteins with lengths (before removal of gaps) of <100 amino acids were discarded. For each protein, the numbers of amino acid substitutions in branches 1, 2, and 3 + 4 are denoted by h, c, and m, respectively (Fig 1A). These numbers were derived from branch length estimates of the tree of orthologous human, chimpanzee, and mouse proteins. The branch lengths were estimated using the neighbor-joining method (SAITOU and NEI 1987 Down). Several distance measures were used, including the protein p-distance, Poisson distance, and gamma distance with the shape parameter of 2.0 (equivalent to Dayhoff distance; NEI and KUMAR 2000 Down). The results were found to be similar and p-distance results are presented as this distance is associated with a relatively low variance. Primates and rodents diverged ~90 million years ago (MYA; KUMAR and HEDGES 1998 Down; ARCHIBALD et al. 2001 Down; NEI et al. 2001 Down) and humans separated from chimpanzees ~5.5 MYA (CHEN and LI 2001 Down; STAUFFER et al. 2002 Down). An acceleration index for the human lineage (branch 1) in comparison to the mammalian lineage before the human-chimpanzee split (branch 3 + 4) is defined by {lambda} = (h/5.5)/[m/(2 x 90 - 5.5)] = 31.7h/m. In other words, if a protein evolves with a constant rate (i.e., {lambda} = 1), the number of amino acid substitutions in branch 3 + 4 (m) is expected to be 31.7 times greater than that in branch 1 (h). Given h and m, the tail probability in a binomial distribution of B(h + m, 0.03056) is computed for testing the statistical significance of rate enhancement in the human lineage. Here, 0.03056 is from 5.5/180, the time span for branch 1, relative to that for branches 1 + 3 + 4. Similarly, an acceleration index for the chimpanzee lineage is defined by {kappa} = (c/5.5)/[m/(2 x 90 - 5.5)] = 31.7c/m.

Computer simulation:
To determine the frequency of type-I error (false-positive results) in the binomial test described above, we conducted a computer simulation. In the simulation, a constant substitution rate is used for branches 1, 3, and 4. Let this rate be r substitutions per amino acid site per million years (MY). Substitution rate variation among sites does not affect the simulation result, as r can also be regarded as the average substitution rate over the entire sequence. Given the length of a protein (n amino acids), the number of substitutions in branch 1 is a Poisson random variable with mean = 5.5nr and that for the branches 3 + 4 is a Poisson variable with mean = 174.5nr. These two random numbers were generated by computer and the binomial test was performed to see if the null hypothesis of rate constancy could be rejected. Such simulations were repeated 5000 times for each given parameter of nr. CHEN and LI (2001) estimated that the average substitution rate between humans and chimpanzees is r = 0.013/(11 MY) = 0.00118 substitutions per amino acid site per million years. The average length for the 120 proteins examined in this study is ~n = 350 amino acids. Thus, average nr is ~350 x 0.00118 = 0.413 substitutions per sequence per million years for orthologous proteins of humans and chimpanzees. In fact, the average nr for the 120 genes, which was 0.323, may also be computed from the Appendix. Considering that nr varies from 0 to 1.41 for the 120 genes, our simulation was conducted under a wide range of nr, from 0.04 to 4.

FOXP2 DNA sequencing and analysis:
All 17 exons of the FOXP2 gene from the chimpanzee, pygmy chimpanzee, gorilla, and orangutan were PCR amplified and sequenced in both directions. The orthologous human (accession no. AF337817) and mouse (accession nos. AY079003 and NT_023632) sequences were obtained from GenBank. The orthology of the FOXP2 sequences was confirmed by phylogenetic analysis and observation of expected levels of synonymous nucleotide distances. Parsimony (FITCH 1971 Down) and distance-based Bayesian (ZHANG and NEI 1997 Down) methods were used to infer numbers of synonymous and nonsynonymous nucleotide substitutions (NEI and KUMAR 2000 Down) in the FOXP2 gene tree of the above six species.

To determine the variability of the amino acid positions in which humans experienced substitutions, part of exon 7 of FOXP2 was PCR amplified and sequenced in both directions from an additional 24 mammals and the chicken (see Fig 3). The same region was also sequenced in 32 human individuals to determine the polymorphism at the aforementioned amino acid positions.



View larger version (41K):
In this window
In a new window
Download PPT slide
 
Figure 2. Amino acid sequence alignment of FOXP2 genes from human, great apes, and mouse. Dots represent identical residues to the human sequence, while dashes represent gaps. The two amino acid substitutions in humans (in boldface type) are both in exon 7. The FOX domain is boxed and the broadly defined transcription repression domain is underlined with the zinc-finger domain italicized. The alignment of the poly(Q) region is tentative.



View larger version (55K):
In this window
In a new window
Download PPT slide
 
Figure 3. Uniqueness of the human FOXP2 sequence. Shown here are the amino acids from the chicken and 29 mammals at the two positions where humans experienced substitutions.

For population genetic analysis, 8679 nucleotides in intron 6 and 1305 nucleotides in intron 7 of the FOXP2 gene were sequenced in both directions in 10 human individuals. All singletons were confirmed from a second PCR reaction and sequencing. Nucleotide diversity ({pi}) and Watterson's {theta} were computed as described in TAJIMA 1989 Down. TAJIMA's (1989) and FU and LI's (1993) tests were conducted using 50,000 coalescent simulations. To test the neutral evolution hypothesis for the polymorphic data of FOXP2, we compiled available data on worldwide polymorphisms in other noncoding regions of the human genome that are at least 3000 nucleotides long and are not known to be under selection. Six data sets were found and the Hudson-Kreitman-Aguadé (HKA) test (HUDSON et al. 1987 Down) was used to compare FOXP2 with these neutral sequences. DnaSP (ROZAS and ROZAS 1999 Down) was used for all population genetic analyses.


*  RESULTS AND DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS AND DISCUSSION
*APPENDIX
*LITERATURE CITED

Identification of proteins with accelerated evolution in the hominid lineage:
Following the criteria set in the above section, we identified 115 genes from GenBank and obtained 5 additional genes from our laboratory that were suitable for the rate analysis. Fig 1B shows the distribution of the acceleration index {lambda} for the 120 genes. Results from each of the 120 genes are given in the Appendix. The mean {lambda} is 1.13 ± 0.54 and the median is 0.39. The distribution is skewed because no amino acid substitutions are found in the human lineage in about one-third (39/120 = 0.325) of the genes examined. A majority of the genes have {lambda} < 3.2. Only two genes have {lambda} significantly >1 (P < 0.003 and P < 0.001, respectively; binomial test). Since 120 tests were conducted, it was necessary to evaluate whether there were false-positive cases. For this, we conducted a computer simulation. As described in the above section, our simulations were designed to examine the type-I error of the binomial test. The results suggest that the expected number of false-positive cases is <<1 for our sample of 120 genes (Table 1). Thus, our positive detection is unlikely due to statistical artifact.


 
View this table:
In this window
In a new window

 
Table 1. Type-I error of the binomial test of rate constancy

The two positive cases, PRM2 and FOXP2, are listed in Table 2. PRM2 (protamine 2) is a DNA-binding protein that replaces histones in spermatogenesis. It has been shown to evolve rapidly in humans and chimpanzees and was suggested to be a likely target of sexual selection (WYCKOFF et al. 2000 Down). Thus, it is not unexpected that PRM2 is identified in our analysis. However, the fact that both human and chimpanzee lineages experienced accelerated evolution ({lambda} and {kappa} are both significantly >1) suggests that the type of selection on PRM2 is probably not unique to humans. In contrast, FOXP2 has the highest {lambda} (63.4) of all genes examined, while {kappa} is 0 (Table 2), suggesting hominid-specific acceleration. We thus focus our analysis on FOXP2 in the remainder of the article.


 
View this table:
In this window
In a new window

 
Table 2. Proteins with significantly enhanced rates of evolution in the human lineage

Enhanced substitution rate of human FOXP2:
FOXP2 belongs to the winged helix/forkhead class of transcription factors (LAI et al. 2001 Down; SHU et al. 2001 Down). It is expressed in multiple fetal and adult tissues with a high expression in certain regions of the fetal brain (LAI et al. 2001 Down; SHU et al. 2001 Down). Mutations in the gene cause a severe speech and language disorder in affected individuals despite their adequate intelligence and opportunity for language acquisition, suggesting that FOXP2 is specifically involved in speech development (LAI et al. 2001 Down). FOXP2 is a conserved protein, with only three amino acid differences (and a 1-amino-acid insertion/deletion) between human and mouse in its entire length of 715 amino acids (Fig 2). We sequenced the coding regions of the FOXP2 gene from the chimpanzee, pygmy chimpanzee, gorilla, and orangutan and determined that two of the three aforementioned substitutions occurred in the hominid lineage and no substitutions occurred in chimpanzees (Fig 2). As indicated in Table 2, the acceleration in the evolution of human FOXP2 is statistically significant. This significance is also obtained (P = 0.001–0.006) when we consider ranges of divergence times for the human-chimpanzee split at 4.0–7.0 MYA (CHEN and LI 2001 Down; BRUNET et al. 2002 Down; STAUFFER et al. 2002 Down) and the primate-rodent split at 80–110 MYA (KUMAR and HEDGES 1998 Down; ARCHIBALD et al. 2001 Down; NEI et al. 2001 Down).

The two amino acid substitutions in the human lineage are a Thr-to-Asn change at position 303 and an Asn-to-Ser change at position 325, both in exon 7. These substitutions are located in a broadly defined transcription repression domain (SHU et al. 2001 Down; Fig 2), so it is possible that they affect the binding of FOXP2 with regulatory sequences of its target genes. If these substitutions are important to speech development, they should be fixed in normal humans and not be found in nonhuman organisms. Indeed, these substitutions are shared by all 32 normal humans surveyed (9 African Americans, 10 Caucasians, 9 Asians, and 4 Amerindians), but by none of the 29 nonhuman species examined. These species include a bird and 28 placental mammals from 12 representative orders (Fig 3). Interestingly, the Asn-to-Ser substitution also occurred independently in carnivores, suggesting that this substitution alone is not sufficient for the origin of speech and language.

Driving forces behind the accelerated evolution of human FOXP2:
It would be interesting to identify the driving force behind the two amino acid substitutions and the accelerated evolution of human FOXP2. There are three possibilities: enhanced mutation rate, relaxed purifying selection, and positive selection. Because synonymous nucleotide changes are usually immune to selection, the rate of synonymous substitutions can be used to measure the mutation rate (NEI and KUMAR 2000 Down). Using parsimony, we determined the number of synonymous substitutions in each branch of the FOXP2 gene tree of five hominoids and mouse (Fig 4). It can be seen that the number of synonymous substitutions in the human lineage (two) is smaller than that in the two chimpanzee lineages (three and four, respectively). The number of synonymous substitutions per MY is also smaller in the human lineage (2/5.5 MY = 0.36) than in the lineage before the human-chimpanzee separation ([2.5 + 4.5 + 127.5]/[90 MY x 2 - 5.5 MY] = 0.77 for the branches linking node A and mouse; see Fig 4). Thus, there is no indication of enhanced mutation rate at FOXP2 in the human lineage. This conclusion is strengthened as the true number of synonymous substitutions is likely to be higher than the parsimony estimate for the long branch leading to the mouse, but not for the short branches within hominoids. Use of Bayesian estimates of ancestral sequences confirmed this result. Furthermore, the ratio of nonsynonymous substitutions to synonymous substitutions in the human lineage (2/2 = 1; see Fig 4) is significantly greater than the ratio in the branches linking node A and mouse (1/[2.5 + 4.5 + 127.5] = 0.007; P < 0.002, Fisher's exact test; ZHANG et al. 1997 Down), suggesting that the rate difference is due to a difference in selection. It is unlikely, however, that the functional constraint and purifying selection on FOXP2 has been relaxed in humans, as mutations show severe deleterious effects (LAI et al. 2001 Down). Consistent with the existence of strong purifying selection, no amino acid polymorphisms in FOXP2 were found in a survey of 48 humans (NEWBURY et al. 2002 Down). Thus, positive selection remains as the most likely cause of the accelerated evolution of human FOXP2.



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 4. Nucleotide substitutions in the evolution of FOXP2. The number of nonsynonymous substitutions followed by the number of synonymous substitutions is given for each branch. The numbers of amino acid changes equal those of nonsynonymous substitutions. The amino acid change in the mouse lineage was from Asp to Glu at position 80 and the one in the orangutan lineage was from Ala to Val at position 6.

We noted, however, that the rate ratio of nonsynonymous to synonymous substitutions per site is not >1 in the human FOXP2 lineage. This is likely due to the fact that FOXP2 is an overall conserved protein and many sites are under purifying selection. Under such circumstances, population genetic data may provide useful information on the evolutionary force. We therefore sequenced 9984 nucleotides in introns 6 and 7 of the FOXP2 gene from 10 humans (3 African-Americans, 3 Caucasians, 3 Asians, and 1 Amerindian) and one chimpanzee (Table 3). Introns 6 and 7 are adjacent to exon 7, where the two amino acid substitutions occurred in humans (Fig 2). By tight linkage to exon 7, these intron sequences may preserve information on the fixation process of the amino acid changes. For comparison, we also compiled available data on worldwide polymorphisms in other noncoding regions of the human genome that are at least 3000 nucleotides long and are not known to be under selection. We found that the level of polymorphism is lower in FOXP2 introns than in any other neutral noncoding regions examined (Table 4). An HKA neutrality test comparing the intra- and interspecific sequence variations between loci (HUDSON et al. 1987 Down) yielded a very significant result when FOXP2 introns were compared with all other regions combined (P < 0.00001; Table 4). When these regions were compared individually with FOXP2, all indicated a lower-than-expected polymorphism in FOXP2 and four out of six cases showed statistical significance (Table 4). Mutation-rate variation among loci would not result in significant HKA test results (HUDSON et al. 1987 Down). Population demographic changes cannot explain them either, because they would have affected all loci in a similar way (HUDSON et al. 1987 Down). Rather, these comparisons suggest background selection and/or selective sweeps. Here background selection refers to purifying selection on deleterious mutations in tightly linked exons and selective sweep refers to quick fixation of advantageous mutations in these exons. These events, if recent enough, can lead to a reduced present-day polymorphism in introns 6 and 7 (MAYNARD SMITH and HAIGH 1974 Down; CHARLESWORTH et al. 1993 Down). Consistent with the HKA test results, Tajima's D (-1.36, P = 0.076) and Fu and Li's F* (-1.81, P = 0.064) are both negative for the FOXP2 intron data, although they are only marginally significant. Note that these tests are conservative as a recombination rate of zero was assumed in the coalescent simulation.


 
View this table:
In this window
In a new window

 
Table 3. Allelic sequences of 10 humans at polymorphic sites of FOXP2 introns 6 and 7


 
View this table:
In this window
In a new window

 
Table 4. Intra- and interspecific DNA sequence variations in noncoding regions of the human genome

If the nonneutral pattern of introns 6 and 7 is due to background selection, the selection intensity must be high, because weak background selection is known to be ineffective in reducing the polymorphic level. This suggests that the adjacent exons must be under strong functional constraints with no relaxed purifying selection, which would imply that positive selection is the only possible explanation for the accelerated protein evolution. If a relatively recent selective sweep caused the low polymorphism, at least one of the two amino acid changes in exon 7 must be advantageous because no other amino acid substitutions occurred in the evolution of human FOXP2 and no other functional genes are located within 100 kb of FOXP2 exon 7. Taken together, unless positive selection is invoked, one cannot explain the accelerated evolution of FOXP2 protein and low polymorphism of introns simultaneously. The finding that FOXP2 is critical to speech and language development (LAI et al. 2001 Down) does not by itself demonstrate the role of this gene in the origin of human speech, because the function of FOXP2 could have remained unchanged during human evolution while other speech-related genes changed. However, the revelation of significant acceleration and positive selection in human FOXP2 suggests functional and fitness relevance of the two amino acid substitutions and provides support for the role of this gene in the evolution of speech and language. Interestingly, the notion of selection is consistent with the belief that the origin of language is an adaptation (PINKER and BLOOM 1990 Down; BOYD and SILK 2000 Down). In the future, it would be interesting to examine the exact functional effects of the two amino acid substitutions of human FOXP2 by in vitro assays of protein function as well as characterization of human phenotypes of reverse mutations.

If the lower-than-expected nucleotide diversity in FOXP2 introns suggested by HKA tests and D and F* statistics is indeed a result of a relatively recent selective sweep, the sweep probably occurred no earlier than 0.5 N generations ago, because the signal of a sweep is unlikely to last longer than that (SIMONSEN et al. 1995 Down). Here N is the effective population size of humans and is generally thought to have been ~10,000 (TAKAHATA 1993 Down). Thus, the sweep would have occurred no earlier than 5000 generations, or ~100,000 years, ago. This estimate is within the wide window of 40,000 years to 4 MYA during which human languages are believed to have emerged (BOYD and SILK 2000 Down). A paleo-population genetic study (LAMBERT et al. 2002 Down) may more accurately define the timing and process of the two amino acid substitutions in humans.

Perspective:
In this study we focused on identification of proteins with accelerated evolution in the hominid lineage. Other strategies that may also be used in the search for genetic bases of uniquely human features include identifying human genes that are under positive selection, human-specific gene duplications, deletions or deactivations, and changes in gene expression (GAGNEUX and VARKI 2001 Down; ENARD et al. 2002 Down). Different from these methods, our approach is useful when the phenotype-affecting genetic changes are simple amino acid substitutions. Our computer simulation showed that unless the substitution rate per sequence (nr) is high, our rate-constancy test is quite conservative. While this property somewhat reduces the power of our approach, it also makes our claims more secure. In other words, the positively identified cases will have a high chance to be biologically meaningful. At present, only a small number of chimpanzee genes have been sequenced, and only 120 genes, or ~0.35% of the genome, have been analyzed here. As the chimpanzee genome sequencing project (FUJIYAMA et al. 2002 Down) proceeds, many more genes affecting uniquely human features may be found by this and other methods.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AY143178, AY143179, AY143180, AY143181 and AF539547, AF539548, AF539549, AF539550. Back


*  ACKNOWLEDGMENTS

We thank S. Hinshaw, A. Rooney, P. Tucker, S. Yokoyama, Y. Zhang, and the University of Michigan Museum of Zoology Mammal Division for providing DNA and animal tissue samples. We thank M. Nei, P. Tucker, and two anonymous reviewers for their comments on earlier versions of the manuscript. This work was supported by a startup fund of the University of Michigan to J.Z.

Manuscript received August 15, 2002; Accepted for publication September 6, 2002.


*  APPENDIX
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS AND DISCUSSION
*APPENDIX
*LITERATURE CITED


 
View this table:
In this window
In a new window

 
APPENDIX Evolutionary rate analysis of 120 orthologous proteins of human, chimpanzee, and mouse


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS AND DISCUSSION
*APPENDIX
*LITERATURE CITED

ARCHIBALD, J. D., A. O. AVERIANOV, and E. G. EKDALE, 2001  Late Cretaceous relatives of rabbits, rodents, and other extant eutherian mammals. Nature 414:62-65.[Medline]

BOYD, B., and J. B. SILK, 2000 How Humans Evolved. W. W. Norton, New York.

BRUNET, M., F. GUY, D. PILBEAM, H. T. MACKAYE, and A. LIKIUS et al., 2002  A new hominid from the Upper Miocene of Chad, Central Africa. Nature 418:145-151.

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303.[Abstract]

CHEN, F. C. and W. H. LI, 2001  Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. Am. J. Hum. Genet. 68:444-456.[Medline]

EASTEAL, S., C. COLLET and D. BETTY, 1995 The Mammalian Molecular Clock. R. G. Landes, Austin, TX.

ENARD, W., P. KHAITOVICH, J. KLOSE, S. ZOLLNER, and F. HEISSIG et al., 2002  Intra- and interspecific variation in primate gene expression patterns. Science 296:340-343.[Abstract/Free Full Text]

FITCH, W. M., 1971  Toward defining the course of evolution: minimum change for a specific tree topology. Syst. Zool. 20:406-416.

FU, Y. X. and W. H. LI, 1993  Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]

FUJIYAMA, A., H. WATANABE, A. TOYODA, T. D. TAYLOR, and T. ITOH et al., 2002  Construction and analysis of a human-chimpanzee comparative clone map. Science 295:131-134.[Abstract/Free Full Text]

FULLERTON, S. M., J. BOND, J. A. SCHNEIDER, B. HAMILTON, and R. M. HARDING et al., 2000  Polymorphism and divergence in the beta-globin replication origin initiation region. Mol. Biol. Evol. 17:179-188.[Abstract/Free Full Text]

GAGNEUX, P. and A. VARKI, 2001  Genetic differences between humans and great apes. Mol. Phylogenet. Evol. 18:2-13.[Medline]

GIBBONS, A., 1998  Which of our genes makes us human? Science 281:1432-1434.[Free Full Text]

GU, X. and W. H. LI, 1992  Higher rates of amino acid substitution in rodents than in humans. Mol. Phylogenet. Evol. 1:211-214.[Medline]

HARRIS, E. E. and J. HEY, 1999  X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:3320-3324.[Abstract/Free Full Text]

HUDSON, R. R., M. KREITMAN, and M. AGUADE, 1987  A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.[Abstract/Free Full Text]

KAESSMANN, H., F. HEISSIG, A. VON HAESELER, and S. PAABO, 1999  DNA sequence variation in a non-coding region of low recombination on the human X chromosome. Nat. Genet. 22:78-81.[Medline]

KUMAR, S. and S. B. HEDGES, 1998  A molecular timescale for vertebrate evolution. Nature 392:917-920.

LAI, C. S., S. E. FISHER, J. A. HURST, F. VARGHA-KHADEM, and A. P. MONACO, 2001  A forkhead-domain gene is mutated in a severe speech and language disorder. Nature 413:519-523.[Medline]

LAMBERT, D. M., P. A. RITCHIE, C. D. MILLAR, B. HOLLAND, and A. J. DRUMMOND et al., 2002  Rates of evolution in ancient DNA from Adelie penguins. Science 295:2270-2273.[Abstract/Free Full Text]

LYNCH, M. and J. S. CONERY, 2000  The evolutionary fate and consequences of duplicate genes. Science 290:1151-1155.[Abstract/Free Full Text]

MAYNARD SMITH, J. and J. HAIGH, 1974  The hitch-hiking effect of a favorable gene. Genet. Res. 23:23-35.[Medline]

MCCONKEY, E. H., R. FOUTS, M. GOODMAN, D. NELSON, and D. PENNY et al., 2000  Proposal for a human genome evolution project. Mol. Phylogenet. Evol. 15:1-4.[Medline]

NEI, M., and S. KUMAR, 2000 Molecular Evolution and Phylogenetics. Oxford University Press, New York.

NEI, M., P. XU, and G. GLAZKO, 2001  Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms. Proc. Natl. Acad. Sci. USA 98:2497-2502.[Abstract/Free Full Text]

NEWBURY, D. F., E. BONORA, J. A. LAMB, S. E. FISHER, and C. S. LAI et al., 2002  FOXP2 is not a major susceptibility gene for autism or specific language impairment. Am. J. Hum. Genet. 70:1318-1327.[Medline]

PINKER, S. and P. BLOOM, 1990  Natural language and natural selection. Behav. Brain Sci. 13:707-784.

ROZAS, J. and R. ROZAS, 1999  DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.[Abstract/Free Full Text]

SAITOU, N. and M. NEI, 1987  The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]

SHU, W., H. YANG, L. ZHANG, M. M. LU, and E. E. MORRISEY, 2001  Characterization of a new subfamily of winged-helix/forkhead (Fox) genes that are expressed in the lung and act as transcriptional repressors. J. Biol. Chem. 276:27488-27497.[Abstract/Free Full Text]

SIMONSEN, K. L., G. A. CHURCHILL, and C. F. AQUADRO, 1995  Properties of statistical tests of neutrality for DNA polymorphism data. Genetics 141:413-429.[Abstract]

STAUFFER, R. L., A. WALKER, O. A. RYDER, M. LYONS-WEILER, and S. B. HEDGES, 2002  Human and ape molecular clocks and constraints on paleontological hypotheses. J. Hered. 92:469-474.[Abstract/Free Full Text]

TAJIMA, F., 1989  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.[Abstract/Free Full Text]

TAKAHATA, N., 1993  Allelic genealogy and human evolution. Mol. Biol. Evol. 10:2-22.[Abstract]

THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINS, 1997  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876-4882.[Abstract/Free Full Text]

VARKI, A., 2000  A chimpanzee genome project is a biomedical imperative. Genome Res. 10:1065-1070.[Free Full Text]

WYCKOFF, G. J., W. WANG, and C. I WU, 2000  Rapid evolution of male reproductive genes in the descent of man. Nature 403:304-309.[Medline]

YU, N., Z. ZHAO, Y. X. FU, N. SAMBUUGHIN, and M. RAMSAY et al., 2001  Global patterns of human DNA sequence variation in a 10-kb region on chromosome 1. Mol. Biol. Evol. 18:214-222.[Abstract/Free Full Text]

ZHANG, J. and M. NEI, 1997  Accuracies of ancestral amino acid sequences inferred by the parsimony, likelihood, and distance methods. J. Mol. Evol. 44(Suppl. 1):S139-S146.

ZHANG, J., S. KUMAR, and M. NEI, 1997  Small-sample tests of episodic adaptive evolution: a case study of primate lysozymes. Mol. Biol. Evol. 14:1335-1338.[Medline]

ZHAO, Z., L. JIN, Y. X. FU, M. RAMSAY, and T. JENKINS et al., 2000  Worldwide DNA sequence variation in a 10-kilobase noncoding region on human chromosome 22. Proc. Natl. Acad. Sci. USA 97:11354-11358.[Abstract/Free Full Text]

ZIETKIEWICZ, E., V. YOTOVA, M. JARNIK, M. KORAB-LASKOWSKA, and K. K. KIDD et al., 1998  Genetic structure of the ancestral population of modern humans. J. Mol. Evol. 47:146-155.[Medline]




This article has been cited by other articles:


Home page
Mol Biol EvolHome page
G. Coop, K. Bullaughey, F. Luca, and M. Przeworski
The Timing of Selection at the Human FOXP2 Gene
Mol. Biol. Evol., July 1, 2008; 25(7): 1257 - 1259.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. L. Kim and Y. Satta
Population Genetic Analysis of the N-Acylsphingosine Amidohydrolase Gene Associated With Mental Activity in Humans
Genetics, March 1, 2008; 178(3): 1505 - 1515.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. A. Bakewell, P. Shi, and J. Zhang
More genes underwent positive selection in chimpanzee evolution than in human evolution
PNAS, May 1, 2007; 104(18): 7489 - 7494.
[Abstract] [Full Text] [PDF]


Home page
J. Neurosci.Home page
S. A. White, S. E. Fisher, D. H. Geschwind, C. Scharff, and T. E. Holy
Singing Mice, Songbirds, and More: Models for FOXP2 Function and Dysfunction in Human Speech and Language
J. Neurosci., October 11, 2006; 26(41): 10376 - 10379.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
P. C. Sabeti, S. F. Schaffner, B. Fry, J. Lohmueller, P. Varilly, O. Shamovsky, A. Palma, T. S. Mikkelsen, D. Altshuler, and E. S. Lander
Positive natural selection in the human lineage.
Science, June 16, 2006; 312(5780): 1614 - 1620.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. Varki and T. K. Altheide
Comparing the human and chimpanzee genomes: Searching for needles in a haystack
Genome Res., December 1, 2005; 15(12): 1746 - 1758.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Y.-q. Wang, Y.-p. Qian, S. Yang, H. Shi, C.-h. Liao, H.-K. Zheng, J. Wang, A. A. Lin, L. L. Cavalli-Sforza, P. A. Underhill, et al.
Accelerated Evolution of the Pituitary Adenylate Cyclase-Activating Polypeptide Precursor Gene During Human Origin
Genetics, June 1, 2005; 170(2): 801 - 806.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
D. M. Webb and J. Zhang
FoxP2 in Song-Learning Birds and Vocal-Learning Mammals
J. Hered., May 1, 2005; 96(3): 212 - 216.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. H. Perry, B. C. Verrelli, and A. C. Stone
Comparative Analyses Reveal a Complex History of Molecular Evolution for Human MYH16
Mol. Biol. Evol., March 1, 2005; 22(3): 379 - 382.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
E. J. Vallender and B. T. Lahn
Positive selection on the human genome
Hum. Mol. Genet., October 1, 2004; 13(suppl_2): R245 - R254.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
T. Kitano, Y.-H. Liu, S. Ueda, and N. Saitou
Human-Specific Amino Acid Changes Found in 103 Protein-Coding Genes
Mol. Biol. Evol., May 1, 2004; 21(5): 936 - 944.
[Abstract] [Full Text] [PDF]


Home page
J. Neurosci.Home page
I. Teramitsu, L. C. Kudo, S. E. London, D. H. Geschwind, and S. A. White
Parallel FoxP1 and FoxP2 Expression in Songbird and Human Brain Predicts Functional Interaction
J. Neurosci., March 31, 2004; 24(13): 3152 - 3163.
[Abstract] [Full Text] [PDF]


Home page
J. Neurosci.Home page
S. Haesler, K. Wada, A. Nshdejan, E. E. Morrisey, T. Lints, E. D. Jarvis, and C. Scharff
FoxP2 Expression in Avian Vocal Learners and Non-Learners
J. Neurosci., March 31, 2004; 24(13): 3164 - 3175.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
S. Li, J. Weidenfeld, and E. E. Morrisey
Transcriptional and DNA Binding Activity of the Foxp1/2/4 Family Is Modulated by Heterotypic and Homotypic Protein Interactions
Mol. Cell. Biol., January 15, 2004; 24(2): 809 - 822.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. Zhang
Evolution of the Human ASPM Gene, a Major Determinant of Brain Size
Genetics, December 1, 2003; 165(4): 2063 - 2070.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Zhang
Paleomolecular biology unravels the evolutionary mystery of vertebrate UV vision
PNAS, July 8, 2003; 100(14): 8045 - 8047.
[Full Text] [PDF]


Home page
Genome ResHome page
M. W. Karaman, M. L. Houck, L. G. Chemnick, S. Nagpal, D. Chawannakul, D. Sudano, B. L. Pike, V. V. Ho, O. A. Ryder, and J. G. Hacia
Comparative Analysis of Gene-Expression Patterns in Human and African Great Ape Cultured Fibroblasts
Genome Res., July 1, 2003; 13(7): 1619 - 1630.
[Abstract] [Full Text] [PDF]