- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Mezey, J. G.
- Articles by Wagner, G. P.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Mezey, J. G.
- Articles by Wagner, G. P.
Is the Genotype-Phenotype Map Modular?: A Statistical Approach Using Mouse Quantitative Trait Loci Data
Jason G. Mezey1,a, James M. Cheverudb, and Günter P. Wagneraa Department of Ecology and Evolutionary Biology, Center for Computational Ecology, Yale University, New Haven, Connecticut 06520-8106
b Department of Anatomy and Neurobiology, Washington University School of Medicine, St. Louis, Missouri 63110
Corresponding author: Günter P. Wagner, Department of Ecology and Evolutionary Biology, Yale University, POB 208106, New Haven, CT 06520-8106., gunter.wagner{at}yale.edu (E-mail)
Communicating editor: Z-B. ZENG
| ABSTRACT |
|---|
Various theories about the evolution of complex characters make predictions about the statistical distribution of genetic effects on phenotypic characters, also called the genotype-phenotype map. With the advent of QTL technology, data about these distributions are becoming available. In this article, we propose simple tests for the prediction that functionally integrated characters have a modular genotype-phenotype map. The test is applied to QTL data on the mouse mandible. The results provide statistical support for the notion that the ascending ramus region of the mandible is modularized. A data set comprising the effects of QTL on a more extensive portion of the phenotype is required to determine if the alveolar region of the mandible is also modularized.
IT has been hypothesized that characters that collectively serve a common functional role should be genetically integrated and relatively independent from the rest of the body (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The study of ![]()
|
In this study we propose an additional method for assessing the modularity of a portion of the phenotype using QTL data. We suggest two statistics that measure the two aspects of modularity: integration by pleiotropic effects and parcelation, i.e., fewer than expected pleiotropic couplings with the rest of the phenotype (![]()
![]()
| THE UTILITY OF QTL DATA FOR EXPLORING THE GENOTYPE-PHENOTYPE MAP |
|---|
Hypotheses about the structure of the genotype-phenotype map are statements about the statistical distribution of mutation effects among phenotypic traits. Ideally, data to test these hypotheses should directly sample the distribution of the effects of individual mutations. Data on the distribution of individual mutation effects, however, is difficult to produce. QTL data provide a reasonable alternative for testing ideas about the structure of the genotype-phenotype map. One limitation when using QTL data, however, is that it reflects both the distribution of individual mutation effects as well as the fixation probabilities of the alleles. In this section, we discuss some of the problems of utilizing QTL data to analyze the structure of the genotype-phenotype map.
QTL are defined as genes influencing quantative traits (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The marker locus technique identifies loci where different alleles have become fixed in divergent lines. Whether a particular allele becomes fixed in a line depends on two probabilities: (1) the probability that a mutation with this effect will occur, and (2) the probability of fixation given that the mutation has occurred. The first probability is relevant for estimating the distribution of mutation effects. It is this statistical distribution that is predicted to be modular. The second probability is determined by the history of drift and selection effects in the study populations. Of the mutations that were segregating in the original base population and the mutations that occurred during selection, some are more likely to be fixed than others. For example, if a line used in a QTL study is produced by selecting for larger character values, mutations with large positive effects on the character are more likely to be fixed. The result is that marker locus QTL data are a "filtered" sample of the mutational distribution.
For the purposes of this study, we are interested in marker locus studies where multiple traits were measured and a large number of QTL were identified. In this study, we reanalyze the data set of ![]()
| STATISTICS FOR MODULARITY |
|---|
Modularity of a set of traits can be defined by two attributes (![]()
T. Ideally, the number of traits in the subset is much smaller than the total set of observables. We define two statistics for assessing the modularity of T1, one to measure integration within the set and one to measure parcelation between trait sets. We call the total number of QTL that have effects on at least two traits n. Of these n QTL, we call m the number of QTL that affect at least one trait in T1, m < n. Note that we only consider QTL with effects on at least two traits in T when calculating these statistics.
Integration statistic:
To measure integration of the set T1 consisting of k traits, we consider only the m QTL with an effect on at least one trait in the set T1. The integration statistic MI is based on the total number of traits in T1 affected by the m QTL compared to the maximum number of traits these QTL could affect in T1, which is km:

The statistic is scaled to the unit interval [0, 1]. The extreme values MI = 1 and MI = 0 correspond to the maximum and minimum amounts of integration of the traits in T1. For example, the most completely integrated case would be one where all m QTL with at least one effect on T1 have effects on all k traits in T1 such that the total number of effects is km. This corresponds to MI = 1 (Fig 2A). At the other extreme, each QTL with at least one effect on T1 affects only a single trait. In this case, each trait in T1 changes independently as a result of an allele substitution. This corresponds to the lowest possible amount of integration such that MI = 0 (Fig 2A).
|
Parcelation statistic:
For measuring parcelation, we consider all n QTL in the sample with effects on more than one trait. Parcelation is defined as a relative lack of pleiotropic effects between two sets of traits. Parcelation is thus a relation between any two nonoverlapping sets of traits T1 and T2. These two sets can be a subset T1 of all traits T and its complement, i.e., all the traits in T that are not in T1. Pearson's
2 is used to compare the observed number of traits affected by each QTL in T1 and T2 to the expected number based on the marginal distribution of effects among all QTL and traits. For each QTL j, the observed number of traits affected in T1 and T2 is symbolized by Oj1 and Oj2, and the expected number of traits affected is calculated as Ej1 = p1Sj and Ej2 = p2Sj. Where Sj is the total number of effects of QTL j, and p1 and p2 are the relative frequencies with which a trait in T1 or T2 is affected by any QTL in the total sample,

The parcelation statistic MP therefore reads

Intuitively, this statistic measures the degree to which QTL preferentially affect traits in either T1 or T2. The null hypothesis is that the effects of a QTL are randomly distributed among the traits of the two trait sets. Rejecting the null hypothesis means that QTL effects tend to be clustered within each trait set. In genetic terms, this means that the QTL are "character specific," i.e., their effects tend to be limited to a set of traits. This was tested in ![]()
Application and test distribution:
To illustrate how these statistics are utilized, consider a study where n QTL have been identified that have effects on at least two of N observed phenotypic traits. Let us assume that k of these traits describe a complex T1 that performs some common function. The information about the functional significance of T1 must be available from some independent source, such as a study of the functional morphology of the traits. We want to test whether T1 is more modular than expected for a random set of traits. To do this, we first determine the values of the statistics for T1 and then the cumulative mass function (cmf) of MI and MP with respect to all the possible subsets of k traits taken from the total N traits. If both the values of MI and MP are greater than the critical value (using
= 0.05 for this study), the null hypothesis is rejected; i.e., the modularity of the set is greater than expected for a randomly chosen set of traits.
The "shape" of the cmf for the statistics MI and MP will depend on the distribution of pleiotropic effects among the QTL. Optimally, the value of MI and MP for all possible sets of k traits should be determined to calculate the cmfs exactly. This is the procedure adopted in our study. However, if the values of the Eij are generally >5 and if no or only a few QTL have a number of effects >k or N - k, then the sampling distribution MP will approximately follow a
2 distribution with 2n d.f.
We do not in general expect the marginal distributions of the statistics MI and MP to be independent. If they are not independent, the probability of rejecting the null hypothesis for both statistics for a random set of k traits is somewhere between 0.0025 (for independent MI and MP) and 0.05 (for a correlation of 1.0). Regardless of the correlation structure, we still consider a functional unit to be significantly modular if we can reject the null hypothesis for both statistics. It is expected that the correlation of the statistics will not be useful for comparing the power of the test for different data sets as the distribution associated with an alternative hypothesis (how traits of the unit T1 are assigned) will not necessarily be comparable between data sets.
It should be noted that for cases where we are assessing the integration and parcelation of more than one unit of a single data set, the units should not have many traits in common. If two units have many traits in common, we would expect similar results (rejection or nonrejection) for both units. We must therefore be cautious when assessing hierarchical and overlapping sets of traits for their degree of integration or parcelation. In this case, the two units of the mandible have only a single trait in common.
| RESULTS |
|---|
For this study, we assessed the modularity of functional sets of the mandible of the mouse Mus musculus. A QTL study performed by ![]()
![]()
![]()
![]()
|
|
Ascending ramus:
The ascending ramus is described by k = 12 of the 21 traits in this study. These traits are affected by m = 32 QTL of the 37 that have effects on at least 2 traits. The maximum number of traits these QTL could affect is 12 x 32 = 384. The observed number of effects on ascending ramus traits is 98 and thus the integration statistic is MI = 0.188. The cmf of the integration statistics for k = 12 yields a P value of 0.0102, which is significant at the 2% level. This value reflects the probability that this or greater levels of integration would have been observed if the traits were selected at random.
Of the total 160 effects of the QTL, 98 affected traits of the ascending ramus. The probability that a trait affected by a QTL is in this region (p1) is therefore 0.6125. Given this probability and the individual effects of the QTL, the parcelation statistic is calculated to be 77.650. This corresponds to a P value of 0.0003. We conclude that the ascending ramus is both more integrated and more parceled than expected by chance. It thus fulfills the criteria for a modularized unit (of the part of the phenotype that is described by the set of observed traits).
Alveolar region:
Of the 21 traits observed, k = 10 are ascribed to the alveolar region. These traits are affected by m = 28 QTL with at least 2 effects. The maximum number of effects on the alveolar region is 280. The number of effects on the alveolar region traits is 71, which leads to an integration statistic of MI = 0.171. This value is similar to that obtained for the ascending ramus. The cmf of the statistic, however, reveals that this value is not significantly different from a random sample of traits (P = 0.0968) at the 5% level.
Of the total 160 effects of the QTL, 71 affected the alveolus such that p1, the a priori probability of an alveolar trait affected by a QTL, is 0.444. Given this probability and the individual effects of the QTL, the parcelation statistic is 68.688. This corresponds to a P value of <10-4. The alveolar region is thus a part of the phenotype that is well parceled from the rest of the mandible but not more integrated than expected by chance. The alveolar region can thus be seen as a collection of quasi-independent traits rather than an integrated character complex.
| DISCUSSION |
|---|
There are two aspects to the test proposed in this article. On the one hand, it is a test for the nonrandomness of the distribution of QTL effects on a subset of traits. As such, this test does not address which evolutionary forces may have caused the nonrandom pattern. If, however, the test is performed on a set of traits that a priori have been classified as describing a functional unit, it is in fact an indirect test of the hypothesis that natural selection has shaped the genotype-phenotype map to modularize functionally integrated sets of characteristics. A rejection of the null hypothesis is considered as evidence in support of the hypothesis of the imitatory epigenotype (![]()
![]()
![]()
![]()
For the data set of ![]()
![]()
A limitation of our tests is that they only take into account whether a QTL has a significant effect on a trait or not. Ideally one would like to take into account the magnitude of the effects to assess the degree of integration and/or parcelation. The problem, though, with including the magnitude of the effects is that most of the effects are small and thus the confidence intervals are relatively large. This implies that there is little actual information added to the analysis if the measured effect is taken into account other than the fact that the effect is greater than zero.
As explained previously, every QTL data set represents a filtered sample of mutations that may occur. That is, a QTL data set not only reflects the probability of a mutation occurring but also the fixation probability of a mutation during the preparation of the lines used in the QTL study. Since different QTL study conditions may produce different probabilities of fixation, it is possible that the tests may produce a significant result for one set of lines but may not with another set of lines. Consider, for instance, a functional complex that is modular. A group of mutations that occur in this system are likely to reflect this modular structure. However, a QTL study protocol may end up fixing a filtered sample of these mutations that no longer have a modular distribution. Because QTL study conditions could also have the opposite effect, making a nonmodular unit appear modular, it is important to test data from other QTL studies using different inbred lines before drawing a final conclusion concerning the modularity of a structure. Since multiple QTL data sets do not presently exist for the mouse mandible, we can only interpret the present results as consistent with the hypothesis that the ascending ramus is a modular unit.
The genetic structure of the mouse mandible has been studied in the past with quantitative genetic methods (![]()
![]()
![]()
![]()
![]()
![]()
![]()
The comparison of genetic correlations and QTL data on the genetic architecture of characters is complicated by two factors. At the one hand, the genetic correlations reflect not only the pattern of mutational effects but also the frequencies of segregating alleles in the population in which the genetic correlations were estimated. Genetic correlations are as much a property of the population composition as they are a property of the genetic architecture of the traits. Furthermore, a low genetic correlation can indicate one of two things: it could be caused by a relative lack of pleiotropic effects among the traits or a pattern of positive and negative pleiotropic effects that cancel each other. There is no way to distinguish between these two scenarios by analyzing genetic correlations. Our results and that reported in ![]()
It should be noted that although the results from the QTL analysis are consistent with the hypothesis that natural selection may result in modular structures, results from this test alone are not sufficient to actually infer the action of natural selection in creating a modular pattern. To more directly investigate the possible role of natural selection in causing the association between functional and genetic integration, it is necessary to analyze data on the evolution of the genotype-phenotype map and the evolution of character function. However, with analyses of the sort described in this study, we can now begin to examine the architecture of genotype-phenotype maps and provide a foundation for such studies.
| FOOTNOTES |
|---|
1 Present address: Department of Biological Science, Florida State University, Tallahassee, FL 32306. ![]()
| ACKNOWLEDGMENTS |
|---|
The authors thank Junhyong Kim and David Houle for critically reading the manuscript. We further thank Homayoun Bagheri, Chi-Hua Chiu, Ashley Carter, Christian Pazmandi, Scott Rifkin, Max Shpak, and Terri Williams for stimulating discussions on the topic of this article. The support by National Science Foundation grants IBN-9905403, DEB-9419992, and DEB-9726433 is gratefully acknowledged.
Manuscript received November 5, 1999; Accepted for publication May 24, 2000.
| LITERATURE CITED |
|---|
ATCHLEY, W. R. and B. K. HALL, 1991 A model for development and evolution of complex morphological structures. Biol. Rev. 66:101-157[Medline].
ATCHLEY, W. R., A. A. PLUMMER, and B. RISKA, 1985 Genetics of mandible form in the mouse. Genetics 111:555-577
ATCHLEY, W. R., D. E. COWELY, C. VOGL, and T. MCLELLAN, 1992 Evolutionary divergence, shape change and genetic correlation structure in the rodent mandible. Syst. Biol. 41:196-221.
BAILEY, D. W., 1956 A comparison of genetic and environmental principal components of morphogenesis in mice. Growth 20:63-74[Medline].
BONNER, J. T., 1988 The Evolution of Complexity. Princeton University Press, Princeton, NJ.
CHEVERUD, J. M., 1982 Phenotypic, genetic and environmental morphological integration in the cranium. Evolution 36:499-516.
CHEVERUD, J. M., 2000 The genetic architecture of pleiotropic relations and differential epistasis, in The Character Concept in Evolutionary Biology, edited by G. P. WAGNER. Academic Press, San Diego (in press).
CHEVERUD, J. M., E. P. ROUTMAN, and D. K. IRSHICK, 1997 Pleiotropic effects of individual gene loci on mandibular morphology. Evolution 51:2004-2014.
KNOTT, S. A. and C. S. HALEY, 1992 Aspects of maximum likelihood methods for the mapping of quantitative trait loci in line crosses. Genet. Res. 60:139-151.
LANDER, E. S. and D. BOTSTEIN, 1989 Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185-199
LIU, B., 1998 Statistical Genomics. CRC Press LLC, Boca Raton, FL.
LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.
OLSON, C., and R. MILLER, 1958 Morphological Integration. The University of Chicago Press, Chicago.
PENROSE, L. S., 1935 The detection of autosomal linkage in data which consist of pairs of brothers and sisters of unspecified parentage. Ann. Eugen. 6:133-138.
RAFF, R. A., 1996 The Shape of Life: Genes, Development, and the Evolution of Animal Form. University of Chicago Press, Chicago.
RIEDL, R., 1978 Order in Living Organisms. Wiley Press, New York.
ROUTMAN, E. P., and J. M. CHEVERUD, 1994 Individual genes underlying quantitative traits: molecular and analytical methods, pp. 593606 in Molecular Ecology and Evolution: Approaches and Applications, edited by B. SCHIERWATER, B. STREIT, G. P. WAGNER and R. DESALLE. Birkhäuser, Boston/New York.
SAX, K., 1923 The association of size differences with seed-coat pattern and pigmentation in Phasolus vulgaris.. Genetics 8:552-560
SCHWENK, K., 2000 Functional units and their evolution, in The Character Concept in Evolutionary Biology, edited by G. P. WAGNER. Academic Press, San Diego (in press).
STURTEVANT, A. H., 1913 The linear arrangement of six sex-linked factors in Drosophila, as shown by their model of association. J. Exp. Zool. 14:43-59.
WAGNER, G. P., 1996 Homologues, natural kinds and the evolution of modularity. Am. Zool. 26:36-43.
WAGNER, G. P. and L. ALTENBERG, 1996 Complex adaptations and the evolution of evolvability. Evolution 50:967-976.
ZENG, Z., 1993 Theoretical basis of segregation of multiple linked gene effects on mapping quantitative trait loci. Proc. Natl. Acad. Sci. USA 90:10972-10976
This article has been cited by other articles:
![]() |
C. J. Vinyard and B. A. Payseur Of "mice" and mammals: utilizing classical inbred mice to study the genetic architecture of function and performance in mammals Integr. Comp. Biol., September 1, 2008; 48(3): 324 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Sherwood, D. L. Duren, L. M. Havill, J. Rogers, L. A. Cox, B. Towne, and M. C. Mahaney A Genomewide Linkage Scan for Quantitative Trait Loci Influencing the Craniofacial Complex in Baboons (Papio hamadryas spp.) Genetics, September 1, 2008; 180(1): 619 - 628. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Klingenberg, L. J. Leamy, and J. M. Cheverud Integration and Modularity of Quantitative Trait Locus Effects on Geometric Shape in the Mouse Mandible Genetics, April 1, 2004; 166(4): 1909 - 1921. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Mezey and D. Houle Comparing G Matrices: Are Common Principal Components Informative? Genetics, September 1, 2003; 165(1): 411 - 425. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Albertson, J. T. Streelman, and T. D. Kocher Genetic Basis of Adaptive Shape Differences in the Cichlid Head J. Hered., July 1, 2003; 94(4): 291 - 301. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Beldade, K. Koops, and P. M. Brakefield Modularity, individuality, and evo-devo in butterfly wings PNAS, October 29, 2002; 99(22): 14262 - 14267. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Murren, N. Pendleton, and M. Pigliucci Evolution of phenotypic integration in Brassica (Brassicaceae) Am. J. Botany, April 1, 2002; 89(4): 655 - 663. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Klingenberg, L. J. Leamy, E. J. Routman, and J. M. Cheverud Genetic Architecture of Mandible Shape in Mice: Effects of Quantitative Trait Loci Analyzed by Geometric Morphometrics Genetics, February 1, 2001; 157(2): 785 - 802. [Abstract] [Full Text] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Mezey, J. G.
- Articles by Wagner, G. P.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Mezey, J. G.
- Articles by Wagner, G. P.


= 1. Each QTL with an effect on T2 has an effect on one in T2. The value of the integration statistic is therefore zero: MI(T2) =
= 0. (B and C) Illustrations of the parcelation statistic MP: (B) For all the QTL in this sample, the expected number of effects is 1(Eji = pTiSj = 2 x 0.5 = 1.0). For QTL 1 and QTL 2 the observed values are 2 and 0 (Oj1 = 2 and Oj2 = 0, where j = 1 or 2). For QTL 3 and QTL 4, the observed values are also 2 and 0 (Oj1 = 0 and Oj2 = 2, where j = 3 or 4). Hence the value of MP =
+
+ 




