- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Toomajian, C.
- Articles by Kreitman, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Toomajian, C.
- Articles by Kreitman, M.
A Method for Detecting Recent Selection in the Human Genome From Allele Age Estimates
Christopher Toomajian2,a, Richard S. Ajiokac, Lynn B. Jorded, James P. Kushnerc, and Martin Kreitmana,ba Committee on Genetics, University of Chicago, Chicago, Illinois 60637
b Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637
c Division of Hematology/Oncology, University of Utah, Salt Lake City, Utah 84112
d Department of Human Genetics, University of Utah, Salt Lake City, Utah 84112
Corresponding author: Christopher Toomajian, University of Southern California, 835 W. 37th St., SHS 172, Los Angeles, CA 90089-1340., cmtoomaj{at}alumni.uchicago.edu (E-mail)
Communicating editor: W. STEPHAN
| ABSTRACT |
|---|
Mutations that have recently increased in frequency by positive natural selection are an important component of naturally occurring variation that affects fitness. To identify such variants, we developed a method to test for recent selection by estimating the age of an allele from the extent of haplotype sharing at linked sites. Neutral coalescent simulations are then used to determine the likelihood of this age given the allele's observed frequency. We applied this method to a common disease allele, the hemochromatosis-associated HFE C282Y mutation. Our results allow us to reject neutral models incorporating plausible human demographic histories for HFE C282Y and one other young but common allele, indicating positive selection at HFE or a linked locus. This method will be useful for scanning the human genome for alleles under selection using the haplotype map now being constructed.
THE ability to detect mutations under positive selection while they are still segregating in populations is an exciting prospect because these are variants that determine the fitness difference among individuals. Some of these variants may be found among human disease alleles that segregate at unexpectedly high frequencies. For example, the case has been made for positive selection for alleles at cystic fibrosis transmembrane conductance regulator (![]()
![]()
![]()
![]()
This article deals with DNA polymorphism on different scales (base pairs, kilobase pairs, megabase pairs) so it helps to clarify our terminology. We study alleles defined at the base pair level and additional variable sites linked to this focal polymorphism at the megabase pair level. We focus on the derived, or novel, allele that is produced when a DNA mutation occurs and for convenience may refer to this as a "mutation" or simply as "the allele." Since allele often implies a single base pair or locus (approximate kilobase pair), we refer to the set of homologous sequences (approximate megabase pair) carrying a particular derived allele (base pair) as an "allele class." Each sequence in this allele class, a copy of the allele, will be identical at the site defining the class, but may or may not be identical to one another at other linked sites. "Haplotype" refers to the particular configuration of the polymorphisms at two or more variable sites, such as occurs on each of two homologs of a diploid chromosome. Thus, an allele class may contain more than one different haplotype. We use the term "haplotype sharing" as a quantitative measure of the extent to which two or more sequences in an allele class resemble each other at linked sites.
Positive selection can be inferred for an allele on the basis of the relationship between its age and frequency in a population. In the absence of selection, higher-frequency alleles are expected to be older than lower-frequency alleles (![]()
![]()
![]()
![]()
As an alternative, one can include LD information from haplotypes to estimate allele age, taking a genealogical view of LD and uncovering historical patterns of recombination that can reflect an allele's age (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To avoid these problems, an allele's age can be estimated by the decay of ancestral haplotype sharing (DHS; ![]()
Our method to test whether alleles at a region of interest are compatible with models of neutral evolution can be summarized as follows. Collect haplotype data from a population using markers that flank a region of interest (e.g., a gene or exon) and span up to 1 cM. Sort these haplotypes on the basis of the alleles they carry at the region of interest. Estimate the ages of these alleles with the DHS method from the observed decay of haplotype sharing at the flanking markers within this haplotype set and compared to the background LD and allele frequencies found at the marker loci on the remaining haplotypes. Compare the frequency of the alleles at the region of interest with the estimated ages of these alleles to identify young alleles at unexpectedly high frequencies. Because natural selection specifically discriminates between alternative alleles at a locus, multiple alleles help resolve whether a pattern is due to selection or to processes that act more generally in regions, such as heterogeneity in recombination rate, or, in populations, such as genetic drift and demographic history. Evaluate the compatibility of the observed relationship between allele age and frequency in the region to neutral evolutionary models by simulating haplotypes that include an allele at the region of interest and at flanking markers and by estimating the age of these alleles using the DHS method. These simulations model uncertainty in the genealogy of alleles and provide an appropriate statistical comparison for the observed alleles. Compare the age of observed alleles with the distribution of ages for simulated alleles at the same frequency produced under different demographic models. Alleles that are younger than the vast majority or all of the simulated alleles are unlikely to occur by chance under the neutral models and indicate the possible role of selection.
As a demonstration, we applied this method to alleles at the human HFE gene. HFE (![]()
![]()
![]()
![]()
![]()
Our method benefits from the implementation of three approaches that overcome some of the difficulties of previous methods in detecting patterns characteristic of recent positive selection: (1) estimating allele age by the DHS method; (2) testing empirical allele age estimates against neutral models that include realistic demographic parameters with the use of coalescent simulations; and (3) comparing results between eight HFE alleles that act as a control in interpreting patterns for any individual allele.
| MATERIALS AND METHODS |
|---|
DNA samples:
Samples were collected from Utah and surrounding states (![]()
SNP typing:
We typed a subset of highly informative SNPs at the HFE locus (![]()
![]()
1 kb to determine the SNP genotype (![]()
Inference of the 11-kb sequence:
We used pedigrees to resolve haplotypes for the SNP alleles and place these haplotypes in the context of the broader marker haplotypes. This process was simplified in individuals heterozygous for the C282Y mutation, because they almost always carry the common C282Y haplotype (22). SNP haplotypes for the non-C282Y-bearing chromosome could then be resolved. Eight non-C282Y chromosomes had been fully sequenced (![]()
![]()
Estimating allele age:
We used DHSMAP v. 1.04 software (![]()
|
Our pedigree samples are biased for the C282Y allele. For the purpose of estimating allele age, we constructed a sample that includes 140 independent non-C282Ychromosomes and 11 randomly chosen independent C282Y chromosomes. This constructed random sample corrects for the C282Y allele bias, so that its frequency is
7%, the estimate for the population from which these samples were drawn (![]()
Coalescent simulations:
Simulations were performed by modifying MS, a C program from R. Hudson, which incorporates recombination, population size changes, and population subdivision (![]()
![]()
Estimating the ages of simulated alleles:
When a comparison was made between simulated and HFE SNP alleles, the age estimate based only on SSRs was used. When the assumption of constant population size is relaxed, the expected TMRCA for two randomly drawn alleles will change and affect the expected variance in SSR allele size. We empirically estimated this deviation by measuring the average variance in allele size of a SSR in a simulated sample that had undergone population size changes and comparing it to the constant population size expectation. To mirror the observed variance in allele size for each SSR in our simulations of demographic models, we adjusted the mutation rate to compensate for the change in the TMRCA for two randomly drawn samples.
| RESULTS |
|---|
We generated the data needed to estimate the ages of several alleles at the HFE locus by two steps: (1) determine individual haplotypes for the SNPs that segregate in an 11-kb region encompassing the HFE locus in a Caucasian sample (![]()
![]()
![]()
![]()
![]()
|
|
Estimating allele ages:
The above data allowed us to determine the age of alleles in the HFE locus, including HFE C282Y, by assessing the decay of haplotype sharing to more distant sites. We selected eight SNP alleles for age estimation by DHSMAP, a program that implements the DHS method (![]()
Table 2 shows the frequency of these alleles and their age estimates for a range of assumed SSR mutation rates (10-510-3). Our best estimate for the age of C282Y is 138 generations, with a range from 88 to 156. When interpreting the frequency of an allele in light of its estimated age, the comparison with many presumably neutral alleles provides a control for the effects of population demography and heterogeneity in mutation and recombination rates in the genomic region under study. Relative to other alleles, C282Y is indeed young, given its frequency. However, the 7633A allele produced an even more remarkable result, as it is at higher frequency and has a younger age estimate (78 generations). The next youngest allele after 7633A and C282Y is 4600G at 529 generations, with the remaining alleles
1000 generations or older when estimating SSR mutation rates from their variance in allele size (Table 2, row A). Deviations from the one-step mutation model of SSRs, which is assumed in estimating these mutation rates, would mean that our mutation rate estimates may be high, although the true value would likely still be within the range of values we consider. Age estimates depended on the assumed marker mutation rate because DHSMAP estimates age from the effect on haplotype sharing of both recombination and mutation. Alleles with much haplotype sharing (7633A) were less sensitive to misspecification of mutation rate than those with little haplotype sharing (H63D, Table 2 rows D, F, and H). Also, the difference between ages estimated with mutation rates 10-4 and 10-5 were small compared to the difference for 10-4 and 10-3. If the true mutation rate for the SSRs is closer to 10-3, then age estimates decrease. This makes the young age estimates of 7633A and C282Y more extreme but is probably unlikely since all HFE alleles <30% would appear fairly young (TMRCA of <550 generations).
|
Test of neutrality using coalescent simulations:
Having estimated young ages for two common alleles at HFE, we tested whether these two alleles were compatible with models that assume no positive selection. To assess the likelihood under neutrality of age estimates for individual HFE alleles, we simulated haplotypes under a range of parameter values (SSR mutation rate, recombination rate, and different demographic scenarios) using coalescent theory (program available upon request). The coalescent simulations model the genealogy of the entire population and account for uncertainty in the number of lineages present in past generations for particular alleles (![]()
Simulation replicates from a constant population size model included at least 1000 representatives in each of 41 allele-frequency classes (from 6/151 to 46/151). Fig 3 displays the probability distribution for each allele-frequency class as percentile values of allele age. Points corresponding to the ages and frequencies of alleles at HFE are overlaid for comparison. Both the C282Y and 7633A alleles are at the tail of the distribution for their frequency classes and are thus significantly younger than expected under this neutral model (P < 0.01 and P < 0.001, respectively). In random population samples, different numbers of chromosomes carrying C282Y or 7633A will be found due to sampling variance. Given our age estimates for these alleles, at least nine copies of C282Y in a sample of 151 chromosomes must be found for the age of this allele to fall below the 2.5 percentile value of the simulated haplotypes, and at least nine or six copies of 7633A must be found for the age of this allele to fall below the 1 and 2.5 percentile value, respectively, of the simulated haplotypes. Also, the reported P values should not be taken at face value since multiple tests have been performed. These tests are not independent, so the standard Bonferroni correction is too conservative and not appropriate. Nevertheless, we note that while the C282Y result would not remain significant after this correction for multiple tests, the 7633A result would.
|
Alternative demographic models:
We simulated samples under different demographic models (Table 3) to explore their effects on the estimation of allele ages by DHSMAP. The significance of HFE allele ages under the various demographic models is indicated by the empirical cumulative probability distributions of ages for alleles in three different frequency classes (Fig 4). Under the simple exponential growth model, the age distributions shifted toward higher values and the significance of the C282Y estimate increased. This is expected, since LD should be less in a fast-growing population (![]()
![]()
|
|
The significance of the young age estimate for C282Y is also insensitive to error in the estimate of recombination rates throughout the region studied. Fig 4 shows the empirical cumulative probability distribution produced when samples were simulated assuming the genetic distances between all markers were only half of the values used to estimate allele ages by DHS (1/2
). In this case, the age of the C282Y allele is borderline significant (P < 0.06). However, the low recombination rate used implies a genetic distance between human histocompatibility-A (HLA-A) and HFE that lies below the 95% confidence interval estimated from recombination events observed between these loci (![]()
![]()
Comparison with other HFE alleles:
The estimated age of 7633A is significantly younger than expected on the basis of its frequency for all models considered. Only a very extreme founder effect would produce such young age estimates with any likelihood. Yet no evidence exists for a severe founder effect on the basis of estimated ages of other HFE alleles. For example, Fig 4B includes the age of the 709T allele, found at the same frequency as allele 7633A. It is older than 97% of the ages estimated for simulated alleles in this frequency class under the recent bottleneck model. Fig 4C gives the example of the H63D allele at 16% frequency. This allele is not consistent with either bottleneck model tested, with
95 and 99.5% of simulated alleles having younger ages under an old bottleneck and a recent bottleneck, respectively. If a recent and severe founding effect occurred in this population, then the old age estimated by DHS at H63D would suggest that it was preferentially preserved through the founding bottleneck. Thus, the comparison of a broad range of alleles from the HFE locus and their patterns of haplotype-sharing decay can jointly determine likely demographic parameters of this sample and detect particular alleles that depart significantly from the expectations of neutral demographic models.
| DISCUSSION |
|---|
Our investigation of the empirical relationship between the decay of haplotype sharing around different HFE alleles and their frequency supports the hypothesis that HFE C282Y has increased in frequency due to selection and detects an additional selected haplotype in the HLA-HFE region. Multilocus LD estimated from the decay of haplotype sharing provides a clear indication of which haplotypes have been affected by natural selection, and our method may be able to indicate recent positive selection too subtle to be detected by current tests that emphasize levels of variation rather than allele association (![]()
![]()
Selective events that are detectable by haplotype sharing may also be detected using methods that focus on the reduction in variation in an allele class (e.g., the haplotype test of ![]()
Highly polymorphic SSRs in a haplotype test may not be as informative as in a haplotype-sharing test because the high mutation rate of SSRs will rapidly increase the number of variants within an allele class. As an example, the SSRs we use are quite variable, and there is not an overwhelming absence of variability in the C282Y allele class. Our study included SSRs not because they increase variability, but because they were the markers that had already been typed in the appropriate samples. Without SSRs, more SNPs would probably be needed to provide the same amount of information on how far ancestral haplotypes extended along chromosomes, but their lower mutation rate would provide a clearer picture of ancestral haplotype extent and result in more accurate age estimates. Our method could also be applied to previously ascertained SNPs, but doing the same with the haplotype test may be problematic because it would require, impractically, many ascertained markers to detect a significant reduction in variation. In some cases, detecting a significant difference in SNP variability between selected vs. neutral allele classes may be more difficult than detecting a significant difference in the level of haplotype sharing.
The age of HFE C282Y:
Our study emphasized the comparison of haplotype sharing for multiple observed SNP alleles and alleles produced in neutral coalescent simulations, so that no conclusions were based on absolute ages alone. Still, comparisons with two previous studies that have used the observed decay of LD to estimate the C282Y allele age show that the DHS method performs well. Our estimate (138 generations) fell within the 95% confidence interval (27161 generations) for the estimate (59 generations) of ![]()
![]()
![]()
Aside from the technical limitations of these age estimates, directly interpreting the young age of an allele, given its frequency, as clear evidence of positive selection is inappropriate without formal statistical testing. Therefore, the fact that the DHS method produces estimates of the TMRCA of an allele class rather than its true age is inconsequential. Because both observed and simulated haplotypes were evaluated by the same method, the conclusions from our statistical testing are robust to any biases inherent in the way we estimate allele age. Inaccuracies in allele age estimates do not affect our ability to evaluate the contribution of natural selection and population demography to the current frequencies of young alleles. We used the DHS method to measure the decay of LD because it is designed to use information from several linked markers, can account for the dependence across loci within a haplotype, and is an improvement over other multilocus LD methods that do not (![]()
![]()
![]()
The accuracy of genetic distances is important in all methods of estimating the age of an allele on the basis of patterns of LD. The genetic distances we used in our HFE example (Table 1) relied on the measured recombination rates in the HFE region, where the rate centromeric to HFE is estimated to be one-fifth the rate telomeric (![]()
![]()
Alternative demographic models:
Recent data on human variation indicate that complex demographic histories should be used as null models in the analysis of human population data (![]()
![]()
In a study of LD in 19 random genomic regions, ![]()
![]()
HFE and natural selection:
Independent data on HLA haplotypes suggest that the HFE 7633A allele has hitchhiked on a particularly large chromosomal region. The 7633A mutation is located in an intron and has no known effect on HFE levels or function. The inferred ancestral allele at HLA-B for the HFE 7633A allele class is B8. Most Caucasians that carry HLA-B8 share alleles on a common haplotype that extends throughout the HLA region and includes HLA A1. This extended haplotype has reached 10% frequency in northern European populations while remaining more or less intact (![]()
![]()
![]()
![]()
![]()
Direct positive selection on the C282Y mutation is a viable possibility, given what is known about its biological effect on HFE function (![]()
![]()
![]()
It should be possible to apply our method described here to several regions within a potentially selected long-range haplotype such as the HFE 7633A-HLA A1-B8 example above to find evidence of the specific target of selection. However, it is not always clear how results should be compared between loci. For example, we estimated the age of alleles at the HLA-A locus and the adjacent SSR D6S265 (data not shown). In each case, the alleles estimated to be ancestral on the C282Y and 7633A haplotypes either were the youngest of the set of alleles at the locus or appeared young given their frequency, consistent with the hypothesis that these two haplotypes have been affected by positive selection. Although at neither locus was the ancestral allele age for either haplotype younger than the age estimated for the respective HFE allele, it would not be appropriate to conclude that in both cases the target of selection is an allele at the HFE gene. The locus spacing of this study is more suited to producing accurate age estimates for alleles at HFE, with many close and highly polymorphic markers surrounding the gene. SSRs pose additional problems since one cannot assume that all sequences in an allele class are descended from a unique mutation event. The nature of allele designations at the HLA-A locus (our alleles are determined by serotype) and its high polymorphism level also make comparisons with HFE alleles difficult. Further work and the inclusion of additional markers in the analysis may provide more resolution to this question.
The possible action of negative selection on an allele must also be considered when explaining an allele age lower than expected given its frequency. Selection on an allele reduces the expected age of the allele given its frequency regardless of the direction of selection (![]()
![]()
![]()
![]()
Conclusion:
We have described a general method for testing human LD data vs. neutral theory predictions about the expected relationship between allele age and frequency under demographic models that can be simulated with the coalescent approach. With an effort now underway to determine the haplotype structure for the entire human genome, methods such as ours are likely to have widespread application. Humans have almost certainly undergone strong recent selection for many different traits, not the least of which is resistance to infectious disease, and methods to detect positively selected haplotypes might provide positional information about the selected allele, not unlike LD mapping itself. Although in most cases our method will not allow identification of the specific locus under positive selection, the number of loci in many cases will be small enough to allow a candidate to be identified. The human data that will be available soon, and the application of methods to detect selection, will dramatically improve our understanding of both human history and molecular evolutionary processes acting in natural populations.
| FOOTNOTES |
|---|
2 Present address: Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089-1340. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank A. Strahs and M. S. McPeek for providing the source code and assistance in implementing DHSMAP and R. Hudson for providing computer code used in the coalescent simulations. Thanks go to R. Hudson, E. Stahl, J. Comeron, M. Antezana, and M. S. McPeek for helpful discussions on the specific form of the simulations and tests of the algorithm's performance. Two reviewers provided helpful comments on the manuscript. This work was supported by National Institutes of Health (NIH) grants GM39355 to M.K., GM-59290 to L.B.J., R01 DK20630-21 and M01 RR 00064 to R.S.A. and J.P.K., and a National Science Foundation Doctoral Dissertation Improvement Grant (DEB-0073297) to C.T. and M.K. C.T. was partially supported by a Howard Hughes Medical Institute predoctoral fellowship and by NIH training grant no. T32 GM07197.
Manuscript received January 8, 2003; Accepted for publication May 21, 2003.
| LITERATURE CITED |
|---|
AJIOKA, R. S., L. B. JORDE, J. R. GRUEN, P. YU, and D. DIMITROVA et al., 1997 Haplotype analysis of hemochromatosis: evaluation of different linkage-disequilibrium approaches and evolution of disease chromosomes. Am. J. Hum. Genet. 60:1439-1447.[Medline]
ANDOLFATTO, P., J. D. WALL, and M. KREITMAN, 1999 Unusual haplotype structure at the proximal breakpoint of In(2L)t in a natural population of Drosophila melanogaster.. Genetics 153:1297-1311.
ARDLIE, K. G., L. KRUGLYAK, and M. SEIELSTAD, 2002 Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3:299-309.[Medline]
BANDELT, H. J., P. FORSTER, B. C. SYKES, and M. B. RICHARDS, 1995 Mitochondrial portraits of human populations using median networks. Genetics 141:743-753.[Abstract]
BEUTLER, E., V. J. FELITTI, J. A. KOZIOL, N. J. HO, and T. GELBART, 2002 Penetrance of 845G
A (C282Y) HFE hereditary haemochromatosis mutation in the USA. Lancet 359:211-218.[Medline]
BUGAWAN, T. L., W. KLITZ, A. BLAIR, and H. A. ERLICH, 2000 High-resolution HLA class I typing in the CEPH families: analysis of linkage disequilibrium among HLA loci. Tissue Antigens 56:392-404.[Medline]
BULAJ, Z. J., L. M. GRIFFEN, L. B. JORDE, C. Q. EDWARDS, and J. P. KUSHNER, 1996 Clinical and biochemical abnormalities in people heterozygous for hemochromatosis. New Engl. J. Med. 335:1799-1805.
CARRINGTON, M., 1999 Recombination within the human MHC. Immunol. Rev. 167:245-256.[Medline]
EDWARDS, C. Q., L. M. GRIFFEN, D. GOLDGAR, C. DRUMMOND, and M. H. SKOLNICK et al., 1988 Prevalence of hemochromatosis among 11,065 presumably healthy blood donors. N. Engl. J. Med. 318:1355-1362.[Abstract]
FAIRBANKS, V. F., 2000 Hemochromatosis: population genetics, pp. 4250 in Hemochromatosis: Genetics, Pathophysiology, Diagnosis and Treatment, edited by J. C. BARTON and C. Q. EDWARDS. Cambridge University Press, Cambridge, UK.
FEDER, J. N., A. GNIRKE, W. THOMAS, Z. TSUCHIHASHI, and D. A. RUDDY et al., 1996 A novel MHC class I-like gene is mutated in patients with hereditary haemochromatosis. Nat. Genet. 13:399-408.[Medline]
FEDER, J. N., D. M. PENNY, A. IRRINKI, V. K. LEE, and J. A. LEBRÓN et al., 1998 The hemochromatosis gene product complexes with the transferrin receptor and lowers its affinity for ligand binding. Proc. Natl. Acad. Sci. USA 95:1472-1477.
GUO, S. W. and M. XIONG, 1997 Estimating the age of mutant disease alleles based on linkage disequilibrium. Hum. Hered. 47:315-337.[Medline]
HUDSON, R. R., 1990 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7:1-44.
HUDSON, R. R., K. BAILEY, D. SKARECKY, J. KWIATOWSKI, and F. J. AYALA, 1994 Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster.. Genetics 136:1329-1340.[Abstract]
JEFFREYS, A. J., L. KAUPPI, and R. NEUMANN, 2001 Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex. Nat. Genet. 29:217-222.[Medline]
KIDD, K. K., 1987 Phenylketonuria: population genetics of a disease. Nature 327:282-283.[Medline]
KIMURA, M. and T. OHTA, 1973 The age of a neutral mutant persisting in a finite population. Genetics 75:199-212.
KIRBY, D. A. and W. STEPHAN, 1995 Haplotype test reveals departure from neutrality in a segment of the white gene of Drosophila melanogaster.. Genetics 141:1483-1490.[Abstract]
LIU, J. S., C. SABATTI, J. TENG, B. J. B. KEATS, and N. RISCH, 2001 Bayesian analysis of haplotypes for linkage disequilibrium mapping. Genome Res. 11:1716-1724.
LUCOTTE, G., 2001 Frequency analysis and allele map in favor of the Celtic origin of the C282Y mutation of hemochromatosis. Blood Cells Mol. Dis. 27:549-556.[Medline]
MALFROY, L., M. P. ROTH, M. CARRINGTON, N. BOROT, and A. VOLZ et al., 1997 Heterogeneity in rates of recombination in the 6-Mb region telomeric to the human major histocompatibility complex. Genomics 43:226-231.[Medline]
MARUYAMA, T., 1974 The age of an allele in a finite population. Genet. Res. 23:137-143.[Medline]
MAYNARD SMITH, J. and J. HAIGH, 1974 The hitch-hiking effect of a favourable gene. Genet. Res. 23:23-35.[Medline]
MCPEEK, M. S. and A. STRAHS, 1999 Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. Am. J. Hum. Genet. 65:858-875.[Medline]
MERRYWEATHER-CLARKE, A. T., J. J. POINTON, J. D. SHEARMAN, and K. J. H. ROBSON, 1997 Global prevalence of putative haemochromatosis mutations. J. Med. Genet. 34:275-278.
MOTULSKY, A. G., 1964 Hereditary red cell traits and malaria. Am. J. Trop. Med. Hyg. 13:147-158.
NORDBORG, M. and S. TAVARÉ, 2002 Linkage disequilibrium: what history has to tell us. Trends Genet. 18:83-90.[Medline]
OHTA, T. and M. KIMURA, 1973 A model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a finite population. Genet. Res. 22:201-204.[Medline]
PICHON, L., G. CARN, P. BOURIC, T. GIFFON, and B. CHAUVEL et al., 1996 Structural analysis of the HLA-A/HLA-F subregion: precise localization of two new multigene families closely associated with the HLA class I sequences. Genomics 32:236-244.[Medline]
PRICE, P., C. WITT, R. ALLCOCK, D. SAYER, and M. GARLEPP et al., 1999 The genetic basis for the association of the 8.1 ancestral haplotype (A1, B8, DR3) with multiple immunolopathological diseases. Immunol. Rev. 167:257-274.[Medline]
PRITCHARD, J. K. and M. PRZEWORSKI, 2001 Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69:1-14.[Medline]
PRZEWORSKI, M., R. R. HUDSON, and A. DI RIENZO, 2000 Adjusting the focus on human variation. Trends Genet. 16:296-302.[Medline]
RANNALA, B. and J. P. REEVE, 2001 High-resolution multipoint linkage-disequilibrium mapping in the context of a human genome sequence. Am. J. Hum. Genet. 69:159-178.[Medline]
REICH, D. E., and D. B. GOLDSTEIN, 1999 Estimating the age of mutations using variation at linked markers, pp. 129138 in Microsatellites: Evolution and Applications, edited by D. B. GOLDSTEIN and C. SCHLÖTTERER. Oxford University Press, Oxford.
REICH, D. E., M. CARGILL, S. BOLK, J. IRELAND, and P. C. SABETI et al., 2001 Linkage disequilibrium in the human genome. Nature 411:199-204.[Medline]
RISCH, N., D. DE LEON, L. OZELIUS, P. KRAMER, and L. ALMASY et al., 1995 Genetic analysis of idiopathic torsion dystonia in Ashkenazi Jews and their recent descent from a small founder population. Nat. Genet. 9:152-159.[Medline]
ROMEO, G., M. DEVOTO, and L. J. V. GALIETTA, 1989 Why is the cystic fibrosis gene so frequent? Hum. Genet. 84:1-5.[Medline]
ROTTER, J. I. and J. M. DIAMOND, 1987 What maintains the frequencies of human genetic diseases? Nature 329:289-290.[Medline]
SABETI, P. C., D. E. REICH, J. M. HIGGINS, H. Z. P. LEVINE, and D. J. RICHTER et al., 2002 Detecting recent positive selection in the human genome from haplotype structure. Nature 419:832-837.[Medline]
SALTER-CID, L., A. BRUNMARK, Y. LI, D. LETURCQ, and P. A. PETERSON et al., 1999 Transferrin receptor is negatively modulated by the hemochromatosis protein HFE: implications for cellular iron homeostasis. Proc. Natl. Acad. Sci. USA 96:5434-5439.
SERRE, J. L., B. SIMON-BOUY, E. MORNET, B. JAUME-ROIG, and A. BALASSOPOULOU et al., 1990 Studies of RFLP closely linked to the cystic fibrosis locus throughout Europe lead to new considerations in population genetics. Hum. Genet. 84:449-454.[Medline]
SLATKIN, M., 1994 Linkage disequiibrium in growing and stable populations. Genetics 137:331-336.[Abstract]
SLATKIN, M. and G. BERTORELLE, 2001 The use of intraallelic variability for testing neutrality and estimating population growth rate. Genetics 158:865-874.
SLATKIN, M. and B. RANNALA, 2000 Estimating allele age. Annu. Rev. Genomics Hum. Genet. 1:225-249.[Medline]
TERMIJTELEN, A., J. D'AMARO, J. J. VAN ROOD, and G. M. T. SCHREUDER, 1995 Linkage disequilibrium in HLA cannot be explained by selective recombination. Tissue Antigens 46:387-390.[Medline]
TERWILLIGER, J. D., 1995 A powerful method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. Am. J. Hum. Genet. 56:777-787.[Medline]
THOMAS, W., A. FULLAN, D. B. LOEB, E. E. MCCLELLAND, and B. R. BACON et al., 1998 A haplotype and linkage disequilibrium analysis of the hereditary hemochromatosis gene region. Hum. Genet. 102:517-525.[Medline]
TOOMAJIAN, C. and M. KREITMAN, 2002 Sequence variation and haplotype structure at the human HFE locus. Genetics 161:1609-1623.
VALDES, A. M., M. SLATKIN, and N. B. FREIMER, 1993 Allele frequencies at microsatellite loci: the stepwise mutation model revisited. Genetics 133:737-749.[Abstract]
WORWOOD, M., R. RAHA CHOWDHURY, K. J. H. ROBSON, J. POINTON, and J. D. SHEARMAN et al., 1997 The HLA A1-B8 haplotype extends 6 Mb beyond HLA-A: associations between HLA-A, B, F and 15 microsatellite markers. Tissue Antigens 50:521-526.[Medline]
This article has been cited by other articles:
![]() |
P. Pfaffelhuber, A. Lehnert, and W. Stephan Linkage Disequilibrium Under Genetic Hitchhiking in Finite Populations Genetics, May 1, 2008; 179(1): 527 - 537. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Storz, M. Baze, J. L. Waite, F. G. Hoffmann, J. C. Opazo, and J. P. Hayes Complex Signatures of Selection and Gene Conversion in the Duplicated Globin Genes of House Mice Genetics, September 1, 2007; 177(1): 481 - 500. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zeng, S. Mano, S. Shi, and C.-I Wu Comparisons of Site- and Haplotype-Frequency Methods for Detecting Positive Selection Mol. Biol. Evol., July 1, 2007; 24(7): 1562 - 1574. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nair, D. Nash, D. Sudimack, A. Jaidee, M. Barends, A.-C. Uhlemann, S. Krishna, F. Nosten, and T. J. C. Ander |





