Genetics, Vol. 150, 1341-1348, December 1998, Copyright © 1998

Detecting Linkage Disequilibrium in Bacterial Populations

Bernhard Haubolda, Michael Travisanoa, Paul B. Raineya, and Richard R. Hudsonb
a Department of Plant Sciences, University of Oxford, Oxford OX1 3RB, United Kingdom
b Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92717

Corresponding author: Bernhard Haubold, Max-Planck-Institut für Chemische Ökologie, Tatzendpromenade 1a, D-07745 Jena, Germany., haubold{at}ice.mpg.de (E-mail).

Communicating editor: P. L. FOSTER


*  ABSTRACT
*TOP
*ABSTRACT
*THE TRADITIONAL METHOD OF...
*COMPUTING THE VARIANCE OF...
*RESULTS AND DISCUSSION
*LITERATURE CITED

The distribution of the number of pairwise differences calculated from comparisons between n haploid genomes has frequently been used as a starting point for testing the hypothesis of linkage equilibrium. For this purpose the variance of the pairwise differences, VD, is used as a test statistic to evaluate the null hypothesis that all loci are in linkage equilibrium. The problem is to determine the critical value of the distribution of VD. This critical value can be estimated either by Monte Carlo simulation or by assuming that VD is distributed normally and calculating a one-tailed 95% critical value for VD, L, L = E(VD) + 1.645 , where E(VD) is the expectation of VD, and Var(VD) is the variance of VD. If VD (observed) > L, the null hypothesis of linkage equilibrium is rejected. Using Monte Carlo simulation we show that the formula currently available for Var(VD) is incorrect, especially for genetically highly diverse data. This has implications for hypothesis testing in bacterial populations, which are often genetically highly diverse. For this reason we derive a new, exact formula for Var(VD). The distribution of VD is examined and shown to approach normality as the sample size increases. This makes the new formula a useful tool in the investigation of large data sets, where testing for linkage using Monte Carlo simulation can be very time consuming. Application of the new formula, in conjunction with Monte Carlo simulation, to populations of Bradyrhizobium japonicum, Rhizobium leguminosarum, and Bacillus subtilis reveals linkage disequilibrium where linkage equilibrium has previously been reported.


BACTERIA might be called "facultative sexuals" because they can exchange genetic material through conjugation, transformation, and transduction, but genetic exchange is not a part of their reproductive mode. Just how frequently recombination takes place in bacteria has been a topic of debate since the first major study of bacterial population genetics, in which Escherichia coli genomes were assumed to recombine frequently leading to linkage equilibrium (MILKMAN 1973 Down). SELANDER and LEVIN 1980 Down showed that this assumption was incorrect and that E. coli populations consisted of many asexual clones evolving in genetic isolation from all other clones comprising the species (cf. MARUYAMA and KIMURA 1980 Down, but see GUTTMAN and DYKHUIZEN 1994 Down). During the 1980s this clonal model was thought to hold for all bacterial populations until ISTOCK et al. 1992 Down reported that a local population of Bacillus subtilis was in linkage equilibrium and argued that this resulted from frequent mixis. In addition to B. subtilis, linkage equilibrium has been reported for Neisseria gonorrhoeae (O'ROURKE and STEVENS 1993 Down), subpopulations of Rhizobium (SOUZA et al. 1992 Down; MAYNARD SMITH et al. 1993 Down; BOTTOMLEY et al. 1994 Down; STRAIN et al. 1995 Down), Burkholderia cepacia (WISE et al. 1995 Down), Helicobacter pylori (GO et al. 1996 Down), and fluorescent Pseudomonas (HAUBOLD and RAINEY 1996 Down).

The conclusion of linkage equilibrium reached in these studies is based on the variance of the distribution of the number of pairwise differences (VD) among bacterial isolates that have been subjected to genetic analysis at multiple loci. VD can be compared to a critical value obtained under the null hypothesis that all loci are in linkage equilibrium. This approach was first developed by BROWN et al. 1980 Down, who applied it to allozyme data from wild barley, Hordeum spontaneum. WHITTAM et al. 1983 Down pioneered its use in bacterial population genetics, and more recently this method served as the basis for an extensive comparative study of bacterial population structure (MAYNARD SMITH et al. 1993 Down).

There are two methods of calculating a critical value for VD. (1) The null distribution of VD can be simulated on a computer, and (2) assuming the null distribution of VD is normal, a critical value can be calculated by the well-known method of adding x standard deviations to E(VD). But, as it is not known whether the null distribution of VD is normal, Monte Carlo simulation has recently emerged as the preferred way for testing linkage equilibrium in bacterial populations (SOUZA et al. 1992 Down; WISE et al. 1995 Down; HAUBOLD and RAINEY 1996 Down). However, this approach is computationally intensive and many workers have preferred to use the simplifying assumption of normality for hypothesis testing. In this case the correct test depends above all on an accurate estimator of the variance of VD.


*  THE TRADITIONAL METHOD OF COMPUTING THE VARIANCE OF VD
*TOP
*ABSTRACT
*THE TRADITIONAL METHOD OF...
*COMPUTING THE VARIANCE OF...
*RESULTS AND DISCUSSION
*LITERATURE CITED

Suppose we have n sampled haploid individuals, arbitrarily numbered from 1 to n, that have been genetically assayed at q loci. Let dij denote the number of loci at which individuals i and j differ. Then the variance of pairwise differences is by definition equal to

(1)
where

(2)

The distribution of VD depends on how replicate samples would be generated. In this article, we assume that replicate samples are generated by randomly shuffling the original alleles among the sampled haplotypes. In this way, the numbers of alleles and the frequencies of the alleles at individual loci are exactly the same in each replicate as in the original sample, but there is no statistical association of alleles on haplotypes except that which arises by chance. This shuffling method is the method suggested by SOUZA et al. 1992 Down. The distribution of VD under this randomization is taken to be our null distribution. We note that the distribution of VD would be slightly different if sampling were done with replacement. Under our randomization scheme the expectation of VD is

(3)
where

(4)
and where pij is the frequency in the sample of the ith allele at the jth locus. We note that hj is an unbiased estimator of the population genetic diversity.

BROWN et al. 1980 Down suggested that the one-tailed 95% critical value for VD could be calculated assuming that the distribution of VD is normal. Thus they estimated this critical value by

(5)
where Var(VD)old is an estimate of the variance of VD calculated as

(6)
where

(7)

(BROWN et al. 1980 Down).

In the next section we derive a formula for the variance of VD under the randomization scheme of SOUZA et al. 1992 Down and show that (6) is inappropriate for calculating the variance of VD under these circumstances.


*  COMPUTING THE VARIANCE OF VD
*TOP
*ABSTRACT
*THE TRADITIONAL METHOD OF...
*COMPUTING THE VARIANCE OF...
*RESULTS AND DISCUSSION
*LITERATURE CITED

In this section we obtain an exact expression for the variance of VD under the shuffling of alleles across individuals (the sampling without replacement method; see also HUDSON 1994 Down). In the following, dij denotes the random number of loci at which individual i and j differ in a shuffled sample. First we write VD in terms of sij, the number of loci at which individuals i and j are identical. Noting that sij = q - dij, it follows that

(8)
where

(9)

Because under the randomization scheme that we are considering is a constant, it follows that

(10)
where E denotes expectation under the randomization scheme.

We now proceed to derive expressions for each of the terms on the right-hand side of the last line of Equation 10. Let xk be an indicator variable, equal to one if individual 1 and individual 2 are identical at locus k, and zero otherwise. Then

(11)
and

(12)
where {phi}k is the probability that two randomly chosen individuals are identical at locus k. For our case,

(13)
where pmk is the frequency of the mth allele at the kth locus in the original sample, and the sum is over all alleles at locus k. Similarly,

(14)

To calculate E(s4ij), we write

(15)

To arrive at the last line, we have used the fact that an indicator variable to any power is equal to the indicator variable itself. (For example, x4k = xk.) We have also made use of the fact that xk is independent of xj, for j != k. We show later that the double, triple, and quadruple sums on the last line of (15) can be written as single sums and products of single sums of terms involving powers of the {phi}i's.

Similarly, to calculate the other terms in (10) we define zk to be one if individuals 3 and 4 are identical at locus k and zero otherwise, and we define yk to be one if individuals 1 and 3 are identical at locus k. It follows that

(16)
where {Delta}k is the probability that individuals 1 and 2 are identical at locus k and individuals 3 and 4 are also identical at this locus. Recall that alleles are assigned to individuals randomly without replacement, so

Similarly,

(17)
where {Gamma} k is the probability that individuals 1, 2, and 3 are identical to each other at locus k,

One can now calculate Var(VD) using (10) together with (15), (16), and (17).

We can write the results in a way that does not require double, triple, or quadruple sums. For example, note that

In a similar fashion, the other multiple sums can be reduced to terms involving the following single sums:

After some manipulation, the result is

(18)

Finally, we define an ~95% critical value as

(19)


*  RESULTS AND DISCUSSION
*TOP
*ABSTRACT
*THE TRADITIONAL METHOD OF...
*COMPUTING THE VARIANCE OF...
*RESULTS AND DISCUSSION
*LITERATURE CITED

To convince ourselves of the correctness of the above algebra and to demonstrate the inadequacy of Var(VD)old we used Monte Carlo simulations. Eleven artificial samples were constructed in the following way: The first data set containing 100 strains and 10 loci with five alleles at each locus was constructed from 96 strains of genotype

and one each of genotype




The second data set was made up of 88 strains of the major genotype and 3 strains of each of the minor genotypes and so on until a data set of maximum genetic diversity was reached consisting of 20 strains of each genotype. In this way we obtained artificial data sets with genetic diversities ranging from 0.078 to 0.8, which represent the range of genetic diversities to which the test developed by BROWN et al. 1980 Down has been applied. The completely linked artificial data sets were then unlinked by one round of resampling without replacement.

For each sample, Var(VD)old and Var(VD) were computed (using Equation 6 and Equation 10, respectively). In addition, the randomization method suggested by SOUZA et al. 1992 Down was applied to each sample. That is, the alleles at each locus were shuffled randomly (resampling without replacement) and VD calculated for each of 10,000 such shuffled samples. This allowed the calculation of the simulated sampling variance of VD, Var(VD)MC.

When Var(VD)old was compared with Var(VD)MC, it was found that the two values diverged dramatically for input matrices of high genetic diversity (Figure 1). This causes similar divergence between true and estimated critical values (data not shown) and has implications for testing linkage equilibrium in bacterial populations that will be discussed later. Clearly, Equation 6 should not be used. No discrepancies were found between Var(VD)MC and the variance calculated with Equation 10 (see Figure 1).



View larger version (15K):
In this window
In a new window
Download PPT slide
 
Figure 1. Comparison between three methods of computing the variance of VD,Var(VD). Single random input matrices with 100 strains and 10 loci, each locus with the same genetic diversity (as described in the text), were analyzed and Var(VD) computed according to Equation 6 ({bullet}), by resampling 10,000 times without replacement ({triangleup}), or by using Equation 10 ({square}).

The usefulness of (19) for hypothesis testing depends on whether the distribution of VD is approximately normal under our null hypothesis of linkage equilibrium with replicates being produced by shuffling of alleles on haplotypes. For multilocus data sets there are three variables that may influence the shape of the distribution of VD, the number of loci, the degree of diversity at each locus, and the number of strains. We investigated the effect of these three variables on the skewness of the distribution of VD through Monte Carlo simulation by calculating g1 as a measure of skewness from sets of resampled VD values,

(20)
where m3 and m2 are the second and third moment of the distribution of VD (SOKAL and ROHLF 1981 Down, p. 114). For a normal distribution g1 = 0; a positive g1 indicates skewness to the right, a negative g1, skewness to the left. We found that the distribution of VD always had positive skewness, that is, at the upper extreme of the distribution, slightly more values lie beyond the normal critical values (Figure 2). This was not affected by the number of loci (data not shown). In contrast, the degree of genetic diversity at each locus had a strong effect on the shape of the distribution. On the whole, the greater the genetic diversity, the closer the distribution was to normality, but this relation was not linear with the strongest changes occurring at the extreme values of mean genetic diversity (; Figure 3). Sample size also had a strong effect on skewness. In general, the larger the sample, the closer the sampling distribution of VD approached normality (Table 1).



View larger version (11K):
In this window
In a new window
Download PPT slide
 
Figure 2. Cumulative probability plot of 2500 resampled VD values. The values expected if the distribution was normal (-) and those observed ({circ}) diverge at both extremes of the distribution, although for testing the hypothesis of linkage equilibrium only the positive skew apparent in the high cumulative probability values is of interest. The resampled artificial input data set consisted of 100 strains and 10 loci, each with a genetic diversity of 0.558.



View larger version (10K):
In this window
In a new window
Download PPT slide
 
Figure 3. Skewness of the distribution of VD (g1) as a function of mean genetic diversity (). Single random input matrices with 100 strains and 10 loci, each locus with the same genetic diversity, were resampled 10,000 times without replacement.


 
View this table:
In this window
In a new window

 
Table 1. Relationship between skewness and the probability of exceeding normal critical values for various levels of significance

Given that the distribution of VD has positive skewness even for large samples, we investigated the effect of this deviation from normality on hypothesis testing. Data sets consisting of between 15 and 480 strains and 10 loci, each with genetic diversity of 0.444, were resampled to calculate the frequency with which VD exceeded the critical values that would be obtained if the distribution of VD was normal. Even for small data sets the discrepancy was slight. For instance, with 15 strains 6.69% of the resampled VD values exceeded the 5% normal critical value (Table 1). For a sample of 480 strains the discrepancy between 5.13% and 5.0% was negligible. Note that the probabilities of exceeding the normal critical values were always slightly too large, as would be expected from the positive skewness of the distribution of VD. For real data this means that whenever a sample has been diagnosed as being in linkage equilibrium, the same conclusion would be reached by Monte Carlo simulation. Further, the more time consuming it becomes to test the hypothesis of linkage equilibrium due to large sample size, the more useful our formula becomes. This is because the sampling distribution of VD approaches normality for large samples.

Several recent reports of panmixis in bacteria have used the observed variance of pairwise differences (VD) as a test statistic. Panmixis was concluded if the critical value of VD was greater than the observed value of VD (MAYNARD SMITH et al. 1993 Down; BOTTOMLEY et al. 1994 Down; DUNCAN et al. 1994 Down; STRAIN et al. 1995 Down; GO et al. 1996 Down). The original method to calculate the critical value was devised for plant populations, which are only moderately diverse [e.g., (H. spontaneum) = 0.145 (BROWN et al. 1980 Down)], compared to bacterial populations (cf. Table 1). In this study we showed by Monte Carlo simulation that high genetic diversity leads to an artificial inflation of Var(VD)old (Figure 1). This problem was overcome by rederiving Var(VD) (Equation 10; Figure 1).

Bacterial populations:
To test the usefulness of this derivation in the study of bacterial population genetics, we investigated published allozyme data for the ECOR collection of E. coli (OCHMAN and SELANDER 1984 Down), which is a well-known example of a clonal population (MILLER and HARTL 1986 Down). In addition, data sets from Bradyrhizobium japonicum, B. subtilis, and Rhizobium leguminosarum were included in the analysis, because for these populations claims of linkage equilibrium have been based on incorrect formulas for the variance of VD. Finally, an allozyme data set from N. gonorrhoeae was reexamined, as this taxon is considered a prime example of a sexual bacterial population (MAYNARD SMITH et al. 1993 Down; O'ROURKE and STEVENS 1993 Down).

Generally we observed that bacterial populations are highly diverse ( = 0.311 to 0.691; Table 2) and that the genetic diversity varies strongly between loci (standard deviation = 0.178 to 0.304; Table 2). Further, the distribution of VD displayed positive skewness in all cases, as observed in the simulations (Table 2).


 
View this table:
In this window
In a new window

 
Table 2. Assessment of multilocus structure in bacterial populations and comparison of old and new estimators of the critical values of VD

E. coli:
As expected from previous work (MILLER and HARTL 1986 Down), the electrophoretic types of the ECOR collection of E. coli are in linkage disequilibrium when the critical value obtained through the Monte Carlo process, LMC, is compared to VD(LMC < VD; Table 2). Further, Lnew (= 2.592) is a good estimator of LMC (= 2.608), while Lold (= 2.985) not only overestimates the critical value of VD, but would also lead to the spurious conclusion that E. coli is in linkage equilibrium as Lold > VD (Table 2).

B. japonicum:
BOTTOMLEY et al. 1994 Down reported linkage equilibrium for a B. japonicum population represented by 17 electrophoretic types. This claim is clearly rejected by Monte Carlo simulation, which shows significant linkage for this population (LMC = 2.593 < VD = 3.985; Table 2). The same conclusion is reached by comparing Lnew (= 2.557) with VD. Surprisingly, VD also exceeds Lold, on which the original claim of linkage equilibrium had been based. This discrepancy is resolved if Lold is calculated on the basis of the biased estimator

rather than on the unbiased estimator (Equation 4) employed in this study. Using hbj, Lold = 3.996, which is slightly greater than VD = 3.985. This result is due to the large difference between biased and unbiased estimators of the genetic diversity per locus in a sample consisting of only 17 ETs.

R. leguminosarum:
STRAIN et al. 1995 Down obtained evidence of linkage disequilibrium in their U.K. population of R. leguminosarum by using Monte Carlo simulation, but H0 was not rejected on the basis of Lold. We obtained the same result, reinforcing the inappropriateness of Lold for hypothesis testing. We further found that Lnew (= 2.911) was again a good alternative to the lengthy calculations necessary for obtaining LMC (= 2.967; Table 2) through simulation. STRAIN et al. 1995 Down also analyzed groups I + III + IX and I + III of their R. leguminosarum U.K. population and reported linkage equilibrium for both subpopulations. We found that H0 is rejected for groups I + III + IX and I + III on the basis of LMC and Lnew (Table 2).

B. subtilis:
DUNCAN et al. 1994 Down reported linkage equilibrium for the 50 electrophoretic types of B. subtilis contained in the B and D subdivisions of their sample. In contrast, we found that the combined electrophoretic types of groups B and D display strong linkage (Table 2) with VD (= 4.128) far exceeding LMC (= 2.422) and Lnew (= 2.397). Group D on its own is also not in linkage equilibrium with LMC and Lnew < VD, but note that as for E. coli, R. leguminosarum, and B. japonicum, application of Lold would lead to an inappropriate conclusion of linkage equilibrium. We further concluded on the basis of LMC (= 2.664) and Lnew (= 2.605) that group B is indeed in linkage equilibrium (Table 2).

N. gonorrhoeae:
This group of bacteria is the best established example of a bacterial population in linkage equilibrium. An extensive allozyme data set comprising 228 isolates has been published and reported to be in linkage equilibrium (MAYNARD SMITH et al. 1993 Down; O'ROURKE and STEVENS 1993 Down). Moreover, N. gonorrhoeae is naturally competent and frequently encounters different genotypes of the taxon due to the sexual habits of its host. As expected, we found that this population is in linkage equilibrium according to LMC (= 1.837); Lnew (= 1.831) gave the same result, further confirming the usefulness of this algebraic confidence limit (Table 2).

For all the bacterial populations tested, LMC and Lnew agreed well. This contrasted with the strong divergence of Lold from LMC, which led to conflicting conclusions about the genetic structure of E. coli, B. japonicum, R. leguminosarum, and B. subtilis. Using computer simulations, MAYNARD SMITH 1994 Down showed that a recombination rate only 20 times the rate of mutation was sufficient to unlink bacterial genomes. The detection of linkage disequilibrium in the soil-dwelling populations of B. japonicum, R. leguminosarum, and B. subtilis presented in this article indicates that the recombination rates in these groups are probably very low. This has also been found experimentally for B. subtilis (ROBERTS and COHAN 1995 Down).

We conclude that past attempts to detect linkage disequilibrium in haploid multilocus data sets through the computation of a critical value for VD were based on an erroneous formula for the variance of VD. The correct formula for Var(VD) communicated in this article forms the basis of a simple test of linkage. Furthermore, we find that VD is approximately normally distributed (especially for large samples). Hence the algebraic test proposed here is a useful alternative to Monte Carlo simulation in cases where simulation is deemed too expensive or time consuming. A computer program written in FORTRAN77, which implements both the algebraic as well as the Monte Carlo test, can be obtained from B.H. upon request.


*  ACKNOWLEDGMENTS

We thank J. Maynard Smith for first drawing our attention to the problem of testing linkage equilibrium from mismatch data and for helpful discussion. Thanks are also due to P. J. Bottomley for providing the Rhizobium allozyme data, and to T. S. Whittam and two anonymous reviewers for comments on the manuscript. This work was supported by grants from the Royal Society, Oxford University and the Biotechnology and Biological Sciences Research Council (United Kingdom).

Manuscript received April 2, 1998; Accepted for publication August 21, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*THE TRADITIONAL METHOD OF...
*COMPUTING THE VARIANCE OF...
*RESULTS AND DISCUSSION
*LITERATURE CITED

BOTTOMLEY, P. J., H.-H. CHENG, and S. R. STRAIN, 1994  Genetic structure and symbiotic characteristics of a Bradyrhizobium population recovered from a pasture soil. Appl. Environ. Microbiol. 60:1754-1761[Abstract/Free Full Text].

BROWN, A. H. D., M. W. FELDMAN, and E. NEVO, 1980  Multilocus structure of natural populations of Hordeum spontaneum.. Genetics 96:523-536[Abstract/Free Full Text].

DUNCAN, K. E., N. FERGUSON, K. KIMURA, X. ZHOU, and C. ISTOCK, 1994  Fine-scale genetic and phenotypic structure in natural populations of Bacillus subtilis and Bacillus licheniformis: implications for bacterial evolution and speciation. Evolution 48:2002-2025.

GO, M. F., V. KAPURA, D. Y. GRAHAM, and J. M. MUSSER, 1996  Population genetic analysis of Helicobacter pylori by multilocus enzyme electrophoresis: extensive allelic diversity and recombinational population structure. J. Bacteriol. 178:3934-3938[Abstract/Free Full Text].

GUTTMAN, D. S. and D. E. DYKHUIZEN, 1994  Clonal divergence in Escherichia coli as a result of recombination, not mutation. Science 266:1380-1383[Abstract/Free Full Text].

HAUBOLD, B. and P. B. RAINEY, 1996  Genetic and ecotypic structure of a fluorescent Pseudomonas population. Mol. Ecol. 5:747-761.

HUDSON, R. R., 1994  Analytical results concerning linkage disequilibrium in models with genetic transformation and conjugation. J. Evol. Biol. 7:535-548.

ISTOCK, C. A., K. E. DUNCAN, N. FERGUSON, and X. ZHOU, 1992  Sexuality in a natural population of bacteria: Bacillus subtilis challenges the clonal paradigm. Mol. Ecol. 1:95-103[Medline].

MARUYAMA, T. and M. KIMURA, 1980  Genetic variability and effective population size when local extinction and recolonization of subpopulations are frequent. Proc. Natl. Acad. Sci. USA 77:6710-6714[Abstract/Free Full Text].

MAYNARD SMITH, J., 1994  Estimating the minimum rate of genetic transformation in bacteria. J. Evol. Biol. 7:525-534.

MAYNARD SMITH, J., N. H. SMITH, C. G. DOWSON, and B. G. SPRATT, 1993  How clonal are bacteria? Proc. Natl. Acad. Sci. USA 90:4384-4388[Abstract/Free Full Text].

MILKMAN, R., 1973  Electrophoretic variation in Escherichia coli from natural sources. Science 182:1024-1026[Abstract/Free Full Text].

MILLER, R. D. and D. L. HARTL, 1986  Biotyping confirms a nearly clonal population structure in Escherichia coli.. Evolution 40:1-12.

OCHMAN, H. and R. K. SELANDER, 1984  Standard reference strains of Escherichia coli from natural populations. J. Bacteriol. 157:690-693[Abstract/Free Full Text].

O'ROURKE, M. and E. STEVENS, 1993  Genetic structure of Neisseria gonorrhoeae populations: a non-clonal pathogen. J. Gen. Microbiol. 139:2603-2611[Medline].

ROBERTS, M. S. and F. M. COHAN, 1995  Recombination and migration rates in natural populations of Bacillus subtilis and Bacillus mojavensis.. Evolution 49:1081-1094.

SELANDER, R. K. and B. R. LEVIN, 1980  Genetic diversity and structure in Escherichia coli populations. Science 210:545-547[Abstract/Free Full Text].

SOKAL, R. R., AND F. J. ROHLF, 1981 Biometry, Ed. 2. W. H. Freeman, New York.

SOUZA, V., T. T. NGUYEN, R. R. HUDSON, D. PIÑERO, and R. E. LENSKI, 1992  Hierarchical analysis of linkage disequilibrium in Rhizobium populations: evidence for sex? Proc. Natl. Acad. Sci. USA 89:8389-8393[Abstract/Free Full Text].

STRAIN, S. R., T. S. WHITTAM, and P. J. BOTTOMLEY, 1995  Analysis of genetic structure in soil populations of Rhizobium leguminosarum recovered from the USA and the UK. Mol. Ecol. 4:105-114.

WHITTAM, T. S., H. OCHMAN, and R. K. SELANDER, 1983  Multilocus genetic structure in natural populations of Escherichia coli.. Proc. Natl. Acad. Sci. USA 80:1751-1755[Abstract/Free Full Text].

WISE, M. G., L. J. SHIMKETS, and J. V. MCARTHUR, 1995  Genetic structure of a lotic population of Burkholderia (Pseudomonas) cepacia.. Appl. Environ. Microbiol. 61:1791-1798[Abstract].




This article has been cited by other articles:


Home page
Appl. Environ. Microbiol.Home page
T. Tomita, B. Meehan, N. Wongkattiya, J. Malmo, G. Pullinger, J. Leigh, and M. Deighton
Identification of Streptococcus uberis Multilocus Sequence Types Highly Associated with Mastitis
Appl. Envir. Microbiol., January 1, 2008; 74(1): 114 - 124.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
T. Wang, Y. Su, and G. Chen
Population Genetic Variation and Structure of the Invasive Weed Mikania micrantha in Southern China: Consequences of Rapid Range Expansion
J. Hered., January 1, 2008; 99(1): 22 - 33.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
J. A. Castillo and J. T. Greenberg
Evolutionary Dynamics of Ralstonia solanacearum
Appl. Envir. Microbiol., February 15, 2007; 73(4): 1225 - 1238.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
S. Tanriverdi, A. Markovics, M. O. Arslan, A. Itik, V. Shkap, and G. Widmer
Emergence of Distinct Genotypes of Cryptosporidium parvum in Structured Host Populations
Appl. Envir. Microbiol., April 1, 2006; 72(4): 2507 - 2513.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
E. M. Goss, M. Kreitman, and J. Bergelson
Genetic Diversity, Recombination and Cryptic Clades in Pseudomonas viridiflava Infecting Natural Populations of Arabidopsis thaliana
Genetics, January 1, 2005; 169(1): 21 - 35.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Das, S. Mohanty, and W. Stephan
Inferring the Population Structure and Demography of Drosophila ananassae From Multilocus Data
Genetics, December 1, 2004; 168(4): 1975 - 1985.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Johansson, J. Farlow, P. Larsson, M. Dukerich, E. Chambers, M. Bystrom, J. Fox, M. Chu, M. Forsman, A. Sjostedt, et al.
Worldwide Genetic Relationships among Francisella tularensis Isolates Determined by Multiple-Locus Variable-Number Tandem Repeat Analysis
J. Bacteriol., September 1, 2004; 186(17): 5808 - 5818.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
A. Koehler, H. Karch, T. Beikler, Thomas. F. Flemmig, S. Suerbaum, and H. Schmidt
Multilocus sequence analysis of Porphyromonas gingivalis indicates frequent recombination
Microbiology, September 1, 2003; 149(9): 2407 - 2415.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
J. Vogel, P. Normand, J. Thioulouse, X. Nesme, and G. L. Grundmann
Relationship between Spatial and Genetic Distance in Agrobacterium spp. in 1 Cubic Centimeter of Soil
Appl. Envir. Microbiol., March 1, 2003; 69(3): 1482 - 1487.
[Abstract] [Full Text] [PDF]


Home page
Am J Trop Med HygHome page
P. DURAND, Y. MICHALAKIS, S. CESTIER, B. OURY, M.C. LECLERC, M. TIBAYRENC, and F. RENAUD
SIGNIFICANT LINKAGE DISEQUILIBRIUM AND HIGH GENETIC DIVERSITY IN A POPULATION OF PLASMODIUM FALCIPARUM FROM AN AREA (REPUBLIC OF THE CONGO) HIGHLY ENDEMIC FOR MALARIA
Am J Trop Med Hyg, March 1, 2003; 68(3): 345 - 349.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
T. Coenye and J. J. LiPuma
Population structure analysis of Burkholderia cepacia genomovar III: varying degrees of genetic recombination characterize major clonal complexes
Microbiology, January 1, 2003; 149(1): 77 - 88.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B. Haubold, J. Kroymann, A. Ratzka, T. Mitchell-Olds, and T. Wiehe
Recombination and Gene Conversion in a 170-kb Genomic Region of Arabidopsis thaliana
Genetics, July 1, 2002; 161(3): 1269 - 1278.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Microbiol.Home page
T. de Meeus, F. Renaud, E. Mouveroux, J. Reynes, G. Galeazzi, M. Mallie, and J. M. Bastide
Genetic Structure of Candida glabrata Populations in AIDS and Non-AIDS Patients
J. Clin. Microbiol., June 1, 2002; 40(6): 2199 - 2206.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. Suerbaum, M. Lohrengel, A. Sonnevend, F. Ruberg, and M. Kist
Allelic Diversity and Recombination in Campylobacter jejuni
J. Bacteriol., April 15, 2001; 183(8): 2553 - 2559.
[Abstract] [Full Text]


Home page
MicrobiologyHome page
M. G. Lorenz and J. Sikorski
The potential for intraspecific horizontal gene exchange by natural genetic transformation: sexual isolation among genomovars of Pseudomonas stutzeri
Microbiology, December 1, 2000; 146(12): 3081 - 3090.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R.-C. Yang
Zygotic Associations and Multilocus Statistics in a Nonequilibrium Diploid Population
Genetics, July 1, 2000; 155(3): 1449 - 1458.
[Abstract] [Full Text]


Home page
GeneticsHome page
J. M. Smith
The Detection and Measurement of Recombination From Sequence Data
Genetics, October 1, 1999; 153(2): 1021 - 1027.
[Abstract] [Full Text] [PDF]