Genetics, Vol. 157, 149-161, January 2001, Copyright © 2001

Polymorphism at the Ribosomal DNA Spacers and Its Relation to Breeding Structure of the Widespread Mushroom Schizophyllum commune

Timothy Y. Jamesa, Jean-Marc Moncalvoa, Sean Li1,a, and Rytas Vilgalysa
a Department of Botany, Duke University, Durham, North Carolina 27708-0338

Corresponding author: Timothy Y. James, Department of Botany, Box 90338, Duke University, Durham, NC 27708-0338., tyj2{at}duke.edu (E-mail)

Communicating editor: S. YOKOYAMA


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The common split-gilled mushroom Schizophyllum commune is found throughout the world on woody substrates. This study addresses the dispersal and population structure of this fungal species by studying the phylogeny and evolutionary dynamics of ribosomal DNA (rDNA) spacer regions. Extensive sampling (n = 195) of sequences of the intergenic spacer region (IGS1) revealed a large number of unique haplotypes (n = 143). The phylogeny of these IGS1 sequences revealed strong geographic patterns and supported three evolutionarily distinct lineages within the global population. The same three geographic lineages were found in phylogenetic analysis of both other rDNA spacer regions (IGS2 and ITS). However, nested clade analysis of the IGS1 phylogeny suggested the population structure of S. commune has undergone recent changes, such as a long distance colonization of western North America from Europe as well as a recent range expansion in the Caribbean. Among all spacer regions, variation in length and nucleotide sequence was observed between but not within the tandem rDNA repeats (arrays). This pattern is consistent with strong within-array and weak among-array homogenizing forces. We present evidence for the suppression of recombination between rDNA arrays on homologous chromosomes that may account for this pattern of concerted evolution.


UNDERSTANDING genes that are important in determining a species' distribution is a principal goal of ecological genetics. A related question asks how a gene is distributed throughout a species' range. With this latter information in hand, it becomes possible to speculate how life history traits influence the spatial distribution of a gene. We have applied this approach in studying the distribution of genes throughout the range of the cosmopolitan mushroom, Schizophyllum commune Fr. Species with cosmopolitan distributions are atypical, and understanding what genetic and life history traits are important in maintaining the global success of these species may suggest basic principles in understanding other species' distributions. Our primary question has been to resolve whether or not long-distance spore dispersal is important in determining the population genetics and widespread distribution of this common fungus.

S. commune is a model organism of the class Basidiomycota that includes the mushroom-forming, rust, and smut fungi. Most research on S. commune has focused on its complex mating system in which haploid individuals must be heteroallelic at two unlinked factors (the A and B incompatibility loci) for mating to occur. Interpopulation compatibility tests have demonstrated that S. commune is a single biological species (RAPER et al. 1958 Down). The species has been found as a wood decomposer of over 150 genera of flowering plants (COOKE 1961 Down) and is well documented as a wound parasite of trees and as a human pathogen of minor but increasing importance (RIHS et al. 1996 Down).

The global distribution of several S. commune genes has been previously determined. The first genes studied were the A and B incompatibility loci. Through extensive genetic crosses, RAPER et al. 1958 Down determined that the numerous mating types (alleles) were randomly distributed throughout the world. The random distribution of mating types in S. commune was generally accepted as the result of widespread dispersal of S. commune spores. Although long-distance airborne dispersal of fungal spores is well documented in literature (PADY and KAPICA 1955 Down; HIRST and HURST 1967 Down; INGOLD 1971 Down), such dispersal may not be necessary to explain the mating type distributions of S. commune. This is primarily because incompatibility loci in fungi are strongly influenced by a form of negative frequency-dependent selection that selects for rare mating types and tends to maintain a large number of mating types at near-equal frequencies (MAY et al. 1999 Down). This same selective pressure tends to equilibrate allele frequencies among populations (WRIGHT 1939 Down; ZAMBINO et al. 1997 Down).

Recently, results from allozyme markers (JAMES et al. 1999 Down) have challenged the notion that long-distance dispersal affects the population genetic structure of S. commune. These results demonstrated that most geographically disjunct populations are genetically near-isolated and hence experience little exchange through spores. Allelic data such as those derived from allozymes are often used to infer patterns of population structure and gene flow among samples. One drawback from such an approach is that historical processes, such as founder or vicariant events, can be difficult to disentangle from patterns resulting from a current genetic drift/gene flow equilibrium (FELSENSTEIN 1982 Down; NEIGEL 1997 Down). As an alternative to protein electrophoretic data, DNA sequence variation has the advantage that the history of the alleles or haplotypes can be inferred through phylogenetic methods. By knowing the phylogenetic history of the alleles at a locus, there is promise of separating the past demographic events among a group of populations from current patterns of migration (NEIGEL 1997 Down; TEMPLETON 1998 Down).

In the fungi, the ribosomal RNA coding cistron (rDNA) has been widely utilized for molecular systematic studies (e.g., WHITE et al. 1990 Down; MONCALVO et al. 2000 Down) as well as population genetics (e.g., RAMSFIELD et al. 1996 Down; CARBONE et al. 1999 Down). At the rDNA locus the amount of variation among copies within a single species is often quite dramatically lower than that between species, a pattern termed concerted evolution (DOVER 1982 Down; ARNHEIM 1983 Down). In S. commune, the rDNA is composed of ~120 tandemly repeated units on the eighth largest chromosome (DONS and WESSELS 1980 Down; ASGEIRSDOTTIR et al. 1994 Down). As in many other fungi, the 5S rDNA gene is also present within the repeated rDNA unit and is found between the 25S and 18S genes (Fig 1A). Two noncoding regions exist in each repeat, the internal transcribed spacer (ITS) and the intergenic spacer (IGS).



View larger version (16K):
In this window
In a new window
Download PPT slide
 
Figure 1. (A) Schematic of rDNA locus in Schizophyllum commune. Boxes indicate coding regions. Half arrows indicate primers used for PCR and sequencing. Unpublished primer sequences were as follows: LR20R, 5'-GTGAGACAGGTTAGTTTTACCCT; LR12R, 5'-GAACGCCTCTAAGTCAGAATCC; 5SRNA, 5'-ATCAGACGGGATGCGGT; 5SRNAR, 5'-ACQGCATCCCGTCTGAT; 5SRNASC, 5'-GGGATGCGGTGCTTTC; INVSR1R, 5'-ACTGGCAGAATCAACCAGGTA; IGS2R1, 5'-AACATTGCAAGCGACCGGCAGTT; IGS2R2, 5'-GGTTTTGGCTGACTTTGATTTTG; IGS2R3, 5'-CACGAGTTTTATGGCCACTG; IGS2R4, 5'-CAGTACTAACAGTCCTTGGTA. (B) Spectrum of polymorphism among the three rDNA spacer regions depicted as a sliding window of size 25 bp and step size of 8 bp. {pi} describes the average number of pairwise substitutions at a particular site among the global population. The abscissa represents the position along the spacer region about which the sliding window is centered.

This study describes rDNA spacer variation in S. commune throughout the species' entire geographic range. We explore the species' population genetic structure and compare the rDNA data with the data that exist for other loci, namely the population structure of the mating incompatibility and allozyme loci. This study builds on our understanding of patterns of gene flow in S. commune by employing a larger and more diverse sample and by using a phylogenetic approach that has the potential of separating population structure from population history. These data also illuminate how patterns of concerted evolution progress in isolated populations that have not undergone speciation.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Sampling strategy:
A subset of 80 strains from the global S. commune collection used by RAPER et al. 1958 Down was provided courtesy of R. C. Ullrich (University of Vermont). This sample consisted of 68 homokaryotic (genetically haploid) and 12 dikaryotic (genetically diploid) strains. Additional fruiting bodies were collected from natural substrates, primarily during the years 1995–1999, and were used to derive 98 mycelial cultures from germinated spores. Last, 16 fruiting bodies of diverse geographic origin were obtained as specimens from the herbaria at Uppsala (UPS), Sweden and Leiden, The Netherlands (Rijksherbarium). Details on the origin of strains and fruiting body collections are available from the authors.

DNA techniques:
Tissue for DNA extraction was obtained from S. commune strains by scraping mycelia from week-old cultures grown on either potato dextrose agar (Difco, Detroit) or a medium consisting of malt extract (1.5%), yeast extract (0.3%), glucose (0.5%), and agar (1.5%). Mycelia were dehydrated in a Speed-vac concentrator (Savant Instruments, Farmingdale, NY) and ground to a fine powder using a dissecting needle and microcentrifuge pestle. Total genomic DNA was isolated from the ground mycelia following ZOLAN and PUKKILA 1986 Down. Herbarium specimens were extracted in a similar manner using ~0.1 g of fruiting body tissue.

The structure of the rDNA locus in S. commune and the location and orientation of primers for PCR are shown graphically in Fig 1A, with unpublished primer sequences provided in the legend. The primers for the ITS region (ITS1 and ITS4) are reported in WHITE et al. 1990 Down. For the IGS1 locus, the primers LR20R and 5SRNA were used for PCR amplification, and internal primers LR12R and 5SRNASC were employed for automated sequencing. Primers annealing to the 5S and 18S rDNA genes (5SRNAR and INVSR1R) were used to amplify the IGS2 region, and these, as well as internal primers (IGS2R1-4), were used to obtain nearly complete IGS2 sequences (~2.5 kbp).

Amplification of the ITS and IGS1 loci was performed using reaction conditions and thermocycling parameters described in VILGALYS and HESTER 1990 Down. For the IGS2 locus, amplification cycles were modified such that annealing was at 57° and extension for 75 sec at 72°. Amplicons were purified with ULTRAfree-MC centrifugal columns (Millipore Corp., Beford, MA) and sequenced using dye terminator chemistries on ABI 373 or 377 DNA sequencers (Perkin Elmer, Norwalk, CT). Restriction analysis of amplicons was conducted using the enzyme HinfI in a total reaction volume of 15 µl following the manufacturer's protocol (Promega, Madison, WI).

To separate different IGS1 haplotypes within heterogeneous PCR reactions, amplicons from both dikaryons and fruiting bodies were cloned into the PCR2.1 vector (Invitrogen, San Diego). A single recombinant bacterial colony was chosen for each sample and placed directly into a 25-µl volume PCR reaction lacking any template. The primers LR20R and 5SRNA were used to reamplify and sequence these clones from PCR2.1.

Sequences for IGS1 are deposited in GenBank under accession nos. AF249392, AF249586. Accession numbers for ITS are AF249364, AF249365, AF249366, AF249367, AF249368, AF249369, AF249370, AF249371, AF249372, AF249373, AF249374, AF249375, AF249376, AF249377, AF249378, AF249379, AF249380, AF249381, AF249382, AF249383, AF249384, AF249385, AF249386, AF249387, AF249388, AF249389, AF249390, AF249391, and numbers for IGS2 are AF249355, AF249356, AF249357, AF249358, AF249359, AF249360, AF249361, AF249362, AF249363.

Phylogenetic analyses:
Sequences were aligned by eye using the program GeneDoc (NICHOLAS and NICHOLAS 1997 Down) and imported into the phylogenetic software package PAUP* (SWOFFORD 1998 Down). For the IGS1 locus, the 195 sequences were used to generate an alignment of 671 bp. An alignment of 626 bp was produced for 27 sequences from the ITS region, and the IGS2 data set employed nine samples aligned into a 2704 bp matrix. At IGS1, sequence alignment was mostly unambiguous; however, several regions were difficult to align and were excluded for the purpose of phylogenetic analysis, leaving 506 aligned characters. Many gaps existed in the alignment of IGS1 and were considered to be an additional character state for phylogenetic analyses. When gaps were greater than a single nucleotide long, these gaps were recoded such that each apparently unique insertion/deletion event was considered a separate character state. For the ITS, gaps in the alignment were also considered a fifth character state, whereas gapped regions were excluded from the analysis of IGS2 due to the uncertainty in the alignment of the repetitive elements.

Phylogenies were inferred using the maximum parsimony criterion. For IGS1 and ITS regions heuristic searches were performed to find the most parsimonious phylogenies, employing random sequence addition to find initial trees and using the tree-bisection and reconnection (TBR) algorithm for branch swapping (SWOFFORD 1993 Down). For the IGS1 data, 100 heuristic searches were performed, saving at most 100 most parsimonious trees per search, and for the ITS region 1000 searches were performed, saving all most parsimonious trees. Exhaustive searching of all possible topologies was used to find most parsimonious trees for IGS2. Bootstrap confidence measures for phylogenetic nodes were generated using 100 replicate resamplings, each subject to a heuristic search with TBR branch swapping. For ITS and IGS2 data sets, all most parsimonious trees were saved for each bootstrap replicate. For the IGS1 data set, a maximum of 50 trees were saved for each replicate. Homoplasy (convergent or parallel evolution) was measured on the molecular phylogenies through the consistency index (CI) statistic (FARRIS 1989 Down). This statistic provides a ratio of the minimum number of character changes along a phylogeny given the data to the actual number of changes along the phylogeny as reconstructed by the parsimony criterion.

Measures of population structure and nucleotide diversity:
Nucleotide diversities, or the average number of differences per site between two homologous sequences ({pi}) were calculated using the program DNASP (ROZAS and ROZAS 1997 Down) following equation 10.5 of NEI 1988 Down. Diversities were calculated both for the global population as a whole as well as separately for the three major clades (see RESULTS). Coding and gapped regions were excluded from the diversity computations.

Wright's FST was used to describe what proportion of the total genetic variance at a locus is due to differences among populations (WRIGHT 1951 Down). Sequences were grouped by sample origin into eight populations: Africa, Asia, Australasia, the Caribbean, Central America, Europe, North America, and South America. These populations are nearly the same as those in previous studies (RAPER et al. 1958 Down; JAMES et al. 1999 Down). FST can range from 1.0, in which all of the variation is among populations, to 0.0, in which all populations appear homogeneous. We calculated FST via an AMOVA analysis using the software package ARLEQUIN (EXCOFFIER et al. 1992 Down; SCHNEIDER et al. 1997 Down).

Interpreting analyses of molecular variation (e.g., FST) as a reflection of gene flow relies on the assumption that associations of haplotypes with geography result from a gene flow/genetic drift equilibrium among populations. Alternatively, nested clade analyses can be used to discriminate between phylogeographic patterns resulting from historical events (e.g., past fragmentation, range expansion, colonization) and those due to recurrent gene flow (TEMPLETON et al. 1995 Down; TEMPLETON 1998 Down). These analyses utilize a phylogeny and the geographical distribution of haplotypes or clades of haplotypes. Nested clade analysis (NCA) was performed using the software GEODIS v. 2.0 (POSADA et al. 2000 Down) on subsets of the IGS1 network (phylogeny) for which there was a low probability (P < 0.05) of containing any convergent mutations (described in TEMPLETON et al. 1992 Down). Nested clade analysis divides a phylogeny into hierarchical subsets of n step clades, where n is an integer equal to the number of observed mutations connecting the haplotypes of a clade. For each clade there is a geographic focal point about which all individuals are centered. Two quantitative measures of how haplotypes within a clade are geographically dispersed were suggested by TEMPLETON et al. 1995 Down. DC (X) describes the average distance of each haplotype within clade X from the geographic center of clade X. Dn (X) similarly describes the average distance of each haplotype within clade X from the geographical center of the next more inclusive clade within which clade X is nested. Values significantly larger or smaller than that expected for no geographic association of haplotypes can be tested via permutation of population assignments to haplotypes (TEMPLETON et al. 1995 Down). Haplotypes were assigned to the same eight populations defined for FST estimation. Clade distances were calculated using a distance matrix among populations. Contiguous locations (e.g., North and Central America) were assigned a distance value of one. Locations separated by Atlantic, Pacific, or Indian Oceans were assigned an arbitrary distance value of three. Although not ideal, the use of approximate values in the distance matrix instead of geographic coordinates was necessitated because population samples were collected from diverse localities within geographically broad populations.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Molecular variation within the spacer regions:
Preliminary sampling showed the ITS region to possess considerably less sequence variation than the IGS1 region. Therefore IGS1 was selected for more intensive sampling. Of 195 strains sampled for IGS1, 143 unique haplotypes were identified. The average number of substitutions per site ({pi}) among any two random samples from the global population was 0.044 ± 0.002. Of the 506 aligned positions at IGS1, 172 (or 33.9%) displayed nucleotide polymorphisms or polymorphic insertion/deletion events (indels). Of the variable positions, 80 (or 46.5%) of these were unique to a single sample, (i.e., singletons). In contrast, of the 27 sequences obtained for ITS, only 31 positions (or 5.0%) were variable, and the corresponding value of {pi} was 0.007 ± 0.001. For the nine IGS2 sequence samples, 252 of 2176 sites (or 11.6%) were variable, with {pi} = 0.049 ± 0.004. Sequence diversity was higher at the 5' half of IGS2 ({pi} = 0.065 ± 0.007) relative to that of the 3' half ({pi} = 0.040 ± 0.004). Moreover, only two sites were variable in the 200-bp region directly upstream of the 18S rDNA. The spectrum of polymorphism along each of the three spacer regions (measured as {pi} along a sliding window of 25 positions; Fig 1B) contrasts the high diversity of intergenic spacers with both coding regions and the ITS.

Regions of invariant sequence within the IGS1 spacer were also uncovered. For instance, all sequences possessed a "TATA box" at nucleotides -29 to -25 relative to the 5S rRNA gene. The "TATA box" of S. commune is in the same position as the consensus sequence of Laccaria bicolor (MARTIN et al. 1999 Down) and Neurospora crassa (SELKER et al. 1986 Down), and is found as an upstream element of the 5S rDNA in all organisms studied to date (VENKATESWARLU et al. 1991 Down).

In the IGS2 spacer, beginning ~40 bp downstream from the 5S rRNA gene, a large region (~400 bp in length) of highly repetitive DNA was encountered. This consisted of numerous 7-bp repeats, with the repeat TCAGTA(G/A) encountered most frequently. Between 15 and 40 of these repeats were found in each strain interspersed among nonrepetitive DNA. Larger repetitive elements (19 bp) primarily in tandem arrangement were found near the middle of the IGS2 spacer, and 4–8 imperfect copies of this element were observed among strains. No large (i.e., >50 bp) repeats at any spacer region were found by dot plot analysis.

Sequence data revealed the ITS1 region to be ~150 bp and the ITS2 region to be ~240 bp. Both ITS spacers displayed minimal among-strain length heterogeneity (<4 bp per spacer). In contrast, the IGS1 spacers demonstrated much greater length heterogeneity between strains, ranging in size from 280 to 340 bp. The size of the IGS2 region also varied between strains and was determined to be 2400–2500 bp from DNA sequencing.

Concerted evolution of rDNA spacers:
Heterogeneity was observed in sequence chromatograms generated by direct sequencing of IGS1 amplicons from many dikaryotic individuals. This indicated that these genetically diploid samples possessed more than one IGS1 haplotype of differing length, causing a reading shift in the sequence chromatograms. In contrast, sequence chromatograms for monokaryotic or haploid individuals always produced distinct, unambiguous data, suggesting that all or most of the copies within these strains were of equal length. Length variation at IGS2 also appeared confined to dikaryotic individuals and absent within monokaryons. Amplification products of the IGS2 region using the primers 5SRNAR and IGS2R4 often produced double-banded products for dikaryotic individuals (Fig 2). At the ITS region, no length or sequence variation was ever observed among the amplification products of dikaryons.



View larger version (51K):
In this window
In a new window
Download PPT slide
 
Figure 2. Extreme spacer length variation near the 5' end of IGS2 detected using PCR primers 5SRNAR and IGS2R4. Strains are grouped into geographic origin (eastern and western hemispheres). A shows monokaryotic strains that appear to have only a single length variant, whereas in B dikaryotic strains frequently possess two variants.

After cloning IGS1 amplicons from dikaryotic samples into PCR2.1, typically a single cloned haplotype was sequenced. For one sample in which two clones were sequenced, two different haplotypes were found (1-94-alpha and 1-94-beta Costa Rica; arrows in Fig 3).



View larger version (37K):
In this window
In a new window
Download PPT slide
 
Figure 3. Phylogeny based on IGS1 sequence data. Heuristic searches found 9700 equally parsimonious trees. One of the trees is shown, using parsimony to describe branch lengths (total length of the tree = 285 steps; CI = 0.795). This portion of the tree covers the western hemisphere group. Sequences are labeled with the strain or collection name, followed by geographical origin. Bootstrap values are shown above nodes; *, short internal node with >90% bootstrap support. Sequences that are placed contradictory to their geographic origin are shown as white-on-black text. The group of identical sequences indicated by a star is the set of haplotypes shown by nested clade analysis to be an internal clade with significantly small geographic dispersion (see Fig 6A).

Since sequence heterogeneity appeared largely restricted to between-array rather than within-array variation, i.e., confined to different homologous chromosomes, we investigated whether recombination was suppressed within this large gene region. The cross between strains Belize#1 and Ecuador1.2 involved two monokaryons, each of whose chromosomal arrays appear homogenous at IGS1. However, Belize#1 and Ecuador1.2 arrays differ in the size of the IGS1 regions, are in two different clades in the IGS1 phylogeny, and differ in the presence of a HinfI restriction site within the spacer. Progeny (n = 74) were isolated from this cross and were genotyped at the HinfI restriction site. All progeny genotypes were uniallelic, with 41 progeny identical to the Belize#1 parental type and 33 identical to the Ecuador1.2 type. In summary, there was no evidence from this cross for recombination between ribosomal arrays on different chromosomes.

Phylogeographic patterns in spacer evolution:
Phylogenetic reconstruction of the IGS1 data revealed three primary groupings, each supported with 100% bootstrap confidence. One of the groupings corresponded to samples from North America (NAM) and Central America (NAM clade; Fig 3). However, only one out of seven samples from western North America were found in this group. A second group consisted of samples from South America (SAM) and the Caribbean, and included some isolates from Florida and North Carolina (SAM clade; Fig 3). The final group contained the large majority of the samples from the eastern hemisphere (EAS clade; Fig 4). This last group also included most samples from western North America and Wisconsin. Besides the placement of the western North American samples, a lack of geographic concordance with phylogeny occurred in other instances. For example, some isolates from Germany, South Africa, and Jordan grouped within the North and Central American clade (Fig 3). These geographic discrepancies could be most easily explained by recent migration events.



View larger version (35K):
In this window
In a new window
Download PPT slide
 
Figure 4. Second portion of the IGS1 phylogeny showing the eastern hemisphere (EAS) group.

Although the samples generally grouped by geographic origin, there is little evidence of geographic substructure within each of the three major groups. There were, however, two subgroups in the EAS clade that clustered by geography (Fig 4). One subgroup consisted of many samples from Australia and Papua New Guinea, and was defined by three unique changes. The other subgroup consisted of western North American strains intermixed with European samples. None of the samples from western North America were identical with those from Europe; however, some of these samples differed by only a single base change.

Phylogenetic analysis of the ITS alignment recovered the same three major geographic groupings as seen at IGS1 (Fig 5A). Due to the low level of variation among sequences, bootstrap values for these groupings were very low. Despite this, the variable sites were very consistent with the phylogeny, as indicated by a high CI of 0.969. The western North American sample grouped with the eastern hemisphere samples, as observed in the IGS1 tree.



View larger version (25K):
In this window
In a new window
Download PPT slide
 
Figure 5. (A) Phylogeny based on ITS sequence data. Presented is one of three most parsimonious trees, 32 steps long (CI = 0.969). (B) Phylogeny based on IGS2 sequence data. Exhaustive searching recovered one most parsimonious tree, 305 steps long (CI = 0.964).

Phylogeographic patterns of sequence evolution at IGS2 resembled those seen with IGS1 and ITS regions (Fig 5B). Despite the limited sampling, the same three geographic clades were well supported. The large amount of IGS2 variation also resolved a very well supported and consistent (CI = 0.964) phylogeny.

Although the three spacer regions showed topological congruence, they differed completely in the rank order of sequence diversity for each of the three major clades (also apparent in relative phylogenetic branch lengths). For each spacer phylogeny, a different clade displayed the highest levels of nucleotide polymorphism as measured by substitutions per site ({pi}). At IGS1, the rank-order of sequence variation followed EAS ({pi} = 0.018 ± 0.001) > NAM ({pi} = 0.015 ± 0.001) > SAM ({pi} = 0.013 ± 0.002). The SAM clade also displayed the lowest IGS2 polymorphism, with rankings NAM ({pi} = 0.027 ± 0.010) > EAS ({pi} = 0.024 ± 0.007) > SAM ({pi} = 0.011 ± 0.003). Finally, a reversal of IGS1 polymorphism was observed at ITS, where SAM ({pi} = 0.008 ± 0.010) > NAM ({pi} = 0.006 ± 0.002) > EAS ({pi} = 0.003 ± 0.001).

Population structure and dispersal:
The extensive sampling of IGS1 sequences allowed estimation of FST, or the proportion of genetic variation due to differences among populations. The estimate of global FST derived using AMOVA was 0.487, implicating strong geographic population subdivision within S. commune. The IGS1 FST value is higher than that estimated from allozyme data (0.214; JAMES et al. 1999 Down; Table 1) and contradicts the low level of among-population differentiation inferred from the distribution of mating types (FST = 0.008 and 0.010 for the A and B mating loci, respectively; RAPER et al. 1958 Down; Table 1).


 
View this table:
In this window
In a new window

 
Table 1. FST estimates among global populations of Schizophyllum commune at selected loci

While estimates of FST rely on equilibrium population dynamics, NCA do not and are capable of detecting nonequilibrium historical associations between populations. Two of the three major geographic clades (SAM and EAS) were subject to NCA. Within the SAM clade (see Fig 3), there is little phylogenetic structure. A parsimonious network of these sequences is unambiguous and results in many tip clades emanating from a single internal haplotype (Fig 6A). NCA shows that the internal haplotype, as well as the one-step clade it is contained within (1-12), shows significantly (P < 0.05) small geographic dispersion (or clade distance, Dc) relative to the distribution of Dc values generated by permuting population assignments of haplotypes (Fig 6B). In addition, since the internal clade 1-12 shows lower clade distance than the average of the clade distances for the tip clades that have presumably descended from it [i.e., DcI-DcT significantly low (P < 0.05)], the inference key presented in TEMPLETON 1998 Down suggests a contiguous range expansion within the SAM clade. Since the internal clade was largely restricted to northern South America, it is inferred that a range expansion into the Caribbean as well as North America and Argentina has occurred from this ancestral haplotype.



View larger version (29K):
In this window
In a new window
Download PPT slide
 
Figure 6. (A) Parsimony network of IGS1 haplotypes within the South American (SAM) clade. Strain designations enclosed by rounded boxes possess the same haplotype. Solid-line boxes surround one-step clades, and dashed-line boxes show two-step clades. Numbering indicates the clade designations utilized by nested clade analysis (e.g., 1-5 is the fifth one-step clade). The large internal haplotype (star in Fig 3) represented by six strains is surrounded by a boldface, rounded box. Strains from the Caribbean are shown in italic; strains from North America are underlined; the remaining strains are from South America. Small circles indicate inferred but not sampled haplotypes. (B) Clade distances (Dc) and nested clade distances (Dn) generated by GeoDis for the one-step clades nested within clade 2-3. User-defined geographic distances among populations were as follows: North America and the Caribbean were considered one distance unit apart, South America and the Caribbean were one distance unit, and North and South America were two distance units. Also shown are the differences between average clade distances for interior vs. tip clades (DcI - DcT) and the corresponding value for nested clade distances (DnI - DnT). Superscripts indicate significantly small (S) or large (L) distances. The chain of inference follows the key provided in the Appendix of TEMPLETON 1998 Down. In words, contiguous range expansion within Clade 2-3 is suggested because clade 1-12 shows significantly small geographic dispersion and it is the only interior clade (presumably ancestral) in clade 2-3.

For the EAS clade (Fig 4), much greater sequence variation exists and such variation causes ambiguity in network estimation. Regardless, significant historical patterns were found in this group. Clearly, because western North American haplotypes are nested within the European clade (Fig 4), a long distance colonization of America from Europe is suggested. Such a colonization is supported by NCA because the clade distances (Dc) of the clades containing western North American haplotypes are significantly low (P < 0.05) but their nested clade distances (Dn) are significantly high (results not shown). Such reversals between Dc and Dn for a clade generally indicate dispersal events (TEMPLETON 1998 Down).


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The data presented here further explore the geographic scale and composition of breeding populations in S. commune. The sequence data show three genetically discrete populations that appear to be geographically broad. More precisely, most of the eastern hemisphere forms one phylogenetic group; most of North America, including Central America, comprises the second group; and the third group contains the majority of the South American and Caribbean samples. Although our extensive haplotype sampling has been able to reveal instances of rare migration (Fig 3 and Fig 4), these migrations may have occurred only in the recent history of the species.

We also investigated the evolution of all rDNA spacer regions to understand how population subdivision affects the amount and pattern of sequence variation at loci evolving under different rates of nucleotide substitution. The IGS displayed high levels of nucleotide diversity in contrast to the minimal variation observed at the ITS. However, all rDNA spacer regions displayed the same three geographic groupings using phylogenetic methods.

Global distribution of genetic elements—selection vs. drift:
S. commune is a common, cosmopolitan species for which neither morphology, mating compatibility tests, nor distributions of mating types have differentiated global populations. In contrast, neutral marker loci have shown that very strong genetic differentiation exists among geographic populations (Table 1). The random distribution of mating types among otherwise genetically divergent S. commune populations can most easily be explained as a demonstration of strong balancing selection on mating loci. This selection prevents differentiation and genetic drift of mating type frequencies despite the lack of significant interpopulation gene flow (JAMES et al. 1999 Down).

Both the lack of mating type differentiation among continents and the lack of mating incompatibility among intercontinental populations in S. commune may have a general genetic basis. Many biological species of mushroom fungi are broadly distributed over multiple continents (HORAK 1983 Down; VILGALYS and SUN 1994 Down; PETERSON and MCCLENEGHAN 1997 Down and references therein) and display high levels of mating type diversity with little or no geographical structuring (WHITEHOUSE 1949 Down; ULLRICH 1977 Down). A model of mating compatibility evolution among allopatric populations of basidiomycete fungi should consider that two populations that become geographically isolated may not differentiate at mating loci due to balancing selection that slows the loss of mating types, allowing equivalent sets of mating types to be maintained between the two populations (ZAMBINO et al. 1997 Down; MAY et al. 1999 Down). Since the mating type loci themselves additionally function in sexual development (HISCOCK and KUES 1999 Down), it is likely that populations that do not possess the same sets of mating types would show a high degree of incompatibility due to nonfunctional interactions between mating types that have evolved independently.

The patterns of phylogeography at S. commune IGS1 were similar to those derived from allozyme markers (JAMES et al. 1999 Down), i.e., geographically broad but genetically divergent populations. In this study, the IGS1 data provide much more detail in defining populations than allozyme loci due to the ability to produce a robust phylogeny. Additionally, phylogenetic methods were capable of detecting in S. commune the signatures of colonization and range expansion that could not be addressed using allele frequency-based estimates of population structure (see RESULTS).

Among loci of S. commune that are believed to be evolving in a neutral or near-neutral fashion, not all show population differentiation, e.g., sorbitol dehydrogenase (Table 1). Menadione reductase and isocitrate dehydrogenase also show this pattern of allozyme variation in S. commune (JAMES et al. 1999 Down), and all three loci additionally share reduced allelic diversity. Similar patterns have been observed in Drosophila melanogaster in which several loci possess a single common allele present in all populations and only a few additional rare alleles at low frequency (SINGH and RHOMBERG 1987 Down). Strong purifying selection at a locus could be responsible for this lack of observable genetic differentiation among populations, by the elimination of new variants of suboptimal fitness.

While the patterns of phylogeography suggest a large geographical component to population subdivision, a climatological component may also be involved. For example, the fact that four of six samples from southern Florida, a subtropical climate, group with samples from the Caribbean Sea and South America in the IGS1 phylogeny (Fig 3), suggests that the similar climate and habitat may be more important in determining population structure than geographical distance. A similar example in the Mediterranean can be found in the grouping of Moroccan samples with those of European origin (Fig 4). These results suggest that S. commune may disperse frequently over marine areas devoid of habitat. If such long distance dispersal is possible, why then are global populations differentiated? One hypothesis appeals to a historical process in which the former distribution of S. commune was much narrower and disjunct, allowing the differentiation of the three major lineages. Subsequent range expansions in combination with long-distance dispersal would then create the phylogeographic patterns we have observed, namely three genetically divergent lineages whose distributions generally, but not entirely, correspond with their geographic origin. This hypothesis is supported from the results of the nested clade analysis that suggest continuous range expansion to have occurred in at least one of the three major lineages (SAM clade; Fig 6). In addition, because the SAM clade contains the least amount of sequence polymorphism at two of the three spacers, the SAM range expansion may be of a relatively recent origin. Further support for the nonequilibrium status of S. commune populations exists in the high number of long-distance dispersal events recorded in the IGS1 phylogeny (Fig 3 and Fig 4). Such migrations appear to be more frequent, or more recent, than the restricted dispersal needed to create the deep phylogenetic divergences between the three major geographic clades. If the range expansion to cosmopolitan has been rather recent, then the process of population homogenization through gene flow may yet be incomplete. Our observations that S. commune thrives in secondary forest growth and human-disturbed sites suggest that hypotheses considering range expansion, and possibly intercontinental human-mediated dispersal, merit further attention.

Concerted evolution of S. commune rDNA spacers:
The large level of haplotype diversity at the ribosomal spacers appears to be entirely due to differences between individuals rather than differences among the tandem repeats within an array. Because haploid strains appear fixed for a single haplotype, yet many haplotypes exist within populations, a strong argument is made for the occurrence of intrachromosomal homogenization of elements in a single array and for a lack of recombination between the rDNA arrays of homologous chromosomes at meiosis. Our failure to detect recombinants between rDNA arrays in one test cross provides additional support for recombination suppression. In addition, the high consistency of the IGS1 phylogeny (CI = 0.790) argues against recombination events within this region. Such events would produce haplotypes that contained nucleotides that had not coevolved and therefore the nucleotide characters would not be consistent with a single phylogeny. The suppression of recombination within rDNA arrays in other mushroom species has been observed (CASSIDY et al. 1984 Down; LIOU 2000 Down), but recombination has been observed between arrays of the mushroom L. bicolor (SELOSSE et al. 1996 Down; MARTIN et al. 1999 Down). The observation of within-rDNA array homogenization combined with repression of homologous recombination between arrays is also reminiscent of recombination of rDNA in Saccarhomyces cerevisiae in which meiotic crossovers between sister chromatids are highly favored over nonsister exchanges (PETES and PUKKILA 1995 Down).

Our data support the notion that the nucleotide substitution rate is higher at the IGS2 region than at the IGS1 region in S. commune, which is in turn higher than that for the ITS spacer region. Data from the IGS regions suggest that the process of intrachromosomal homogenization of repeats on the same array occurs faster than the collective substitution rate of all the sites at IGS1. In turn, nucleotide substitution occurs faster than interchromosomal homogenization can make populations monomorphic for a single IGS1 haplotype. For the ITS region, the homogenization of arrays also happens faster than the substitution rate, the latter of which appears to progress almost as slowly as genetic drift and fixation of haplotypes within populations.


*  FOOTNOTES

1 Present address: Department of Biochemistry, University of Illinois, Urbana, IL 61801. Back


*  ACKNOWLEDGMENTS

We thank Robert C. Ullrich for providing the S. commune strains from the collection of J. R. Raper. Thanks are also due to Cathie Aime, Omon Isikhuehmen, Karen Nakasone, David Porter, Martin Schnittler, and Sebastian Verduiin for providing additional material. John Wares, Stuart McDaniel, Cymon Cox, and Austen Ganley provided critical comments on earlier drafts. Two anonymous reviewers are also thanked for suggestions to improve the manuscript. Steve Leathers provided valuable laboratory assistance in sequencing efforts. This work was funded by a National Science Foundation Grant to R.V. (DEB-9408347) and a Sigma Xi GIAR to T.J.

Manuscript received May 15, 2000; Accepted for publication October 6, 2000.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ARNHEIM, N., 1983 Concerted evolution of multigene families, pp. 38–61 in Evolution of Genes and Proteins, edited by M. NEI and R. K. KOEHN. Sinauer Associates, Boston.

ÁSGEIRSDÓTTIR, S. A., F. H. J. SCHUREN, and J. G. H. WESSELS, 1994  Assignment of genes to pulse-field separated chromosomes of Schizophyllum commune.. Mycol. Res. 98:689-693.

CARBONE, I., J. B. ANDERSON, and L. M. KOHN, 1999  Patterns of descent in clonal lineages and their multilocus fingerprints are resolved with combined gene genealogies. Evolution 53:11-21.

CASSIDY, J. R., D. MOORE, B. C. LU, and P. J. PUKKILA, 1984  Unusual organization and lack of recombination in the ribosomal RNA genes of Coprinus cinereus.. Curr. Genet. 8:607-613.

COOKE, W. B., 1961  The genus Schizophyllum.. Mycologia 53:575-599.

DONS, J. J. M. and J. G. H. WESSELS, 1980  Sequence organization of the nuclear DNA of Schizophyllum commune.. Biochim. Biophys. Acta 607:385-396[Medline].

DOVER, G. A., 1982  Molecular drive: a cohesive mode of species evolution. Nature 295:111-117.

EXCOFFIER, L., P. E. SMOUSE, and J. M. QUATTRO, 1992  Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131:479-491[Abstract].

FARRIS, J. S., 1989  The retention index and the rescaled consistency index. Cladistics 5:417-419.

FELSENSTEIN, J., 1982  How can we infer geography and history from gene frequencies? J. Theor. Biol. 96:9-20[Medline].

HIRST, J. M. and G. W. HURST, 1967  Long-distance spore transport. Symp. Soc. Gen. Microbiol. 17:307-344.

HISCOCK, S. J. and U. KÜES, 1999  Cellular and molecular mechanisms of sexual incompatibility in plants and fungi. Int. Rev. Cytol. 193:165-295[Medline].

HORAK, E., 1983  Mycogeography in the South Pacific region: Agaricales, Boletales. Austr. J. Bot. Suppl. Ser. 10:1-41.

INGOLD, C. T., 1971 Fungal Spores: Their Liberation and Dispersal. Oxford University Press, London.

JAMES, T. Y., D. PORTER, J. L. HAMRICK, and R. VILGALYS, 1999  Evidence for limited intercontinental gene flow in the cosmopolitan mushroom, Schizophyllum commune.. Evolution 53:1665-1677.

LIOU, S.-R., 2000 Evolutionary genetics of speciation in basidiomycetes: genetic studies of reproductive isolation in Pleurotus djamor/calyptratus complex. Ph.D. Thesis, Duke University, Durham, NC.

MARTIN, F., M.-A. SELOSSE, and F. LE TACON, 1999  The nuclear rDNA intergenic spacer of the ectomycorrhizal basidiomycete Laccaria bicolor: structural analysis and allelic polymorphism. Microbiology 145:1605-1611[Medline].

MAY, G., F. SHAW, H. BADRANE, and X. VEKEMANS, 1999  The signature of balancing selection: fungal mating compatibility gene evolution. Proc. Natl. Acad. Sci. USA 96:9172-9177[Abstract/Free Full Text].

MONCALVO, J.-M., F. M. LUTZONI, S. A. REHNER, J. JOHNSON, and R. VILGALYS, 2000  Phylogenetic relationships of agaric fungi based on nuclear large subunit ribosomal DNA sequences. Syst. Biol. 49:278-305[Medline].

NEI, M., 1988 Molecular Evolutionary Genetics. Columbia University Press, New York.

NEIGEL, J. E., 1997  A comparison of alternative strategies for estimating gene flow from genetic markers. Annu. Rev. Ecol. Syst. 28:105-128.

NICHOLAS, K. B., and H. B. NICHOLAS, JR., 1997 GeneDoc: analysis and visualization of genetic variation. http://www.cris.com/~Ketchup/genedoc.shtml.

PADY, S. M. and L. KAPICA, 1955  Fungi in air over the Atlantic Ocean. Mycologia 47:34-50.

PETERSON, R. H. and S. C. MCCLENEGHAN, 1997  Reports on long-distance sexual compatibility in Agaricales. Nord. J. Bot. 17:419-432.

PETES, T. D. and P. J. PUKKILA, 1995  Meiotic sister chromatid recombination. Adv. Genet. 33:41-62[Medline].

POSADA, D., K. A. CRANDALL, and A. R. TEMPLETON, 2000  GeoDis: a program for the cladistic nested analysis of the geographical distribution of genetic haplotypes. Mol. Ecol. 9:487-488[Medline].

RAMSFIELD, T. D., E. M. BECKER, S. M. RATHLEF, Y. TANG, and T. C. VRAIN et al., 1996  Geographic variation of Chondrostereum purpureum detected by polymorphisms in the ribosomal DNA. Can. J. Bot. 74:1919-1929.

RAPER, J. R., G. S. KRONGELB, and M. G. BAXTER, 1958  The number and distribution of incompatibility factors in Schizophyllum commune.. Am. Nat. 92:221-232.

RIHS, J. D., A. A. PADHYE, and C. B. GOOD, 1996  Brain abscess caused by Schizophyllum commune: an emerging basidiomycete pathogen. J. Clin. Microbiol. 34:1628-1632[Abstract].

ROZAS, J. and R. ROZAS, 1997  DnaSP version 2.0: a novel software package for extensive molecular population genetics analysis. Comput. Appl. Biosci. 13:307-311.

SCHNEIDER, S., J.-M. KUEFFER, D. ROESSLI and L. EXCOFFIER, 1997 ARLEQUIN: a software for population genetic data analysis. Ver 1.1. Genetics and Biometry Lab, Department of Anthropology, University of Geneva, Geneva.

SELKER, E. U., E. MORZYCKA-WRÓBLEWSKA, J. N. STEVENS, and R. L. METZENBERG, 1986  An upstream signal is required for in vitro transcription of Neurospora 5S RNA genes. Mol. Gen. Genet. 205:189-192[Medline].

SELOSSE, M.-A., G. COSTA, C. DI BATTISTA, F. L. TACON, and F. MARTIN, 1996  Meiotic segregation and recombination of the intergenic spacer of the ribosomal DNA in the ectomycorrhizal basidiomycete Laccaria bicolor.. Curr. Genet. 30:332-337[Medline].

SINGH, R. S. and L. R. RHOMBERG, 1987  A comprehensive study of genic variation in natural populations of Drosophila melanogaster. II. Estimates of heterozygosity and patterns of geographic differentiation. Genetics 117:255-271[Abstract/Free Full Text].

SWOFFORD, D. L., 1993 PAUP: Phylogenetic Analysis Using Parsimony. 3.1.1. User's manual. Smithsonian Institution, Washington, DC.

SWOFFORD, D. L., 1998 PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sinauer Associates, Sunderland, MA.

TEMPLETON, A. R., 1998  Nested clade analysis of phylogeographic data: testing hypotheses about gene flow and population history. Mol. Ecol. 7:381-397[Medline].

TEMPLETON, A. R., K. A. CRANDALL, and C. F. SING, 1992  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. Genetics 132:619-633[Abstract].

TEMPLETON, A. R., E. ROUTMAN, and C. A. PHILIPS, 1995  Separating population structure from population history: a cladistic analysis of the geographical distribution of mitochondrial DNA haplotypes in the tiger salamander, Ambystoma tigrinum.. Genetics 140:767-782[Abstract].

ULLRICH, R. C., 1977  Natural distribution of incompatibility factors in basidiomycetous fungi. Mycologia 69:714-719.

VENKATESWARLU, K., S.-W. LEE, and R. N. NAZAR, 1991  Conserved upstream elements in plant 5S ribosomal RNA encoding genes. Gene 105:249-253[Medline].

VILGALYS, R. and M. HESTER, 1990  Rapid genetic identification and mapping of enzymatically amplified ribosomal DNA from several Cryptococcus species. J. Bacteriol. 172:4238-4246[Abstract/Free Full Text].

VILGALYS, R. and B. L. SUN, 1994  Ancient and recent patterns of geographic speciation in the oyster mushroom Pleurotus revealed by phylogenetic analysis of ribosomal DNA sequences. Proc. Natl. Acad. Sci. USA 91:4599-4603[Abstract/Free Full Text].

WHITE, T. J., T. D. BRUNS, S. B. LEE and J. W. TAYLOR, 1990 Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics, pp. 315–321 in PCR Protocols: A Guide to Methods and Applications, edited by M. A. INNIS, D. H. GELFAND, J. J. SNINSKY and T. J. WHITE. Academic Press, New York.

WHITEHOUSE, H. L. K., 1949  Multiple-allelomorph heterothallism in the fungi. New Phytol. 48:212-244.

WRIGHT, S., 1939  The distribution of self-sterility alleles in populations. Genetics 24:538-552[Free Full Text].

WRIGHT, S., 1951  The genetical structure of populations. Ann. Eugen. 15:323-354.

ZAMBINO, P., J. V. GROTH, L. LUKENS, J. R. GARTON, and G. MAY, 1997  Variation at the b mating type locus of Ustilago maydis.. Phytopathology 87:1233-1239[Medline].

ZOLAN, M. E. and P. J. PUKKILA, 1986  Inheritance of DNA methylation in Coprinus cinereus.. Mol. Cell. Biol. 6:195-200[Abstract/Free Full Text].




This article has been cited by other articles:


Home page
MycologiaHome page
H. Kauserud, K. Shalchian-Tabrizi, and C. Decock
Multilocus sequencing reveals multiple geographically structured lineages of Coniophora arida and C. olivacea (Boletales) in North America.
Mycologia, September 1, 2007; 99(5): 705 - 713.
[Abstract] [Full Text] [PDF]


Home page
MycologiaHome page
P. B. Matheny, E. C. Vellinga, N. L. Bougher, O. Ceska, P.-A. Moreau, M. A. Neves, and J. F. Ammirati
Taxonomy of displaced species of Tubaria
Mycologia, July 1, 2007; 99(4): 569 - 585.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. R.D. Ganley and T. Kobayashi
Highly efficient concerted evolution in the ribosomal DNA repeats: Total rDNA repeat variation revealed by whole-genome shotgun sequence data
Genome Res., February 1, 2007; 17(2): 184 - 191.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
N. G. Miller and S. F. McDaniel
Bryophyte dispersal inferred from colonization of an introduced substratum on Whiteface Mountain, New York
Am. J. Botany, August 1, 2004; 91(8): 1173 - 1182.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
G. I. Zervakis, J.-M. Moncalvo, and R. Vilgalys
Molecular phylogeny, biogeography and speciation of the mushroom species Pleurotus cystidiosus and allied taxa
Microbiology, March 1, 2004; 150(3): 715 - 726.
[Abstract] [Full Text] [PDF]


Home page
MycologiaHome page
H. Kauserud
Widespread vegetative compatibility groups in the dry-rot fungus Serpula lacrymans
Mycologia, March 1, 2004; 96(2): 232 - 239.
[Abstract] [Full Text] [PDF]


Home page
MycologiaHome page
H. Kauserud and T. Schumacher
Regional and local population structure of the pioneer wood-decay fungus Trichaptum abietinum
Mycologia, May 1, 2003; 95(3): 416 - 425.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
I. Stehlik
Glacial history of the alpine herb Rumex Nivalis (Polygonaceae): a comparison of common phylogeographic methods with nested clade analysis
Am. J. Botany, December 1, 2002; 89(12): 2007 - 2016.
[Abstract] [Full Text] [PDF]


Home page
MycologiaHome page
Q. Shen, D. M. Geiser, and D. J. Royse
Molecular phylogenetic analysis of Grifola frondosa (maitake) reveals a species partition separating eastern North American and Asian isolates
Mycologia, May 1, 2002; 94(3): 472 - 482.
[Abstract] [Full Text] [PDF]