Genetics, Vol. 166, 935-945, February 2004, Copyright © 2004

Genomic Duplication, Fractionation and the Origin of Regulatory Novelty

Richard J. Langhama, Justine Walsha, Molly Dunn1,b, Cynthia Ko1,b, Stephen A. Goff1,b, and Michael Freelinga
a Department of Plant and Microbial Biology, University of California, Berkeley, California 94720
b Torrey Mesa Research Institute, Syngenta, San Diego, California 92121

Corresponding author: Michael Freeling, Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720., freeling{at}nature.berkeley.edu (E-mail)

Communicating editor: V. SUNDARESAN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Having diverged 50 MYA, rice remained diploid while the maize lineage became tetraploid and then fractionated by losing genes from one or the other duplicate region. We sequenced and annotated 13 maize genes (counting the duplicate gene as one gene) on one or the other of the pair of homeologous maize regions; 12 genes were present in one cluster in rice. Excellent maize-rice synteny was evident, but only after the fractionated maize regions were condensed onto a finished rice map. Excluding the gene we used to define homeologs, we found zero retention. Once retained, fractionation (loss of functioning DNA sequence) could occur within cis-acting gene space. We chose a retained duplicate basic leucine zipper transcription factor gene because it was well marked with big, exact phylogenetic footprints (CNSs). Detailed alignments of lg2 and retained duplicate lrs1 to their rice ortholog found that fractionation of conserved noncoding sequences (CNSs) was rare, as expected. Of 30 CNSs, 27 were conserved. The 3 unexpected, missing CNSs and a large insertion support subfunctionalization as a reflection of fractionation of cis-acting gene space and the recent evolution of lg2's novel maize leaf and shoot developmental functions. In general, the principles of fractionation and consolidation work well in making sense of maize gene and genomic sequence data.


E. B. LEWIS 1951 Down postulated a scheme by which a novel gene may arise following a gene or genome duplication. He based this hypothesis on cases of linked, diversified paralogous genes in Drosophila and maize. The Lewis scheme has much case support (e.g., OHNO 1970 Down; LI 1997 Down; TRUE and CARROLL 2002 Down). The theoretical expectation for an average duplicate gene pair is that one will be lost (HALDANE 1933 Down). FORCE et al. 1999 Down suggested a neutral scheme to explain cases in which duplicate gene retention seemed to be higher than expected. In this scheme, duplicates are retained because each partner gene loses a dispensable cis-acting function such that the ancestral function is now subfunctionalized into two genes. One consequence of subfunctionalization is that duplicates become fixed in a population so that they remain a potential substrate for novelty (LYNCH and FORCE 2000 Down). Another consequence of subfunctionalization is that the loss of a cis-acting function should reflect a change in DNA sequence or sequence arrangement. Indeed, the problem with testing Lewis models and subfunctionalization models experimentally is that measuring function of presumptive cis-acting gene space is experimentally laborious. Given the assumption that functional cis-acting gene space should be conserved over evolutionary time, just as are exons, then a new measure of gene function is possible: conserved noncoding sequence (CNS) patterns. CNSs are phylogenetic footprints that are so large and/or exact that only two orthologous genes from suitably diverged species are required to measure them with confidence, as has been done for mouse-human (HARDISON et al. 1997 Down; DUBCHAK et al. 2000 Down; HARDISON 2000 Down) and also for maize-rice (KAPLINSKY et al. 2002 Down).

The terms "fractionation" and "consolidation" are used with specific meaning in this article. No matter how grand the duplication event—be it genomic, segmental, or genic—the immediate result is two DNA sequences (paralogs) where there used to be one; at this point the neutral process of fractionation begins. Fractionation is mutation leading to the loss of redundant function by any of several processes: randomization by substitution of neutral base pairs, deletion, insertion, copy over by simple sequence repeats (SSRs), and similar processes. However, fractionation applies only to situations in which duplication of a cis-acting unit of function has occurred, such as duplication of a gene (cistron) or duplication of a cis-acting part of a gene that confers some specific component of function above that of being necessary for gene function per se. Examples of such specific cis-acting function are organ specificity or late onset. Given duplication, fractionation can cause functional loss that does not remove the full complement of cis-acting function. In other words, fractionation is the DNA-level cause of the loss of one of the postduplication paralogs predicted by HALDANE 1933 Down and the mutational cause of the loss of specific cis-acting function (subfunctionalization) predicted by FORCE et al. 1999 Down. Fractionation involves change in DNA sequence. Unlike function, DNA sequence can be measured in routine fashion.

Fractionation and consolidation are useful concepts when dealing with the consequences of duplicate genomes, chromosomal segments, or individual genes. For the duplicated segment, genes make useful markers. For one extreme, diagrammed in Fig 1, each gene is lost from one or the other of the two 100% syntenous (homeologous) chromosomes such that they eventually have zero sequence in common. Synteny can be seen only by consolidating (FREELING 2001 Down) the two fractionated homeologs into a predicted ancestral sequence. This 100% fractionation example is essentially our result for the 13-gene maize duplicated region studied here, as will be shown. The alternative extreme is that each duplicate gene in the segment is retained in the duplicated ancestor, leading to 0% fractionation. In this case, consolidation is not necessary to reconstruct a likely ancestor. The concepts of fractionation and consolidation also apply intragenically. Instead of beginning with a chromosomal segment marked with a series of exon clusters (genes), one begins with a duplicated gene space marked by exons and CNSs, ideally marking cis-acting gene space with easily monitored islands of near-identical sequence surrounded by random, potentially functionless sequence. Thus, measuring fractionation of CNSs or disruption of a CNS pattern by insertion or deletion could provide DNA-level evidence underlying subfunctionalization (FORCE et al. 1999 Down). We use the concepts of fractionation and consolidation at the chromosomal level and within the gene. We could not find CNSs between genes in our grass chromosomal segment, and so we could not apply fractionation and consolidation to intergenic regions. Interestingly, fractionation at the chromosomal level leads to loss of genes (fractionation of the information content of a segment over two homeologous segments) whereas fractionation within genes can lead to retention of duplicates, each encoding complementing, but partial, function.



View larger version (48K):
In this window
In a new window
Download PPT slide
 
Figure 1. One extreme outcome of duplication followed by fractionation. In this case, the two resulting homeologs have been 100% fractionated. They share zero sequences in common. Only by consolidating the homeologs into a putative ancestor can the perfect synteny of these sequences be evidenced (FREELING 2001 Down). Consolidation is a mental exercise, not a mechanism. In reality, some duplicates are retained, meaning that fractionation is <100%. These principles apply to genes on a chromosome, cis-acting sites that may act from a distance, and specific cis-acting regulatory sites that may exist within a gene's space.

The grasses, Poaceae, are particularly important for fractionation research because the common subfamilies of grass turn out to be diverged for a useful amount of time—not too much and not too little—for applying CNS analyses (KAPLINSKY et al. 2002 Down; INADA et al. 2003 Down) and because the well-studied grass, maize, is the descendant of a tetraploidy event. The grasses are a monophyletic family. The ancestor to the major subfamilies of grass lived ~50 million years ago (MYA; KELLOGG 2001 Down) and was "diploid" in the sense that rice and maize's tribal relative, sorghum, is also diploid (DEVOS and GALE 2000 Down). Maize and rice represent different subfamilies and are about as diverged as grasses can be, if the basal subfamilies are ignored. It just happened that this 50-MYA branch point in the grass lineage was recent enough so that large phylogenetic footprints—noncoding sequences that are conserved because of some function—are so strongly conserved that they are about as identical in sequence as are bits of orthologous exon. However, divergence was far enough in the past to assure that each functionless nucleotide would randomize. This leaves maize-rice CNSs as islands of conservation surrounded by unalignable randomness (KAPLINSKY et al. 2002 Down; INADA et al. 2003 Down). While every genome evidences large-scale or whole-genome duplication in its history, as is the case for both Arabidopsis (in an ancestor to crucifers: BLANC et al. 2000 Down; PATERSON et al. 2000 Down; VISION et al. 2000 Down; SIMILLION et al. 2002 Down) and the grass ancestor (GOFF et al. 2002 Down), maize is special in having a comparatively recent tetraploid ancestor. Maize has been described as a descendant of a tetraploidy event happening ~11 MYA (GAUT and DOEBLEY 1997 Down) relative to the maize-rice branch at ~50 MYA (KELLOGG 2001 Down) and an intratribal maize-sorghum branch at ~16 MYA (GAUT and DOEBLEY 1997 Down). There has been enough retention of duplicates in maize so that almost all large chromosomal regions have a clear homeologous region(s) elsewhere in the maize genome (AHN and TANKSLEY 1993 Down; WILSON et al. 1999 Down; DEVOS and GALE 2000 Down). Estimates of maize gene duplicate retention vary from ~70% (AHN and TANKSLEY 1993 Down) to 14% (FREELING 2001 Down).

KAPLINSKY et al. 2002 Down have shown that stringent local alignments of orthologous genes from maize and rice often uncover conserved patches of sequence in noncoding DNA (CNSs), conservations reflecting positive selection. A recent analysis of 52 additional maize-rice gene spaces (INADA et al. 2003 Down) found that 73% of plant genes have at least one CNS, as strictly defined. The average was about three CNSs per grass gene. The length of these CNSs averaged ~20 bp, but occasionally could be >80 bp in length. Upstream regulatory genes were considerably more CNS-rich than were enzyme-encoding genes. To examine the concept of fractionation within a single gene space, we chose the most CNS-rich gene, with ~30 CNSs, among ~200 genes for which we found published or unpublished CNS data: maize liguleless2 (lg2) and its genomic duplicate, liguleless related sequence1 (lrs1).

The lg2 gene in maize encodes a basic leucine zipper protein that is necessary to specify an exact sheath-blade transition in the maize leaf (WALSH et al. 1998 Down) and to specify a timely transition from vegetative to flowering when the shoot apical meristem is founding tassel (male flower) branches (WALSH and FREELING 1999 Down). The phenotypes of homozygotes for lg2 deletions support these functions. The lg2 gene maps to chromosome 3.06; homeolog lrs1 maps to chromosome 8 near umc7 (data not shown). The deduced protein sequence encoded by these two genes is nearly identical. Our project begins with sequencing two maize bacterial artificial chromosomes (BACs), each containing a lg2-lrs1 duplicate, and annotating them individually. We then compared each of these sequences to the orthologous rice sequence contributed by the Rice Genome Project (RGP).


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Maize BAC sequence, assembly, and annotation:
Maize inbred B73 HindiIII BAC library ZMMBBb filters were purchased from Clemson University Genomics Institute (CUGI) and screened by hybridization with an lg2 cDNA probe that hybridizes to lg2 or lrs1. DNA was isolated from BAC clone 249I19, called lg2-BAC, and from BAC clone 240N14, called lrs1-BAC, and the presence of an entire lg2 or lrs1 was confirmed using PCR. DNA from lg2-BAC and lrs1-BAC was sheared and subcloned into pBluescript shotgun libraries. Average insert size was 1.5 kb. Subclones were sequenced from both ends to approximately seven times coverage. Bases were called by Phred (EWING et al. 1998 Down). Vector and Escherichia coli DNA was screened using CrossMatch (Phil Green, University of Washington) and reads were assembled into contigs using Phrap version 0.990329 (Phil Green, University of Washington).

Southern hybridizations:
Genomic: Maize B73 genomic Southerns were performed under high stringency (65° in 0.2x SSPE; 0.2% SDS). A variety of restriction enzymes were used to estimate a minimal number of fragments hybridizing to our exon probes (the probes were genomic gene space containing all or most exons of lg2, lrs1, and unk4 from lrs1-BAC or hypro1 from lrs1-BAC). Probes for the lg2 and lrs1 pair were used as a control for a bona fide retained duplicate. Gels were probed, stripped, and reprobed with the first probe to control for loss of template.

BAC: The CUGI BAC clone 222A1 was identified per methods presented above. Southern analysis using either 5' or 3' lg2 exon probes, which also hybridized with lrs1, was used to determine that this clone lacked the 5'-most lrs1 HindiIII restriction fragment, but carried the 3'-end of the lrs1 gene. (The lrs1 BAC is missing just the 3'-end of the gene.) Hybridizations using exon probes from genes on the lg2-BAC were performed at moderate stringency (65° in 0.5x SSPE; 0.2% SDS) in an effort to visualize a possible additional genomic copy downstream of lrs1, a region not represented on the original lrs1-BAC of Fig 2.



View larger version (26K):
In this window
In a new window
Download PPT slide
 
Figure 2. A 13-gene region of the maize/rice genome. Virtual alignment of an annotated RGP PAC sequence (AP003287) with two maize BAC contig collections is shown: maize lrs1-BAC (AY211534, seven contigs with blue numbers from 1 to 7 from left to right, covering 53,425 bp) and maize lg2-BAC (AY211535, six contigs with red numbers from 1 to 6 from left to right, covering 23,962 bp). The two homeologous regions in maize were identified because of the retention of the lrs1/lg2 gene pair; this retained duplicate is enclosed in a box. The top line is rice chromosome drawn approximately to scale. The middle line anchors genes that we annotated on the maize lrs1-BAC, and the bottom line anchors genes annotated on the maize lg2-BAC. Except for the magenta rice gene, a ferredoxin gene annotated only because of synteny, all rice genes were annotated by RGP; those with experimental support are solid green or black and those that are known parts of transposons or are hypothetical only are represented as outlines. Maize contigs containing transposons only, which constituted most of the BAC sequences, were ignored after they failed to hybridize to any portion of the rice PAC. Those that did hybridize reflected an annotated gene. The blue and red numbers relate these gene-carrying contigs to the order of contigs as they appear in GenBank.

CNS discovery:
Maize-rice CNSs were found using the BLAST-2-sequence parameters given in KAPLINSKY et al. 2002 Down with the following additions. For CNS discovery in the lg2 and lrs1 gene spaces (accession nos. AY180106 and AY180107), hits that were on the opposite strands, hits that moved >50 bp from the expectation of positional conservation, and hits that were >75% mono- or dinucleotide simple sequence repeats were removed from the graphic. This was done only to aid in visualization of the underlying order of CNS sequence and position.

In our search for CNSs that might have existed between grass genes, we needed to adapt our definition of a potential hit in order not to call isolated 15/15 exact homologies that might have occurred by chance alone. We excluded any single hit below an e-value equivalent to 17/17 exact match and demanded a conserved cluster of two or more 15/15 hits in the same orientation. We disregarded retrotransposon gene hits and hits to mono- or dinucleotide SSRs. Under these conditions, we found zero CNSs detached or spaced away from a cluster of exons. In our (unsuccessful) search for unexpected, intergenic maize-rice CNSs or other homologies, we expanded our rice P1-derived artificial chromosome (PAC) sequence AP003287 with both adjacent chromosome 1 overlapping rice PACs AP003794 and AP003681.

Maize lrs1 and lg2 gene space sequence and annotation:
The lg2 and lrs1 were contained on multiple nonoverlapping contigs, as identified in Fig 2. PCR using the BACs as templates was used to piece together the contigs. PCR products were sequenced at the University of California at Berkeley Sequencing Facility. The lrs1-BAC did not contain the 3' end of lrs1 (bp 10,327–10,850 of AY180107). This finishing sequence is from a previously identified lrs1-containing genomic clone obtained from a maize inbred B73 genomic library (our unpublished results). The exons of both lg2 (AY190106) and lrs1 (AY180106, called "lg2-like" by GenBank) were experimentally determined using a complete cDNA sequence from maize LG2-mRNA, accession no. AF036949.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Fractionation and consolidation of a 13-gene segment of grass chromosome:
We chose maize inbred B73 BACs containing lg2 (GenBank accession no. AY211535; cDNA sequence was available) and its genomic duplicate, lrs1 (AY211534); these were sequenced to seven times coverage and assembled into contigs as described in MATERIALS AND METHODS. The resulting maize contigs were individually assessed by BLASTx (ALTSCHULE et al. 1990 Down) and GeneMark.hmm (LUKASHIN and BORODOVSKY 1998 Down) using the Caenorhabditis elegans model. The results are diagrammed on the lg2-BAC and lrs1-BAC tracks of Fig 2. Most maize contigs appeared to be composed entirely of transposons. These are not plotted in Fig 1. A total of 8 genes were called from lrs1-BAC contigs and a total of 5 from lg2-BAC contigs, which sums to the lg2/lrs1 pair plus 11 unpaired genes. Independently, the contigs, whether or not we annotated genes on them, were virtually assembled onto a single rice PAC (AP003287). Of the 12 genes we annotated in maize (counting lg2 and lrs1 as one gene), 11 were present in a cluster and annotated by the RGP (rice track of Fig 1). The rice annotation did not lead to new genes being discovered in maize. In addition to the 11 maize genes anchored to rice, our maize-rice comparison found an exon-like homology in the RGP PAC at 71,700–72,198: a ferredoxin gene missed by RGP and by ourselves. This brings the total count of genes in both maize and rice to 12. All of these genes are ~85% (75–89%) identical in sequence over at least 75% of the exon (see supplemental Fig 1 available at http://www.genetics.org/supplemental/). Of the 5 known rice genes, all were in maize. Of the 7 rice genes with experimental evidence for existence, but unknown function (unks), 6 were found in maize; unk3 was missing. Of the 9 hypothetical rice genes (hypros), maize carried hypro1. Since hypothetical genes are just that, we do not count a missing "hypro" as an unexpected event. Were it not for the exceptional unk3, which is unexpectedly missing in both maize homeologs, consolidation of the fractionated lg2-BAC and the fractionated lrs1-BAC would yield perfect synteny for this region of grass chromosome.

An exceptional maize rab7A-related gene fragment was supported by convincing BLASTx hits, but was not present in the rice PAC; 11 of 12 maize genes were present. Rab genes are part of multigene families of small GTPases. There are several rabA genes in rice (data not shown).

The single maize genes we found orthologous to the rice genes on PAC AP003287 gave exon identities from 75 to 89%. This degree of conservation is consistent with recent, post-tetraploidy function (see supplemental Fig 1 available at http://www.genetics.org/supplemental/), 11 MY of no selection being adequate to greatly degenerate identities. As for function today, we have genetic and expression evidence for maize lg2 function only.

Where did the 11 fractionated maize genes go? An unequivocal answer to this question for this or any individual maize case study is simply not possible because the maize genome is not sequenced and because use of stringent Southern hybridization data is flawed. When an expected genomic fragment is not found using Southerns, then chances are high (not proved) that the gene is indeed gone. The flaw is, when Southerns do find one or more fragments in the genome in addition to the gene in question, interpretation is equivocal; perhaps there are paralogs, or particularly conserved gene regions among paralogs, or simply spurious hybridizations, or a fragment of a gene that is functionally dead. For this reason, our best attempt to address this "where did the genes go?" question requires some explanation.

Consider the extreme alternative: every newly duplicated gene or gene cluster in maize has a reasonably high probability of having moved physically to another unlinked location over the last 11 MY. Such an explanation simply could not explain our data because a near-complete ancestral genome—11 of 12 genes—was left behind at the expected locus on the homeologs (Fig 2). Only selection could account for this, and selection for this one complete function would not exist if other unlinked copies also function. Thus, logic alone leads to a likely conclusion: the missing genes are functionally inert. However, this is an argument, not a proof. The genes on the lrs1-BAC surround the genes on the lg2-BAC. The easiest way to account for this with a single chromosomal aberration would be to evoke an inversion or short-range movement that would place the "missing" genes on the lg2-BAC on the other side (to the left in Fig 2) of the lrs1-BAC; this rearrangement would be within the lrs1 chromosome. To test for this local movement coincidence, we went back to the CUGI B73 BAC library and found an additional lrs1-specific BAC, clone 222A1, that meets with lrs1-BAC at lrs1, which together span 230 kbp. This new ~100-kbp BAC was grown and isolated as were the original BACs, and Southern analysis was used to determine that the clone lacked the 5'-most lrs1 HindiIII restriction fragment (leaving ~7.3 kb of lrs1 at one end). This positions the new BAC as running to the left of the lrs1 BAC diagrammed in Fig 2, adding ~93 kbp of chromosomal sequence, as diagrammed in Fig 3. As a positive control, the hypro1 and chitinaseB probes were also used to hybridize to the lg2-BAC from which they came. The autoradiographic results of this hybridization experiment are shown in Fig 3: the missing genes are not within this extra 93 kbp to the left of lrs1, but are detected, as expected, on the lg2-BAC (Fig 3, right lanes). We did not address rearrangements that might have placed the missing genes at a distance >93 kbp, so these data also are only supportive, not conclusive.



View larger version (46K):
In this window
In a new window
Download PPT slide
 
Figure 3. Two autoradiograms of the same Southern blot in which DNA was restricted with BamHI, separated, blotted, and probed with an exon-rich gene fragment present on the lg2-BAC (lane 3, 249I19) but fractionated from the lrs1-BAC (lane 2, 240N14) and an additional BAC (lane 1, 222A1) that extends the lrs1-BAC-marked chromosome ~90 kbp 3' of the lrs1 gene. The relationship of lrs1-BACs 1 and 2 is diagrammed; scale is approximate. The 3'-end of the lrs1-BAC (2), as diagrammed, includes the 3' of lrs1 added during finishing. (A) Hybridization was to the entire coding sequence of hypro1, this being the closest gene to lg2 on the lg2-BAC. (B) The blot in A was stripped and reprobed with a probe covering the entire coding sequence of chitinaseB; this is potentially the furthest gene from lg2 on the lg2-BAC. Both blots were hybridized at moderate stringency (65° in 0.5x SSPE; 0.2% SDS) in an effort to visualize a possible genomic duplicate. The lanes carried either BAC (1, 2, and 3 as represented by the diagram) or B73 whole-genome (G) DNA. Note that neither of the lrs1-containing BACs (1 and 2) carried these lg2 region genes. However, the lg2-BAC did carry these genes (lane 3), as expected. Were this blot exposed longer, one or two "bands" would be seen in the B73 whole-genome DNA lanes, denoted G. The central DNA ladder is for sizing.

Even given the flaws of genomic Southern searches for missing genes, we went hunting for two: unk4 and hypro1. The results are available from supplemental Fig 2 at http://www.genetics.org/supplemental/. unk4 exon sequence was used to probe B73 Southern blots, which were then washed under stringent conditions; no second fragment evidencing a potentially duplicate gene was found using several enzymes. This negative result constitutes strong but not conclusive support for loss or randomization of the lrs1-homeolog of the unk4 gene. We found a second hybridizing fragment when hypro1 was used as probe, and the lg2/lrs1 duplicate was found as a control, as expected. Taken at face value, this positive result supports retention of this gene. However, since false positives are expected, the hypro1 result is difficult to interpret correctly.

A search for phylogenetic footprint markers in maize intergenic space:
Were grasses like mammals, then we could hope to find maize-rice CNSs (large, exact phylogenetic footprints) between genes, somehow acting over several to many kilobases to affect activity of a region of a chromosome (LOOTS et al. 2002 Down and references therein). A significant percentage of CNSs found between humans and mouse are located between genes, and a significant fraction of these are structured as matrix attachment regions (GLAZKO et al. 2003 Down). A putative scaffold attachment region has been shown to be a phylogenetic footprint between sorghum, a close maize relative, and rice (AVRAMOVA et al. 1998 Down).

Given these successes, we used the sequence libraries comprising our two maize BACs as queries and "blasted" these onto the known orthologous rice PAC and the two adjacent RGP rice PAC clones as subjects. We used "find CNS" bl2seq conditions (KAPLINSKY et al. 2002 Down) known to find all hits of e-value equal to or greater than a 15/15 exact nucleotide match. We looked for a pattern of two or more small hits or one large hit in any area of a contig not containing exons or not farther than a few kilobases from called exons. While most of our 11 genes certainly had CNSs very close or within 2.5 kb of exons, there were no distant or intergenic CNSs. Therefore, we have no intragenic markers to check for fractionation of intergenic regions following the tetraploidy event.

Fractionation of sequence markers within a single gene's space:
We chose our BACs because we knew that lg2 and lrs1 were retained duplicate genes and because we knew that lg2/lrs1 was particularly CNS rich even among upstream regulatory grass genes, with ~30 individual CNSs identified (INADA et al. 2003 Down for lrs1). We predicted that these CNSs would serve as gene-space markers with which to test intrageneic fractionation following the tetraploidy event. Exons cannot fractionate without inactivating gene function. However, some cis-acting sequences might fractionate if they were not essential to basal gene function (FORCE et al. 1999 Down; LYNCH and FORCE 2000 Down). Using KAPLINSKY et al. 2002 Down computational definition of grass CNSs, we found 30 CNSs, including 7 CNSs >40 bp long. These 30 CNSs are identified by the colored, "parallel" lines of Fig 4. CNSs composed of >75% mono- or dinucleotide SSRs were discarded, and alignments that change strand (that were inverted) were also excluded for Fig 4.



View larger version (64K):
In this window
In a new window
Download PPT slide
 
Figure 4. Two independent blast comparisons plotted coordinately: maize gene lg2 with its rice ortholog and maize gene lrs1 with the same rice ortholog. The purple alignment line of lg2CNS17 represents how sequences near 5' exons align. Exons were identified in genomic DNA for all three genes using the complete cDNA of LG2-mRNA (AF036949) and then masked. Bl2seq conditions were modified from those of KAPLINSKY et al. 2002 Down. The BLAST result is represented by the solid, multicolored lines connecting the maize lg2 (Zm LG2) and rice lrs1 (Os LRS1) gene diagrams. The identity match is indicated parenthetically. A color connects CNSs that are essentially the same. A dotted line reflects a maize lg2-lrs1 CNS retention, but just below the 15/15 cutoff. A yellow highlight denotes those rare CNSs that are fractionated. The insertion into lg2 promoter is noted.

It is important to note that the solid lines of Fig 4 denote CNSs that are derived from two independent maize-rice pairwise blasts: lg2-rice ortholog (Fig 4, top pair) and lrs1-rice ortholog (Fig 4, bottom pair). The two results of these two alignments are plotted onto the single rice ortholog gene space (Fig 4, center sequence) that they have in common. The overall result is that almost all of the intragenic lg2/lrs1 CNSs have been "conserved," not fractionated, over the 11 MY following the tetraploidy event.

To evaluate the data of Fig 4 in an informed way, it is important to calculate the expectation for conservation of neutral sequences when the common ancestor lived 11 MYA, the approximate time of the maize tetraploidy event. Any 15-bp positionally conserved sequence comparing maize and rice (ancestor 50 MYA) has about four chances in a million of being carried over in the absence of selection (KAPLINSKY et al. 2002 Down; INADA et al. 2003 Down). For maize-maize (ancestor 11 MYA), the expectation of carryover is 13%. (Both calculations assume a uniform 7 x 10-9 base substitutions/neutral base pair/year.) Therefore, the lower e-value CNSs of Fig 4 (like lg2CNS27, 23/27) have a calculated 15% chance of existing on both homeologs of maize without selection, and more significant CNSs (like lg2CNS1, 72/75) have a much higher expectation for conservation whether selected for or not because they can degrade in significance and still be called CNSs. The maize duplication event occurred so recently that there has not been enough time to randomize functionless sequences by base substitution alone. Therefore, it is safest to assume that "conservation" of CNSs between the maize homeologous genes lg2 and lrs1 is expected whether these CNSs function or not. For us to maximize our chance of seeing this expected conservation (carryover), we manually inspected each sequence when the original data indicated that a CNS had fractionated. The dotted lines of Fig 4 denote that manual inspection did indeed find an alignable, but degraded, sequence. This situation is exemplified by the dotted extension of yellow lg2CNS24 (18/19) that does exist in the expected position in lrs1. So, if base substitution were the only fractionation mechanism, the general expectation is that maize-maize CNS patterns will appear to be conserved, but are actually carryover from the common ancestor. Of course, base substitution is not the only mechanism of fractionation, as will be discussed.

On a background of expected conservation, the exceptions are outstanding. Fig 4 denotes four exceptions to conservation: a 1.4-kbp insertion in the promoter region of lg2 is visible because it moves almost all 5' lg2 CNSs upstream. Additionally, three CNSs are fractionated, as denoted by yellow highlighting. These three are the large (76/88) promoter lg2CNS3 that is not present in maize lrs1 and the smaller lrs1CNSs—CNS16 (38/48) and CNS23 (17/17)—that are not present in lg2. "Not present" in this case means that no amount of imagination could find an alignment anywhere within the gene space. As with the fractionation results involving genes on our chromosomes (Fig 2), fractionation within the lg2/lrs1 gene behaves like a qualitative character: a CNS is either retained or fractionated.

Observation of Fig 4 reveals that the well-studied maize lg2 gene is the more unique and divergent gene of the lg2-lrs1 pair. The insertion in lg2 is particularly striking, as is the loss of an 88-bp promoter CNS. The divergence evidenced in CNS pattern, combined with genetic studies in rice and maize, which will be discussed, support the hypothesis that the LG2 regulatory function in maize is newly evolved, a case in support of the Lewis scheme for the evolution of novelty. Since the maize lrs1 gene has lost two CNSs, and the lg2 has lost one, the involvement of subfunctionalization becomes a reasonable hypothesis.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The concepts of fractionation and consolidation have worked well in our efforts to reconstruct the evolutionary history of a 13-gene segment of a grass chromosome and also to reconstruct the evolution of a regulatory gene with a novel function. We attempted to find markers between the genes of our maize BACs by looking for CNSs with the rice orthologous chromosome. Unlike the situation in mammals, all big phylogenetic footprints were associated closely with exons, so we could not test fractionation of any sort of long-range, cis-regulatory function.

Our annotation of maize BAC contigs for gene content identified 13 genes, and 12 of these were present in one chromosomal region of rice. We found one new gene in the region by synteny only; this gene eluded annotation by ourselves in maize and also by the Rice Genome Project. In general, virtual assembly of maize BAC contigs using a bit of finished rice genome worked efficiently. Except for the gene we chose as a retained duplicate, lg2/lrs1, none of the other 12 "experimental" maize genes we found in this single grass region were retained in the homeologous maize BACs (Fig 2) or in an adjacent lrs1-BAC (Fig 3). Hybridization evidence strongly supports loss for one of these "missing genes" and it can be reasonably argued that the average missing gene must be lost or randomized, not moved elsewhere in the genome. Even so, our data support only the contention that genes fractionated from our region are actually missing from the genome. However, this case of 0% retention (not counting the lg2/lrs1 gene pair that was a given) is not unequivocal in itself, and it would be wrong to generalize from it.

Zero percent duplicate retention for maize is obviously too low. On the other hand, the estimate of 70% retention is probably too high. The AHN and TANKSLEY 1993 Down estimate is based on Southern hybridization data, and there are many situations in which a fractionated gene might be evidenced as a false positive using such hybridization data: the existence of paralogs (duplications) that precede the duplication event, the existence of some very conserved regions among otherwise distant paralogs, the existence of spurious hybridizations of high GC or SSR regions, the existence of dead gene fragments, and the like. Two other case studies do not use genomic hybridization data, and each also yields a low estimate of retained duplicates for maize. The first involves identification and mapping of a gene family in both rice and maize (SENTOKU et al. 1999 Down). In this study, seven knox class I homeobox genes were identified by homeobox sequence and map position in rice and then related to sequence and map positions of the most homologous (orthologous) genes in maize: only one of these seven was retained as a duplicate in maize (rs1/gn1), which computes to ~14% retention (FREELING 2001 Down). In a case study similar to our own in that both homeologs from maize were used in the comparison, ILIC et al. 2003 Down(and data shared with us before publication) found that of eight genes present in the reconstructed maize ancestor, only one was retained as a duplicate; this computes to 12.5% retention. These workers found one result that is of special interest: partially fractionated genes. That is, cases in which one of the retained pair of genes was an obvious pseudogene with lowered nucleotide identity. This result wreaks havoc with attempts to measure anything meaningful using positive results of genomic hybridization experiments. On the basis of these three case studies only, we conclude that maize is well along on the path toward diploidy, as predicted by HALDANE 1933 Down. So far, case studies found 14, 12.5, and 0% (this study) retention. More case studies are needed.

Given that fractionation of recently duplicated maize chromosomes has been extensive, future research should be particularly careful not to misinterpret nonsyntenic results when both homeologs are not included in the study. A case in point is the recently published comparison of gene content from two different genotypes in homologous regions of maize chromosome 9 near bz1 (FU and DOONER 2002 Down). In their study, one cultivar had genes missing in the region as compared to the other. Fu and Dooner did not sequence the homeologous bz1 region where these missing genes might have been. This omission leaves open the possibility that multiple editions of post-tetraploidy fractionation in maize might have continued into modern times into the teosinte (wild maize) gene pool from which the various races of maize were selected by humans. In other words, multiple fractionation outcomes from the maize tetraploidy event may have generated diverse gene contents and may conceivably be generating maize diversity even today. In any case, our results do not detract from FU and DOONER'S 2002 Down novel hypothesis that maize heterosis might be explained at the level of gene content. In general, given the high gene fractionation expectation for maize, any random stretch of maize chromosome is expected to be incomplete. Both or all homeologs must be sequenced, and the results condensed into a putative ancestor before suggesting that maize genes have unexpectedly gone missing.

In mammals, there is solid evidence for the existence of CNSs that act on more than one gene and often from a distance >10 kbp (LOOTS et al. 2002 Down; GLAZKO et al. 2003 Down). We found no such intergenic CNSs in our maize BACs. The only convincing CNSs or patterns of phylogenetic footprints between either homeologous maize chromosome segment individually and its orthologous rice chromosome occurred between exons of the same gene or within a few kilobases of them. Therefore, we have no chromosomal regional markers to use to test for fractionation following the tetraploidy event. This result would be a surprise if we thought that CNS research results found in mouse-human comparisons would tend to hold up in grasses. In fact, while ~35% of noncoding gene space in mammals is CNS, only ~2% of grass gene space is conserved (INADA et al. 2003 Down). Indeed, 27% of grass genes have no CNSs at all. Perhaps the sort of gene regulation that explains CNS function is utilized far more intensively in higher animals than in higher plants (INADA et al. 2003 Down).

The ~30 CNSs characterizing both maize lg2/rice and maize lrs1/rice (Fig 4) provided an adequate amount of marker detail for this gene's space. This TGA1a-type basic leucine zipper transcription factor gene is not the average gene. Rather, it is the most CNS rich of any grass gene we have measured (INADA et al. 2003 Down; our unpublished results). The general conclusion that CNS position and sequence tend to be retained in both homeologs is an inescapable deduction from Fig 4. The calculations reported in the RESULTS support the conclusion that the rate of neutral base-pair substitution traditionally assigned to the grass family is not high enough to randomize nucleotide sequence over only 11 MY. So, if base substitution were the only mechanism of fractionation, then our general result of maize-maize sequence conservation is not surprising. It is clear that other mechanisms of fractionation operated in the maize lineage during the last 11 MY. The mechanism that completely fractionated our 13-gene grass chromosomal segment in <11 MY, for example, appears to have been deletion. Copy over by simple sequence repeats is another possible fractionation mechanism, as is any mechanism involving transposon insertion or excision (e.g., "scrambling"; KLOECKENER-GRUISSEM and FREELING 1995 Down). Base substitution is not the only mechanism for fractionation. If a mechanism has an average target greater than a single base pair, then there is the complication that fractionation would remove a linked group of elements. For example, the average deletion in maize could be in the kilobase range; if so, it would make an inefficient intragenic fractionation mechanism because removing a neutral CNS would tend to remove an essential CNS or exon, and lethality would result. To the extent that a mutation mechanism acts on small (1–300 bp) targets, it would mediate intragenic fractionation as well as chromosomal fractionation. At present, we have no quantitative estimates of expected rates of any sort of mutation in the grasses except base substitution, but that does not imply that base substitution is the primary mutational agent.

Intragenic fractionation did occur in the percentage range: two shorter CNSs are missing from lrs1 and one particularly significant CNS is missing from maize lg2 (Fig 4). These CNSs are fractionated even when we manually looked for any remnant of alignable sequence anywhere in the gene space. The fractionated CNSs were unexpected only because we do not know rates of any sort of mutational mechanism except base substitution; they appear to have been deleted or copied over, not randomized 1 bp at a time.

The lg2 insertion is a gross change of gene content (Fig 4). The nucleotides within this 1.4-kb insertion are not structured like known transposons and do not exist anywhere in the rice genome; there is inadequate maize or sorghum (a tribal relative) sequence to hope to find an origin for this post-tetraploidy-inserted sequence (our unpublished results). We predict that this insertion, and perhaps the rare CNSs that were fractionated, conditioned a change of expression of this transcription factor that somehow evolved into a new, specific leaf function. In other words, the lg2 gene and the specific LG2 functions appear to have evolved recently, and after the tetraploidy event, in general support of the LEWIS 1951 Down scheme.

Were lg2 truly a novel gene that evolved just a few million years ago, it would explain the peculiar distribution of known liguleless genes in maize and rice. The absence of a ligule in a grass plant is readily observed because of an upright, leaves-up stature. In maize, this phenotype is saturated: of 27 independently screened liguleless mutants, 18 are lg1 alleles, 9 are lg2 alleles, and 0 map elsewhere (HARPER and FREELING 1996 Down; D. BRAUN and J. WALSH, unpublished results). All liguleless mutants reported in rice map to 4L, which is where rice lg1 is located (KAPLINSKY et al. 2002 Down). There are no liguleless mutants mapping to 8L in maize, the site of lrs1, and there are also no reports of a liguleless mutant mapping near lrs1 (or away from lg1 in rice 4L) in rice (our unpublished observations). In conclusion, the evidence is strong that lg2 transcription factor regulation has been modified and co-opted (TRUE and CARROLL 2002 Down) to a new, ligule-specific function at some point or points after tetraploidy.

Little is known about the exact phylogenetic branch point in the tribe Andropogoneae where this tetraploidy event/lg2 origin took place. Recent sequence data (MATHEWS et al. 2002 Down) have been subjected to Bayesean analysis of base substitution frequencies from several genes in many tribes constructed from a 50% node consensus of 95,000 trees. The genus Elionuris, classified in the Rottboelliinae subtribe by CLAYTON and RENVOIZE 1986 Down, grouped with maize and Tripsacum (known to be a genus very close to Zea). Other Rottboelliinae grouped in various places in the tree and not together. The maize/Tripsacum lineage tetraploidy branch point is not placed. Only now are the leaves and ligules of these tropical grasses being observed by collectors in search of evidence of a LG2 function. Mapping the lg2 insertion and lg2/lrs1 fractionated CNSs in Rottboelliinae species onto the tribal phylogenetic tree should be illuminating. Since we have an anti-LG2 antibody that localizes maize LG2 to the adaxial layers of the primordial ligule (J. WALSH and M. FREELING, unpublished results), mapping this immunolocalization as a character onto the tribal phylogenetic tree should also be illuminating.

The function of the lg2 insertion, the fractionated CNSs, or any CNS is not known. We use CNSs as conserved markers only. The CNS/insertion fractionation data of Fig 4 provide a logically sound starting point for analyses to find these functions using conventional molecular genetics. Given these functions, it should be possible to reconstruct the regulatory history of an ancestral basic leucine zipper transcription factor gene as it evolved to specify the Liguleless2 function. Given the complexity of the lg2/lrs1 grass gene, with its many CNS markers, mapping insertions and deletions and finding fractionated CNSs permits a computational parsing of gene space that would be difficult or impossible to do in any other way.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AY211535, AY211534, AY180106, and AY180107. Back
1 Present address: Syngenta Biotechnology, 3054 Cornwallis Rd., Research Triangle Park, NC 27713. Back


*  ACKNOWLEDGMENTS

We thank Damon Lisch and an anonymous reviewer for thoughtful comments, Randall Tyers for help finishing the lg2 and lrs1 sequences, and Nancy Nelson for expert editorial help. Funding was provided by the University of California, Berkeley-Syngenta strategic alliance to M.F. and National Institutes of Health grant 2RO1-GM42610 to M.F.

Manuscript received June 20, 2003; Accepted for publication September 17, 2003.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AHN, S. and S. D. TANKSLEY, 1993  Comparative linkage maps of rice and maize genomes. Proc. Natl. Acad. Sci. USA 90:7980-7984.[Abstract/Free Full Text]

ALTSCHULE, S. F., W. GISH, W. MILLER, E. W. MYERS, and D. J. LIPMAN, 1990  Basic local alignment search tools. J. Mol. Biol. 215:403-410.[CrossRef][Medline]

AVRAMOVA, Z., A. TIKHONOV, M. CHEN, and J. L. BENNETZEN, 1998  Matrix attachment regions and the structural colinearity in the genomes of two grass species. Nucleic Acids Res. 26:761-767.[Abstract/Free Full Text]

BLANC, G., A. BARAKAT, R. GUYOT, R. COOKE, and M. DELSENY, 2000  Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 12:1093-1101.[Abstract/Free Full Text]

CLAYTON, W. D., and S. RENVOIZE, 1986 Genera Graminum: Grasses of the World. Kew Bulletin Additional Series XIII, Royal Botanical Gardens, Kew, Her Majesty's Stationary Office, London.

DEVOS, K. M. and M. D. GALE, 2000  Genome relationships: the grass model in current research. Plant Cell 12:637-646.[Abstract/Free Full Text]

DUBCHAK, I., M. BRUDNO, G. G. LOOTS, L. PACHTER, and C. MAYOR et al., 2000  Active conservation of noncoding sequences revealed by three-way species comparisons. Genome Res. 10:1304-1306.[Abstract/Free Full Text]

EWING, B., L. HILLIER, M. C. WENDL, and P. GREEN, 1998  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8:175-185.[Abstract/Free Full Text]

FORCE, A. M., M. LYNCH, F. B. PICKETT, A. AMORES, and Y.-L. YAN, 1999  Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151:1531-1545.[Abstract/Free Full Text]

FREELING, M., 2001  Grasses as a single genetic system: reassessment 2001. Plant Physiol. 125:1191-1197.[Free Full Text]

FU, H. and H. DOONER, 2002  Intraspecific violation of genetic colinearity and its implications in maize. Proc. Natl. Acad. Sci. USA 99:9573-9578.[Abstract/Free Full Text]

GAUT, B. S. and J. F. DOEBLEY, 1997  DNA sequence evidence for the segmental allotetraploid origin of maize. Proc. Natl. Acad. Sci. USA 94:6809-6814.[Abstract/Free Full Text]

GLAZKO, G. V., E. V. KOONIN, I. B. ROGOZINE, and S. A. SHABALINA, 2003  A significant fraction of conserved noncoding DNA in human and mouse consists of predicted matrix attachment regions. Trends Genet. 19:119-124.[CrossRef][Medline]

GOFF, S. A., D. RICKE, T.-H. LAN, G. PRESTING, and R. WANG et al., 2002  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296:92-100.[Abstract/Free Full Text]

HALDANE, J. B. S., 1933  The part played by recurrent mutation in evolution. Am. Nat. 67:5-19.[CrossRef]

HARDISON, R. C., 2000  Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet. 16:369-372.[CrossRef][Medline]

HARDISON, R. C., J. OELTJEN, and W. MILLER, 1997  Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res 7:959-966.[Free Full Text]

HARPER, L. and M. FREELING, 1996  Interactions of liguleless1 and liguleless2 function during ligule induction in maize. Genetics 144:1871-1882.[Abstract]

ILIC, K., P. J. SANMIGUEL, and J. L. BENNETZEN, 2003  A complex history of rearrangement in an orthologous region of the maize, sorghum and rice genomes. Proc. Natl. Acad. Sci USA 100:12265-12270.[Abstract/Free Full Text]

INADA, C. D., A. BASHIR, C. LEE, B. C. THOMAS, and C. KO et al., 2003  Conserved noncoding sequences in the grasses. Genome Res. 13:2030-2041.[Abstract/Free Full Text]

KAPLINSKY, N. J., D. M. BRAUN, J. PENTERMAN, S. A. GOFF, and M. FREELING, 2002  Utility and distribution of conserved noncoding sequences in the grasses. Proc. Natl. Acad. Sci. USA 99:6147-6151.[Abstract/Free Full Text]

KELLOGG, E. A., 2001  Evolutionary history of the grasses. Plant Physiol. 125:1198-1205.[Free Full Text]

KLOECKENER-GRUISSEM, B. and M. FREELING, 1995  Transposon-induced promoter scrambling: a mechanism for the evolution of new alleles. Proc. Natl. Acad. Sci. USA 92:1836-1840.[Abstract/Free Full Text]

LEWIS, E. B., 1951  Pseudoallelism and gene evolution. Cold Spring Harbor Symp. Quant. Biol. 16:159-174.[Abstract/Free Full Text]

LI, W.-H., 1997 Molecular Evolution. Sinauer Associates, Sunderland, MA.

LOOTS, G. G., I. OVCHARENKI, L. PACHTER, I. DUBCHAK, and E. RUBIN, 2002  rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res. 12:832-839.[Abstract/Free Full Text]

LUKASHIN, A. V. and M. BORODOVSKY, 1998  GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 26:1107-1115.[Abstract/Free Full Text]

LYNCH, M. and A. FORCE, 2000  The probability of duplicate gene preservation by subfunctionalization. Genetics 154:459-473.[Abstract/Free Full Text]

MATHEWS, S., R. E. SPANGER, R. J. MASON-GAMER, and E. A. KELLOGG, 2002  Phylogeny of Andropogoneae inferred from phytochrome B, GBSSI and NDHF. Int. J. Plant Sci. 163:441-450.[CrossRef]

OHNO, S., 1970 Evolution by Gene Duplication, Springer-Verlag, Berlin.

PATERSON, A. H, J. E. BOWERS, X. D. BUROW, C. G. ELSIK, and C.-X. JIANG et al., 2000  Comparative genomics of plant chromosomes. Plant Cell 12:1523-1539.[Abstract/Free Full Text]

SENTOKU, M., S. YATUKA, N. KURATA, Y. ITO, and H. KITANO et al., 1999  Regional expression of the rice Kn1-type homeobox gene family during embryo, shoot and flower development. Plant Cell 11:1651-1664.[Abstract/Free Full Text]

SIMILLION, C., K. VANDEPOELE, M. C. VAN MONTAGU, M. ZABEAU, and Y. VAN DE PEER, 2002  The hidden duplication past of Arabidopsis thaliana.. Proc. Natl. Acad. Sci. USA 99:13627-13632.[Abstract/Free Full Text]

TRUE, J. R. and S. B. CARROLL, 2002  Gene co-option in physiological and morphological evolution. Annu. Rev. Cell Dev. Biol. 18:53-90.[CrossRef][Medline]

VISION, T. D., D. B. BROWN, and S. D. TANKSLEY, 2000  The origins of genomic duplications in Arabidopsis.. Science 290:2114-2117.[Abstract/Free Full Text]

WALSH, J. and M. FREELING, 1999  The liguleless2 gene in maize functions during the transition from the vegetative to the reproductive shoot apex. Plant J. 19:489-495.[CrossRef][Medline]

WALSH, J., C. A. WATERS, and M. FREELING, 1998  The maize gene liguleless2 encodes a basic leucine zipper protein involved in the establishment of the blade-sheath boundary. Genes Dev. 12:208-218.[Abstract/Free Full Text]

WILSON, W. A., S. E. HARRINGTON, W. L. WOODMAN, M. LEE, and M. E. SORRELLS et al., 1999  Inferences on the genome structure of progenitor maize through comparative analysis of rice, maize and the domesticated panicoids. Genetics 153:453-473.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
Plant Physiol.Home page
M. G. Sawchuk, T. J. Donner, P. Head, and E. Scarpella
Unique and Overlapping Expression Patterns among Members of Photosynthesis-Associated Nuclear Gene Families in Arabidopsis
Plant Physiology, December 1, 2008; 148(4): 1908 - 1924.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
J. T. Huang and H. K. Dooner
Macrotransposition and Other Complex Chromosomal Restructuring in Maize by Closely Linked Transposons in Direct Orientation
PLANT CELL, August 1, 2008; 20(8): 2019 - 2032.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
J. A. Schlueter, B. E. Scheffler, S. Jackson, and R. C. Shoemaker
Fractionation of Synteny in a Genomic Region Containing Tandemly Duplicated Genes across Glycine max, Medicago truncatula, and Arabidopsis thaliana
J. Hered., March 2, 2008; (2008) esn010v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Liu, C. Vitte, J. Ma, A. A. Mahama, T. Dhliwayo, M. Lee, and J. L. Bennetzen
A GeneTrek analysis of the maize genome
PNAS, July 10, 2007; 104(28): 11844 - 11849.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. A. Schlueter, B. E. Scheffler, S. D. Schlueter, and R. C. Shoemaker
Sequence Conservation of Homeologous Bacterial Artificial Chromosomes and Transcription of Homeologous Genes in Soybean (Glycine max L. Merr.)
Genetics, October 1, 2006; 174(2): 1017 - 1028.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. H. Paterson, M. Freeling, and T. Sasaki
Grains of knowledge: Genomics of model cereals
Genome Res., December 1, 2005; 15(12): 1643 - 1650.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
G. Haberer, S. Young, A. K. Bharti, H. Gundlach, C. Raymond, G. Fuks, E. Butler, R. A. Wing, S. Rounsley, B. Birren, et al.
Structure and Architecture of the Maize Genome
Plant Physiology, December 1, 2005; 139(4): 1612 - 1624.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
J. F. HANCOCK
Contributions of Domesticated Plant Studies to our Understanding of Plant Evolution
Ann. Bot., November 1, 2005; 96(6): 953 - 963.
[Abstract] [Full Text] [PDF]