- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- A corrigendum has been published
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Kramer, E. M.
- Articles by Irish, V. F.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Kramer, E. M.
- Articles by Irish, V. F.
Molecular Evolution of Genes Controlling Petal and Stamen Development: Duplication and Divergence Within the APETALA3 and PISTILLATA MADS-Box Gene Lineages
Elena M. Kramera, Robert L. Doritb, and Vivian F. Irishaa Departments of Molecular, Cellular and Developmental Biology
b Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut 06520
Corresponding author: Vivian F. Irish, Department of Molecular, Cellular and Developmental Biology, Yale University, P.O. Box 208104, New Haven, CT 06520-8104, vivian.irish{at}yale.edu (E-mail).
Communicating editor: E. MEYEROWITZ
| ABSTRACT |
|---|
The specification of floral organ identity in the higher dicots depends on the function of a limited set of homeotic genes, many of them members of the MADS-box gene family. Two such genes, APETALA3 (AP3) and PISTILLATA (PI), are required for petal and stamen identity in Arabidopsis; their orthologs in Antirrhinum exhibit similar functions. To understand how changes in these genes may have influenced the morphological evolution of petals and stamens, we have cloned twenty-six homologs of the AP3 and PI genes from two higher eudicot and eleven lower eudicot and magnolid dicot species. The sequences of these genes reveal the presence of characteristic PI- and AP3-specific motifs. While the PI-specific motif is found in all of the PI genes characterized to date, the lower eudicot and magnolid dicot AP3 homologs contain distinctly different motifs from those seen in the higher eudicots. An analysis of all the available AP3 and PI sequences uncovers multiple duplication events within each of the two gene lineages. A major duplication event in the AP3 lineage coincides with the base of the higher eudicot radiation and may reflect the evolution of a petal-specific AP3 function in the higher eudicot lineage.
FLOWERS are a defining characteristic of the angiosperms. The typical hermaphroditic angiosperm flower contains both sterile and reproductive organs. These organs are generally organized into whorls, with a particular organ type arising from a single node on the axis of a determinate floral meristem. The flowers of the model species Arabidopsis thaliana display the typical higher eudicot floral organization. The first and second whorls of the flower contain the sterile sepals and petals, respectively. The third whorl contains the stamens, the male reproductive structures which produce pollen. The female reproductive structures, the carpels, arise in the fourth whorl and contain the ovules. Numerous variations on this basic floral architecture exist within the angiosperms. These differences in organ number, structure and phyllotaxy are critical morphological characters in the study of angiosperm systematics.
The evolution of angiosperm floral diversity has been a subject of considerable study. While the stamens and carpels are thought to have each evolved only once, it is widely accepted that the sterile organs have evolved many times within the angiosperms, although the details of these events are unresolved (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The isolation of floral homeotic mutants in Arabidopsis and Antirrhinum has provided an inroad into the dissection of the genetic mechanisms underlying floral diversity and the derivation of floral organs (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
While many of the currently characterized MADS domain proteins appear to function as homodimers, the AP3 and PI gene products are thought to act together as a heterodimeric transcription factor. This conclusion is supported by the finding that both the AP3 and PI gene products are required for DNA binding and nuclear localization (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The conservation of B group functions across the angiosperms is being addressed through studies of AP3 and PI homologs in several higher eudicot species. In general, the expression patterns of these genes are all quite similar to those seen in Arabidopsis (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Since the petals of the higher eudicots are thought to be homologous, it is not surprising that all the higher eudicots studied to date appear to have a conserved petal developmental pathway. In order to examine how independent petal derivation events may be reflected in the petal developmental program, it is necessary to examine species whose petals are not homologous to those of the higher eudicots. Accordingly, we have cloned AP3 and PI homologs from eleven lower eudicot and magnolid dicot species in an effort to understand the evolution both of these gene lineages and of the pathways of petal specification. We present a phylogenetic analysis of the B group genes which indicates that the path of B group gene evolution is more complex than previously thought. Our analysis suggests that there are, in fact, two paralogous AP3 lineages in the higher eudicots: one represented by the well-studied AP3 ortholog group and the other containing the tomato AP3 paralog TM6 and several related genes. The data suggest that these two lineages are the result of a gene duplication event which occurred after the divergence of the Buxaceae in the lower eudicots but before the diversification of the higher eudicots. Sequence analysis reveals that the AP3-like genes of the lower eudicots and magnolid dicots are actually more similar to the members of the TM6 lineage than they are to the higher eudicot AP3 lineage. Although we have also identified duplication events in the PI lineage, none appear to date back to the base of the higher eudicots. In addition, the PI homologs we have isolated display a greater overall conservation of sequence than do the AP3 lineage members. Based on these observations, we present a model for the evolution of the B group gene lineage and discuss how duplication and divergence in this gene lineage may have influenced the evolution of petals.
| MATERIALS AND METHODS |
|---|
Species sampled and sources of plant material:
The species included in this analysis are given in Table 1 Table 2 Table 3, along with family membership, general collection information and GenBank accession numbers for the gene sequences. Throughout the text we have followed the taxonomic designations of ![]()
![]()
|
|
|
Cloning and analysis:
For each species, total RNA was prepared using Trizol (GIBCO BRL, Gaithersburg, MD) from whole flower buds collected across a range of developmental stages. Poly-A mRNA was extracted from total RNA using Magnetight Oligo (dT) particles (Novagen, Madison, WI). Single-stranded cDNA was synthesized by priming with the oligonucleotide 5'-CCGGATCCTCTAGAGCGGCCGC(T)17 from 500 ng of poly-A RNA. This poly-T primer was used with a second primer with the sequence 5'-GGGGTACCAA(C/T)(A/C)GICA(A/G)GTIACITA(T/C)TCIAAG(A/C)GI(A/C)G-3' in a polymerase chain reaction (PCR) to amplify MADS-box-containing cDNAs (primary PCR reaction). PCR analysis was performed in 100 µl of PCR buffer (10 mM Tris pH 8.3, 50 mM KCl, 1.5 mM MgCl2) containing 50 pmol and 20 pmol of 5' and 3' primer, respectively, 200 µmol of each deoxyribonucleotide triphosphate, and 2.5 units of AmpliTaq Gold polymerase (Perkin Elmer, Foster City, CA). Amplification began with a Taq-activation step of 12 min at 95°C, followed by 10 cycles of 20 sec denaturing at 95°, 30 sec annealing at 38° and 1 min extension at 72°. The program was completed by 30 cycles of 20 sec denaturing at 95°, 30 sec annealing at 42° and an extension time of 1 min at 72°. Amplified products were analyzed on a 1% agarose gel, revealing one or more distinct fragments of
0.6 kb. The reactions were directly cloned using the TA and TOPO-TA Cloning kits (Invitrogen, Carlsbad, CA). Clones were analyzed based on size and all fragments over 0.6 kb were sequenced using fluorescent sequencing methods by the Keck Foundation Biotechnology Resource Laboratory at Yale University.
In Papaver nudicaule, Dicentra eximia, Ranunculus bulbosus and Pachysandra terminalis, 3' primers targeted to AP3- or PI-specific C-terminal motifs were used in conjunction with the degenerate 5' primer to specifically amplify AP3 and PI homologs from the primary PCR reaction (same conditions as above). The PI-specific primer has the sequence 5'-TGIA(A/G)(A/G)TTIGGITGIA(A/T)(T/G)GGITG and the AP3-specific primer has the sequence 5'-CIAGICGIAG(A/G)TC(A/G)T. For these species, clones from both the primary PCR reaction as well as the AP3- and PI-specific secondary amplifications were analyzed. The complete 3' sequence of the clones generated with the AP3- or PI-specific primers was obtained using 3' RACE (3' RACE primer sequences available upon request). The complete cDNA sequences of PnAP3-2 and PnPI-1 were obtained using the 5' RACE System for Rapid Amplification of cDNA Ends, Version 2.0 (Gibco BRL, Gaithersburg, MD), in conjunction with several different oligonucleotides (sequences available upon request).
For the cloning of LeAP3 from Lycopersicon esculentum, a Solanaceae AP3- specific primer with the sequence 5'-A(A/G)IGC(A/G)AAIGTIGTIAT(A/G)TC was designed from the C-terminal consensus D(I/L)TTFAL. This 3' primer was used in conjunction with the degenerate 5' MADS-box primer to specifically amplify LeAP3. The complete 3' sequence was obtained using 3' RACE (primer sequence available upon request).
Phylogenetic analysis:
The sequences of all of the published B group representatives were obtained from GenBank (see Table 2 and Table 3 for accession numbers). Protein sequences were aligned using CLUSTALW and refined by hand, taking both nucleotide and amino acid sequences into consideration (see Table 4 and Table 5). The alignments were further modified (for phylogenetic analysis) by encoding gaps as single characters in a supplemental data matrix. In this approach, gaps are treated as single events, thus preventing their overweighting (based on gap length) in the subsequent phylogenetic analyses.
We generated parsimony trees based on the aligned protein sequences using the PAUP 4.0* package (Phylogenetic Analysis Using Parsimony, Version 61, used by permission of the author, ![]()
Distance matrices were derived from the PI and AP3 data-sets under a mean character difference criterion. Amino acid substitutions were weighted in accordance with the BLO-SUM substitution matrix. Trees were subsequently generated via the Neighbor-Joining algorithm (NJ) as implemented in PAUP. The resulting trees represent the "minimum evolution" networks connecting the sequences. Bootstrap values for resolved nodes are derived from 1000 replicate runs, again using the NJ algorithm.
For both the parsimony and distance analyses, the AP3 trees were rooted using six PI sequences (GLO, PI, DaPI, LtPI, PhPI and OsMADS2); conversely, six AP3 sequences (AP3, DEFA, SLM3, MfAP3, PhAP3 and CpAP3) were used to root the PI trees. These choices for outgroups were based, first of all, on the fact that AP3 and PI are known to be paralogous lineages and are, therefore, each other's natural outgroup. Secondly, while the AP3 and PI orthologs can be reasonably aligned to each other throughout their entire length, unambiguous alignments cannot be generated between the B group sequences and the remaining members of the MADS-box gene family.
Similar analyses were conducted using nucleotide data sets aligned in accordance with the protein alignments (data not shown, see RESULTS).
| RESULTS |
|---|
Cloning of B-group genes from lower eudicots and magnolid dicots:
Our strategy for cloning B group gene members from various lower eudicot and magnolid dicot species depends on the presence of a highly conserved sequence in the putative DNA-binding
-helix of the MADS-box domain (![]()
![]()
![]()
We used Syringa vulgaris (lilac), a higher eudicot, as a positive control to test the primary PCR reaction and cloning strategy. Both AP3 and PI orthologs were isolated from Syringa and their sequences showed very high similarity to that of the other Asterid B group representatives. For each lower eudicot and magnolid dicot species, we analyzed 10 to 36 clones (Table 1). It should be noted that in several species, an exhaustive survey has not been undertaken (<10 clones analyzed, Table 1). In the case of Lycopersicon, this is due to the fact that we were only interested in establishing the sequence of one gene (see below). Analyses of the Papaveraceae and Ranunculaceae were initiated with Papaver californicum and Delphinium ajacis but subsequent analyses were carried out with more convenient representative species (P. nudicaule, R. bulbosus and Caltha palustris).
A small number of the sequenced clones obtained from the primary PCR reactions proved not to be MADS-box gene representatives (5/171 total clones sequenced) and two were found to be members of the AGAMOUS-like family (one from P. nudicaule and one from Delphinium ajacis). Conceptual translation of all of the other cDNAs showed that they encode B group-type MADS-box gene products (164/171 total clones sequenced). We assigned the novel genes to the AP3 or PI classes based on the overall sequence similarity to the known AP3 and PI representatives and on the presence of specific diagnostic sites in the MADS, K and C domains. AP3 and PI-like proteins can be distinguished from one another at MADS-box residues 29, 35 and 47. Within the K box, PI homologs possess a highly conserved sequence KHExL (Table 4, residues 88 to 92). The comparable sequence in the K box of the AP3 homologs is (H/Q)YExM (Table 5, residues 85 to 89). We also found that the C-terminal portions of the predicted proteins contained diagnostic motifs for each lineage (see below). Each unique cDNA was named using the first letter of the genus and species from which it was isolated followed by either AP3 or PI, depending on its sequence similarity.
Sequence and phylogenetic analysis of PI homologs:
We have identified a total of twelve new PI-like genes which have been cloned from nine species (Table 2). In most of the species surveyed, only one PI-like gene was identified, while in P. nudicaule, R. bulbosus and Piper magnificum, two distinct PI-like genes were found. The sequences of the predicted products of the new PI clones align well with the previously studied higher eudicot and monocot representatives. Particularly striking is an approximately twenty amino acid region at the C-terminal end of the predicted proteins that displays extremely high conservation (Figure 3A). This domain, which we will refer to as the PI motif, has a core consensus sequence of MPFxFRVQPxQPNLQE. Four of the positions are completely invariant and the remainder show very strong conservation of chemical characteristics. Overall, the PI motif appears to be a strongly hydrophobic domain, although several charged and polar amino acids are present within the region. The motif bears no strong similarity to any known structural elements and a BLAST search for similar sequences in GenBank yields only PI homologs.
There are only two instances of marked divergence in sequence or structure among the newly isolated PI homologs. One is found in the predicted products of the D. eximia (Fumariaceae) and P. nudicaule (Papaveraceae) genes, DePI and PnPI-1, both of which contain an approximately 20 amino acid insertion upstream of the PI motif as well as a 10 to 12 amino acid addition at the C-terminal end of the protein (Figure 3A). These novel regions are characterized by stretches of A/C-rich repetitive DNA sequence. At the nucleotide level, the two cDNAs clearly align with one another through these novel regions, sharing 58% identity in the 100 bp upstream of the PI motif and 70% identity in the downstream region. It is likely that the event(s) that produced these insertions occurred before the last common ancestor of the Papaveraceae and Fumariaceae, which are sister families within the Papaverales. The second example of major sequence deviation within the PI homologs is seen in the P. nudicaule gene PnPI-2. The PnPI-2 cDNA encodes a truncated protein of only 164 amino acids, as opposed to the usual length of 180 to 210 amino acids. Alignment of the PnPI-1 and PnPI-2 nucleotide sequences reveals that the similarity between the PnPI-1 and PnPI-2 transcripts is quite high (80% identity) up to the point of the stop codon in PnPI-2, after which the similarity declines considerably.
Parsimony and distance-based phylogenies of the PI sequences were produced by analysis of the complete PI protein data set using several AP3-like sequences as the outgroup. The parsimony analysis resulted in 8 trees of equivalent length, from which a 50% majority rule consensus tree with a consistency index of 0.972 was derived, shown in Figure 4A. The topology of the parsimony tree is similar in many ways to that of the distance-based tree shown in Figure 4B. In both trees, all of the higher eudicot PI sequences are grouped into a single clade. The distance and parsimony trees do differ in the placement of the PnPI-1/PnPI-2 paralog pair relative to the other Ranunculid PI orthologs. One unexpected result is the position of the P. magnificum gene, PmPI-2, as sister to the higher eudicot clade in the parsimony analysis. This is not the expected position of a representative of a basal magnolid dicot. The other Piperaceae representatives, PmPI-1 and PhPI, are located at the base of the tree. In diverging from its paralog, PmPI-2 may have independently acquired sequence motifs characteristic of the higher eudicot PI representatives. In the distance analysis, PmPI-2 is placed at the base of the tree, closer to the other Piperaceae representatives. We note that the low bootstrap support seen for certain nodes reflects the restricted number of characters on which the node is based, rather than on the existence of alternative, better supported topologies. Analysis of a nucleotide PI data set yielded trees which displayed a significant number of unresolved polytomies due largely to the effects of saturation (data not shown). Where resolution was obtained using the nucleotide sequences, the structure of the consensus tree did not differ from that found using the protein sequences.
Sequence and phylogenetic analysis of the AP3 homologs:
The higher eudicot L. esculentum (tomato) has been previously found to contain an AP3 paralog, TM6, which is considered to be orthologous to the Solanum tuberosum gene PD2 (![]()
![]()
![]()
We have also cloned eleven AP3-like genes from a total of nine lower eudicot and magnolid dicot species (Table 3). P. terminalis and P. nudicaule were the only species that we found to have two AP3-like genes. The Ranunculid and magnolid dicot AP3 representatives show many differences when compared to the higher eudicot AP3 orthologs. Foremost among these are the considerable differences in the sequences of the predicted C-termini. The higher eudicot AP3 gene products display a highly conserved C-terminal motif with the consensus sequence D(L/I)TTFALLE (euAP3 motif, Figure 3B). The majority of the AP3 clones from the Ranunculidae and the magnolid dicots have a completely different predicted C-terminal sequence with the consensus YGxHDLRLA (paleoAP3 motif, Figure 3B). The predicted product of an AP3-like gene from Zea mays, SILKY-1, also displays the paleoAP3 motif with only one amino acid difference (C. PADILLA and R. SCHMIDT, personal communication). The paleoAP3 motif is slightly divergent, but recognizable, in PtAP3-1 and PtAP3-2 from P. terminalis. Interestingly, the paleoAP3 motif aligns very well with the C-terminal end of the predicted TM6 and PD2 proteins (Figure 3B). Similarity to the paleoAP3 motif is also seen in the product of a putative B group-related gene which has been isolated from the Pteridophyte (fern) Ceratopteris (![]()
![]()
One other motif that we recognized in the new AP3 gene products is a region with similarity to the PI motif. The Ranunculid AP3 representatives have a region upstream of the paleoAP3 motif with the consensus sequence FxFRLQPSQPNLH. This is very similar to the core of the PI motif consensus sequence (Figure 3B). When all of the AP3-like gene products are aligned, the PI motif-derived sequence can be observed in almost all of the AP3-like proteins. While this sequence has been very highly conserved in the PI lineage, in the AP3 lineage it has diverged to differing degrees. TM6 and PD2 show a much greater conservation of the PI motif than do the other higher eudicot AP3 gene products. The PI motif is particularly divergent in RAD1 and RAD2 from Rumex, while in CMB2, an AP3 paralog from Dianthus, a truncation has eliminated most of the PI motif-derived sequence. In the Michelia and Peperomia AP3 representatives, the PI motif sequence is unrecognizable.
Figure 5A and Figure B, show the results of parsimony and distance analysis of the complete AP3 protein data set. The parsimony analysis produced 27 trees of equivalent length, from which we generated a 50% majority rule consensus tree with a consistency index of 0.89. The topmost clade in both the parsimony and distance trees contains a grouping of the higher eudicot Brassicaceae, Asterid and Caryophyllid AP3 orthologs (Figure 5A). The members of this clade will be referred to as the euAP3 lineage. The second main clade in both trees is made up of several other higher eudicot AP3-like genes: TM6, PD2, AsAP3 and CMB2. The orthology of PD2 and TM6 has been suggested previously (![]()
![]()
In the parsimony analysis, the remaining AP3-like genes of the lower eudicots and magnolid dicots constitute a paraphyletic group at the base of the tree. The two very similar Pachysandra genes are paired together and are the most closely related to the higher eudicot clades. In contrast, the distance analysis defines a clade composed of PtAP3-1, PtAP3-2 and the Ranunculid and Magnoliaceae sequences. This result highlights the similarities between the Pachysandra and other lower eudicot sequences. In both trees, PnAP3-1 and PcAP3 from P. nudicaule and P. californicum, respectively, are paired together with very high certainty, suggesting that they define a paralogous lineage. Similar to the PI analysis, the low bootstrap support found for several nodes reflects the limited number of characters on which the node is based. All resolved nodes shown in Figure 5A were present in all equally parsimonious trees. As with PI, a nucleotide AP3 data set was analyzed, but the resulting trees were highly unresolved (data not shown). Where resolution was obtained using the nucleotide sequences, the structure did not differ from that found using the protein sequences.
If the character of each representative's predicted C-terminal motif is mapped on Figure 5A, we see that the paleoAP3 motif is present throughout the basal paraphyly and along the TM6 lineage. We will refer to the lower dicot and magnolid dicot genes which exhibit the paleoAP3 motif as the paleoAP3 lineage. The euAP3 motif is synapomorphic for the clade defining what we have called the euAP3 lineage. Diagnostic amino acid character states from throughout the protein sequence can be mapped onto the tree as well. Although the C-termini of the predicted PtAP3-1 and PtAP3-2 proteins resemble the paleoAP3 motif, in other positions the sequence is more similar to the euAP3 genes. These euAP3-like positions include 54Leu, 55His, 72Leu, 148-Asn and 150Ile. The euAP3 clade is also defined by several synapomorphic residues, aside from the C-terminal euAP3 motif. These changes include 2Gly
Ala, 7Glu
Gln, 28Ile
Leu, 115Asp
Ser/Cys, 144His
Lys and 226Phe
Leu. Interestingly, the primitive states of positions 2 and 7 in the MADS box are identical to the character state in the PI lineage members.
| DISCUSSION |
|---|
Morphological evolution proceeds by changes in developmental pathways due to alterations in the structure and regulation of particular developmental genes. The organ identity genes of the A, B, and C classes are logical starting points for a study of changing floral morphologies. The B group genes, in particular, appear to be a likely site of developmental flexibility in floral evolution. While A and C group members might be expected to show very high levels of constraint due to their pleiotropic roles in floral meristem identity and determinacy, as well as in organ identity, the functions of the B group genes appear to be limited to establishing organ identity. Moreover, the fact that the petals and stamens display enormous morphological plasticity within the angiosperms may be directly reflected as changes in the B group genes. These predictions are supported by substitution rate analysis, which shows that the AP3/PI group is evolving 2040% faster than are all of the other plant MADS-box genes (![]()
In this study we present the isolation of twenty-six new B group genes from a total of thirteen species distributed throughout the angiosperms. One of the most striking findings of the analysis of these genes is the high frequency of gene duplications within both the AP3 and PI lineages. We have identified several key ancestral duplications, in addition to what appear to be more recent duplication events. Pinpointing the exact time at which these duplications have occurred is difficult, however. The functional redundancy that results from having multiple gene copies can serve to release constraint on the sequence of the paralogs. Thus, although a high sequence similarity between paralogs would seem to suggest a relatively recent duplication event, low sequence similarity could reflect either an ancient duplication or the rapid divergence of a paralogous lineage. If we can identify orthologs of both duplication products in more than one species, then it is clear that the duplication event occurred before the last common ancestor of the species in question.
The B group gene ancestor and the AP3/PI duplication event:
The AP3 and PI genes have previously been shown to be members of closely paralogous gene lineages (![]()
![]()
![]()
![]()
All of the species which we have thoroughly surveyed contain both an AP3 and a PI representative, suggesting that the duplication event which produced these two lineages predates the diversification of the angiosperms. Initially after the duplication event, both of the products would have exhibited the PI motif, the characteristic MADS-box residues and, possibly, the paleoAP3 motif. In the PI lineage, the PI motif and the MADS-box characters appear to have been highly conserved throughout while the paleoAP3 motif was lost, possibly by a single truncation event. In contrast, along the AP3 lineage, the PI motif diversifies dramatically, along with additional changes throughout the protein (Figure 6).
The PI lineage:
The high degree of conservation of the PI motif throughout the members of the PI lineage suggests that it has a critical function. The requirement of the C terminus for in vivo function has been demonstrated for the products of several plant MADS-box genes, including AP3 (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Many other duplication events have followed the creation of the separate AP3 and PI lineages. In the PI lineage, five different paralog pairs have been identified, in Petunia hybrida, R. bulbosus, P. nudicaule, P. magnificum and Oryza sativa. The phylogenetic analysis suggests that these are the products of independent duplication events which occurred after the last common ancestor of any of the species included in this analysis. In two of these species, P. hybrida and O. sativa, there is some evidence for functional divergence of the paralogs (![]()
![]()
![]()
![]()
![]()
The AP3 lineage:
We have defined two distinct AP3 lineages which appear to be present throughout the higher eudicots. These are clearly the products of an ancient duplication event that occurred before the diversification of the higher eudicots. We cannot rule out the possibility that this duplication event occurred very early in the angiosperms and we have simply not detected the members of the euAP3 lineage in the lower eudicots or magnolid dicots. The sequence and phylogenetic analyses do not support an extremely early duplication, however. Most notably, the Pachysandra PtAP3-1 and PtAP3-2 genes appear to represent a mosaic of paleoAP3 characters and characters of the euAP3 and TM6 lineages of the higher eudicots. Although the C-terminal regions of the predicted products of the Pachysandra genes bear more similarity to the paleoAP3 motif, other positions in the protein sequences ally them with the higher eudicot genes. Based on these observations, we propose that the duplication which produced the euAP3 and TM6 lineages from a paleoAP3 ancestor occurred after the last common ancestor of the Buxaceae and the higher eudicots but before the diversification of the major higher eudicot subclasses (Figure 7).
We see the results of additional taxon-specific duplication events in the AP3 lineages of Brassica oleracea, Rumex acetosa, and P. terminalis. The AP3 paralog pairs of these species all show relatively high levels of sequence similarity. We have also identified several more divergent paralogous AP3 lineages. One such lineage is defined by PcAP3 and PnAP3-1 from P. californicum and P. nudicaule, respectively. Other than being florally expressed, the role of these Papaver genes is unknown. Evidence for a functionally divergent AP3 paralog is seen in the Medicago sativa gene, NMH7 (![]()
![]()
Implications of the AP3 duplication event and subsequent divergence:
While the PI lineage appears to be highly conserved, the AP3 lineage has experienced significant diversification, some of which clearly correlates with duplication events. If the fern CRM3 gene is, in fact, a representative of the B group ancestral lineage which possesses a paleoAP3 motif, then this small region has been conserved, to some degree, from the Pteridophytes, dating back some 400 mya, up through the lower Rosids, which first appeared approximately 80 mya (![]()
![]()
There are many possible ways in which the functions of the euAP3, paleoAP3 and TM6 lineage members may differ. The euAP3 lineage members function primarily in the establishment of petal and stamen identity. The role played by euAP3 representatives in stamen identity is likely to be the ancestral function of these genes since stamens are thought to have evolved only once, before the diversification of the angiosperms. In contrast, the function of euAP3 lineage members in petal identity may be more recently acquired, reflecting a de novo evolution of petals at the base of the higher eudicot radiation. Similarly, it seems likely that the paleoAP3 lineage members also function in stamen identity, but each time that petals were derived, whether from stamens or from bracts, the paleoAP3 ortholog(s) may have been recruited to new roles. In the Ranunculidae, for example, andropetals are thought to have been derived many times, even within single families (![]()
![]()
These results reveal a complex pattern of gene duplication and divergence in the AP3 and PI lineages. While it appears that the euAP3 motif is found in all of the higher eudicot AP3 homologs that are known to play a role in determining petal and stamen identity, it is the paleoAP3 motif that defines the AP3 homologs of the lower eudicots and magnolid dicots. To understand what changes in function may underlie the sequence divergence observed in the euAP3 lineage relative to the ancestral paleoAP3 lineage, investigation of the function of the paleoAP3 lineage members in the lower eudicots and magnolid dicots will be necessary. These and other comparative developmental approaches are critical to defining the relationships between gene duplication, gene expression and the morphological diversification that is the hallmark of the angiosperm radiation.
| ACKNOWLEDGMENTS |
|---|
The authors thank the members of the IRISH lab for helpful discussions. We are very grateful to C. PADILLA, R. SCHMIDT, M. BARRIER and M. PURUGGANAN for providing unpublished sequence information. We also thank DAVID SWOFFORD for kindly allowing us to use Version 61 of PAUP 4.0*. This work was supported by grants from the National Science Foundation (IBN 9630916) and United States Department of Agriculture (97 01286) to V.F.I., a National Science Foundation graduate fellowship to E.M.K. and the Yale Science Development Fund to R.L.D.
Manuscript received December 24, 1997; Accepted for publication March 2, 1998.
| APPENDIX |
|---|
|
|
| LITERATURE CITED |
|---|
AINSWORTH, C., S. CROSSLEY, V. BUCHANAN-WOLLASTON, M. THANGAVELU, and J. PARKER, 1995 Male and female flowers of the dioecious plant sorrel show different patterns of MADS box gene expression. Plant Cell 7:1583-1598.
ANGENENT, G. C., M. BUSSCHER, J. FRANKEN, J. N. M. MOL, and A. J. VAN TUNEN, 1992 Differential expression of two MADS box genes in wild-type and mutant petunia flowers. Plant Cell 4:983-993.
ANGENENT, G. C., J. FRANKEN, M. BUSSHER, L. COLOMBO, and A. J. VAN TUNEN, 1993 Petal and stamen formation in petunia is regulated by the homeotic gene fbp1. Plant J. 4:101-112.
BENNETT, M. D. and I. J. LEITCH, 1995 Nuclear DNA amounts in angiosperms. Ann. Bot. 76:113-176.
BENNETT, M. D. and J. B. SMITH, 1976 Nuclear DNA amounts in angiosperms. Philos. Trans. R. Soc. Lond. B Biol. Sci. 274:227-274.
BENNETT, M. D. and J. B. SMITH, 1991 Nuclear DNA amounts in angiosperms. Philos. Trans. R. Soc. Lond. B Biol. Sci. 334:309-345.
BOWMAN, J. L., D. R. SMYTH, and E. M. MEYEROWITZ, 1989 Genes directing flower development in Arabidopsis.. Plant Cell 1:37-52.
BOWMAN, J. L., D. R. SMYTH, and E. M. MEYEROWITZ, 1991 Genetic interactions among floral homeotic genes of Arabidopsis.. Development 112:1-20.
CARPENTER, R. and E. S. COEN, 1990 Floral homeotic mutations produced by transposon-mutagenesis in Antirrhinum majus.. Genes Dev. 4:1483-1493.
CARR, S. M. and V. F. IRISH, 1997 Floral homeotic gene expression defines developmental arrest stages in Brassica oleracea L. vars. botrytis and italica. Planta 201:179-188.
CHASE, M. W., D. E. SOLTIS, R. G. OLMSTEAD, D. MORGAN, and D. H. LES et al., 1993 Phylogenetics of seed plants: an analysis of nucleotide sequences from the plastid gene rbcL. Ann. Miss. Bot. Gard. 80:528-580.
CHUNG, Y.-Y., S.-R. KIM, H.-G. KANG, Y.-S. NOH, and M. PARK et al., 1995 Characterization of two rice MADS box genes homologous to GLOBOSA.. Plant Sci. 109:45-56.
COEN, E. S. and E. M. MEYEROWITZ, 1991 The war of the whorls: genetic interactions controlling flower development. Nature 353:31-37.
CRANE, P. R., E. M. FRIIS, and K. R. PEDERSEN, 1995 The origin and early diversification of angiosperms. Nature 374:27-33.
CRONQUIST, A., 1981 An Integrated System of Classification of Flowering Plants. Columbia University Press, New York.
CRONQUIST, A., 1988 The Evolution and Classification of Flowering Plants. The New York Botanical Garden, New York.
DAHLGREN, R. M. T., H. T. CLIFFORD and P. F. YEO, 1984 The Families of the Monocotyledons. Springer Verlag, New York.
DAVIES, B. and Z. SCHWARZ-SOMMER, 1994 Control of floral organ identity by homeotic MADS-box transcription factors. Results Probl. Cell Differ. 20:235-258.
DAVIES, B., A. DI ROSA, T. ENEVA, H. SAEDLER, and H. SOMMER, 1996 Alteration of tobacco floral organ identity by expression of combinations of Antirrhinum MADS-box genes. Plant J. 10:663-677.
DOYLE, J. J., 1994 Evolution of a plant homeotic multigene family: toward connecting molecular systematics and molecular developmental genetics. Syst. Biol. 43:307-328.
DRINNAN, A. N., P. R. CRANE and S. B. HOOT, 1994 Patterns of floral evolution in the early diversification of non-magnoliid dicotyledons (eudicots), pp. 93122 in Early Evolution of Flowers, edited by P. K. ENDRESS and E. M. FRIIS. Springer-Verlag, New York.
ENDRESS, P. K., 1994 Floral structure and evolution of primitive angiosperms: recent advances. Plant Syst. Evol. 192:79-97.
FRIIS, E. M. and P. K. ENDRESS, 1990 Origin and evolution of angiosperm flowers. Adv. Bot. Res. 17:99-162.
GARCIA-MAROTO, F., F. SALAMINI, and W. ROHDE, 1993 Molecular cloning and expression patterns of the Deficiens homologous gene St-Deficiens from Solanum tuberosum.. Plant J. 4:771-780.
GOTO, K. and E. M. MEYEROWITZ, 1994 Function and regulation of the Arabidopsis floral homeotic gene PISTILLATA. Genes Dev. 8:1548-1560.
HANSEN, G., J. ESTRUCH, H. SOMMER, and A. SPENA, 1993 NTGLO: a tobacco homologue of the GLOBOSA floral homeotic gene of Antirrhinum majus. cDNA sequence and expression pattern. Mol. Gen. Genet. 239:310-312.
HARDENACK, S., D. YE, H. SAEDLER, and S. GRANT, 1994 Comparison of MADS box gene expression in developing male and female flowers of the dioecious plant white campion. Plant Cell 6:1775-1787.
HEARD, J. and K. DUNN, 1995 Symbiotic induction of a MADS-box gene during development of alfalfa root nodules. Proc. Natl. Acad. Sci. USA 92:5273-5277.
HILL, T., C. D. DAY, S. C. ZONDLO, A. G. THACKERAY, and V. F. IRISH, 1998 Discrete spatial and temporal cis-acting elements regulate transcription of the Arabidopsis floral homeotic gene APETALA3.. Development 125:1711-1721.
IRISH, V. F. and E. M. KRAMER, 1998 Genetic and molecular analysis of angiosperm flower development. Adv. Bot. Res. 28:197-230.
IRISH, V. F. and Y. T. YAMAMOTO, 1995 Conservation of floral homeotic gene function between Arabidopsis thaliana and Antirhinum majus.. Plant Cell 7:1635-1644.
JACK, T., L. L. BROCKMAN, and E. M. MEYEROWITZ, 1992 The homeotic gene APETALA3 of Arabidopsis thaliana encodes a MADS box and is expressed in petals and stamens. Cell 68:683-697.
KOMAKI, M., K. OKADA, E. NISHINO, and Y. SHIMURA, 1988 Isolation and characterization of novel mutants of Arabidopsis thaliana defective in flower development. Development 104:195-203.
KRIZEK, B. A. and E. M. MEYEROWITZ, 1996 Mapping the protein regions responsible for the functional specificities of the Arabidopsis MADS domain organ-identity proteins. Proc. Natl. Acad. Sci. USA 93:4063-4070.
KUSH, A., A. BRUNELLE, D. SHEVELL, and N.-H. CHUA, 1993 The cDNA sequence of two MADS box proteins in petunia. Plant Physiol. 102:1051-1052.
MA, H., M. F. YANOFSKY, and E. M. MEYEROWITZ, 1991 AGL1-AGL6, an Arabidopsis gene family with similarity to floral homeotic and transcription factor genes. Genes Dev. 5:484-495.
MCGONIGLE, B., K. BOUHIDEL, and V. F. IRISH, 1996 Nuclear localization of the Arabidopsis APETALA3 and PISTILLATA homeotic gene products depends on their simultaneous expression. Genes Dev. 10:1812-1821.
MEYEROWITZ, E. M., J. L. BOWMAN, L. L. BROCKMAN, G. N. DREWS, and T. JACK et al., 1991 A genetic and molecular model for floral development in Arabidopsis thaliana.. Development 1:157-167.
MUNSTER, T., J. PAHNKE, A. DI ROSA, J. T. KIM, and W. MARTIN et al., 1997 Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants. Proc. Natl. Acad. Sci. USA 94:2415-2420.
NORMAN, A., M. RUNSWICK, R. POLLOCK, and R. TREISMAN, 1988 Isolation and properties of cDNA clones encoding SRF, a transcription factor that binds to the c-fos serum response element. Cell 55:989-1003.
PNUELI, L., M. ABU-ABEID, D. ZAMIR, W. NACKEN, and Z. SCHWARZ-SOMMER et al., 1991 The MADS box gene family in tomato: temporal expression during floral development, conserved secondary structures and homology with homeotic genes from Antirrhinum and Arabidopsis.. Plant J. 1:255-266.
POLLOCK, R. and R. TREISMAN, 1991 Human SRF-related proteins: DNA-binding properties and potential regulatory targets. Genes Dev. 5:2327-2341.
PURUGGANAN, M. D., 1997 The MADS-box floral homeotic gene lineages predate the origin of seed plants: phylogenetic and molecular clock estimates. J. Mol. Evol. 45:392-396.
PURUGGANAN, M. D., S. D. ROUNSLEY, R. J. SCHMIDT, and M. F. YANOFSKY, 1995 Molecular evolution of flower development: diversification of the plant MADS-box regulatory gene family. Genetics 140:345-356.
QIU, Y.-L., M. W. CHASE, D. H. LES, and C. R. PARKS, 1993 Molecular phylogenetics of the Magnoliidae: cladistic analyses of nucleotide sequences of the plastid gene rbcL. Ann. Miss. Bot. Gard. 80:587-606.
RIECHMANN, J. L. and E. M. MEYEROWITZ, 1997 MADS domain proteins in plant development. J. Biol. Chem. 378:1079-1101.
RIECHMANN, J. L., B. A. KRIZEK, and E. M. MEYEROWITZ, 1996a Dimerization specificity of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA, and AGAMOUS. Proc. Natl. Acad. Sci. USA 93:4793-4798.
RIECHMANN, J. L., M. WANG, and E. M. MEYEROWITZ, 1996b DNA-binding properties of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA and AGAMOUS. Nucleic Acids Res. 24:3134-3141.
SAMACH, A., S. E. KOHALMI, P. MOTTE, R. DATLA, and G. W. HAUGHN, 1997 Divergence of function and regulation of class B floral organ identity genes. Plant Cell 9:559-570.
SCHWARZ-SOMMER, Z., I. HUE, P. HUIJSER, P. J. FLOR, and R. HANSEN et al., 1992 Characterization of the Antirrhinum floral homeotic MADS-box gene deficiens: evidence for DNA binding and autoregulation of its persistent expression throughout flower development. EMBO J. 11:251-263.
SHORE, P. and A. D. SHARROCKS, 1995 The MADS-box family of transcription factors. Eur. J. Biochem. 229:1-13.
SOMMER, H., J.-P. BELTRÁN, P. HUIJSER, H. PAPE, and W.-E. LÖNNIG et al., 1990 Deficiens, a homeotic gene involved in the control of flower morphogenesis in Antirrhinum majus: the protein shows homology to transcription factors. EMBO J. 9:605-613.
STEWART, W. N., and G. W. ROTHWELL, 1993 Paleobotany and the Evolution of Plants, Ed. 2, Cambridge University Press, Cambridge.
SWOFFORD, D., 1993 Phylogenetic Analysis using Parsimony. Illinois Natural History Survey, Champaign, IL.
TAKHTAJAN, A., 1991 Evolutionary Trends in Flowering Plants. Columbia University Press, New York.
THEISSEN, G. and H. SAEDLER, 1995 MADS-box genes in plant ontogeny and phylogeny: Haekel's `biogenetic law' revisited. Curr. Opin. Gen. Dev. 5:628-639.
THEISSEN, G., J. T. KIM, and H. SAEDLER, 1996 Classification and phylogeny of the MADS-box multigene family suggest defined roles of MADS-box gene subfamilies in the morphological evolution of eukaryotes. J. Mol. Evol. 43:484-516.
TROBNER, W., L. RAMIREZ, P. MOTTE, I. HUE, and P. HUIJSER et al., 1992 Globosaa homeotic gene which interacts with deficiens in the control of antirrhinum floral organogenesis. EMBO J. 11:4693-4704.
VAN DER KROL, A. R. and N.-H. CHUA, 1993 Flower development in petunia. Plant Cell 5:1195-1203.
VAN DER KROL, A. R., A. BRUNELLE, S. TSUCHIMOTO, and N. H. CHUA, 1993 Functional analysis of petunia floral homeotic mads box gene PMADS1. Genes Dev. 7:1214-1228.
This article has been cited by other articles:
![]() |
F. Jabbour, C. Damerval, and S. Nadot Evolutionary Trends in the Flowers of Asteridae: Is Polyandry an Alternative to Zygomorphy? Ann. Bot., August 1, 2008; 102(2): 153 - 165. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. H. Leseberg, C. L. Eissler, X. Wang, M. A. Johns, M. R. Duvall, and L. Mao Interaction study of MADS-domain proteins in tomato J. Exp. Bot., May 17, 2008; (2008) ern094v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Drea, L. C. Hileman, G. de Martino, and V. F. Irish Functional analyses of genetic pathways controlling petal specification in poppy Development, December 1, 2007; 134(23): 4157 - 4166. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Piwarzyk, Y. Yang, and T. Jack Conserved C-Terminal Motifs of the Arabidopsis Proteins APETALA3 and PISTILLATA Are Dispensable for Floral Organ Identity Function Plant Physiology, December 1, 2007; 145(4): 1495 - 1505. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. P. Ronse De Craene Are Petals Sterile Stamens or Bracts? The Origin and Evolution of Petals in the Core Eudicots Ann. Bot., September 1, 2007; 100(3): 621 - 630. [Abstract] [Full Text] [PDF] |
||||



