Genetics, Vol. 149, 677-692, June 1998, Copyright © 1998

Molecular Organization of the 20S Proteasome Gene Family from Arabidopsis thaliana

Hongyong Fua, Jed H. Doellinga, Cassandra S. Arendtb, Mark Hochstrasserb, and Richard D. Vierstraa
a Cellular and Molecular Biology Program and the Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706
b Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637

Corresponding author: Richard D. Vierstra, Department of Horticulture, 1575 Linden Drive, University of Wisconsin-Madison, Madison, WI 53706, vierstra{at}facstaff.wisc.edu (E-mail).

Communicating editor: D. PREUSS


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The 20S proteasome is the proteolytic complex in eukaryotes responsible for degrading short-lived and abnormal intracellular proteins, especially those targeted by ubiquitin conjugation. The 700-kD complex exists as a hollow cylinder comprising four stacked rings with the catalytic sites located in the lumen. The two outer rings and the two inner rings are composed of seven different {alpha} and ß polypeptides, respectively, giving an {alpha}7/ß7/ß7/{alpha}7 symmetric organization. Here we describe the molecular organization of the 20S proteasome from the plant Arabidopsis thaliana. From an analysis of a collection of cDNA and genomic clones, we identified a superfamily of 23 genes encoding all 14 of the Arabidopsis proteasome subunits, designated PAA-PAG and PBA-PBG for Proteasome Alpha and Beta subunits A–G, respectively. Four of the subunits likely are encoded by single genes, and the remaining subunits are encoded by families of at least 2 genes. Expression of the {alpha} and ß subunit genes appears to be coordinately regulated. Three of the nine Arabidopsis proteasome subunit genes tested, PAC1 ({alpha}3), PAE1 ({alpha}5) and PBC2 (ß3), could functionally replace their yeast orthologs, providing the first evidence for cross-species complementation of 20S subunit genes. Taken together, these results demonstrate that the 20S proteasome is structurally and functionally conserved among eukaryotes and suggest that the subunit arrangement of the Arabidopsis 20S proteasome is similar if not identical to that recently determined for the yeast complex.


PROTEIN degradation plays an integral role in cell physiology and development by removing abnormal proteins and important short-lived regulators. One abundant intracellular protease that has been implicated in various catabolic processes is the 20S proteasome, a 700-kD multisubunit protease with broad specificity (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down; SCHMIDT and KLOETZEL 1997 Down). The 20S proteasome (also called the prosome, multicatalytic proteinase, and macropain) is present in the cytoplasm and nucleus of eukaryotes and is closely related to protease complexes in some archaebacteria and eubacteria (LOWE et al. 1995 Down; MAUPIN-FURLOW and FERRY 1995 Down; TAMURA et al. 1995 Down).

Whereas its exact function in prokaryotes is unclear, a number of critical functions have been ascribed to the 20S proteasome in eukaryotes. Most important is its role in ubiquitin-dependent proteolysis. Here, association of the 20S complex with a 19S regulatory complex creates the ATP-dependent 26S proteasome which degrades proteins covalently modified with one or more ubiquitins (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down; VIERSTRA 1996 Down). The 20S proteasome also participates in the breakdown of certain proteins in the absence of ubiquitination, in proteolytic processing of precursor proteins, and in the production of antigenic peptides presented by the major histocompatibility class (MHC) I pathway in mammals and other metazoans (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down). In mammalian cells, the 20S complex can also associate with an additional ring-shaped complex called PA28 or the 11S regulator, which dramatically enhances its in vitro proteolytic activity (KNOWLTON et al. 1997 Down). The importance of the 20S proteasome is reflected by the fact that all but one [the exception being PRE9 ({alpha}3) (EMORI et al. 1991 Down)] of the 14 yeast 20S proteasome subunits are essential (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down).

In recent years, the organization and structure of the archaeal, yeast, and mammalian 20S proteasomes have been examined in considerable detail (LOWE et al. 1995 Down; GROLL et al. 1997 Down; SCHMIDT and KLOETZEL 1997 Down). The particle exists as a hollow cylinder ~148 Å in length and ~113 Å in width. It is created by the assembly of four stacked rings; the two peripheral rings are each composed of seven {alpha} subunits, whereas the two central rings are each composed of seven ß subunits. The holocomplex contains three chambers with narrow entrances at each end restricting access. The central chamber is fashioned solely from ß subunits and contains the catalytic sites of the protease complex. In the archaeon Thermoplasma acidophilum, the 20S proteasome has a simple subunit composition, containing only a single {alpha}-type and a single ß-type subunit (ZWICKL et al. 1992 Down; LOWE et al. 1995 Down). However, in the yeast Saccharomyces cerevisiae and in animals, the particle is more complex, containing seven distinct {alpha}-type and seven distinct ß-type subunits in each particle (CHEN and HOCHSTRASSER 1995 Down; GROLL et al. 1997 Down; SCHMIDT and KLOETZEL 1997 Down). The T. acidophilum ß subunit and many of the yeast and human ß subunits are synthesized as proproteins that are proteolytically processed during assembly of the particle. For the T. acidophilum ß subunit and three of the yeast and human ß subunits (ß1, ß2, and ß5), this processing exposes a Thr residue at the N-terminus which forms part of a novel catalytic site that includes the N-terminal amino group (SEEMULLER et al. 1995 Down; LOWE et al. 1995 Down).

In contrast to our understanding of the particles from other kingdoms, little is known about the organization and structure of the plant 20S proteasome. The complex has been detected and purified from several sources including potato, tobacco, mungbean, pea, spinach and wheat, typically using its large size as one purification criterion (KREMP et al. 1986 Down; SCHLIEPHACKE et al. 1991 Down; SKODA and MALEK 1992 Down; OZAKI et al. 1992 Down; FUJINAMI et al. 1994 Down; P. H. HATFIELD and R. D. VIERSTRA, unpublished results). All appear to have a subunit composition similar to 20S proteasomes from animals and yeast, as determined by two-dimensional sodium dodecyl sulfate (SDS)-PAGE. Several of these subunits are antigenically related to those in mammals (SCHLIEPHACKE et al. 1991 Down; OZAKI et al. 1992 Down). Likewise, electron microscopy of plant 20S proteasomes showed a similar barrel-shaped structure comprising four stacked rings. Genes encoding three subunits, two {alpha} and one ß, have been described from Arabidopsis thaliana, with substantial amino acid sequence similarity to those from other eukaryotes (GENSCHIK et al. 1992 Down, GENSCHIK et al. 1994 Down; SHIRLEY and GOODMAN 1993 Down). An Arabidopsis mutant bearing a deletion in one of these {alpha} subunits is phenotypically normal, suggesting that a second locus encoding this subunit exists (SHIRLEY and GOODMAN 1993 Down).

Given the importance of protein degradation to many phases of the plant life cycle, we expect the 20S proteasome has numerous essential functions in plants, especially with respect to its role in ubiquitin-dependent proteolysis (VIERSTRA 1996 Down). As a first step in elucidating these functions, an extensive analysis of the molecular organization and structure of the 20S proteasome from A. thaliana was initiated. Here we describe a set of Arabidopsis genes that encodes the full complement of 20S proteasome subunits. Conservation of the polypeptide sequences indicates that the Arabidopsis 20S complex is organized similar to those from T. acidophilum, yeast, and animals. In fact, several of the Arabidopsis subunits can functionally replace their yeast orthologs in vivo.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Identification of Arabidopsis genes encoding 20S proteasome subunits:
A collection of Arabidopsis sequences encoding 20S proteasome subunits was identified from the A. thaliana ecotype Columbia databases (AtDB, Stanford University; http://genome-www.stanford.edu/Arabidopsis/) using the Blast program. The search first used peptide sequences of the 14 yeast S. cerevisiae 20S proteasome subunits as queries (HOCHSTRASSER 1996 Down). The nucleotide sequences of the Arabidopsis DNAs derived from this search were then used to explore the Arabidopsis expressed sequence tag (EST) and genomic nucleotide databases. After redundant entries were excluded, 106 Arabidopsis 20S proteasome DNA sequences were assembled: one genomic and three cDNA sequences had been published previously and corresponded to PAD1 (TASG64; GENSCHIK et al. 1992 Down), PAF1 (AtPSM30; SHIRLEY and GOODMAN 1993 Down), and PBF1 (FAFP98 and TASG39.20; GENSCHIK et al. 1994 Down); 91 were ESTs; and 11 were genomic sequences identified from various A. thaliana chromosome sequencing projects.

Based on their relationship to the 14 yeast 20S proteasome subunits, the Arabidopsis sequences were classified into 14 groups (Table 1) corresponding to the seven {alpha} ({alpha}1–{alpha}7) and seven ß (ß1–ß7) subunits. EST clones with the longest 5' untranslated region (UTR) from each group (obtained from Arabidopsis Biological Resource Center (ABRC) at Ohio State University) were sequenced in their entirety. ESTs containing the entire presumptive coding region were available from each group, except PAD2 and PAF2. Apparently full-length cDNA clones for these two were isolated from the Kieber and Ecker 1–2 kbp size-selected cDNA library (ARBC Stock Number CD4-14) by PCR using Pfu polymerase (Stratagene, La Jolla, CA). Sequence analyses were performed using programs from the University of Wisconsin Genetics Computer Group (UW-GCG) software package (DEVERAUX et al. 1984 Down). Accession numbers for the 21 Arabidopsis cDNAs encoding the 20S proteasome subunits described here are PAA1 (AF043518), PAA2 (AF043519), PAB1 (AF043520), PAC1 (AF043521), PAD1 (AF043522), PAD2 (AF043523), PAE1 (AF043524), PAE2 (AF043525), PAF1 (AF043526), PAF2 (AF043527), PAG1 (AF043528), PBA1 (AF043529), PBB1 (AF043530), PBB2 (AF043531), PBC1 (AF043532), PBC2 (AF043533), PBD1 (AF043534), PBD2 (AF043535), PBE1 (AF043536), PBF1 (AF043537), and PBG1 (AF043538). Coding sequences for PAB2 and PAC2 were derived from genomic sequences (accession numbers: AC002986 and Z97338, respectively).


 
View this table:
In this window
In a new window

 
Table 1. Subunits of the Arabidopsis 20S proteasome

RNA and genomic DNA gel blot analysis:
Total RNA was extracted from A. thaliana ecotype Wassilewskija (WS) and purified by LiCl precipitation (RAPP et al. 1992 Down). Seedlings grown two weeks on Gamborg B-5 Media (GIBCO-BRL, Gaithersburg, MD) in 0.8% agar (GM agar) under continuous light or dark provided the plant material for green seedlings, roots (light-grown) and etiolated seedlings (dark-grown). Plants grown in soil under continuous light provided the stems, cauline leaves, siliques and flowers. RNA samples (7 µg) were size fractionated by electrophoreses in 1.2% agarose-formaldehyde gels and transferred onto Zeta-Probe membranes (Bio-Rad, Hercules, CA).

Total genomic DNA was isolated as described (CONE et al. 1989 Down) from 2-wk-old A. thaliana ecotype WS grown on GM agar under continuous light. DNA samples (5 µg) were digested with BglII, EcoRI or EcoRV, size-fractionated by electrophoresis in 0.8% agarose gels, and transferred onto Zeta-Probe membranes. Specific probes to the various {alpha} and ß subunit genes were amplified by PCR from the corresponding cDNAs using vector-specific primers, and the products were gel-purified. The PCR products were radiolabeled by incorporation of [{alpha}-32P] dCTP during primer extension with random oligonucleotides. Hybridization of the probes to the membrane-bound DNA or RNA was performed at 65° in 0.5 M sodium phosphate, 7% SDS, 1 mM Na4EDTA. High stringency wash conditions were 65° in 0.5x SSC and 0.1% SDS. Low stringency wash conditions were 65° in 3x SSC and 0.1% SDS (20x SSC = 3 M NaCl and 0.3 M Na3citrate). Following the washes, the blots were subjected to autoradiography.

To help interpret the fragmentation patterns from the DNA gel blots for PAA1, PAC1, PAD1, PAE2, PAF1, PAG1, PBA1, PBB1, PBC1, PBD1, PBD2, PBE1 and PBF1, the presence or absence of BglII, EcoRI, and EcoRV restriction sites was determined. Specific gene sequences were PCR amplified from genomic DNA of A. thaliana ecotype WS using 5' and 3' gene-specific primers near the start and stop codons, respectively. The amplified fragments were digested with the three restriction enzymes individually and size-fractionated by electrophoresis in 0.8% agarose gels; their electrophoretic patterns were compared with those obtained with the uncut fragments.

Construction of URA3, TRP1, and LEU2 plasmids bearing various wild-type yeast and Arabidopsis 20S subunit genes:
The URA3-plasmids bearing yeast PRE3 (ß1), PRE1 (ß4), PRS3 (ß6), and PRE4 (ß7) for covering corresponding gene deletions in MHY1031, MHY1028, MHY1057, and MHY1029 were made as follows: To construct pRS316-PRE3, a 1.4-kbp BamHI-XhoI fragment from p15E3 (from W. HEINEMEYER, Universität Stuttgart, Germany) was subcloned into BamHI/SalI-digested pRS316 (SIKORSKI and HIETER 1989 Down). The p15E3 plasmid carries a 1.4-kbp NruI-SnaBI yeast genomic DNA fragment encompassing PRE3 (ENENKEL et al. 1994 Down) subcloned between the BamHI and XhoI sites of pRS315 (SIKORSKI and HIETER 1989 Down). pDP83.PRE1-PRE4 was made by subcloning a 1.4-kbp HindIII/BamHI PRE4-bearing fragment from p15E4 (from W. HEINEMEYER) into the similarly digested pDP83.E1-L8 plasmid (HEINEMEYER et al. 1991 Down); pDP83.E1-L8 [CEN14, URA3] carries a genomic PRE1 fragment. The PRS3 gene was amplified from yeast genomic DNA using PCR with the following primers: AGCTTGGAGTAGGCATT and AGGTACAGCACGGAAGAT. The 1.4-kbp PCR product was cloned into pCR2.1 (Invitrogen, Carlsbad, CA) and the insert was subsequently excised with EcoRI and subcloned into pRS316 (SIKORSKI and HIETER 1989 Down) to generate pRS316-PRS3. Other URA3-based plasmids carrying yeast 20S proteasome genes were described previously (CHEN and HOCHSTRASSER 1995 Down; ARENDT and HOCHSTRASSER 1997 Down).

For controls in the 5-fluoroorotic acid (FOA) assays, genomic DNA fragments for DOA5 ({alpha}5), PRE3 (ß1), PUP1 (ß2), PUP3 (ß3), PRE1 (ß4), DOA3-His6 (ß5), PRS3 (ß6), and PRE4 (ß7) were cloned into either the LEU2-plasmid YEplac181 or the TRP1-plasmid YCplac22 (GIETZ and SUGINO 1988 Down) to generate YCplac22-DOA5, YCplac22-PRE3, YCplac22-PUP1, YEplac181-PUP3, YCplac22-PRE1, YCplac22-DOA3-His6, YCplac22-PRS3 and YEplac181-PRE4, respectively. The PRE1 fragment was isolated as a 1.15-kbp EcoRI/HindIII fragment from pDP83.E1-L8. The PRS3 fragment was isolated as an EcoRI fragment from pRS316-PRS3. The PRE4-bearing HindIII/BamHI fragment was from p154E (see above).

The predicted full-length coding sequences including the 3' UTR for the various A. thaliana 20S subunits were isolated from the corresponding cDNAs by PCR using Pfu polymerase. The 5' primers were designed to add an NdeI site at the putative start codons and the 3' primers were designed to add an EcoRI site to the 3' UTR. The products were cloned into NdeI/EcoRI sites of a modified pRS424 plasmid described previously (FU et al. 1998 Down); it contains the 552-bp promoter sequence of yeast MCB1/SUN1, followed by NdeI and EcoRI cloning sites. All constructions were verified as correct by DNA sequence analysis.

Yeast strain construction:
S. cerevisiae strains used in this study are listed in Table 3. All the strains are congenic with MHY501 (CHEN et al. 1993 Down). MHY787, MHY991, MHY996, and MHY784 bearing chromosomal deletions of DOA5 ({alpha}5), PUP1 (ß2), PUP3 (ß3), and DOA3 (ß5), respectively, and harboring the corresponding wild-type gene on a URA3-plasmid were described previously (CHEN and HOCHSTRASSER 1995 Down; ARENDT and HOCHSTRASSER 1997 Down). The MHY1029 strain carrying a chromosomal deletion of PRE4 (ß7) was made as follows. Diploid MHY606 cells were transformed with the pre4{Delta}::HIS3 allele described by HILT et al. 1993 Down, and histidine prototrophs were selected. Putative heterozygotes were sporulated and subjected to tetrad analysis. A strain that yielded tetrads displaying 2:2 segregation of viability (with all viable segregants being His-) was then transformed with pDP83.PRE1-PRE4, and uracil prototrophs were selected. Following sporulation and tetrad dissection, Ura+/His+ segregants were identified. To create a pre1::LYS2 disruption allele, a 5.5-kbp LYS2 fragment was excised with ClaI from pUB39 (SPENCE et al. 1995 Down), the 5'-overhangs were filled in using T4 DNA polymerase, and the fragment was then ligated into pDP83.E1-L8 at the unique Ecl136II site within the PRE1 sequence. In the resulting plasmid pDP83pre1::LYS2, the LYS2 coding sequence is in the opposite orientation relative to the PRE1 sequence. The pre1::LYS2 allele was isolated from pDP83pre1::LYS2 on a 6.7-kbp NarI/PstI fragment that was then transformed into MHY606 cells. The resulting lysine prototrophs were screened for a single insertion in the PRE1 locus using the same strategy as that described for pre4{Delta}/+ heterozygotes. A pre1::LYS2/+ heterozygote was transformed with pDP83.PRE1-PRE4, uracil prototrophs were selected, and, following sporulation and tetrad dissection, Ura+ Lys+ segregants were identified resulting in the strain MHY1028.


 
View this table:
In this window
In a new window

 
Table 2. Amino acid sequence homology (% identity/similarity) of the Arabidopsis 20S {alpha} ({alpha}1/PAA-{alpha}7/PAG) and ß (ß1/PBA-ß7/PBG) subunitsa,b


 
View this table:
In this window
In a new window

 
Table 3. Yeast Strains

To produce a null allele of PRE3, the HIS3 gene was PCR amplified with primers, TGTCATTTTCACTTTTCCACTCGCAACGGAATCCGGTGGCCTCTTGGCCTCCTCTAG and CTTGCTTACGAAATTCCCTTCTAGGATACTTTGTCGTTCTAACAGTCGTTCAGAATGACACG. The primers were designed to amplify the entire HIS3 gene and flanked by sequences complementary to the 5' and 3' coding region of PRE3; the amplified fragments were transformed into MHY606 cells. The resulting heterozygote (made by P. CHEN), was transformed with pRS316-PRE3, uracil prototrophs were selected, and, following sporulation and tetrad dissection, Ura+/His+ segregants were identified, yielding MHY1031. An exactly analogous PCR-based deletion strategy was used to create the prs3-{Delta}2::HIS3 and pre9-{Delta}2::HIS3 null alleles. The primers used for making prs3-{Delta}2::HIS3 were GAGAGTAGCAAGACTATTGAACTATAAAGTTAAACAAAATATGGCTCTTGGCCTCCTCTAGand TTCTTTTTATACTATGATATGTATGCATTAATCTCTTTTTAGCTCTCGTTCAGAATGACACG. The MHY1057 strain was derived from a prs3{Delta}/+ heterozyote transformed with pRS316-PRS3. The primers used for amplifying pre9-{Delta}2::HIS3 were CATGGGTTCCAGAAGATACGATTCCAGGACAACAATTTTTCTCCCCCTCTTGGCCTCCTCTAG and GGCTCTCGAGCGATTCCGATCTTGAAATTTGCGCATCGTTCAGAATGACACG. MHY606 transformants were screened for the deletion by colony PCR, and, following sporulation, tetrads were dissected. The tetrads all contained four viable spores, with two His+ and two His- segregants. The His+ segregants (MHY1069) grew slightly slower at 30° than those His-. The expected structure at the PRE9 locus was confirmed by colony PCR of several complete tetrads. These data confirm the original observation that the PRE9 gene is not required for viability (EMORI et al. 1991 Down).

Yeast media and techniques:
Yeast rich (YPD) and minimal synthetic media (SD) were prepared as described previously (KAISER et al. 1994 Down). For complementation of amino acid-analog sensitivity, arginine and phenylalanine were omitted from the SD media and their analogs canavanine and p-fluorophenylalanine were added to 3 and 25 µg/ml, respectively. 5-fluoroorotic acid (FOA) was added to the SD media at a concentration of 1 g/l. Standard methods were used for genetic manipulation of yeast (KAISER et al. 1994 Down). Yeast transformation was carried out as described by GIETZ et al. 1995 Down, and cultures were incubated at 30°.

FOA and amino acid analog sensitivity assays:
Assays for complementation of essential genes using FOA resistance were performed as described by ARENDT and HOCHSTRASSER 1997 Down. Yeast strains bearing chromosomal deletions for the various 20S subunit genes and harboring the corresponding wild-type yeast genes on a URA3-plasmid, were transformed with plasmids bearing yeast or Arabidopsis 20S subunits genes. Transformants were patched on solid YPD media, incubated overnight to allow random loss of either plasmid, and then streaked on solid SD media containing FOA.

Complementation of the amino acid analog sensitivity of the yeast pre9{Delta} ({alpha}3) strain was performed as previously described (FU et al. 1998 Down). Yeasts were grown in liquid SD media to late logarithmic phase. Cultures were washed once with SD medium (minus arginine and phenylalanine), resuspended in the same medium to A600 = 1.0, and spotted in a 10-fold dilution series onto solid SD medium supplemented with canavanine and p-fluorophenylalanine.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Identification of Arabidopsis genes encoding subunits of the 20S proteasome:
As a first step in the analysis of the 20S proteasome from Arabidopsis, we initiated a detailed characterization of the corresponding genes. Using the 14 genes encoding the full complement of polypeptides from the yeast 20S proteasome (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down) as queries, we identified 23 related sequences in the various nucleic acid sequence databases of A. thaliana, ecotype Columbia. Three of the genes corresponded to the two {alpha} subunits and the one ß subunit previously described by GENSCHIK et al. 1992 Down, GENSCHIK et al. 1994 Down and SHIRLEY and GOODMAN 1993 Down. The remaining twenty, which represented related but new loci, were 18 partially sequenced cDNAs obtained from expressed-sequence-tag (EST) databases and two genomic sequences deposited by the A. thaliana genome sequencing projects. [The search also detected related sequences in Genebank databases from other plant species such as rice, corn, soybean, spinach, alfalfa, tomato, and tobacco.] All but two of the cDNAs contained the entire coding region. For the two partial cDNAs [PAD2 and PAF2 (see Table 1)], the full coding sequences were obtained subsequently from an Arabidopsis cDNA library by PCR.

The cDNAs avaliable for the 21 proteasome genes (the exception are PAB2 and PAC2 which were identified only by genomic sequences) were sequenced in their entirety and used to derive the predicted full-length amino acid sequence of the corresponding proteins. Coding regions of PAB2 and PAC2 were assembled by alignment with cDNAs of their respective Arabidopsis paralogs, PAB1 and PAC1. The predicted start codon for most {alpha} and ß subunit genes were identified as the 5' most ATG codon adjacent to sequence conserved in orthologs from other species. In-frame stop codons were found immediately upstream of these presumptive start codons. For PAA1, PAG1, PBD1, the start codon was assigned as the 5' most ATG beyond the conserved region. No in-frame stop codons were present upstream of these putative start codons. For PBG1, the predicted start codon was assigned based on a comparison of the coding sequence with that of the propeptide present within its yeast and mammalian orthologs (e.g., HILT et al. 1993 Down).

Pairwise comparisons of derived amino acid sequences revealed that the 23 Arabidopsis 20S proteasome subunit genes cluster into two groups corresponding to the {alpha} and ß families detected in other organisms (COUX et al. 1996 Down; HOCHSTRASSER 1996 Down). The Arabidopsis {alpha} and ß families are weakly but significantly related to each other (20–30% amino acid similarity over specific regions of the corresponding proteins) further supporting the proposal that the {alpha} and ß families share a common progenitor (ZWICKL et al. 1992 Down; COUX et al. 1994 Down). Phylograms of the sequence comparisons showed that one to two genes in each Arabidopsis family are closely related to each of the seven {alpha} and seven ß subunit genes present in yeast (Figure 1). The phylograms adopted seven branches for both the {alpha} and ß subunits; each branch contained a single yeast {alpha} or ß subunit gene and one or more Arabidopsis genes. In all cases, the Arabidopsis {alpha} and ß subunits in each branch were more similar to specific subunits in yeast and other organisms than to other {alpha} or ß subunits in Arabidopsis (Figure 1 and data not shown). As examples, the {alpha}5 subunit (PAE1) is 66–78% similar to the {alpha}5 subunits from yeast, humans, Drosophila melanogaster, and Caenorhabditis elegans but only 48–57% similar to the other six Arabidopsis {alpha} subunits, whereas, the ß3 subunit (PBC2) is 58–66% similar to the ß3 subunits from yeast, humans, and C. elegans but only 29–45% similar to the other six Arabidopsis ß subunits. From this analysis, we conclude that this Arabidopsis collection encodes the full complement of 20S proteasome subunits with clear matches to each of its putative yeast and human orthologs. It is remotely possible that the 20S proteasome from plants has additional subunits not found in the animal and yeast complexes. However, comparisons of the SDS-PAGE profiles of the plant and animal 20S proteasomes suggest a similar polypeptide composition (KREMP et al. 1986 Down; SCHLIEPHACKE et al. 1991 Down; SKODA and MALEK 1992 Down; OZAKI et al. 1992 Down; FUJINAMI et al. 1994 Down; P. H. HATFIELD and R. D. VIERSTRA, unpublished results).



View larger version (30K):
In this window
In a new window
Download PPT slide
 
Figure 1. —Phylograms showing the amino acid sequence relationships of the complete collection of 20S proteasome subunits from A. thaliana with the 14 subunits that comprise the 20S proteasome from yeast (S. cerevisiae): (A) {alpha} subunits; (B) ß subunits. Yeast subunits designations {alpha}1–{alpha}7 and ß1–ß7 were based on the position of the subunits within the crystal structure of the yeast 20S complex (GROLL et al. 1997 Down). Phylograms were created using the UW-GCG computer programs Paupsearch and Paupdisplay, using the bootstrap analysis option. Distance along the horizontal axis separating two sequences is proportional to the divergence between the sequences. Protein sequence comparisons (% identity/similarity) of the Arabidopsis subunits to their yeast orthologs are indicated to the right.

On the basis of their homology with yeast subunits (Figure 1), the Arabidopsis collection was designated using the new systematic nomenclature for the yeast 20S proteasome which was organized from the crystal structure of the particle (GROLL et al. 1997 Down). The protein subunits, referred to as {alpha}1–{alpha}7 and ß1–ß7, likely assume the same positions in the particle as the equivalent yeast subunits based on our cross-species complementation data (see below). The corresponding genes were named using the protocol recommended for A. thaliana (MEINKE and KOORNNEEF 1997 Down). They are referred to as AtPAA-G and AtPBA-G (for A. thaliana Proteasome Alpha subunits A–G and Proteasome Beta subunits A–G) with numerical suffixes added to designate various members in each gene family. Table 1 relates the various Arabidopsis subunits with their presumptive orthologs from yeast and human. For five of the Arabidopsis subunits ({alpha}7, ß1, and ß5–ß7), a single gene was identified. For each of the remaining nine ({alpha}1–{alpha}6 and ß2–ß4), two independent genes sharing substantial similarity were detected; the average identity among the nine related pairs is 86 and 95% at the nucleotide and amino acid sequence levels, respectively. As examples, PAF1 and PAF2, encoding the {alpha}6 subunit, share 87 and 92% identity at the nucleotide and amino acid sequence levels, respectively, and PBB1 and PBB2 genes encoding the ß2 subunit, shared 88 and 98% identity at the nucleotide and amino acid sequence levels, respectively. Significant divergence at the nucleotide level and subsequent genomic DNA blot analysis (see below) indicate that the members in each group of paralogs represent independent loci and not polymorphic alleles in A. thaliana.

proteasome {alpha} subunits:
Thirteen of the Arabidopsis genes encode the seven {alpha}-type subunits. An amino acid sequence alignment and pairwise comparisons of sequence homology using a representative member of each of the seven {alpha} subunits are presented in Figure 2 and Table 2. Amino acid sequence identities/similarities within the family range from 34/44 to 47/56% with several regions exhibiting high conservation to corresponding subunits in other eukaryotes and to the T. acidophilum {alpha} subunit (LOWE et al. 1995 Down; GROLL et al. 1997 Down). The most conserved region is near the N-terminus [residues 9–33 in {alpha}1/PAA1 (Figure 2)], which assumes an {alpha}-helical structure necessary for assembly and/or subsequent stabilization of appropriate {alpha} subunit/{alpha} subunit contacts in the T. acidophilum and yeast complexes (LOWE et al. 1995 Down; GROLL et al. 1997 Down). The invariant Tyr residue at the N-terminal side of the {alpha}-helix, which plays a crucial role in this contact, is present in all the Arabidopsis {alpha} subunits [Tyr9 in PAA1 ({alpha}1) (Figure 2)].



View larger version (109K):
In this window
In a new window
Download PPT slide
 
Figure 2. —Derived amino acid sequence alignment of the representative members of the {alpha}-subunit gene family (PAA-PAG) from A. thaliana. The alignment was created using the computer program BoxShade 2.7 (Netserve@embl-heidelberg.de; Bioinformatics Group, Lausanne, Switzerland). Identical and similar residues are in reverse type and shaded boxes, respectively. The bracket indicates the N-terminal {alpha}-helical region required in the corresponding yeast subunits for interactions among the {alpha} subunits; the line shows the putative NLS sequence; and the double line identifies the amino acids that border the pore of the T. acidophilum {alpha}-subunit ring (LOWE et al. 1995 Down; GROLL et al. 1997 Down). The Tyr residue essential for assembly of the {alpha} ring is indicated by the arrowhead. ({bullet}) marks a collection of conserved Gly residues present in many {alpha} and ß subunits (COUX et al. 1994 Down). Sequences encoding subunit {alpha}4/PAD1 and {alpha}6/PAF1 were first described by GENSCHIK et al. 1992 Down and SHIRLEY and GOODMAN 1993 Down and previously designated TASG64 and AtPSM30, respectively. The amino acids sequence for the {alpha} subunit from T. acidophilum (TaAlpha) is included for comparison (ZWICKL et al. 1992 Down).

The pore in the seven-membered {alpha} ring of the yeast complex has not been resolved (GROLL et al. 1997 Down). However, in T. acidophilum, a 13-Å opening is lined by bulky hydrophobic residues [Tyr126 (LOWE et al. 1995 Down)]. Bulky aromatic hydrophobic residues are present at or near the same position in many of the Arabidopsis subunits (Tyr, Phe and/or Trp), suggesting that the pore in the plant complex is also hydrophobic (Figure 2). Several Arabidopsis {alpha} subunits (especially {alpha}6/PAF1 and PAF2), like their yeast orthologs, contain long C-terminal extensions. In the yeast 20S complex, these extensions project from the surface of the {alpha}-subunit disc, potentially providing important contact sites for accessory factors (GROLL et al. 1997 Down). A sequence potentially involved in nuclear localization of the yeast {alpha} subunits (TANAKA et al. 1990 Down) is present in the Arabidopsis counterparts [KKVPDK (residues 54–59) in {alpha}1/PAA1 (Figure 2)], suggesting a similar role in nuclear import. A number of contextually conserved Gly residues have been detected in numerous {alpha} subunits from a variety of species (COUX et al. 1994 Down). Most of these are also present in the Arabidopsis subunits [residues 20, 36, 41, 77, 135, 142, 158 and 169 in {alpha}1/PAA1 (Figure 2)].

proteasome ß subunits:
Ten of the Arabidopsis 20S proteasome genes encode the seven ß-type subunits. An amino acid sequence alignment and pairwise comparisons of sequence homology using a representative member of each of the seven ß subunits are presented in Figure 3 and Table 2. Consistent with results from other species (COUX et al. 1996 Down), the ß subunits are generally less related amongst themselves than are the {alpha} subunits with amino acid sequence identities/similarities ranging from 18/30 to 33/45%.



View larger version (102K):
In this window
In a new window
Download PPT slide
 
Figure 3. —Derived amino acid sequence alignment of the representative members of the ß-subunit gene family (PBA-PBG) from A. thaliana. The alignment was created using the computer program BoxShade 2.7. Identical and similar residues are in reverse type and shaded boxes, respectively. The N-terminal sequences predicted to be removed during the maturation of the subunits, PBA1 (ß1), PBB1 (ß2), PBE1-PBG1 (ß5–ß7) are indicated in lower case letters. The residues predicted to comprise part of the protease active site for subunits PBA1 (ß1), PBB1 (ß2), and PBE1 (ß5) are indicated by arrowheads; the double line identifies the amino acids that border the pore of the T. acidophilum ß-subunit ring (LOWE et al. 1995 Down; GROLL et al. 1997 Down); {blacksquare}, positions of amino acid residues that correspond to the S1 substrate specificity pocket in yeast PRE3 (ß1), PUP1 (ß2), and DOA3 (ß5) subunits; *, residues in subunit ß5 that possibly interact with the inhibitor lactacystin; {bullet}, a collection of conserved Gly residues present in many {alpha} and ß subunits (COUX et al. 1994 Down). Sequences encoding subunit PBF1 (ß6) were previously identified by GENSCHIK et al. 1994 Down and formly designated FAFP98. The amino acids sequence for the ß subunit from T. acidophilum (TaBeta) is included for comparison (ZWICKL et al. 1992 Down). The numbering of TaBeta starts with the internal Thr (Thr1; {diams}) released after removal of the 8-amino-acid prosequence.

The single ß subunit of T. acidophilum and five of the seven ß subunits in yeast and humans are synthesized as proproteins that are proteolytically processed before generating the catalytically active holoenzyme complex (SEEMULLER et al. 1995 Down; CHEN and HOCHSTRASSER 1996 Down; SCHMIDTKE et al. 1996 Down; ARENDT and HOCHSTRASSER 1997 Down). The ß3 (PUP3/C10) and the ß4 (PRE1/C7) subunits are not processed from their initial translation products (GROLL et al. 1997 Down; SCHMIDT and KLOETZEL 1997 Down). In yeast, prosequence cleavage is accomplished by the active ß1, ß2, and/or ß5 subunits working autocatalytically or in trans to the other ß subunits in the ring. N-terminal sequences related in length to the yeast and human prosequences are present in the Arabidopsis ß1, ß2, ß5–ß7 orthologs, suggesting that they are also synthesized as precursors that then require processing (Figure 3 and data not shown). The alignments predict that the Arabidopsis ß1, ß2, and ß5 subunits are cleaved between a Gly/Thr pair [e.g., Gly12/Thr13 in PBA1 (ß1)], thus liberating the Thr active site (see below). For PBF1 (ß6), the predicted cleavage site is between Glu4 and His5 (instead of Asp/His in yeast ß6) and for PBG1 (ß7), it is between Arg23 and Thr24 [identical to its human counterpart and similar to the Asn/Thr cleavage site in yeast ß7 (CHEN and HOCHSTRASSER 1995 Down; SCHMIDTKE et al. 1996 Down; GROLL et al. 1997 Down)]. Assuming these cleavage sites are correct, prosequences of 12 (ß1), 39 (ß2), 57 (ß5), 4 (ß6), and 23 (ß7) amino acids would be released.

Processing of the T. acidophilum ß subunit and the catalytically active yeast and human subunits ß1 (PRE3/Y), ß2 (PUP1/Z), and ß5 (DOA3/X) is necessary to expose a Thr1 residue at their N-termini, which then becomes organized into a novel active site involving the free {alpha}-amino group (SEEMULLER et al. 1995 Down; CHEN and HOCHSTRASSER 1996 Down). In the T. acidophilum complex, this amino group works in a catalytic tetrad that also contains the Thr1 hydroxyl group, Glu17, and the {epsilon} amino group of a proximal Lys33. Glu17 appears to maintain the correct orientation of Lys33 relative to Thr1. Three residues adjacent to Thr1 (Ser129, Asp166, and Ser169) in the three-dimensional complex also may be involved in forming the catalytic site (SEEMULLER et al. 1996 Down). From sequence alignments with the comparable T. acidophilum and yeast subunits, we propose that a similar protease active site is present in the ß1, ß2, and ß5 subunits of Arabidopsis. For instance in PBA1 (ß1), all six of the contextually essential residues are evident, including the Thr and Lys residues at positions 13 and 45, an invariant Asp at position 29 (which could substitute for the Glu residue), and the Ser/Asp/Ser sequence at positions 142, 179 and 182 (Figure 3). The ß3 (PBC1 and 2), ß4 (PBD1 and 2), and ß6 (PBF1) subunits lack the N-terminal Thr residue whereas subunit ß7 (PBG1) does have the conserved Thr but lacks the essential Lys, suggesting that these subunits, like their yeast and human counterparts (SCHMIDT and KLOETZEL 1997 Down), are not proteolytically active.

For the yeast 20S proteasome, mutagenesis studies indicate that subunit PRE3 (ß1) carries the peptidylglutamyl peptide hydrolyzing (acidic) activity, DOA3 (ß5) carries the chymotrypsin-like (hydrophobic) activity, and PUP1 (ß2) carries the trypsin-like (basic) activity (CHEN and HOCHSTRASSER 1996 Down; ARENDT and HOCHSTRASSER 1997 Down; HEINEMEYER et al. 1993 Down, HEINEMEYER et al. 1997 Down). Based on sequence similarity, we expect that the Arabidopsis counterparts, PBA1 (ß1), PBB1 and 2 (ß2), and PBE1 (ß5), will have similar peptidase specificities. In each, the predicted substrate specificity (S1) pocket [residues 32, 43, 47, 57, 61 and 65 in PBA1 (ß1)] is nearly identical to the pocket identified in the corresponding yeast ortholog (GROLL et al. 1997 Down) (Figure 3). Covalent inhibition of the yeast ß5 subunit by lactacystin involves hydrogen bonding with Thr1, Thr21, Met45, and Gly47 residues in the S1 pocket (GROLL et al. 1997 Down). Identically situated residues are present in Arabidopsis ß5 subunit PBE1 (except for Thr21, which is replaced by a Ser (Figure 3)], suggesting that lactacystin inhibits the Arabidopsis 20S proteasome as well. Like the {alpha}-ring pore, the pores of the ß rings in T. acidophilum and yeast are surrounded by one or more bulky hydrophobic residues (LOWE et al. 1995 Down; GROLL et al. 1997 Down). A similar hydrophobic patch is present in all the Arabidopsis ß subunits (Figure 3). Many of the Arabidopsis ß subunits also contain a number of contextually conserved Gly residues [residues 17, 23, 59, 114, 120, 130, 141 and 143 in PBA1 (ß1) (Figure 3)] present in the ß subunits from a variety of other species (COUX et al. 1994 Down).

Genomic DNA gel blot analysis:
To estimate the number of genes encoding each 20S proteasome subunit, genomic DNA from the A. thaliana ecotype WS was digested with one of three restriction endonucleases (BglII, EcoRI or EcoRV) and subjected to DNA gel blot analysis using probes specific for each of the 14 proteasome subunit gene families [PAA-PAG ({alpha}1–{alpha}7) and PBA-PBG (ß1–ß7)]. Following hybridization, the blots were washed at low stringency to enable detection of all closely related fragments (Figure 4A and Figure B) and then at high stringency to identify fragments specific to the probe (data not shown). To help assign genomic fragments to individual genes, specific DNAs were PCR amplified from A. thaliana genomic DNA and similarly digested with the same set of restriction endonucleases (data not shown).



View larger version (96K):
In this window
In a new window
Download PPT slide
 
Figure 4. —Genomic DNA gel blot analysis of genes encoding various Arabidopsis 20S proteasome subunits. DNA from A. thaliana ecotype WS was isolated, digested singly with BglII (B), EcoRI (E) or EcoRV (V), and size-fractionated by electrophoresis in 0.8% agarose gels. The DNA was transferred to Zeta-Probe membranes and hybridized to the gene-specific probes encoding various {alpha} and ß subunits as indicated in MATERIALS AND METHODS. (A and B) Hybridization patterns following low stringency washes using {alpha}-subunit and ß-subunit specific probes, respectively. Each band marked by a diamond represents a genomic DNA fragment that corresponds to the gene-specific probe used in that panel.

This analysis was consistent with the amino acid sequence alignments, indicating that many of the proteasome subunits are encoded by multiple genes in Arabidopsis. For the 6 {alpha}-subunits and the three ß subunits that are encoded by two identified cDNAs (Figure 1 and Table 1), DNA gel blot patterns agreed with the presence of two or more genes in the WS ecotype (Figure 4A and Figure B). For the PAA-PAF ({alpha}1–{alpha}6) subunit families, the DNA gel blots were consistent with only two genes. For the remaining 3 subunits encoded by the identified cDNAs [PBB (ß2), PBC (ß3) and PBD (ß4)], additional bands were detected that could not be unequivocally attributed to the two identified genes, suggesting the possibility of a third gene. For four of the five subunits where only a single cDNA was identified [PAG1 ({alpha}7), PBA1 (ß1), PBF1 (ß6), and PBG1 (ß7) (Figure 1)], the patterns were compatible with the presence of a single gene (Figure 4A and Figure B). For the other subunit represented by a single cDNA, PBE1 (ß5), additional bands were detected in the DNA gel blots suggesting that a close relative of PBE1 exists but has not yet been isolated.

RNA gel blot analysis of 20S proteasome subunits:
RNA gel blot analysis examined the expression patterns of the {alpha} and ß subunit genes in Arabidopsis. Blots of total RNA were probed with cDNAs for three of the {alpha}-subunits [PAA1 ({alpha}1), PAC1 ({alpha}3), and PAG1 ({alpha}7)] and three of the ß-subunits [PBD2 (ß4), PBE1 (ß5), and PBG1 (ß7)] and washed at high stringency (0.5x SSC and 65°) to restrict hybridization to closely related mRNAs (as determined by DNA gel blot analysis). Hybridization to a ß-tubulin cDNA and the polyubiquitin gene UBQ3 was also tested to confirm the integrity of the mRNAs (SUN and CALLIS 1997 Down). As shown in Figure 5, all seven RNA preparations contained intact rRNAs as detected by ethidium bromide staining. With the exception of the RNA from cauline leaves, near equal levels of ß-tubulin mRNA were apparent in all samples. mRNA for UBQ3 also was present in all tissues with a distribution that matched that reported previously by SUN and CALLIS 1997 Down; the levels were highest in etiolated seedlings and siliques, intermediate in roots and stems, and lowest in light-grown seedlings, cauline leaves, and flowers. In addition to detecting the UBQ3 mRNA, the UBQ3 probe also weakly detected mRNAs of different sizes that likely represent other A. thaliana ubiquitin genes (CALLIS et al. 1995 Down).



View larger version (72K):
In this window
In a new window
Download PPT slide
 
Figure 5. —RNA gel blot analysis of Arabidopsis 20S proteasome subunit gene expression. Total RNA isolated from light-grown seedlings (S), dark grown seedlings (E), roots (R), stems (St), cauline leaves (C), siliques (Si) and flowers (F) was size-fractionated on 1.2% agarose-formaldehyde gels, transferred to Zeta-Probe membrane and hybridized to the gene-specific probe as indicated. EtBR, duplicate RNA samples visualized by ethidium bromide staining. DNA probes include the following: PAA1, PAC1, and PAG1, which encode {alpha}1, {alpha}3 and {alpha}7 subunits, respectively; PBD2, PBE1 and PBG1, which encode ß4, ß5 and ß7 subunits, respectively; ß-TUB that encodes ß-tubulin; and UBQ3 that encodes a tetrameric polyubiquitin.

When probed with each of the six 20S proteasome subunit genes, corresponding mRNAs of the expected size were detected in the RNA prepared from each tissue (Figure 5). However, compared to those for ß-tubulin and UBQ3, the signal intensities obtained with the proteasome probes were substantially lower, suggesting lower levels of mRNA (Figure 5 and data not shown). The mRNA levels for each subunit were roughly similar in all tissues except siliques and flowers where slightly higher levels were seen relative to those of ß-tubulin. Despite the low levels of ß-tubulin and UBQ3 mRNA, mRNA for the six proteasome subunit genes could be detected in the RNA isolated from cauline leaves.

Arabidopsis {alpha}3, {alpha}5, and ß3 subunits can functionally replace their yeast orthologs:
Amino acid sequence conservation between the Arabidopsis 20S proteasome subunits and their counterparts in other organisms suggested that the Arabidopsis subunit families assume identical locations in the particle and perform similar functions (Figure 1). To examine whether this primary sequence homology could be translated into functional homology, we tested in yeast whether nine of the A. thaliana subunits ({alpha}3, {alpha}5, and ß1–ß7) could functionally replace their corresponding yeast orthologs (Table 1). For the eight subunits tested that are essential for yeast [{alpha}5/DOA5, ß1/PRE3, ß2/PUP1, ß3/PUP3, ß4/PRE1, ß5/DOA3, ß6/PRS3, and ß7/PRE4 (HOCHSTRASSER 1996 Down)], yeast strains were constructed which bear a chromosomal deletion of the respective proteasome subunit gene but express the corresponding wild-type gene from a URA3-plasmid (Table 3). High copy TRP1-plasmids bearing the various Arabidopsis 20S proteasome subunits were then introduced into these haploid yeast deletion strains by transformation, and the presence of the plasmid-born copy of the yeast gene was selected against by plating on 5-fluoroorotic acid (FOA) which is toxic to cells expressing Ura3. Streaked transformants that formed colonies on FOA plates were those in which the Arabidopsis 20S proteasome gene was able to rescue the loss of the yeast subunit gene.

All eight tested yeast strains bearing the 20S proteasome subunit genes on a URA3-plasmid could grow on FOA-containing media when transformed with a TRP1- or LEU2-plasmid containing the corresponding yeast subunit but not with a corresponding empty plasmid (Figure 6A). When the eight corresponding Arabidopsis subunits genes were tested similarly, only plasmids bearing PAE1 ({alpha}5) or PBC2 (ß3) rescued colony growth on FOA-containing media for yeast strains bearing deletions in the orthologous genes; PAE1 ({alpha}5) rescued the loss of DOA5 ({alpha}5) and PBC2 rescued the loss of PUP3 (ß3) (Figure 6B). However, the colony growth rate of the yeast deletion strains harboring either Arabidopsis PAE1 or PBC1 was slower than the rate of those containing the corresponding yeast genes, suggesting that the Arabidopsis {alpha}5 and ß3 subunits only partially restored yeast 20S proteasome function (Figure 6B and data not shown). In contrast, TRP1-plasmids bearing PBA1 (ß1), PBB1 (ß2), PBD1 (ß4), PBE1 (ß5), PBF1 (ß6), and PBG1 (ß7) failed to restore viability to yeast strains missing the orthologous yeast subunits (PRE3, PUP1, PRE1, DOA3, PRS3, and PRE4, respectively) (Figure 6B).




View larger version (97K):
In this window
In a new window
Download PPT slide
 
Figure 6. —Functional complementation of yeast 20S proteasome subunit genes with orthologs from A. thaliana. For the essential yeast subunit genes DOA5 ({alpha}5), PRE3 (ß1), PUP1 (ß2), PUP3 (ß3), PRE1 (ß4), DOA3 (ß5), PRS3 (ß6), and PRE4 (ß7), tests were performed with yeast strains bearing a chromosomal deletion for the corresponding 20S subunit gene and containing the wild-type gene on a URA3-plasmid. These yeast strains were transformed with an empty TRP1- or LEU2-plasmid (Vector Laboratories, Burlingame, CA) or one expressing the corresponding yeast (A) or Arabidopsis (B) subunit gene. The transformants were patched on YPD medium, incubated overnight, then streaked on FOA-containing media, and grown for 6 days at 30°. (C) Complementation of the amino-acid-analog growth sensitivity of a yeast strain bearing a chromosomal deletion of PRE9 ({alpha}3) with its Arabidopsis ortholog PAC1. Wild-type (PRE9) yeast or the pre9{Delta} strain was transformed with either an empty TRP1-plasmid (Vector) or one expressing Arabidopsis PAC1 ({alpha}3) or Arabidopsis PAE1 ({alpha}5). Cultures from the various transformants were resuspended to similar cell densities and spotted in a 10-fold dilution series on SD medium containing canavanine and p-fluorophenylalanine. The cells were grown for 7 days at 30°. Cell densities of the resuspended cultures prior to plating on the analog-containing media are shown on the right.

To demonstrate that successful complementation of doa5{Delta} and pup3{Delta} by PAE1 and PBC2 was specific for these Arabidopsis genes, we attempted to rescue the two yeast strains with other Arabidopsis subunit genes. Using the same FOA selection, none of the six other Arabidopsis {alpha} and ß subunits genes restored viability to the yeast doa5{Delta} and pup3{Delta} strains. PAC1 encoding an Arabidopsis {alpha}3 subunit was unable to complement the loss of DOA5 ({alpha}5) (Figure 7A). Likewise, PBA1, PBB1, PBD1, PBE1, and PBF1 encoding the Arabidopsis ß1, ß2, ß4, ß5, and ß6 subunits, respectively, were unable to complement the loss of PUP3 (ß3) (Figure 7B). To examine the possibility that the Arabidopsis ß3 subunit could complement deletions of other yeast ß subunits besides PUP3 (ß3), we attempted to rescue viability of the yeast pup1{Delta} (ß2), pre1{Delta} (ß4), doa3{Delta} (ß5), and prs3{Delta} (ß6) strains with PBC2. None produced viable colonies on FOA-containing media, indicating that Arabidopsis PBC2 (ß3) can exchange only with its yeast ortholog (data not shown).



View larger version (56K):
In this window
In a new window
Download PPT slide
 
Figure 7. —Specificity of functional complementation of 20S subunit genes between Arabidopsis and yeast. To detemine whether functional replacement of yeast DOA5 ({alpha}5) and PUP3 (ß3) is specific to the corresponding A. thaliana subunit genes, PAE1 and PBC2, respectively, complementation of the various subunit deletion strains was tested with other Arabidopsis {alpha} and ß subunit genes. (A) Complementation of doa5{Delta} containing wild-type DOA5 on a URA3-plasmid, with empty TRP1-plasmid (Vector) or one expressing Arabidopsis PAC1 ({alpha}3) or PAE1 ({alpha}5). (B) Complementation of pup3{Delta} containing wild-type PUP3 on a URA3-plasmid, with an empty TRP1-plasmid (Vector) or one expressing either Arabidopsis PBA1 (ß1), PBB1 (ß2), PBC2 (ß3), PBD1 (ß4), PBE1 (ß5), or PBF1 (ß6). In all tests, the transformants were patched on YPD medium, incubated overnight, then streaked on FOA-containing media, and grown for six days at 30°.

Although the yeast {alpha}3 subunit encoded by PRE9 is not essential, a haploid yeast strain containing a chromosomal deletion of PRE9 (Table 3) is sensitive to the amino acid analogs canavanine and p-fluorophenylalanine (EMORI et al. 1991 Down and Figure 6C). Resistance to the analogs could be restored by extrachromosomal expression of Arabidopsis PAC1 ({alpha}3) on a centromeric low copy plasmid (Figure 6C). When adjusting for cell number before plating, the growth rate of the yeast pre9{Delta} strain containing PAC1 was indistinguishable from that of wild-type yeast. Complementation of the analog sensitivity of pre9{Delta} was subunit-specific since introduction of another Arabidopsis {alpha} subunit gene PAE1 ({alpha}5) did not restore growth as compared to Arabidopsis PAC1 (Figure 6C). In fact, expression of PAE1 appeared to slightly reduce growth beyond that seen for pre9{Delta}.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Here we describe a collection of 23 genes encoding the full complement of subunits within the A. thaliana 20S proteasome. As in other eukaryotes, these genes can be organized into the {alpha} and ß subunit families encoding the seven distinct {alpha} subunits and seven distinct ß subunits of the 20S particle. Sequence alignments with orthologs from T. acidophilum, yeast, and animals indicate substantial amino acid conservation of the subunits occurs across kingdoms. Like others (ZWICKL et al. 1992 Down), we found that the various {alpha} and ß subunits from Arabidopsis are more similar in amino acid sequence to corresponding subunits in other organisms than to different Arabidopsis subunits. These data further support the notion that the {alpha} and ß families were derived from a common ancestral gene and that the multiple subunits in each family subsequently evolved prior to the divergence of the plant, fungal, and animal kingdoms. ESTs were available for most of the 20S subunit genes (the exceptions were PAB2 and PAC2, which were derived from genomic sequences), indicating that a majority of the genes represent functional loci. The expression patterns of six subunits were coincident, suggesting that the genes are coordinately regulated. Enhanced expression was observed in flower and silique tissue, implicating the 20S proteasome in flower development and seed maturation.

During the preparation of this manuscript, PARMENTIER et al. 1997 Down reported the sequence of a subset of the Arabidopsis 20S proteasome subunit genes described here. In general, their study agrees well with ours except that: (i) several of the cDNAs they reported encoded only part of the full amino-acid sequence (PAA1, PAE1, PAD2, PAF2, PBB1, PBG1); (ii) one gene in their collection likely contains a 2-bp deletion which truncates the reading frame (PBC1); (iii) two genes they characterized as unique may be duplicates of each other (homologs of PAD2); and (iv) we identified additional family members not found in their collection (PAA2, PAB2, PAC2, PAE2, PBC2). In contrast to their designations, our nomenclature matches more closely those recommended for Arabidopsis gene assignments (MEINKE and KOORNNEEF 1997 Down) and thus were retained here (see Table 1). PARMENTIER et al. 1997 Down and the various Arabidopsis genome sequencing projects have mapped many of the subunit genes on the A. thaliana chromosomes and found them to be randomly distributed.

Based on overall amino acid sequence conservation, we expect that the families of Arabidopsis {alpha} and ß subunits assume three dimensional structures similar to their T. acidophilum and yeast orthologs (LOWE et al. 1995 Down; GROLL et al. 1997 Down). This is especially true for the ß subunits, ß1, ß2 and ß5, where almost all of the residues essential for catalysis, substrate specificity, and inhibitor binding are contextually conserved (Figure 3). Moreover, N-terminal extensions similar to the prosequences found in their yeast and mammalian counterparts likely indicate that the Arabidopsis ß1, ß2, ß5, ß6, and ß7 subunits are synthesized as longer precursors which then require proteolytic processing to release the mature subunits that assemble into the 20S particle (HOCHSTRASSER 1996 Down; SCHMIDT and KLOETZEL 1997 Down). For ß1, ß2, and ß5, the exposed Thr residue at the N- terminus then would become part of the catalytic site of the subunit.

For three of the seven Arabidopsis subunits tested, this primary sequence similarity could be translated into functional homology. Genes encoding the {alpha} subunits, {alpha}3/PAC1 and {alpha}5/PAE1, and the ß subunit, ß3/PBC2, were able to specifically complement deletions of their yeast counterparts. PAE1 ({alpha}5) and PBC2 (ß3) rescued yeast deletions of the essential subunits DOA5 ({alpha}5) and PUP3 (ß3), respectively. PAC1 ({alpha}3) restored amino acid-analog resistance to a yeast strain missing PRE9 ({alpha}3). Previously, SEELIG et al. 1993 Down showed by stable transfection of mouse cultured cells that a Drosophila {alpha}2 subunit could structurally replace its ortholog in the mouse 20S proteasome. Here we extend this conservation by showing for the first time that proteasome subunits from one species can functionally replace their orthologs in another species. This cross-species complementation is remarkable given the extent of sequence divergence between the yeast and Arabidopsis {alpha}3, {alpha}5, and ß3 subunits (Figure 1) and the multiple structural constraints that must exist for each subunit to allow its proper assembly into and function within the 20S complex (GROLL et al. 1997 Down).

Those Arabidopsis subunit genes that failed to complement their yeast orthologs were PBA1 (ß1), PBB1 (ß2), PBD1 (ß4), PBE1 (ß5), PBF1 (ß6), and PBG1 (ß7). Whether this failure represents an inability of the Arabidopsis subunits to exchange with the yeast subunits in the complex or simply inadequate expression of functional Arabidopsis proteins is unknown. Interestingly, both the Arabidopsis {alpha} subunits tested successfully replaced their yeast counterparts but only one of the seven ß subunits tested did. This success rate could be related to the higher sequence similarity of the {alpha} subunits between yeast and Arabidopsis (Figure 1). Alternatively, it could imply that the assembly and/or function of the {alpha} ring is more tolerant of sequence variations or that the position of the {alpha} subunits on the periphery of the 20S cylinder confers greater structural freedom. These possibilities are supported by the observation that the only non-essential polypeptide in yeast is an {alpha} subunit [{alpha}3/PRE9 (EMORI et al. 1991