Genetics, Vol. 162, 1617-1630, December 2002, Copyright © 2002

kangaroo, a Mobile Element From Volvox carteri, Is a Member of a Newly Recognized Third Class of Retrotransposons

Leonard Duncana, Kristine Bouckaerta, Fay Yeh2,a, and David L. Kirka
a Department of Biology, Washington University, Saint Louis, Missouri 63130

Corresponding author: Leonard Duncan, Inc., 1502 Viceroy Dr., Dallas, TX 75235., len.duncan{at}cumbre.net (E-mail)

Communicating editor: D. J. GRUNWALD


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Retrotransposons play an important role in the evolution of genomic structure and function. Here we report on the characterization of a novel retrotransposon called kangaroo from the multicellular green alga, Volvox carteri. kangaroo elements are highly mobile and their expression is developmentally regulated. They probably integrate via double-stranded, closed-circle DNA intermediates through the action of an encoded recombinase related to the {lambda}-site-specific integrase. Phylogenetic analysis indicates that kangaroo elements are closely related to other unorthodox retrotransposons including PAT (from a nematode), DIRS-1 (from Dictyostelium), and DrDIRS1 (from zebrafish). PAT and kangaroo both contain split direct repeat (SDR) termini, and here we show that DIRS-1 and DrDIRS1 elements contain terminal features structurally related to SDRs. Thus, these mobile elements appear to define a third class of retrotransposons (the DIRS1 group) that are unified by common structural features, genes, and integration mechanisms, all of which differ from those of LTR and conventional non-LTR retrotransposons.


RETROTRANSPOSONS are mobile genetic elements that are found in a wide range of eukaryotes (XIONG and EICKBUSH 1990 Down; GABRIEL and BOEKE 1993 Down). They use reverse transcriptase (RT) to convert RNA intermediates into DNA copies that can then be integrated in new locations. Such replicative transposition means that retrotransposons can greatly influence genome size: it is estimated that ~40% of mammalian genomes and ~50% of the maize genome is composed of retroelements (SANMIGUEL et al. 1996 Down; SMIT 1999 Down). In addition to increasing genome size, retroelements can affect genome structure and function in other ways. Although they often inactivate the gene into which they insert, there are now many examples in which novel cis-regulatory sequences or protein domains are believed to have been acquired from retroelements (KUMAR and BENNETZEN 1999 Down; SMIT 1999 Down). For example, telomerase may have obtained its RT function this way (EICKBUSH 1997 Down). Retroelements can also transduce flanking sequences, alter splicing of chimeric pre-mRNAs, create pseudogenes, and promote unequal crossing over and other genomic rearrangements (FINNEGAN 1989 Down; COFFIN 1993 Down; KUMAR and BENNETZEN 1999 Down).

Most retrotransposons can readily be placed in either the long terminal repeat (LTR) or the non-LTR classes (Fig 1A; XIONG and EICKBUSH 1990 Down; GABRIEL and BOEKE 1993 Down). LTR retrotransposons are closely related to retroviruses and are bounded by direct repeats that contain transcription-initiation and polyadenylation signals. They typically contain one open reading frame (ORF) that encodes a nucleic acid binding protein (Gag) and a second ORF that encodes protease, RT, RNAse H, and integrase domains (Fig 1A). They sometimes contain a third ORF encoding an envelope protein. Non-LTR elements are simpler than (and probably ancestral to) the LTR class (XIONG and EICKBUSH 1990 Down): they lack terminal repeats, typically contain a poly(A)-rich sequence near their 3' ends, and usually contain one ORF encoding a Gag protein and a second ORF encoding endonuclease, RT, and RNAse H functions (Fig 1A).



View larger version (33K):
In this window
In a new window
Download PPT slide
 
Figure 1. Three major groups of retrotransposons. (A) The generic structures of LTR and conventional non-LTR retroelements. The figures are not to scale and are not intended to represent specific retrotransposons. Large black arrows, long terminal repeats; PR, protease domain; RT, reverse transcriptase domain; H, RNAse H domain; INT, integrase domain; UTR, untranslated region; EN, endonuclease; An, poly(A)-rich sequence. Major ORFs are indicated by shading. (B) Diagrams of four members of the DIRS1 group of retroelements. PAT and TOC1 contain SDR termini (solid and open triangles designated A and B), while the termini of DIRS-1 and DrDIRS1 contain inverted repeats (dotted triangles). DIRS-1 and DrDIRS1 also contain short terminal sequences (solid and open rectangles labeled A and B) that overlap with part of and, in some cases, extend beyond the inverted repeats and that are repeated in an internal complementary region (ICR), where they are present in a juxtaposed and inverted arrangement (A'B'). Inversion of the ICR would create a structure very similar to that found in the SDR elements. When comparing any two DIRS1-group elements, the A and B repeat units are similar in structure but unrelated in sequence. REC, recombinase. Accession numbers for PAT, DIRS-1, and TOC1 are X60774, M11339, and X56231, respectively. The copy of DrDIRS1 shown here is found in accession no. AL590134. The specific PAT and DrDIRS1 elements shown here contain nonsense codons (small solid triangles) in their rec genes. A second copy of DrDIRS1 (not shown) and, presumably, other copies of PAT contain an uninterrupted ORF-C.

LTR and non-LTR elements use distinct transposition mechanisms (CRAIG 1997 Down). LTR elements use RT to generate a free, linear cDNA copy of the element that is inserted into a target site by the action of an integrase related to DNA transposases. In contrast, non-LTR elements transpose by "target-primed reverse transcription," in which an RNA copy of the element is reverse transcribed only after an encoded endonuclease has cleaved the target DNA to generate a primer for reverse transcription.

RT-based molecular phylogenies generally identify retroelement clades whose individual members share other important features (DOOLITTLE et al. 1989 Down; XIONG and EICKBUSH 1990 Down; MCCLURE 1993 Down). For example, the elements placed in the gypsy, copia, BEL, and retrovirus clades on the basis of their RT sequence all contain LTRs. Furthermore, elements in the copia clade all have an integrase gene upstream of the RT gene, whereas in other LTR lineages (such as gypsy) the integrase gene is downstream of the RT gene.

A notable exception to this rule is the basis of this report. An RT-based phylogeny places two unorthodox retrotransposons—PAT, from the nematode Panagrellus redivivus (DE CHASTONAY et al. 1992 Down) and DIRS-1 from Dictyostelium discoideum (CAPPELLO et al. 1985 Down)—as nearest neighbors on a branch located near the copia and gypsy clades (MALIK and EICKBUSH 2001 Down). However, neither PAT nor DIRS-1 encode either an LTR type of integrase or a non-LTR type of endonuclease. Moreover, PAT and DIRS-1 possess termini that appear to be very different from one another and from the termini of any other retroelement group. PAT contains split direct repeat (SDR) termini, in which one copy of an ~300-bp direct repeat is found in the interior of the element (juxtaposed open and solid triangles, Fig 1B), while the second copy is bifurcated, with about one-half of it present at each terminus (solo solid and open triangles, Fig 1B), such that the half-repeats alternate in the order A, BA, B. DIRS-1, on the other hand, contains inverted terminal repeats (Fig 1B).

Our efforts to develop transposon-tagging tools for use in studying the developmental genetics of the multicellular green alga, Volvox carteri (KIRK 1998 Down), led us to the discovery of a highly mobile retroelement called kangaroo that contains SDR termini like those of PAT and that is closely related to PAT and DIRS-1 in terms of its RT sequence. Here we report on the characterization of kangaroo, its developmentally regulated expression, and its probable method of integration.

The unusual features shared by PAT, DIRS-1, and kangaroo-1 suggest that these elements must transpose by a mechanism distinct from that used by either LTR or non-LTR retroelements, despite the similarity of their RT proteins to those of well-known LTR elements. Because similar unorthodox elements are also found in the genomes of zebrafish and other metazoans, we conclude that this DIRS1 group represents a widespread third class of retrotransposons.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Volvox strains and cultivation conditions:
Strains HK 9 (male) and HK 10 (female) of V. carteri f. nagariensis were isolated and described by STARR 1969 Down and later provided to us by the University of Texas Culture Collection of Algae. EVE is our standard subclone of HK 10 (HARPER et al. 1987 Down) and CRH7 (MILLER et al. 1993 Down) and LDV45 (L. DUNCAN, unpublished data) are first- and second-generation subclones of EVE. The NIES male and female strains were isolated in 1983 from a slightly different area of Japan than HK 9 and HK 10 had been and were provided to us by the National Institute for Environmental Studies in Ibaraki, Japan. Volvox cultures were maintained in standard Volvox medium under standardized culture conditions (KIRK and KIRK 1983 Down) on a 16-hr-light/8-hr-dark cycle.

Nucleic acid purification and sequencing:
Volvox genomic DNA was purified as described (MILLER and KIRK 1999 Down). Plasmid DNA, {lambda}DNA, and hybridization probes were isolated using purification kits from QIAGEN (Valencia, CA). Volvox RNA was purified by minor modifications of a previously described method (KIRK and KIRK 1985 Down). DNA was sequenced with Applied Biosystem's (Foster City, CA) BigDye v. 2.0 premix using standard methods. Sequence data were collected on a departmental MJ Research (Waltham, MA) Basestation and analyzed using PHRED/PHRAP/CONSED (http://www.phrap.org).

Cloning kangaroo-1:
Two novel HindIII fragments that resulted from the insertion of kangaroo-1 into the nitA (nitrate reductase-encoding) gene of CRH7 were separately cloned from subgenomic libraries generated from size-selected HindIII fragments of CRH7 genomic DNA ligated to HindIII-digested pBluescript II KS. The resulting 3.8-kb (pLD41) and 7.5-kb (pLD40) inserts containing two segments of kangaroo-1 were sequenced using a combination of primer walking and the Genome Priming System (NEB) to a final estimated error rate of <0.01/10,000 bp. Next, we used OLV32 (5'-TTGTTGGGCTGCTTTCCTC-3') and OLV46 (5'-GGAAGCACACGAAGTTGG-3'; Fig 2A) as PCR primers to amplify from CRH7 genomic DNA a 2.7-kb DNA fragment that spans the internal HindIII site within kangaroo-1. The sequence of this fragment confirmed that these two HindIII fragments that contain kangaroo-1 sequences are juxtaposed as shown in Fig 2A.




View larger version (40K):
In this window
In a new window
Download PPT slide
 
Figure 2. kangaroo-1 is an unorthodox retrotransposon containing SDR termini. (A) Diagram of kangaroo-1 (rectangle) inserted in nitA (solid line). The solid and open triangles (labeled A and B) at opposite ends of the element represent the two halves of the SDR. The juxtaposed open and solid triangles (BA) represent the full-length interior direct repeat. The arrows above the diagram indicate the locations of PCR primers that were used to demonstrate that the two HindIII fragments containing kangaroo-1 are juxtaposed as shown here. The dashed vertical lines within the 89-bp repeat region represent the last two copies of the repeat, which are less well conserved. ORFs A, B, and C are shaded. Abbreviations for RT, H, and REC are as in Fig 1. The positions of relevant restriction enzyme sites are shown: H3, HindIII; S, SacI; R, RsaI; P, PstI; and Ap, ApaI. (B) Alignment of the first 10 89-bp repeats. Solid squares indicate positions that differ from the consensus. A dash indicates a gap. The numbers linked to the first and last nucleotides in the alignment correspond to the sequence of kangaroo-1.

Nucleic acid hybridization:
Southern and Northern blotting experiments followed standard procedures (SAMBROOK et al. 1989 Down). [{alpha}-32P]dCTP-labeled probes were prepared using the oligolabeling kit (Pharmacia, Piscataway, NJ) and purified using NucTrap columns (Stratagene, La Jolla, CA).

DNA fragments used as hybridization probes in the figures were as follows: Probe 2 was a 1.3-kb SmaI fragment isolated from pLD41. Probe 3 was an ~600-bp fragment generated by PCR amplification from pLD40 using oligonucleotides OLV25 (5'-GCACTTACGACCGTGAAACC-3') and OLV67 (5'-AAAACGGACGCTCCACGA-3'). Probe 4 was an ~4-kb XmnI-HindIII fragment isolated from pLD40. Probe 5 was an ~1-kb fragment generated by PCR amplification from pLD40 using oligonucleotides OLV22 (5'-ATCCATCTTCGTATTTGCTG-3') and OLV23 (5'-ACGAACGGGAGCACACTTAT-3'). Probe 6 was an ~1.4-kb HindIII-DraI fragment isolated from pLD40. Probe 7 was a 325-bp fragment generated by PCR amplification from pLD41 using oligonucleotides OLV50 (5'-AATAGCGGGAAAGGGATG-3') and OLV63 (5'-GAAGTGTGAAGCCGACGA-3'). The C38 probe has been described previously (TAM and KIRK 1991 Down).

Isolation of kangaroo-2 termini and preinsertion site:
kangaroo-2 was identified by Southern blot analysis (using probe 2, Fig 2A) as a 3.5-kb BamHI restriction fragment length polymorphism (RFLP) present in LDV45 but absent from EVE. The DNA fragment corresponding to this RFLP was cloned, generating pLD35. An ~400-bp fragment of DNA (probe 8) derived from the nonretrotransposon sequence that flanks the left side of kangaroo-2 was amplified from pLD35 by PCR using oligonucleotides OLV9 (5'-ATGGATGGGACTTGCTGCTGAC-3') and OLV10 (5'-CACCAATTTACCCGCCAGGATG-3'). Southern blotting experiments demonstrated that probe 8 hybridized with a single copy sequence in LDV45 and EVE genomic DNA and hybridized to the same 3.5-kb BamHI RFLP present in LDV45 that is recognized by probe 2.

To isolate the kangaroo-2 preinsertion site, probe 8 was used to screen an EVE genomic library constructed in {lambda}DASH II (KIRK et al. 1999 Down). The preinsertion site was sequenced directly from bacteriophage DNA isolated from three independent probe 8-hybridizing clones.

Next, the sequence of the preinsertion site was used to design an oligonucleotide, OLV58 (5'-CACAGGGCGGGCAGTTAT-3'), whose sequence was expected to be present within the nonretrotransposon DNA that flanked the right side of kangaroo-2. Two kangaroo specific oligonucleotides, OLV42 (5'-AGATTTGAGGCAGAGTAGG-3') and OLV43 (5'-AGAAGACACAGTCGGATGAG-3'), were separately used in conjunction with OLV58 to PCR amplify an ~1-kb fragment from LDV45 genomic DNA containing the junction between the right side of kangaroo-2 and flanking DNA. Both independent PCR products were sequenced.

Sequencing of retrotransposon:flanking DNA junctions from kangaroo-3 through kangaroo-13:
Plasmids pLD33 and pLD34 were obtained by screening a partial LDV45 genomic library with kangaroo probe 2, and they contain distinct inserts that include the left termini of kangaroo-5 and kangaroo-4, respectively. kangaroo-3 and kangaroo-6 through kangaroo-13 were isolated by screening an EVE {lambda} genomic library (KIRK et al. 1999 Down) with a labeled 2.8-kb XmnI DNA fragment of kangaroo-1 derived from pLD40. The kangaroo:flanking DNA boundary sequences for several of these clones were determined by directly sequencing the corresponding purified bacteriophage DNA with oligonucleotides that hybridize to the left or right terminus of kangaroo elements. For kangaroo-3 and kangaroo-6, however, SalI DNA fragments containing the right side of the retroelement plus associated flanking DNA were first subcloned into pBluescript II SK to generate pLD43 and pLD42.

PCR amplification of a portion of the putative circular form of kangaroo:
PCR reactions containing 0.5 µM OLV2 (5'-AAGACACAGTCGGATGAGGAG-3'), 0.5 µM OLV93 (5'-CATTCTGGTGTCCTCCTT-3'; Fig 7B), and 0.5 µg of EVE DNA were carried out using standard methods (SAMBROOK et al. 1989 Down). The predominant PCR product, which was the expected size (850 bp) to have been generated from a circular form of kangaroo, was cloned into pGEM-T Easy (Promega, Madison, WI) to generate pLD53, which was then sequenced.



View larger version (99K):
In this window
In a new window
Download PPT slide
 
Figure 3. The RT from kangaroo-1 is closely related to the RT present in other members of the DIRS1 group. A portion of the ORF-B predicted amino acid sequence from kangaroo-1 (kan) is aligned with the deduced RT proteins of DIRS-1, DrDIRS1, and PAT. The RT and RNAse H domains are outlined. Conserved regions 2–7 within the RT superfamily (XIONG and EICKBUSH 1990 Down) are indicated by bars above the alignment. The solid triangle in region 5 indicates the first residue of the (Y/F)XDD box (see text). The alignment begins slightly upstream of region 2 and continues to the termination codon of each ORF (asterisk). The solid circles within the RNAse H domain indicate residues known to be present in the enzyme's active site and believed to be important for catalysis. The alignment was created using CLUSTALX (with manual refinement of the RNAse H domain) and MacBoxshade. Identical and similar amino acids are indicated by black and gray shading, respectively, using a 75% consensus threshold. A dot represents a gap.



View larger version (117K):
In this window
In a new window
Download PPT slide
 
Figure 4. kangaroo-1 is a member of a dispersed repetitive family. Autoradiograms of Southern blots containing 2 µg of restricted genomic DNA from five closely related V. carteri f. nagariensis strains: CRH7, EVE, HK9, NIES female, and NIES male (see MATERIALS AND METHODS). (A) HindIII-restricted DNA hybridized with probe 2 (see Fig 2A). (B) SacI-restricted DNA hybridized with probe 3 (see Fig 2A).



View larger version (39K):
In this window
In a new window
Download PPT slide
 
Figure 5. Most kangaroo elements have the same general structure as kangaroo-1. (A) Autoradiogram of a Southern blot hybridized with probe 2 (see Fig 2A). Lane 1, 6 ng of RsaI-digested pLD41 (which contains the left half of kangaroo-1); lane 2, 1.5 µg of EVE genomic DNA digested with RsaI. The 1.2-kb hybridizing band flanked by dots in lane 1 is a fragment of pLD41 that contains only a short segment of kangaroo-1 and whose size is unrelated to the true structure of the retrotransposon. (B) Autoradiogram of a Southern blot hybridized with probe 3 (see Fig 2A). Lane 1, 6 ng of ApaI-digested pLD40 (which contains the right half of kangaroo-1); lane 2, 1.5 µg of ApaI-digested EVE genomic DNA; lane 3, 6 ng of pLD40 digested with PstI; lane 4, 1.5 µg of PstI-digested EVE genomic DNA. (C) Aligned sequences of the retrotransposon:flanking DNA junctions of several independent kangaroo clones (kangaroo-1 through kangaroo-13). The kangaroo sequences are enclosed by a rectangle and shown in uppercase. Only a short segment of each terminus of the kangaroo elements is shown. Flanking DNA is in lowercase and delineated by arrows. Note that (with one exception) the kangaroo elements are flanked by a dT residue on both sides (solid circles), and it is unclear which residue is derived from the retroelement and which is derived from the target DNA; possibly, the rectangle delineating the boundaries of kangaroo should be shifted 1 bp to the left. HDR, half direct repeat; CON, conserved target DNA sequences; Y, pyrimidine; R, purine.




View larger version (63K):
In this window
In a new window
Download PPT slide
 
Figure 6. Multiple developmentally regulated transcripts from kangaroo are observed. (A) A diagram of kangaroo-1 showing the extent of five kangaroo cDNA clones (2, 4, 5, 11, and 13) and the locations of sequences used as probes for the Northern blots shown in C–E. Also shown are the deduced structures of the four major kangaroo transcripts (a–d) shown in C–E. TGTAA, volvocalean polyadenylation signal (KIRK 1998 Down); An, poly(A) tail. (B) The asexual life cycle of V. carteri (KIRK 1998 Down). An individual V. carteri spheroid (time point 1) contains only two cell types: large asexual reproductive cells called gonidia in the interior of the sphere and small, terminally differentiated somatic cells at its surface. A 24-hr light-dark cycle (inner circle) can be used to synchronize development. Under these conditions, gonidia become mature and begin to divide near the end of one light period, and embryogenesis is completed in the dark. Following the completion of mitotic divisions, the embryo turns inside out in a process called inversion to produce a juvenile, which is a miniature version of an adult spheroid. During the second light period, the two cell types of the juvenile differentiate and both the juveniles and parental spheroids expand by deposition of extracellular matrix (ECM). Near the end of the second dark period, the juveniles digest holes in the parental ECM and swim away. The cycle is completed when the gonidia of the juvenile initiate a new round of embryogenesis. The 10 time points at which total RNA was isolated from synchronized gonidia, embryos, or juveniles are indicated and correspond to the lane numbers shown in C–E. Somatic cells from the parental spheroids were removed from all samples prior to RNA isolation. (C and D) Autoradiograms of Northern blots containing 10 µg of total RNA isolated at time points 1–7 as indicated in B and hybridized with the indicated probes. (E) Autoradiogram of a Northern blot containing 8 µg of total RNA isolated at time points 3, 4, 6, and 8 and hybridized with probe 7. As an RNA-loading control, the same blots were stripped and rehybridized with a probe for a transcript of constant abundance, C38 (bottom).



View larger version (22K):
In this window
In a new window
Download PPT slide
 
Figure 7. A model for kangaroo insertion. (A) Comparison of the observed pre- and postinsertion target-site sequences for kangaroo-1. The postinsertion product could be generated if integration occurred by a single DNA crossover event between the target site (boxed 5'-ctg-3') and the circle junction that is created by ligation of the ends of the linear form of kangaroo (boxed 5'-ATG-3'). Here, the crossover (black X) is shown as being 3' to the dT residues present in the recombining DNAs. Alternatively, the same postinsertion product could be generated if the DNA crossover occurred 5' to the dT residues (not shown). This recombination event could be catalyzed by the kangaroo Rec protein (see text). (B) PCR amplification of the kangaroo circle junction region. PCR amplification of linear, integrated forms of kangaroo with OLV93 and OLV2 would not be expected to produce a product (left), but amplification from a closed-circle DNA form of kangaroo would produce a product with a precisely defined size and junction (boxed 5'-ATG-3') between the two newly juxtaposed half-repeats (right). A PCR product whose size and sequence is fully consistent with the existence of such a circular form was obtained (see text).

Isolation of kangaroo-hybridizing cDNA clones:
We purified several clones that hybridized with kangaroo probe 4 (Fig 6A) from EVE cDNA libraries constructed in {lambda}gt10 (TAM and KIRK 1991 Down; clones {lambda}5-11 and {lambda}5-13) or {lambda}-Uni-ZAP XR (B. TAILLON and D. KIRK, unpublished data; clones {lambda}5-2, {lambda}5-4, and {lambda}5-5). The inserts in {lambda}5-2, {lambda}5-4, and {lambda}5-5 were converted to phagemids using the Rapid Excision kit (Stratagene) to generate pLD48, pLD49, and pLD50, respectively. The inserts from {lambda}5-11 and {lambda}5-13 were PCR amplified from purified bacteriophage DNA using {lambda}gt10 forward and reverse oligonucleotides and cloned into pGEM-T Easy to generate pLD56 and pLD57, respectively.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Isolation of kangaroo-1:
MILLER et al. 1993 Down previously used a selection for chlorate-resistant individuals to enrich for V. carteri mutants carrying transposon insertions in the nitrate-reductase-encoding gene, nitA (GRUBER et al. 1992 Down). One such mutant, CRH7, was found to contain a large insertion within nitA that was unrelated to the transposon that became the major focus of that study (MILLER et al. 1993 Down). We confirmed the existence of such an insertion element by probing a Southern blot containing HindIII-restricted genomic DNAs with a probe derived from the 5' end of the nitA coding region: The ~2.5-kb HindIII fragment derived from the wild-type nitA gene was replaced in CRH7 by 3.8- and 7.5-kb restriction fragments (data not shown), a result consistent with the insertion of an ~9-kb DNA element. We cloned and sequenced both novel HindIII fragments, thereby establishing the structure of the inserted element (Fig 2A), which we call kangaroo-1. We then confirmed (as described in MATERIALS AND METHODS) that the two HindIII fragments are juxtaposed as shown.

kangaroo-1 is an unorthodox retrotransposon:
kangaroo-1 possesses termini (solo solid and open triangles, Fig 2A) that are distinct from those of DNA transposons and most retrotransposons and that are similar in structure (but not in sequence) to the SDRs found in the PAT retroelement (Fig 1B). The terminal half-repeats of kangaroo-1 are identical in sequence (data not shown) to their counterparts within the full-length internal repeat (juxtaposed open and solid triangles, Fig 2A). Near its left end, kangaroo-1 also contains ~12 contiguous copies of an 89-bp sequence (Fig 2A). The first 10 copies of the 89-bp repeat are extremely similar, although many of them can be distinguished by a small number of nucleotide deletions and/or polymorphisms (Fig 2B). The last two repeats (not shown) are less well conserved and more difficult to align.

The two largest uninterrupted ORFs predicted by the nucleotide sequence of kangaroo-1 are shown in Fig 2A. ORF-A and ORF-B potentially encode proteins 417 and 829 amino acids long, respectively. Importantly, a reverse position-specific BLAST search revealed that a portion of the deduced amino acid sequence of ORF-B is strikingly similar (E = 6e-13) to the RT family of proteins in the Pfam database (BATEMAN et al. 2002 Down). This similarity extends over and includes the conserved RT regions 2–7 as defined by XIONG and EICKBUSH 1990 Down and the key conserved residues of RNAse H (DOOLITTLE et al. 1989 Down; MCCLURE 1993 Down; Fig 3). Region 5 of RT includes a highly conserved (Y/F)XDD box that is thought to be essential for binding divalent metal ions (KOHLSTAEDT et al. 1993 Down). Although region 5 of RT from kangaroo-1 and two other independent kangaroo clones (data not shown) encodes a less common LIDD (solid triangle, Fig 3), such divergence has been observed previously in other RT and related proteins (DOOLITTLE et al. 1989 Down; XIONG and EICKBUSH 1990 Down). We therefore conclude that kangaroo-1 encodes RT/RNAse H and is likely to be a retrotransposon. We note that in terms of nucleotide sequence, predicted amino acid sequences, overall organization, and phylogenetic position, kangaroo-1 is clearly distinct from members of the copia class of retrotransposons (e.g., Osser) that are found in V. carteri (LINDAUER et al. 1993 Down).

Interestingly, the RT protein from kangaroo-1 is most closely related to the RT proteins encoded by PAT (BLASTP score: E = 4e-24) and DIRS-1 (BLASTP score: E = 6e-16). Furthermore, as we discovered through TBLASTN searches of the databases and as was reported by GOODWIN and POULTER 2001 Down, the Danio rerio (zebrafish) genome contains several copies of a 6.1-kb retrotransposon (DrDIRS1) that is strikingly similar in structure to DIRS-1 (Fig 1B) and encodes a deduced RT protein very similar to that of kangaroo-1 (BLASTP score: E = e-16). An alignment of a portion of the RT/RNAse H domains from these four related retroelements is shown in Fig 3.

When we used PAUP* 4.0 to perform a neighbor-joining phylogenetic analysis of the RT domains of kangaroo-1 and 22 other RT proteins, a tree (not shown) was produced that was very similar in overall topology to those previously published (XIONG and EICKBUSH 1990 Down; MCCLURE 1993 Down; MALIK and EICKBUSH 2001 Down). This phylogram indicated (with a bootstrap value of 77%) that the RT protein sequences from kangaroo-1, PAT, DIRS-1, and DrDIRS1 constitute a clade—the DIRS1 group—that apparently diverged after the copia group and at about the same time as the retrovirus and gypsy groups. This finding suggests that although the DIRS1 group of retrotransposons includes members with two different types of unusual termini, these subfamilies may nevertheless share a common mode of replication. In this regard, it is particularly noteworthy that the similarity between the RT proteins from the four DIRS1-group members extends >100 amino acids beyond the RNAse H domain (Fig 3). This conserved C-terminal extension is apparently not found in other RT proteins and may indicate that DIRS1 members share some unknown, conserved function, possibly related to their unusual mode of transposition (see below).

The kangaroo-1 sequence includes nothing suggestive of protease or envelope functions but, like most other retrotransposons, it does contain a large ORF upstream of the RT gene (ORF-A, Fig 2A). An ORF in this location often encodes a Gag protein with one or more C2HC "zinc-finger" motifs thought to bind nucleic acids (REIN et al. 1998 Down). However, ORF-A from kangaroo-1 lacks any discernible zinc-finger motif and BLASTP searches fail to identify any other protein with significant sequence similarity. However, our observation that ORF-A-specific transcripts are developmentally regulated (see below) suggests that ORF-A may play a role in retrotransposition. Finally, as discussed below, kangaroo-1 also encodes a recombinase that is distinct from the integrases and endonucleases normally associated with LTR and non-LTR elements, respectively.

kangaroo-1 is a member of a dispersed, repetitive family of mobile elements:
We compared the kangaroo elements of five closely related V. carteri f. nagariensis strains on DNA blots (Fig 4). Both probe 2 from the left side and probe 3 from the right side of kangaroo-1 (see Fig 2A) recognized numerous, discrete bands in all strains (Fig 4A and Fig B). Many bands appeared to be present in all strains, but others were present in only one or a few of the strains examined. Most strikingly, many polymorphisms were visible between CRH7 and its clonal progenitor, EVE, which have been separated in culture for only a few years. Strains that have been isolated from one another for longer periods—such as EVE and HK9, which have been separate for at least 35 years—showed a correspondingly greater number of RFLPs. Because the restriction enzyme/probe combinations used were chosen to produce and reveal fragments with one end derived from kangaroo-1 and the other end derived from flanking DNA, these results are consistent with kangaroo-1 being a member of a large family of dispersed mobile elements.

Most members of the kangaroo family have similar structures:
To determine whether other kangaroo elements in the genome possessed the same general structure as kangaroo-1, we probed a blot of RsaI-digested EVE DNA with probe 2, which covers the region of kangaroo-1 containing the 89-bp repeats (see Fig 2A). The predominant hybridizing band that was detected (Fig 5A, lane 2) was identical in size (~2 kb) to the hybridizing fragment produced by RsaI digestion of cloned kangaroo-1 (Fig 5A, lane 1), although numerous other bands of lower intensity were also seen. Similar results were obtained using other restriction enzymes and probe 3, which is derived from the opposite side of kangaroo-1 (Fig 5B). We interpret these results to mean that a large fraction of the kangaroo elements within V. carteri possess the same general structure as kangaroo-1.

To determine whether other kangaroo elements have the same SDR termini as kangaroo-1, we cloned several distinct kangaroo-hybridizing DNA fragments that contain one or both ends of the retrotransposon and sequenced portions of these clones using oligonucleotides designed to prime just inside each terminus and read into the DNA flanking the insertion site. One of these clones, kangaroo-2, corresponds to a recent insertion that is present in strain LDV45, but absent in its progenitor, EVE (data not shown), providing additional evidence of kangaroo mobility. The other clones contained randomly selected kangaroo-hybridizing fragments from EVE (kangaroo-3 and kangaroo-6 through kangaroo-13) or from LDV45 (kangaroo-4 and kangaroo-5) genomic libraries. Fig 5C shows an alignment of these flanking DNA:kangaroo junctions, with the left and right ends of kangaroo oriented as in Fig 2A. We found that these kangaroo clones are identical in sequence on the left side of the alignment beginning with the sequence 5'-TGCATGTTGATTAA-3' and, with one exception, are also identical on the right side of the alignment until they all diverge after the sequence 5'-GACGTTTAAGCAAT-3' (Fig 5C). We conclude that the majority of kangaroo elements contain SDR termini similar to those of kangaroo-1.

Analysis of kangaroo insertion sites:
As shown above the alignment in Fig 5C, we observed some conservation in the DNA sequences that flank the various kangaroo elements that we have characterized, which suggests that kangaroo integration may exhibit some degree of target-site specificity. Most notably, all but one of the kangaroo insertions shown are bordered by the nucleotide dT on both their 5' and 3' ends (solid circles, Fig 5C). By comparing pre- and postintegration sites, we found that kangaroo-1 had inserted into the sequence 5'-CTG-3', and kangaroo-2 had inserted into the sequence 5'-CTT-3' (Fig 5C and data not shown). These findings lead us to conclude that the dT residue at one kangaroo–flanking DNA junction is derived from the target site, while the other is derived from the retrotransposon. However, there is presently no way to be certain which dT is derived from which source. Thus (as noted in the caption to Fig 5C) there is a one-nucleotide uncertainty regarding the boundaries of the retrotransposon and its target site.

kangaroo expression is developmentally regulated:
We isolated from V. carteri cDNA libraries several clones that hybridized with kangaroo probe 4 (Fig 6A). These clones fall into two classes. Members of the first class (clones 2, 4, 5, and 13) encode all or part of ORF-A (Fig 6A) and terminate 13–16 bp downstream of a volvocalean polyadenylation signal sequence (5'-TGTAA-3'; KIRK 1998 Down) that is located just upstream of ORF-B. cDNA 4 contains the longest insert of this class (~2.1 kb) and corresponds to a transcript containing two exons with canonical splice sites. Interestingly, the intron includes the region of 89-bp repeats. Because cDNA 4 begins near the left end of kangaroo-1, it probably represents a full-length or nearly full-length cDNA clone. This suggests that a promoter may reside within the left half-repeat. The second cDNA class has only a single member, cDNA 11 (Fig 6A). This apparently partial cDNA clone corresponds to a transcript that encodes a portion of ORF-B and whose processed 3' end terminates midway through the internal full-length repeat.

We next analyzed the accumulation of kangaroo transcripts by Northern blot analysis using developmentally staged RNAs harvested at various points during the asexual life cycle of V. carteri, which is outlined in Fig 6B. Using probe 4 (Fig 6A), we found that four major kangaroo-hybridizing transcripts (2.0, 3.8, 7.1, and 9.0 kb; a–d in Fig 6, C–E) are produced during development. Such transcripts were virtually undetectable in precleavage gonidia, began to accumulate during cleavage, reached a maximum level shortly after inversion, and then declined dramatically by 6 hr later (Fig 6C, lanes 1–7). These transcripts then remained at low levels throughout the rest of the asexual life cycle (time points 8–10; Fig 6E, lane 8 and data not shown). Because the transcripts reached maximum abundance during the dark period at the end of embryogenesis, but are present at much lower levels during the dark period 24 hr later (time points 9 and 10, Fig 6B; data not shown), we conclude that transcript accumulation is controlled by developmental rather than circadian factors.

Transcript a probably corresponds to cDNA 4 (Fig 6A), because it is of the appropriate size (~2.1 kb), and it hybridizes with ORF-A-specific probe 6 (data not shown) but not with intron-specific probe 7 (Fig 6E, lane 6), or with ORF-B-specific probe 5 (Fig 6D, lane 6), or with probe 3, which is derived from the right side of kangaroo (data not shown).

Transcript d is about the right size to be a full-length, unspliced kangaroo RNA species (Fig 6A). Consistent with this view, transcript d hybridizes with all of the kangaroo probes that we have used in Northern blotting experiments (Fig 6, C–E and data not shown). Presumably, transcript d is the template used during reverse transcription.

The discovery of an intron within kangaroo-1 prompted us to also examine the nature of transcripts b and c. We found that the hybridization patterns for transcript b are identical to those of transcript a (Fig 6C and Fig D and data not shown) except that b also hybridizes with the intron-specific probe 7 (Fig 6E, lane 6); we also found that the hybridization patterns for transcript c are identical to those of transcript d (Fig 6C and Fig D, and data not shown) except that c fails to hybridize to the intron-specific probe 7 (Fig 6E, lane 6). These results lead us to propose that transcripts b and c have the structures shown in Fig 6A. The sizes of transcripts b and c are consistent with this interpretation.

Finally, because cDNA 11 (Fig 6A) apparently does not correspond to any of the major kangaroo-hybridizing transcripts visualized by Northern blot analysis, it seems likely that it represents a relatively low-abundance message.

kangaroo may integrate as a closed-circle, double-stranded DNA copy:
A common feature of LTR and non-LTR retrotransposon integration is the generation of element-specific target-site duplications (GABRIEL and BOEKE 1993 Down). Thus, one interpretation of the finding that kangaroo elements are bounded by single dT residues (Fig 5C) is that kangaroo may integrate using a conventional retrotransposon mechanism that duplicates the conserved dT target-site residue. However, to our knowledge, no LTR or non-LTR retrotransposon is known to create a single-base-pair target-site duplication. More importantly, kangaroo and other members of the DIRS1 group do not encode a conventional DDE-integrase or a non-LTR-like endonuclease. These results and the asymmetric nature of kangaroo's termini lead us to propose the insertional model presented in Fig 7A, which involves an extrachromosomal DNA circle very similar to the one that was proposed as an intermediate in the replication of DIRS-1 (CAPPELLO et al. 1985 Down) and a second, related element called TOC1 (DAY et al. 1988 Down; see below). Specifically, it is postulated that a critical intermediate formed during retrotransposition of kangaroo is an extrachromosomal, closed-circle, double-stranded DNA copy in which the ends of the retroelement are fused to generate an identical second copy of the full, interior direct repeat (Fig 7A). As shown in Fig 7A, the integration product that is observed for kangaroo-1 (Fig 5C) could be produced by a DNA crossover event between the newly formed circle junction of kangaroo and the target site. The integration product produced by insertion of kangaroo-2 (Fig 5C) can be explained in a similar manner (not shown).

In accord with this model, we were able to use the PCR primers shown in Fig 7B to amplify a fragment from EVE DNA whose size and sequence is fully consistent with the product that would be produced from the postulated closed-circle DNA intermediate––but not from a linear—DNA form of kangaroo. It is formally possible that such a product could have been produced by amplification from two copies of kangaroo integrated in the genome in a head-to-tail manner. However, we believe that even if such a tandem juxtaposition of linear kangaroo elements were to exist in the genome, it would be unlikely to have coincidentally generated a sequence identical to that which would be created by ligation of the half-repeats in a circular kangaroo form.

kangaroo encodes a protein related to {lambda}-site-specific integrase:
GOODWIN and POULTER 2001 Down reported that members of the DIRS1 group of retrotransposons potentially encode a protein (ORF-C, Fig 1B) related to the Int family of recombinases (NUNES-DUBY et al. 1998 Down). The quintessential member of this protein family is the integrase protein from bacteriophage {lambda}, which catalyzes the integration and excision of the {lambda}-genome at a specific site within the Escherichia coli chromosome (LANDY 1989 Down). Int family members also catalyze the non-site-specific integration of circular forms of conjugative (DNA) transposons (SCOTT and CHURCHWARD 1995 Down). It is important to note that the Int recombinases are distinct in structure and catalytic mechanism from the DDE-type integrases normally associated with retroviruses and LTR retrotransposons (HAREN et al. 1999 Down) and from the endonucleases associated with non-LTR elements (FURANO 2000 Down). Members of the Int family are quite divergent in sequence. Indeed, only four amino acids, which comprise the "RHRY" tetrad essential for catalysis, are present in all members of this group (NUNES-DUBY et al. 1998 Down). In support of the results of GOODWIN and POULTER 2001 Down we have found that kangaroo also potentially encodes a 225-amino-acid protein related to {lambda}-integrase (ORF-C, Fig 2A). Fig 8 shows part of the deduced ORF-C protein from kangaroo-1 aligned with portions of {lambda}-integrase and several other members of the Int family, including the deduced recombinase proteins from PAT, DIRS-1, and DrDIRS1. The alignment identifies the conserved box I and box II regions containing the catalytic RHRY tetrad (NUNES-DUBY et al. 1998 Down) and, in extension of the results of GOODWIN and POULTER 2001 Down, we have found that the recombinase proteins from the DIRS1 family also contain the "Patch II" and "Patch III" regions identified by NUNES-DUBY et al. 1998 Down(Fig 8). Although the ORF-C protein from kangaroo-1 is predicted to contain several insertions and deletions of amino acids relative to other proteins shown in Fig 8 (e.g., immediately following box I), these insertions/deletions are likely to fall within loop regions of the folded protein (NUNES-DUBY et al. 1998 Down) and presumably do not prevent its adopting a structure similar to {lambda}-integrase. These results lead us to conclude that ORF-C encodes a member of the Int family and to speculate that this enzyme catalyzes the integration of kangaroo via the mechanism we have proposed in Fig 7A.



View larger version (77K):
In this window
In a new window
Download PPT slide
 
Figure 8. DIRS1-group members encode a recombinase related to {lambda}-site-specific integrase. The predicted ORF-C proteins from kangaroo-1, DIRS-1, DrDIRS1, and PAT are aligned with a number of recombinase proteins from the Int family. The conserved residues forming the RHRY catalytic tetrad are indicated by asterisks. For clarity, the sequences FGPDYSHVI and RGGGPFRGFPLPLPDPFGA were removed from the DrDIRS1 and kangaroo proteins, respectively, at the positions indicated just to the right of box I. The other aligned proteins are: Cb (Coxiella burnetii integrase, CAA75853), Psp (Pseudomonas sp. integrase, CAA67462), Ll (Lactobacillus leichmannii XerC, CAA59018), Asp (Anabaena sp. integrase, BAB77331), and {lambda} (integrase, P03700). The alignment was created using CLUSTALX (with manual refinement) and MacBoxshade. Identical and similar residues are shaded black and gray, respectively, using a 50% consensus threshold. A dot indicates a gap.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

kangaroo is a member of a newly recognized but rapidly growing class of retrotransposons—the DIRS1 group—whose members share structures, genes, and (probably) integration mechanisms that differ from those of the LTR and the conventional non-LTR retroelements. The first members of this group to be recognized were DIRS-1, from Dictyostelium (CAPPELLO et al. 1985 Down), and PAT, from P. redivivus (DE CHASTONAY et al. 1992 Down), which were united solely by the similarity of their RT proteins and appeared to differ greatly in the structure of their termini. A less-well-characterized, but clearly related, retroelement called Prt1 was also found in the fungus Phycomyces blakesleeanus (RUIZ-PEREZ et al. 1996 Down). Here we have characterized the first member of the group to be identified in a photoautotroph, namely kangaroo-1 of V. carteri, a green alga, and we have shown that kangaroo not only is closely related to DIRS-1 and PAT in terms of RT amino acid sequence, but also has split direct repeat termini very similar in structure to the termini of PAT. Meanwhile, GOODWIN and POULTER 2001 Down have reported discovering several additional metazoan members of the DIRS1 group by screening public databases. These included DrDIRS1 of the zebrafish Danio rerio, TnDIRS1 of the pufferfish Tetraodon nigroviridis, and CbPAT1 of Caenorhabditis briggsae, as well as several fragmentary DIRS1-like sequences from two species of Xenopus and the sea urchin, Strongylocentrotus purpuratus (GOODWIN and POULTER 2001 Down; L. DUNCAN, unpublished data). DIRS1-group retroelements have not yet been found in insects or vascular plants.

However, earlier studies (DAY et al. 1988 Down; DAY and ROCHAIX 1991 Down) had revealed that Chlamydomonas reinhardtii, the closest unicellular relative of V. carteri, contains a mobile element called TOC1 with SDRs similar in structure (but not sequence) to those of kangaroo-1 and PAT (Fig 1B). TOC1 also contains a stretch of 76-bp repeats that is strikingly similar in position and length to the stretch of 89-bp repeats in kangaroo-1 (compare Fig 1B and Fig 2A), although the nucleotide sequences of their repetitive units are wholly unrelated. Because TOC1 does not encode RT or other proteins that play a role in retrotransposition, it appears to be a nonautonomous element and could not be included in the phylogenetic analysis of retroelement RT sequences. Nevertheless, the studies reported here lead us to predict that TOC1 probably replicates and transposes by a mechanism similar to the one that we have proposed for kangaroo in Volvox, using functions that are encoded by an autonomous kangaroo-like element elsewhere in the Chlamydomonas genome. Consistent with this hypothesis, we have found that several Chlamydomonas expressed sequence tags are present in the public databases that encode peptides with significant similarity to regions of the kangaroo RT protein, including the C-terminal extension that we believe is diagnostic for the DIRS1 family of transposons (L. DUNCAN, unpublished data). This, in turn, leads us to suspect that kangaroo-like elements may be widely distributed within the order Volvocales. Indeed, with representatives now known to be present in slime molds, fungi, green algae, and a variety of different metazoans, it seems reasonable to postulate that DIRS1-group elements may be almost as widely distributed among the eukaryotes as the LTR and conventional non-LTR retroelements are now known to be.

The two DIRS1-group subfamilies contain related terminal structures:
As discussed in the Introduction, most RT-based phylogenies identify clades of retroelements that share other important structural and genetic features. Thus, we were initially surprised to find that the DIRS1 clade grouped elements that apparently contain dissimilar termini: namely, those containing inverted repeats and those containing SDRs. However, we subsequently realized that the elements with inverted-repeat termini have other features that are structurally similar to the SDRs found in kangaroo, PAT, and TOC1. Specifically, both DIRS-1 and DrDIRS1 have short sequences (labeled A and B in Fig 1B) that extend past the end of, or are part of, the terminal inverted repeats, and these same sequences are also found juxtaposed internally in an inverted orientation (A'B') in a structure called the internal complementary region (ICR; CAPPELLO et al. 1985 Down). TnDIRS1 also has a similar structure (GOODWIN and POULTER 2001 Down). This arrangement can be symbolized as: A > ... < A < B ... B >, and we propose that it may represent an altered form of the type of SDR seen in PAT, TOC1, and kangaroo, which can be symbolized as: A > ... B > A > ... B >. It is tempting to speculate that one of the DIRS1-group subfamilies was derived from the other subfamily by inversion of the full interior repeat (Fig 1B).

DIRS1 elements appear to integrate by a novel mechanism:
Here we have proposed that members of the DIRS1 group may transpose by a mechanism similar to the one proposed initially by CAPPELLO et al. 1985 Down and DAY et al. 1988 Down, in which RT generates a closed-circle, double-stranded DNA intermediate that is then inserted into a target site by a single crossover event. If this model is correct, then the integration mechanism used by the DIRS1-group members would be clearly distinct from that used during insertion of known LTR and conventional non-LTR retrotransposons.

This model was initially based on: (1) our observation that members of the DIRS1 group contain unorthodox, but related, terminal structures and do not encode a conventional retrotransposon integrase or endonuclease; (2) our comparisons of kangaroo pre- and postintegration sites; and (3) our demonstration that we could amplify a PCR product from Volvox DNA whose structure is consistent with the existence of the postulated closed-circular DNA intermediate. These latter two results constitute the first pieces of experimental evidence in support of this kind of model.

Meanwhile, GOODWIN and POULTER 2001 Down proposed a very similar model, after discovering that members of the DIRS1 group encode proteins of the Int family of recombinases, some of which (such as {lambda}-site-specific integrase) are known to mediate this type of integration process. We have now shown that kangaroo also encodes such a recombinase. Thus, our study and that of GOODWIN and POULTER 2001 Down are mutually reinforcing.

However, whereas GOODWIN and POULTER 2001 Down consider the DIRS1 elements as a "group of LTR retrotransposons," we believe that because of their very substantial differences from LTR elements in termini, gene content, and probable integration mechanisms, the DIRS1 group should be considered to be a third class of retrotransposons, distinct from both the LTR and the traditional non-LTR classes.

Developmental regulation of kangaroo expression:
We have shown that the accumulation of four discrete transcripts produced by kangaroo is developmentally regulated, which is a property shared with numerous other retrotransposons. The expression of at least 19 different Drosophila retroelements is controlled both temporally and spatially during development (DING and LIPSHITZ 1994 Down; FROMMER et al. 1994 Down; MOZER and BENZER 1994 Down; BRONNER et al. 1995 Down; AWASAKI et al. 1996 Down; KERBER et al. 1996 Down; MARSANO et al. 2000 Down), as is expression of the zebrafish retroelement bhikhara (VOGEL and GERSTER 1999 Down) and the Xenopus 1A11 element (GREENE et al. 1993 Down). Similarly, transcription of Ty elements in Saccharomyces cerevisiae (ERREDE et al. 1987 Down), LINE-1 in mammals (OSTERTAG and KAZAZIAN 2001 Down), and DIRS-1 in D. discoideum (COHEN et al. 1984 Down) all appear to be under cell-type or stage-specific controls. As with expression of more conventional kinds of genes, developmentally regulated expression of retrotransposons has been shown to involve both cis-regulatory elements located within the transposons themselves and trans-acting factors encoded elsewhere (ERREDE et al. 1987 Down; DING and LIPSHITZ 1994 Down; MOZER and BENZER 1994 Down; BRONNER et al. 1995 Down; AWASAKI et al. 1996 Down; KERBER et al. 1996 Down; VOGEL and GERSTER 1999 Down). It has been argued that the developmentally regulated expression of such retroelements may have been selected for by virtue of its benefit to the host (DING and LIPSHITZ 1994 Down; BRONNER et al. 1995 Down), but convincing evidence in favor of this hypothesis remains lacking.

Interestingly, observations made in the related unicell C. reinhardtii raise the possibility that developmental accumulation of kangaroo transcripts could be regulated epigenetically. JEONG et al. 2002 Down have shown that TOC1 expression in C. reinhardtii is repressed by two genes (Mut9 and Mut11) that are involved in transcriptional gene silencing (TGS). Furthermore, the levels of TOC1 RNA are also controlled, at least in part, by degradation that is dependent on the Mut6 RNA helicase (WU-SCHARF et al. 2000 Down), which is a component of the post-transcriptional gene silencing (PTGS) machinery of C. reinhardtii. It is conceivable that kangaroo expression in V. carteri is controlled in an analogous manner by uncharacterized, developmentally regulated TGS or PTGS mechanisms.

kangaroo as a molecular genetic tool:
At present, the only method available for cloning genes by forward genetics in V. carteri has involved tagging with the DNA transposon, Jordan (MILLER et al. 1993 Down). Although this approach has been used successfully to clone several developmentally important genes (KIRK et al. 1999 Down; MILLER and KIRK 1999 Down; I. NISHII, personal communication), we have encountered several cases in which interesting mutations that have the earmarks of transposon-induced mutations cannot be correlated with Jordan RFLPs (our unpublished observations). Evidence presented here identifies kangaroo as a second, highly mobile element within the V. carteri genome that is capable of causing gene disruptions. Whether kangaroo will turn out to have other properties required to make it a second useful transposon-tagging tool for V. carteri developmental biologists, only time will tell.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession no. AY137241. Back
2 Present address: University of Connecticut School of Medicine, 263 Farmington Ave., Farmington, CT 06030. Back


*  ACKNOWLEDGMENTS

We thank I. Nishii for staged RNAs and the Northern blot (with associated C38 loading control data) used in Fig 6C; A. Horton and D. Berg for helpful advice; and J. Umen, S. Miller, I. Nishii, and C. Shaffer for comments on the manuscript. L.D. was a postdoctoral fellow of, and this investigation has been aided by, a grant from the Jane Coffin Childs Memorial Fund for Medical Research. This work was also supported by a National Science Foundation grant (IBN-9904739) to D.K.

Manuscript received February 28, 2002; Accepted for publication August 30, 2002.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AWASAKI, T., N. JUNI, and K. M. YOSHIDA, 1996  An eye imaginal disc-specific transcriptional enhancer in the long terminal repeat of the tom retrotransposon is responsible for eye morphology mutations of Drosophila ananassae.. Mol. Gen. Genet. 251:161-166.[Medline]

BATEMAN, A., E. BIRNEY, L. CERRUTI, R. DURBIN, and L. ETWILLER et al., 2002  The Pfam protein families database. Nucleic Acids Res. 30:276-280.[Abstract/Free Full Text]

BRONNER, G., H. TAUBERT, and H. JACKLE, 1995  Mesoderm-specific B104 expression in the Drosophila embryo is mediated by internal cis-acting elements of the transposon. Chromosoma 103:669-675.[Medline]

CAPPELLO, J., K. HANDELSMAN, and H. F. LODISH, 1985  Sequence of Dictyostelium DIRS-1: an apparent retrotransposon with inverted terminal repeats and an internal circle junction sequence. Cell 43:105-115.[Medline]

COFFIN, J., 1993 Reverse transcriptase and evolution, pp. 445–479 in Reverse Transcriptase, edited by A. M. SKALKA and S. P. GOFF. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.

COHEN, S. M., J. CAPPELLO, and H. F. LODISH, 1984  Transcription of Dictyostelium discoideum transposable element DIRS-1. Mol. Cell. Biol. 4:2332-2340.[Abstract/Free Full Text]

CRAIG, N. L., 1997  Target site selection in transposition. Annu. Rev. Biochem. 66:437-474.[Medline]

DAY, A. and J. D. ROCHAIX, 1991  A transposon with an unusual LTR arrangement from Chlamydomonas reinhardtii contains an internal tandem array of 76 bp repeats. Nucleic Acids Res. 19:1259-1266.[Abstract/Free Full Text]

DAY, A., M. SCHIRMER-RAHIRE, M. R. KUCHKA, S. P. MAYFIELD, and J. D. ROCHAIX, 1988  A transposon with an unusual arrangement of long terminal repeats in the green alga Chlamydomonas reinhardtii.. EMBO J. 7:1917-1927.[Medline]

DE CHASTONAY, Y., H. FELDER, C. LINK, P. AEBY, and H. TOBLER et al., 1992  Unusual features of the retroid element PAT from the nematode Panagrellus redivivus.. Nucleic Acids Res. 20:1623-1628.[Abstract/Free Full Text]

DING, D. and H. D. LIPSHITZ, 1994  Spatially regulated expression of retrovirus-like transposons during Drosophila melanogaster embryogenesis. Genet. Res. 64:167-181.[Medline]

DOOLITTLE, R. F., D. F. FENG, M. S. JOHNSON, and M. A. MCCLURE, 1989  Origins and evolutionary relationships of retroviruses. Q. Rev. Biol. 64:1-30.[Medline]

EICKBUSH, T. H., 1997  Telomerase and retrotransposons: Which came first? Science 277:911-912.[Free Full Text]

ERREDE, B., M. COMPANY, and C. A. HUTCHISON, 1987  Ty1 sequence with enhancer and mating-type-dependent regulatory activities. Mol. Cell. Biol. 7:258-265.[Abstract/Free Full Text]

FINNEGAN, D. J., 1989  Eukaryotic transposable elements and genome evolution. Trends Genet. 5:103-107.[Medline]

FROMMER, G., R. SCHUH, and H. JACKLE, 1994  Localized expression of a novel micropia-like element in the blastoderm of Drosophila melanogaster is dependent on the anterior morphogen bicoid. Chromosoma 103:82-89.[Medline]

FURANO, A. V., 2000  The biological properties and evolutionary dynamics of mammalian LINE-1 retrotransposons. Prog. Nucleic Acid Res. Mol. Biol. 64:255-294.[Medline]

GABRIEL, A., and J. D. BOEKE, 1993 Retrotransposon reverse transcription, pp. 275–328 in Reverse Transcriptase, edited by A. M. SKALKA and S. P. GOFF. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.

GOODWIN, T. J. and R. T. POULTER, 2001  The DIRS1 group of retrotransposons. Mol. Biol. Evol. 18:2067-2082.[Abstract/Free Full Text]

GREENE, J. M., H. OTANI, P. J. GOOD, and I. B. DAWID, 1993  A novel family of retrotransposon-like elements in Xenopus laevis with a transcript inducible by two growth factors. Nucleic Acids Res. 21:2375-2381.[Abstract/Free Full Text]

GRUBER, H., S. D. GOETINCK, D. L. KIRK, and R. SCHMITT, 1992  The nitrate reductase-encoding gene of Volvox carteri: map location, sequence and induction kinetics. Gene 120:75-83.[Medline]

HAREN, L., N. TON-HOANG, and M. CHANDLER, 1999  Integrating DNA: transposases and retroviral integrases. Annu. Rev. Microbiol. 53:245-281.[Medline]

HARPER, J. F., K. S. HUSON, and D. L. KIRK, 1987  Use of repetitive sequences to identify DNA polymorphisms linked to regA, a developmentally important locus in Volvox. Genes Dev. 1:573-584.[Abstract/Free Full Text]

JEONG, B., D. WU-SCHARF, C. ZHANG, and H. CERUTTI, 2002  Suppressors of transcriptional transgenic silencing in Chlamydomonas are sensitive to DNA-damaging agents and reactivate transposable elements. Proc. Natl. Acad. Sci. USA 99:1076-1081.[Abstract/Free Full Text]

KERBER, B., S. FELLERT, H. TAUBERT, and M. HOCH, 1996  Germ line and embryonic expression of Fex, a member of the Drosophila F-element retrotransposon family, is mediated by an internal cis-regulatory control region. Mol. Cell. Biol. 16:2998-3007.[Abstract]

KIRK, D., 1998 Volvox: Molecular-Genetic Origins of Multicellularity and Cellular Differentiation. Cambridge University Press, Cambridge, UK.

KIRK, D. L. and M. M. KIRK, 1983  Protein synthetic patterns during the asexual life cycle of Volvox carteri.. Dev. Biol. 96:493-506.[Medline]

<