Genetics, Vol. 151, 1027-1039, March 1999, Copyright © 1999

Evolution of the RECQ Family of Helicases: A Drosophila Homolog, Dmblm, Is Similar to the Human Bloom Syndrome Gene

Kohji Kusanoa, Mark E. Berresb, and William R. Engelsa
a Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
b Laboratory of Zoology, University of Wisconsin, Madison, Wisconsin 53706

Corresponding author: William R. Engels, Laboratory of Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706., wrengels{at}facstaff.wisc.edu (E-mail)

Communicating editor: R. S. HAWLEY


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Several eukaryotic homologs of the Escherichia coli RecQ DNA helicase have been found. These include the human BLM gene, whose mutation results in Bloom syndrome, and the human WRN gene, whose mutation leads to Werner syndrome resembling premature aging. We cloned a Drosophila melanogaster homolog of the RECQ helicase family, Dmblm (Drosophila melanogaster Bloom), which encodes a putative 1487-amino-acid protein. Phylogenetic and dot plot analyses for the RECQ family, including 10 eukaryotic and 3 prokaryotic genes, indicate Dmblm is most closely related to the Homo sapiens BLM gene, suggesting functional similarity. Also, we found that Dmblm cDNA partially rescued the sensitivity to methyl methanesulfonate of Saccharomyces cerevisiae sgs1 mutant, demonstrating the presence of a functional similarity between Dmblm and SGS1. Our analyses identify four possible subfamilies in the RECQ family: (1) the BLM subgroup (H. sapiens Bloom, D. melanogaster Dmblm, and Caenorhabditis elegans T04A11.6); (2) the yeast RECQ subgroup (S. cerevisiae SGS1 and Schizosaccharomyces pombe rqh1/rad12); (3) the RECQL/Q1 subgroup (H. sapiens RECQL/Q1 and C. elegans K02F3.1); and (4) the WRN subgroup (H. sapiens Werner and C. elegans F18C5.2). This result may indicate that metazoans hold at least three RECQ genes, each of which may have a different function, and that multiple RECQ genes diverged with the generation of multicellular organisms. We propose that invertebrates such as nematodes and insects are useful as model systems of human genetic diseases.


DNA/RNA helicase protein families have been found with seven consensus motifs (GORBALENYA et al. 1989 Down). These enzymes play important roles in cellular processes such as genome replication, recombination and repair, transcription, and mRNA translation. Members within each helicase family also share consensus sequences between motifs. The RECQ helicase family includes proteins with extensive amino acid sequence similarity to the Escherichia coli helicase, RecQ.

The E. coli RecQ gene encodes a DNA helicase (UMEZU et al. 1990 Down) involved in homologous recombination (NAKAYAMA et al. 1984 Down). The RecQ helicase appears to interact with RecJ exonuclease because recQ null mutations suppress recJ defects such as decreased crossing over (KUSANO et al. 1994 Down), increased sensitivity to DNA-damaging agents (KUSANO et al. 1994 Down; LOVETT and SUTERA 1995 Down), and decreased conjugational recombination (LOVETT and SUTERA 1995 Down) in a recB-C- sbcA- background of E. coli. A recQ null mutation increases illegitimate recombination in a wild-type background of E. coli (HANADA et al. 1997 Down). The Saccharomyces cerevisiae SGS1 gene encodes similar helicase motifs (GANGLOFF et al. 1994 Down; WATT et al. 1995 Down) and possesses a 3'-5' DNA helicase activity (LU et al. 1996 Down; BENNETT et al. 1998 Down). An sgs1 mutation suppresses slow growth in a top3 mutant (GANGLOFF et al. 1994 Down; LU et al. 1996 Down) and causes missegregation of chromosomes in meiosis and mitosis (WATT et al. 1995 Down). The SGS1 protein physically interacts with topoisomerase III (GANGLOFF et al. 1994 Down) and topoisomerase II (WATT et al. 1995 Down). The mutations of another RECQ family member, Schizosaccharomyces pombe rqh1/rad12, cause increased mitotic recombination with hydroxyurea treatment (STEWART et al. 1997 Down). S. pombe rqh1/rad12 appears to function in a checkpoint dependent DNA damage response during S phase (MURRAY et al. 1997 Down; STEWART et al. 1997 Down; DAVEY et al. 1998 Down). Three homologs of RECQ have been found in humans. One is the BLM gene whose defects cause immunodeficiencies and a highly increased rate of cancer, called Bloom syndrome (ELLIS et al. 1995 Down). Somatic cells from Bloom patients are susceptible to several DNA-damaging agents (KREPINSKY et al. 1979 Down; ISHIZAKI et al. 1981 Down; KURIHARA et al. 1987 Down) and exhibit increased interchanges between homologous chromosomes (GERMAN 1993 Down) and increased sister chromatid exchanges (HEARTLEIN et al. 1987 Down; KURIHARA et al. 1987 Down). The Bloom gene product has a 3'-5' DNA helicase activity (KAROW et al. 1997 Down). The second homolog is the WRN gene whose defects result in clinical symptoms resembling premature aging, called Werner syndrome (YU et al. 1996 Down). Cells from Werner patients show various chromosome rearrangements (SALK et al. 1981 Down; SCAPPATICCI et al. 1982 Down). More recently, WRN cells were shown to be hypersensitive to 4-nitro-quinoline-1-oxide (OGBURN et al. 1997 Down). The Werner protein has DNA helicase activity (GRAY et al. 1997 Down; SUZUKI et al. 1997 Down) with 3'-5' directionality (SHEN et al. 1998 Down). The third human RecQ homolog is RECQL/Q1, which has also been demonstrated to possess a 3'-5' DNA helicase activity (PURANAM and BLACKSHEAR 1994 Down; SEKI et al. 1994 Down). The symptoms of RECQL/Q1 deficiency have not been determined.

Phylogenetic analysis of a protein family often reveals the presence of subfamilies that are likely to have functional similarities. For example, an analysis of MutS-related proteins shows that there are several distinct subgroups (FISHEL et al. 1993 Down), each corresponding to a different function: mitochondrial mismatch repair, DNA sequence stability, and meiotic crossing over. The family of SNF2-related proteins provides another example. Phylogenetic analysis of this family suggests that there are eight distinct subgroups (EISEN et al. 1995 Down), each corresponding to different functions, including transcription activation, transcription-coupled repair, recombinational repair, and removal of TATA binding protein from DNA. These two phylogenetic analyses indicate that the gene families functionally diverged with the generation of unicellular eukaryotes such as yeast.

In this work, we cloned a cDNA of the Drosophila melanogaster RECQ homolog, Dmblm (Drosophila melanogaster Bloom). It encodes a putative 1487 amino acid protein (Figure 1) and exhibits significant similarity to other RECQ family members. Our phylogenetic analysis of the RECQ family including Dmblm leads us to propose that the currently recognized RECQ family contains four subfamilies and that they functionally diverged with the generation of multicellular organisms. We suggest that Dmblm is included in the same group with Human BLM and Caenorhabditis elegans T04A11.6, each of which appears to be related to S. pombe rqh1/rad12 and S. cerevisiae SGS1. Also, we found that Dmblm partially rescued the sensitivity to methyl methanesulfonate (MMS) of a S. cerevisiae sgs1 mutant.



View larger version (58K):
In this window
In a new window
Download PPT slide
 
Figure 1. Predicted protein sequence of the D. melanogaster homolog of RECQ gene, Dmblm (Accession number U92536). Helicase motifs I–VI are underlined by dashes, and their locations are based on previously described helicase domain alignment (GORBALENYA et al. 1989 Down). The repeat near the N-terminal end is underlined. The positions of amino acid substitution in the repeat are double-underlined.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Yeast strains and media:
Yeast strain MRQ966 (from T. Enomoto) is an sgs1::URA3 derivative of strain MR966 (MATa ura3-52 leu2-3, 112 trp-289 his1-7). The following media were used for yeast growth: YEPD, which contains 1% yeast extract, 2% peptone, and 2% glucose, and SD, which is the selective medium for strains with a plasmid carrying a TRP1 marker and contains 2% glucose, 0.67% Bacto-yeast nitrogen base, and supplements of uracil and all amino acids except tryptophan.

Plasmids:
pYCp1305 (from T. Enomoto) contains the entire SGS1 gene of S. cerevisiae, a derivative of pRS314 that includes a centromere element, and an ARS and TRP1 marker (SIKORSKI and HIETER 1989 Down). Two plasmids were used for the expression of Dmblm cDNA in yeast: (1) The plasmid pAS2-1 (CLONTECH, Palo Alto, CA) contains a GAL4-DNA-binding domain under an ADH1 promoter, 2-µm origin, and a TRP1 marker. (2) The plasmid pGBT9-bd (from A. Shimamoto), a derivative of pGBT9 (CLONTECH), lacks a GAL4-DNA-binding domain but contains an ADH1 promoter, 2-µm origin, and a TRP1 marker. The deletion of the domain was made by replacing the region from the HindIII site (nucleotide 409) to the EcoRI site (nucleotide 878) of pGBT9 (GenBank accession number U07646) with EcoRI oligonucleotides, GGAATTCC (New England Biolabs, Beverly, MA). The plasmid pBSDMBLM1 contains a PCR product that spans the entire region of Dmblm cDNA, nucleotides 1–4725 (GenBank accession number U92536), at the EcoRV site of Bluescript SK(+). The next two plasmids contain the fragment that spans the region from the DraI site (nucleotide 111), at 15 bp before the putative start codon, to nucleotide 4716 on Dmblm cDNA. The fragment was prepared by partially cutting pBSDMBLM DNA with DraI restriction enzyme, ligating BamHI oligonucleotides CCGGATCCGG (New England Biolabs) to the resultant DraI end and completely cutting with BamHI restriction enzyme. The plasmids pYEDMBLM1 and pYEDMBLM2 possess this fragment at the BamHI site downstream of the GAL4-DNA-binding domain under the ADH1 promoter on pAS2-1, and at the BamHI site downstream of the ADH1 promoter on pGBT9-bd, respectively.

PCR cloning:
Genomic DNA for PCR was prepared from the Canton-S strain of D. melanogaster. Two degenerate oligonucleotides, sense, ATGCCNACYGGHGGHGGHAA, and antisense, GGRCGGAARTCRTGDCCCCA, were used for the PCR and cloning of a fragment of the D. melanogaster RECQ homolog. Two gene specific primers based on the nucleotide sequence of the obtained fragment were used for 5' RACE and 3' RACE: GGTGTCACGTAGAGCAATTTGACCATAGG and GCCTGCCATTCTGACCGAGGGAGTG, respectively. A cDNA library was constructed using total RNA extracted by the RNAzol B method (CHOMCZYNSKI and SACCHI 1987 Down) from Schneider II cells for the degenerate PCR, 5' RACE and 3' RACE. A CLONTECH Marathon cDNA amplification kit was used for constructing the cDNA library and for 5' and 3' RACE. The 5' and 3' end primers used for amplifying an entire Dmblm cDNA were ATTGTTTCCATTGGCATATTGCG and CTGAGCACATTGCTCATACAG, respectively. A mixture of Stratagene (La Jolla, CA) Taq DNA polymerase and Stratagene Extender was used for the amplification of the 5' RACE, 3' RACEs, and the entire cDNA. PCR products were always cloned into Bluescript SK(+).

Nucleotide sequencing:
The M13-20 primer (GTAAAACGACGGCCAGT) or the T3 primer (AATTAACCCTCACTAAAGGG) was used for two types of sequencing reactions: the dideoxy method and the Dye deoxy termination method (Applied Biosystems International, Foster City, CA). The Promega (Madison, WI) Erase-A-base kit was used for preparing plasmid DNA with nested deletions in the insert region.

Additional sequences:
Sequences other than D. melanogaster Dmblm (D.m.BLM), the abbreviations for their helicase domain, and their accession numbers are as follows: Homo sapiens Bloom (H.s.BLM), U39817 (ELLIS et al. 1995 Down); H. sapiens RECQL/Q1 (H.s.RECQL/Q1), L36140 (PURANAM and BLACKSHEAR 1994 Down); H. sapiens Werner (H.s.WRN), L76937 (YU et al. 1996 Down); C. elegans E03A3.2 (C.e.E03A3.2), Z38112; C. elegans K02F3.1 (C.e.K02F3.1), U00052 (WILSON et al. 1994 Down); C. elegans F18C5.2 (C.e.F18C5.2), U29097 (WILSON et al. 1994 Down); C. elegans T04A11.6 (C.e.T04A11.6), Z83123; S. pombe rqh1/rad12 (S.p.rqh1/rad12), Z54354 (MURRAY et al. 1997 Down; STEWART et al. 1997 Down); S. cerevisiae SGS1 (S.c.SGS1), L07870; E. coli RecQ (E.c.RecQ), M30198 (IRINO et al. 1986 Down); Haemophilus influenzae RECQ (H.i.RECQ), U32756 (FLEISCHMANN et al. 1995 Down); Bacillus subtilus RECQ (B.s.RECQ), L47648 (ROELS et al. 1992 Down; SOROKIN et al. 1993 Down, SOROKIN et al. 1996 Down).

Intron boundaries:
We noted that the putative amino acid sequence in U00052 (C.e.K02F3.1) did not contain the region around helicase motif VI and that the putative amino acid sequence in U29097 (C.e.F18C5.2) did not contain the region around helicase motifs V and VI. We checked each amino acid sequence of the two published nucleotide sequences in three different reading frames and found that motif V and VI of C.e.F18C5.2 were present behind putative introns in U29097 and that motif VI of C.e.K02F3.1 was present in another reading frame in U00052. Therefore, we adjusted the exon-intron boundaries in C.e.K02F3.1 and C.e.F18C5.2 to include motifs V and VI. The putative introns start at GT and stop at AG. The nucleotide sequence of C.e.F18C5.2 from U29097 contains the region, nucleotide numbers 10192–10497, which was regarded as an intron, and the region, 10498–10580, which was regarded as an exon. The region 10192–10580 was modified as follows: 10192–10239 (intron), 10240–10427 (exon), 10428–10475 (intron), and 10476–10580 (exon). According to these modifications, amino acid positions 302–398 in C.e.F18C5.2, with reference to the coordinates of Figure 2, were used instead of QKEEVVENLT, which was part of the amino acid sequence in U29097. Also, the sequence of C.e.K02F3.1, derived from U00052, contains the regions 34759–35328 (exon), 35329–35532 (intron), 35533–35701 (exon), and 35702 through the end of this clone as part of an intron. These regions were modified as follows: 34759–35221 (exon), 35222–35532 (intron), 35533–35779 (exon), and 35780 through the end (intron). According to these modifications, amino acid positions 339 through the last site of C.e.K02F3.1 in Figure 2 were used instead of the last 33 amino acid residues in U00052.




View larger version (211K):
In this window
In a new window
Download PPT slide
 
Figure 2. Aligned amino acid residues of the 13 helicase domains. The total consensus length was 419 sites of which 349 were retained for analysis after deletion of ambiguously aligned residues. Boxed and bold regions indicate invariant sites. Shaded residue positions indicate 80% or higher identity for that particular site. Boxed regions without shading indicate areas that were not unambiguously aligned and excluded from the phylogenetic analysis. Gaps introduced by the alignment algorithm are indicated by a dash. For a key to helicase domain abbreviations, see MATERIALS AND METHODS. For the initial and final residue numbers of each helicase domain, see Figure 3.

Sequence alignment:
An alignment of the amino acid sequence data in the helicase domains of the above proteins was constructed using clustering algorithms incorporated into the program CLUSTALW version 1.73 (HIGGINS et al. 1992 Down). Parameters were set as follows: gap penalty is 10.0; gap extension penalty is 0.05; and protein weight matrix is BLOSUM30 (HENIKOFF and HENIKOFF 1994 Down). Further visual editing of the algorithmically aligned amino acid sequences was not attempted. We used the protein alignment as a guide to exclude highly variable regions that exhibited many insertion and deletion events from the final alignment to eliminate ambiguous residue positioning (Figure 2). Corresponding codon positions in the nucleotide sequence data were deleted and the resulting in-frame nucleotide alignment was used for all subsequent analyses.

Phylogenetic analysis:
The skewness of a random sample of all tree-length distributions based on the g1 statistic (SOKAL and ROHLF 1981 Down) as described by HILLIS 1991 Down, HILLIS and HUELSENBECK 1992 Down, and HUELSENBECK 1991 Down was estimated using PAUP version 3.1.1 (SWOFFORD 1993 Down). Initial tests for significant variance in nucleotide and codon substitution rates based on maximum likelihood estimates were performed according to MUSE and GAUT 1994 Down and MUSE and WEIR 1992 Down. PAUP version 3.1.1 (SWOFFORD 1993 Down) was used for an initial heuristic tree search applying the TBR option for branch swapping. The F84 substitution model (THORNE et al. 1992 Down) incorporated into PAML version 1.3 (YANG 1996B Down) was used to estimate substitution parameters and the maximum likelihood of all tree topologies.

Other analyses:
Dot plot analysis for protein was performed using MegAlign software (DNASTAR, Madison, WI). Cytological mapping of the Dmblm locus was carried out by in situ hybridization to polytene chromosomes (ENGELS et al. 1986 Down).


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Cloning a Drosophila homolog of RECQ:
To obtain a fragment of a Drosophila RECQ homolog, we carried out PCR on genomic DNA of D. melanogaster using degenerate primers. The primers (see MATERIALS AND METHODS) correspond to the amino acid residues MPTGGGK, located within the conserved helicase motif I, and WGHDFRP, located between motifs II and III (Figure 1), which is conserved within the RECQ helicase family (Figure 2). Resulting amplicons were cloned and sequenced. One clone was 426 bp in length and included a putative 63-bp intron whose 5'- and 3'-splice sites were ATgtgagtt and ttactttttaacagAT, respectively. These are in general agreement with the splice donor and acceptor consensus sequences of Drosophila short introns, AGgt(a/g)agt(a/t) and ttttt(c/t)(c/t)(c/t)(c/t)tncag(A/G)T (MOUNT et al. 1992 Down). The predicted amino acid sequence from the 363-bp region without the intron sequence indicated good homology with the corresponding region of other RECQ genes. Also, PCR on cDNA isolated from Schneider II cells using the same primer pair produced an ~360-bp fragment. This was judged as one related to the clone from genomic DNA based on digestion patterns by HaeIII and MboI restriction enzymes.

Using the RACE cloning method with the two internal primers (MATERIALS AND METHODS), designed on the basis of the nucleotide sequence of the 363-bp region, a 2.6-kb 5' RACE clone and a 2.4-kb 3' RACE clone (the largest fragments) were cloned.

Cytological localization:
Each of the DNA fragments hybridized within the cytological map region 86F1-8. Also, PCR of a P1 clone carrying the region 86F1-4 (BDGP, HARTL et al. 1994 Down) using the above internal primer pair produced the expected PCR fragment size.

Predicted 1487-amino-acid protein is included in RECQ helicase family:
Sequencing of the two overlapping RACE fragments revealed an open reading frame of 1487 amino acids in length. The putative 5'- and 3'-untranslated regions contain two and six stop codons in the same frame, respectively. The predicted protein sequence is shown in Figure 1. The central region contains seven motifs commonly found in DNA/RNA helicases, including a putative ATP-binding site (motif I) and a DExH box (motif II; GORBALENYA et al. 1989 Down). An alignment of this protein to other members of the RECQ family shows significant similarity among them, especially in the residues of the seven helicase motifs (Figure 2). Therefore, we conclude that this Drosophila gene is a member of the RECQ family.

Sequence alignment:
The schematic diagram in Figure 3 represents the position of each helicase domain in the 13 RECQ family proteins. The algorithmically aligned protein sequences of the 13 helicase domains formed a consensus length of 419 sites of which 70 sites could not be unambiguously aligned (Figure 2). Thorne et al. have shown that the true alignment between a set of sequences tends to exhibit fewer gaps than an algorithmically generated alignment (THORNE et al. 1991 Down, THORNE et al. 1992 Down; THORNE and KISHINO 1992 Down). As a result, assignment of any residue to a particular position in an area of introduced gaps would be arbitrary because of a lack of knowledge concerning prior insertion and deletion events. Therefore, areas with multiple floating gaps were omitted following a rule that usable sites must be within anchored regions of two unambiguously aligned sites. Using these criteria, the final amino acid alignment consisted of 349 aligned residues, of which 61 were constant and 288 were variable. Deletion of the corresponding ambiguously aligned codon positions in the nucleotide sequence data produced an alignment containing a total of 1047 nucleotide sites, of which 164 were constant and 883 were variable.



View larger version (32K):
In this window
In a new window
Download PPT slide
 
Figure 3. (A–F) Schematic representation of proteins of RECQ family members. Each entire protein is exhibited by a box with its total amino acid residue number on the right end. Shaded regions indicate the helicase domains whose sequences are shown in Figure 2 with the initial and final residue numbers. The C-terminal region of C. elegans K02F3.1 is not yet sequenced completely and is shown by a dashed line. Each protein is aligned along the position of the initial residue of the helicase domain to compare the length of N-terminal domains between each protein. Each of the short dotted lines in (C) shows the region sharing sequence similarity to the N-terminal domains of C. elegans K02F3.1 and H. sapiens RECQL/Q1 (see Figure 5J). A pair of triangles indicates a duplication with the indicated amino acid sequence. Substitutions within the duplications are underlined.

Skewness:
On the basis of a statistical measure of skewness called the g1 statistic, HILLIS 1991 Down devised a method for testing whether or not a particular data set contains phylogenetic information. One prediction of the model is that for data sets containing phylogenetic information, the set of all possible topologies (or a subsample thereof) arranged in order of increasing length will have a definite skew to the right. This skewing is attributed to character correlation among sequences with a common phylogenetic history (HILLIS 1991 Down). The tree-length distribution analysis of 1 x 108 random topologies from a possible 8.64 x 108 unrooted topologies of our nucleotide sequence data was highly skewed (g1 = -0.5598, P {Lt} 0.01).

Evolutionary rates:
In most molecular sequences, substitution rates across sites are not equally distributed. UZZELL and CORBIN 1971 Down have shown that rate differences may be best fitted to a gamma distribution. The shape parameter, {alpha}, of the gamma distribution, is now used as a measure of substitution heterogeneity across sites (YANG 1996A Down). The importance of considering substitution heterogeneity across sites in phylogenetic reconstructions is established (NEI 1987 Down, NEI 1991 Down; YANG 1994B Down). To test the nucleotide sequence data for significant substitution heterogeneity under a particular evolutionary model, we employed a likelihood ratio test (HUELSENBECK and RANNALA 1997 Down). Our null hypothesis was that substitution homogeneity existed across each codon position. Our alternative hypothesis was that substitution heterogeneity existed across each codon position. The likelihood ratio statistic is two times the difference in log likelihood of the model and is approximately {chi}2 distributed with one degree of freedom. Substitution heterogeneity was accommodated under the F84 model (THORNE et al. 1992 Down) by applying the discrete gamma model (F84-d{Gamma} ; YANG 1994B Down), which approximates the gamma distribution by allowing different rate categories for each nucleotide position, each with equal probability of occurrence. Our analysis was performed with four rate categories, which provides a good approximation of the gamma distribution (YANG 1994B Down). It is known that substitution heterogeneity is overestimated if a grossly incorrect tree topology (e.g., a completely multifurcating star tree) is used (SULLIVAN et al. 1996 Down). Therefore, we used the single most parsimonious tree derived from the nucleotide sequence data (MP1, Figure 4A). When the F84-d{Gamma} model was applied to our data, substitution rates were gamma distributed across the three codon positions with {alpha}-values of 0.73, 0.61, and 0.65, corresponding to the first, second, and third codon positions, respectively. The likelihood ratio test indicates clearly that accounting for substitution heterogeneity across codon sites improves significantly the likelihood of the maximum parsimony topology (Table 1).



View larger version (21K):
In this window
In a new window
Download PPT slide
 
Figure 4. Topology of the 13 helicase domains depicted with branch lengths proportional to the expected substitutions. The four groups were inferred from the helicase domains as well as from the flanking regions (see Figure 5). (A) MP1. Maximum parsimony tree. (B) Tree 2. Maximum likelihood topology of the best tree found by global branch swapping of the initial maximum parsimony tree (MP1) using the F84-d{Gamma} substitution model.


 
View this table:
In this window
In a new window

 
Table 1. Likelihood ratio tests for rate heterogeneity across nucleotide sites and among the RECQ homologs

Significant differences in substitution rates across lineages are known to impair tree estimation methods (FELSENSTEIN 1978 Down; HUELSENBECK and HILLIS 1993 Down). We initially applied relative rate tests (WU and LI 1985 Down; MUSE and WEIR 1992 Down; MUSE and GAUT 1994 Down) to determine whether both nucleotides (transition and transversion substitutions) and codons (synonymous and nonsynonymous substitutions) evolve in a clocklike fashion among the helicase sequences examined. This test consisted of a pairwise comparison of all 13 helicase domains using the RECQ sequence from E. coli as an outgroup to obtain an overall estimation of rate variances. Of 78 individual tests, 32 nucleotide and 24 codon sequence comparisons exhibited a significant deviation (P < 0.05) from the expectation that 2 test sequences should show an equal amount of divergence compared to the reference sequence. To test for this effect under the F84 substitution model, we again used a likelihood ratio test. Our null hypothesis was that rate homogeneity existed across all lineages. The alternative was that rate heterogeneity existed across one or more lineages. In this case, the likelihood ratio statistic is two times the difference in log likelihood of the model and is approximately {chi}2 distributed with two degrees of freedom. This test, when applied to our data, easily rejects the presence of a molecular clock with high significance (Table 1).

Phylogenetic analysis:
In maximum likelihood methods, probabilities for possible topologies are computed using all nucleotide sites, each of which is considered separately. The evaluation of the likelihood that a particular topology exhibits is conditional on the given evolutionary model and how well the model can explain the observed data. The topology with the highest likelihood is thus considered the best representation of relationship. Our estimation of the best topology was guided by the objective of accounting for parameters that have a significant bearing on tree reconstruction efficiency, such as rate heterogeneity and transition and transversion biases. We used the maximum parsimony method to construct an initial topology for the purposes of relevant parameter estimation. It is known that maximum parsimony often fails when significant rate variation exists across lineages (FELSENSTEIN 1978 Down; NEI 1991 Down). However, maximum parsimony has the advantage of requiring no explicit evolutionary model. Under these assumptions, maximum parsimony recovers a single most parsimonious tree (MP1, Table 1; Figure 4A), with length 4157 and a consistency index of 0.475. MP1 was then used as the starting topology for a maximum likelihood tree search assuming the F84-d{Gamma} model of sequence evolution, which allows for rate heterogeneity across sites, unequal base composition, and different transition and transversion rates (YANG 1994A Down). This tree search resulted in a best maximum likelihood tree of -14311.24 with transition/transversion rate ratio parameters of 0.58, 0.22, and 10.52 and {alpha}-parameters of 0.71, 0.61, and 0.63 for each codon position respectively (Figure 4B).

Similarities outside the helicase domain:
A different kind of analysis was needed to determine whether the gene groupings indicated above are also apparent in sequences flanking the helicase domain. In these areas, the similarities were too weak to use the phylogenetic methods employed for the helicase domain. Therefore, we examined dot plots of all 78 pairwise comparisons in the C- and N-terminal regions and ranked each pair according to total length of similarity. The highest-ranking comparisons are shown in Figure 5. For the C-terminal region, 5 of the 6 within-group comparisons had clear similarity (Figure 5, B–E and I) as opposed to only 4 of the 72 between-group comparisons (Figure 5A and F–H). In the N-terminal regions, only 1 of the 78 comparisons had notable similarity (Figure 5J). This comparison was also a within-group pair. We conclude that sequence similarities extend beyond the helicase domain for all 6 pairwise comparisons within the proposed groups and that there is relatively little such similarity between groups.



View larger version (41K):
In this window
In a new window
Download PPT slide
 
Figure 5. Dot plot analysis between proteins of RECQ family members. The entire amino acid sequences were compared by a PAM250 scoring matrix, a window size of 30 residues, and a minimum score of 35% similarity. Shading indicates the helicase domains aligned in Figure 2. The total of length of the dots (excluding helicase domain) that lie on the diagonal line are shown. Dots with a closed triangle include the sequences similar to the nuclear translocation signal of H. sapiens BLM (KANEKO et al. 1997 Down).

Domain structure comparison of the RECQ family:
The 13 RECQ family proteins were arranged in Figure 3 on the basis of the subgrouping postulated above. Recently, the possibility was suggested that RECQ family members can be separated into distinct functional subgroups on the basis of the presence or absence of extensive N-terminal domains (WATT and HICKSON 1996 Down). Our analyses provide evidence against this possibility. For example, the N-terminal segment of H. sapiens RECQL/Q1 is short as are the bacterial RECQs. However, in the helicase domain, H. sapiens RECQL/Q1 is more closely related to C. elegans K02F3.1, which has a long N-terminal segment (Figure 3C). Furthermore, the short N-terminal segment of H. sapiens RECQL/Q1 shares limited similarity with the region adjacent to the helicase domain of C. elegans K02F3.1 (Figure 3C and Figure 5J), suggesting that these short segments are important for a common function between H. sapiens RECQL/Q1 and C. elegans K02F3.1. The N-terminal domain of H. sapiens WRN is as long as the N-terminal domains of Dmblm, H. sapiens BLM, C. elegans T04A11.6, S. cerevisiae SGS1, S. pombe rqh1/rad12, and C. elegans K02F3.1, all of which are members of other subgroups. Overall, it is clear from Figure 3 and Figure 4 that the length of the N-terminal regions is not well correlated with the relatedness of the helicase domain sequences.

Presence of direct sequence repeats in the RECQ family:
By examining dot plots, we detected several direct sequence repeats within the genes (Figure 3). The H. sapiens WRN gene was found to have a perfect repeat of 27 amino acids near its helicase domain as shown previously (YU et al. 1996 Down; Figure 3E). There is also a repeat of 20/24 amino acids near the N-terminal end of Dmblm (Figure 1 and Figure 3A), a repeat of 7/9 amino acids in the N-terminal domain of the C. elegans T04-A11.6 gene (Figure 3A), and a repeat of 9/10 amino acids near the N-terminal end of the C. elegans F18C5.2 gene (Figure 3E). These repeat structures might be correlated with the apparent evolutionary flexibility of the length of the N-terminal sequences, as discussed below.

Rescue of the S. cerevisiae sgs1 mutant by the Dmblm cDNA:
The structural homology between the BLM and yeast RECQ subgroups (Figure 4 and Figure 5) suggests the possibility that Dmblm functions in a manner similar to SGS1. First we tested sensitivity of the sgs1 mutant to MMS. The sgs1::URA3 mutant showed hypersensitivity to MMS (Figure 6A). The SGS1 gene cloned in the yeast centromere plasmid complemented this MMS hypersensitivity (Figure 6A). We examined whether Dmblm can functionally substitute for SGS1 in S. cerevisiae. The Dmblm cDNA placed downstream of the ADH1 promoter in the yeast 2-µm plasmids conferred a 12-fold increase in the survival fraction of the sgs1 mutant cells against 0.02% MMS (Figure 6B), although it did not reach that of the wild-type strain carrying the vector.



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 6. Rescue of sensitivity in S. cerevisiae to MMS of an sgs1 mutant by Dmblm. MMS sensitivity of the sgs1 mutant and its wild-type strain carrying a centromere plasmid (A) or a 2-µm plasmid (B) with a TRP marker. Symbols in (A) are as follows: {blacksquare}, MRQ966 (sgs1::URA3) with pRS314 (vector); {diamondsuit}, MR966 (SGS1) with pRS314 (vector); and {square}, MRQ966 (sgs1::URA3) with YCp1305 (SGS1 plasmid). Symbols in (B) are as follows: {bullet}, MRQ966 (sgs1::URA3) with pAS2-1 (vector); {blacktriangledown}, MR966 (SGS1) with pAS2-1 (vector); {circ}, MRQ966 (sgs1:: URA3) with pYEDMBLM1 (Gal4-DNA-binding domain::Dmblm cDNA); and {triangledown}, MRQ966 (sgs1::URA3) with pYEDMBLM2 (Dmblm cDNA). Cells were grown in selective medium without tryptophan to maintain selection of plasmids in late-exponential phase, diluted, and plated on YEPD plates containing 0.01 and 0.02% MMS. The plates were incubated for 4 days at 30° before scoring viable colonies. Measurements were done for two independent transformants from each of the strains.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

D. melanogaster Bloom:
We have identified a new D. melanogaster gene and include it in the RECQ helicase family. On the basis of our phylogenetic analysis, dot plot analysis, and comparative domain structure analysis, this new gene, designated Dmblm, is most similar to the BLM gene of H. sapiens, strongly suggesting functional similarity.

The subgroups in the RECQ family:
Maximum likelihood estimations, assuming the F84-d{Gamma} model (YANG 1994B Down), were applied to the helicase domain present in all 13 RECQs. Dot plot analyses were executed for all domains of all 13 RECQs. On the basis of these results, we suggest that there are four possible subgroups in the RECQ family: (i) the BLM subgroup (H. sapiens Bloom, D. melanogaster Dmblm, and C. elegans T04A11.6); (ii) the yeast RECQ subgroup (S. cerevisiae SGS1 and S. pombe rqh1/rad12); (iii) the RECQL/Q1 subgroup (H. sapiens RECQL/Q1 and C. elegans K02F3.1); and (iv) the WRN subgroup (H. sapiens Werner and C. elegans F18C5.2). Our designation of subgroups provides reasonable insights for functional analysis of RECQ family genes.

N-terminal length variation in the RECQ family:
The presence of four direct repeats in the N-terminal domains allows us to suggest a correlation with the apparent evolutionary variability of the length of the N-terminal sequences. Gene elongation during evolution seems to have depended largely on domain duplications (BARKER et al. 1978 Down). The multiple duplication of small units would partially contribute to elongation of N-terminal ends of an ancestor gene that might be similar to bacterial RECQs. The small duplications shown in Figure 3 in Dmblm, C. elegans T04A11.6, C. elegans F18C5.2, and H. sapiens WRN would likely have occurred recently in evolutionary history. The WRN homologues of two vertebrates, Mus musculus (IMAMURA et al. 1997 Down) and Xenopus laevis (YAN et al. 1998 Down), have only one such sequence. The length difference of their N-terminal domains might depend on the length of the original repeat unit and the frequency of duplication events. However, the first half of the N-terminal segment of the Werner protein includes a nuclease domain homologous to bacterial RNaseD and the 3'-5' exonuclease domain of DNA polymerase I (MIAN 1997 Down; MUSHEGIAN et al. 1997 Down). This indicates that other mechanisms such as exon shuffling contributed to the N-terminal length variation.

Features and perspectives in the functional subgroups:
The relationships among members of the RECQ family do not follow the distinction between vertebrates and invertebrates, as seen in the three subgroups that contain both human and invertebrate RECQ genes (Figure 4). These features may indicate that gene duplication of ancestral RECQ sequences occurred several times. Over evolutionary time each RECQ gene would diverge and acquire a new function stabilized by natural selection or would be lost, as proposed recently by FRYXELL 1996 Down.

Mutations of S. cerevisiae SGS1 increased mitotic recombination in ribosomal DNA (rDNA) repeats (GANGLOFF et al. 1994 Down) and between direct repeated genes (WATT et al. 1996 Down). We found that an sgs1 mutant is hypersensitive to MMS. Mutations of S. pombe rqh1/rad12 caused the hypersensitivity to UV, hydroxyurea or {gamma}-ray (MURRAY et al. 1997 Down; STEWART et al. 1997 Down; DAVEY et al. 1998 Down) and increased mitotic recombination with hydroxyurea treatment (STEWART et al. 1997 Down). S. pombe rqh1/rad12 appear to function on a checkpoint protein dependent DNA damage response during S phase (MURRAY et al. 1997 Down; STEWART et al. 1997 Down; DAVEY et al. 1998 Down). Yeast RECQ proteins might inhibit inappropriate homologous recombination associated with postreplicational repair against DNA damage. Also, the SGS1 gene was shown to be involved in cellular aging (SINCLAIR et al. 1997 Down), which was associated with accumulation of extrachromosomal rDNA circles (SINCLAIR and GUARENTE 1997 Down).

Somatic cells from patients with Bloom syndrome show increased sister chromatid exchanges (HEARTLEIN et al. 1987 Down; KURIHARA et al. 1987 Down) and increased interchanges between homologous chromosomes (GERMAN 1993 Down). BLM cells are shown to be hypersensitive to ethyl methane-sulfonate (KREPINSKY et al. 1979 Down). Dmblm cDNA partially rescued the MMS sensitivity of an sgs1 mutant. H. sapiens BLM has a similar rescue activity (YAMAGATA et al. 1998 Down). These results provide evidence of the functional similarity between the BLM subgroup and the yeast RECQ subgroup. The BLM subfamily members might inhibit extra homologous recombination at/near the site of DNA damage. In D. melanogaster, P-element-induced double strand gaps are repaired by gene conversion accompanied very infrequently with crossing over (NASSIF and ENGELS 1993 Down). Interchromosomal recombination in mitotic cells of mammals is very low compared with that in yeast (SHULMAN et al. 1995 Down). These reports indicate that mitotic crossing over is strictly inhibited in metazoans. H. sapiens BLM and Dmblm might be involved in preventing the loss of heterozygosity induced by DNA damage.

Cells from patients with Werner syndrome exhibit various chromosome rearrangements (SALK et al. 1981 Down; SCAPPATICCI et al. 1982 Down), but do not show increased sister chromatid exchange (GEBHART et al. 1988 Down), in contrast to BLM mutants. WRN cells exhibited a slower rate of DNA replication (FUJIWARA et al. 1977 Down). More recently WRN cells were shown to be hypersensitive to 4-nitro-quinoline-1-oxide (OGBURN et al. 1997 Down). The Werner protein is likely to work with replication protein A (SHEN et al. 1998 Down). YAN et al. 1998 Down suggested the role of Werner protein and replication protein A in DNA replication. The WRN subgroup members might be involved in resumption of DNA replication following postreplicational repair. The human WRN gene and the C. elegans F18C5.2 share siginificant similarity in their helicase and C-terminal domains. This indicates that C. elegans F18C5.2 may be a useful model of Werner syndrome.

In the RECQL/Q1 subgroup, there is no genetical data. We suggest that a genetical approach using C. elegans K02F3.1 may be useful to address the role of RECQL/Q1 in humans. A biochemical finding suggests that the human RECQL/Q1 protein possesses DNA helicase activity and translocates in a 3' to 5' direction on the DNA to which it binds (SEKI et al. 1994 Down). The manner in which the short N-terminal segment shared by the members of this subgroup affects their DNA helicase activities may be a key for the clarification of their specific function.

In E. coli, homologous recombination is initiated in several ways (CLARK and LOW 1988 Down; LLOYD and LOW 1996 Down). The E. coli RecQ helicase appears to initiate a type of homologous recombination with RecJ exonuclease (KUSANO et al. 1994 Down). In a recB-C- sbcA- background, recQ-J- strains are more resistant to UV-irradiation than recJ- strains (KUSANO et al. 1994 Down; LOVETT and SUTERA 1995 Down). In a recB-C- sbcB-C- background, however, a recQ-J- strain is more sensitive to UV-irradiation than a recJ- strain (data not shown). These two results indicate that E. coli RecQ contains an additional function(s) as well as one epistatic to RecJ function. Also, the E. coli RecQ gene appears to control nonhomologous recombination (HANADA et al. 1997 Down). The functional relationships between bacterial RECQs and eukaryotic RECQs are important in order to understand how RECQ family members have diverged.

Meaning of suppression by mutations in the RECQ family members:
Consider two genes A and B. A- suppresses a defect due to B-. This genetic interaction is not allele-specific, because a null mutation in A, such as a deletion or insertion, causes suppression. The following two phenomena appear to correspond to this genetic interaction: the suppression of the defect of a top3 mutation (B-) by an sgs1 null mutation (A-) in S. cerevisiae (GANGLOFF et al. 1994 Down; LU et al. 1996 Down) and the suppression of the defects of recJ mutations (B-) by recQ null mutations (A-) in E. coli. (KUSANO et al. 1994 Down; LOVETT and SUTERA 1995 Down). The two phenomena could be explained by the hypothesis that A- activates a pathway substituted for the AB pathway so as to alleviate any severe phenotype caused by B-. In this way, the sgs1- mutation might enable type I topoisomerase to substitute for Top3 in S. cerevisiae cells (LU et al. 1996 Down). The recQ- mutation might open other homologous recombination pathways for E. coli cells (KUSANO et al. 1994 Down). The RECQ family members might therefore operate on the early steps in the intracellular events that involve them.


*  ACKNOWLEDGMENTS

We thank Christine Preston and Dena Johnson-Schlitz for helping in the examination of polytene chromosomes and Carlos Flores for helpful comments on the manuscript. We are grateful to Takemi Enomoto and Akira Shimamoto for providing yeast strains and plasmids. This work was supported by National Institutes of Health grant GM-30948 and Uehara Memorial Foundation. This is paper 3526 from the University of Wisconsin Laboratory of Genetics.

Manuscript received June 17, 1998; Accepted for publication November 10, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

BARKER, W. C., L. K. KETCHAM and M. O. DAYHOFF, 1978 Duplications in protein sequences, pp. 359–362 in Atlas of Protein Sequence and Structure, edited by M. O. DAYHOFF. National Biomedical Research Foundation, Silver Spring, MD.

BENNETT, R. J., J. A. SHARP, and J. C. WANG, 1998  Purification and characterization of the Sgs1 DNA helicase activity of Saccharomyces cerevisiae. J. Biol. Chem. 273:9644-9650[Abstract/Free Full Text].

CHOMCZYNSKI, P. and N. SACCHI, 1987  Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem. 162:156-159[Medline].

CLARK, A. J., and K. B. LOW, 1988 Pathways and systems of homologous recombination in Escherichia coli, pp. 155–215 in The Recombination of Genetic Material, edited by K. B. LOW. Academic Press, Inc., NY.

DAVEY, S., C. S. HAN, S. A. RAMER, J. C. KLASSEN, and A. JACOBSON et al., 1998  Fission yeast rad12+ regulates cell cycle checkpoint control and is homologous to the Bloom's syndrome disease gene. Mol. Cell. Biol. 18:2721-2728[Abstract/Free Full Text].

EISEN, J. A., K. S. SWEDER, and P. C. HANAWALT, 1995  Evolution of the SNF2 family of proteins: subfamilies with distinct sequences and functions. Nucleic Acids Res. 23:2715-2723[Abstract/Free Full Text].

ELLIS, N. A., J. GRODEN, T.-Z. YE, J. STRAUGHEN, and D. J. LENNON et al., 1995  The Bloom's syndrome gene product is homologous to RecQ helicases. Cell 83:655-666[Medline].

ENGELS, W. R., C. R. PRESTON, P. THOMPSON, and W. B. EGGLESTON, 1986  In situ hybridization to Drosophila salivary chromosomes with biotinylated DNA probes and alkaline phosphatase. Focus 8:6-8.

FELSENSTEIN, J., 1978  Cases in which parsimony and compatibility methods will be positively misleading. Syst. Zool. 27:401-410.

FISHEL, R., M. K. LESCOE, M. R. RAO, N. G. COPELAND, and N. A. JENKINS et al., 1993  The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer. Cell 75:1027-1038[Medline].

FLEISCHMANN, R. D., M. D. ADAMS, O. WHITE, R. A. CLAYTON, and E. F. KIRKNESS et al., 1995  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.. Science 269:496-512[Abstract/Free Full Text].

FRYXELL, K. J., 1996  The coevolution of gene family trees. Trends Genet. 12:364-369[Medline].

FUJIWARA, Y., T. HIGASHIKAWA, and M. TATSUMI, 1977  A retarded rate of DNA replication and normal level of DNA repair in Werner's syndrome fibroblasts in culture. J. Cell. Physiol. 92:365-374[Medline].

GANGLOFF, S., J. P. MCDONALD, C. BENDIXEN, L. ARTHUR, and R. ROTHSTEIN, 1994  The yeast type I topoisomerase Top3 interacts with Sgs1, a DNA helicase homolog: a potential eukaryotic reverse gyrase. Mol. Cell. Biol. 14:8391-8398[Abstract/Free Full Text].

GEBHART, E., R. BAUER, U. RAUB, M. SCHINZEL, and K. W. RUPRECHT et al., 1988  Spontaneous and induced chromosomal instability in Werner syndrome. Hum. Genet. 80:135-139[Medline].

GERMAN, J., 1993  Bloom syndrome: a mendelian prototype of somatic mutational disease. Medicine 72:393-406[Medline].

GORBALENYA, A. E., E. V. KOONIN, A. P. DONCHENKO, and V. M. BLINOV, 1989  Two related superfamilies of putative helicases involved in replication, recombination, repair and expression of DNA and RNA genomes. Nucleic Acids Res. 17:4713-4730[Abstract/Free Full Text].

GRAY, M. D., J.-C. SHEN, A. S. KAMATH-LOEB, A. BLANK, and B. L. SOPHER et al., 1997  The Werner syndrome protein is a DNA helicase. Nat. Genet. 17:100-103[Medline].

HANADA, K., T. UKITA, Y. KOHNO, K. SAITO, and J. KATO et al., 1997  RecQ DNA helicase is a suppressor of illegitimate recombination in Escherichia coli.. Proc. Natl. Acad. Sci. USA 94:3860-3865[Abstract/Free Full Text].

HARTL, D. L., D. I. NURMINSKY, R. W. JONES, and E. R. LOZOVSKAYA, 1994  Genome structure and evolution in Drosophila: applications of the framework P1 map. Proc. Natl. Acad. Sci. USA 91:6824-6829[Abstract/Free Full Text].

HEARTLEIN, M. W., H. TSUJI, and S. A. LATT, 1987  5-Bromodeoxyuridine-dependent increase in sister chromatid exchange formation in Bloom's syndrome is associated with reduction in topoisomerase II activity. Exp. Cell Res. 169:245-254[Medline].

HENIKOFF, S. and J. G. HENIKOFF, 1994  Position-based sequence weights. J. Mol. Biol. 243:574-578[Medline].

HIGGINS, D. G., A. J. BLEASBY, and R. FUCHS, 1992  CLUSTAL V: improved software for multiple sequence alignment. Comput. Appl. Biosci. 8:189-191[Abstract/Free Full Text].

HILLIS, D. M., 1991 Discrimination between phylogenetic signal and random noise in DNA sequences, pp. 278–294 in Phylogenetic Analysis of DNA Sequences, edited by M. M. MIYAMOTO and J. CRACRAFT. Oxford University Press, Oxford, UK.

HILLIS, D. M. and J. P. HUELSENBECK, 1992  Signal, noise, and reliability in molecular phylogenetic analyses. J. Hered. 83:189-195[Abstract/Free Full Text].

HUELSENBECK, J. P., 1991  Tree-length distribution skewness: an indicator of phylogenetic information. Syst. Zool. 40:257-270.

HUELSENBECK, J. P. and D. M. HILLIS, 1993  Success of phylogenetic methods in the four-taxon case. Syst. Biol. 42:247-264.

HUELSENBECK, J. P. and B. RANNALA, 1997  Phylogenetic methods come of age: testing hypotheses in an evolutionary context. Science 276:227-232[Abstract/Free Full Text].

IMAMURA, O., K. ICHIKAWA, Y. YAMABE, M. GOTO, and M. SUGAWARA et al., 1997  Cloning of a mouse homologue of the human Werner syndrome gene and assignment to 8A4 by fluorescence in situ hybridization. Genomics 41:298-300[Medline].

IRINO, N., K. NAKAYAMA, and H. NAKAYAMA, 1986  The recQ gene of Escherichia coli K12: primary structure and evidence for SOS regulation. Mol. Gen. Genet. 205:298-304[Medline].

ISHIZAKI, K., T. YAGI, M. INOUE, O. NIKAIDO, and H. TAKEBE, 1981  DNA repair in Bloom's syndrome fibroblasts after UV irradiation or treatment with Mitomycin C. Mutat. Res. 80:213-219[Medline].

KANEKO, H., K. ORII, E. MATSUI, N. SHIMOZAWA, and T. FUKAO et al., 1997  BLM (the causative gene of Bloom syndrome) protein translocation into the nucleus by a nuclear localization signal. Biochem. Biophys. Res. Commun. 240:348-353[Medline].

KAROW, J. K., R. K. CHAKRAVERTY, and I. D. HICKSON, 1997  The Bloom's syndrome gene product is a 3'-5' DNA helicase. J. Biol. Chem. 272:30611-30614[Abstract/Free Full Text].

KREPINSKY, A. B., J. A. HEDDLE, and J. GERMAN, 1979  Sensitivity of Bloom's syndrome lymphocytes to ethyl methanesulfonate. Hum. Genet. 50:151-156[Medline].

KURIHARA, T., M. INOUE, and K. TATSUMI, 1987  Hypersensitivity of Bloom's syndrome fibroblasts to N-ethyl-N-nitrosourea. Mutat. Res. 184:147-151[Medline].

KUSANO, K., Y. SUNOHARA, N. TAKAHASHI, H. YOSHIKURA, and I. KOBAYASHI, 1994  DNA double-strand break repair: genetic determinants of flanking crossing-over. Proc. Natl. Acad. Sci. USA 91:1173-1177[Abstract/Free Full Text].

LLOYD, R. G., and K. B. LOW, 1996 Homologous recombination, pp. 2236–2255 in Escherichia coli and Salmonella: Cellular and Molecular Biology, edited by F. C. NEIDHARDT. ASM Press, Washington, DC.

LOVETT, S. T. and V. A. SUTERA, JR., 1995  Suppression of RecJ exonuclease mutants of Escherichia coli by alterations in DNA Helicases II (uvrD) and IV (helD). Genetics 140:27-45[Abstract].

LU, J., J. R. MULLEN, S. J. BRILL, S. KLEFF, and A. M. ROMEO et al., 1996  Human homologues of yeast helicase. Nature 383:678-679[Medline].

MIAN, I. S., 1997  Comparative sequence analysis of ribonucleases HII, III, II, PH and D. Nucleic Acids Res. 25:3187-3195[Abstract/Free Full Text].

MOUNT, S. M., C. BURKS, G. HERTZ, G. D. STORMO, and O. WHITE et al., 1992  Splicing signals in Drosophila: intron size, information content, and consensus sequences. Nucleic Acids Res. 20:4255-4262[Abstract/Free Full Text].

MURRAY, J. M., H. D. LINDSAY, C. A. MUNDAY, and A. M. CARR, 1997  Role of Schizosaccharomyces pombe RecQ homolog, Recombination, and checkpoint genes in UV damage tolerance. Mol. Cell. Biol. 17:6868-6875[Abstract].

MUSE, S. V. and B. S. GAUT, 1994  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol. Biol. Evol. 11:715-724[Abstract].

MUSE, S. V. and B. S. WEIR, 1992  Testing for equality of evolutionary rates. Genetics 132:269-276[Abstract].

MUSHEGIAN, A. R., D. E. BASSETT, JR., M. S. BOGUSKI, P. BORK, and E. V. KOONIN, 1997  Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs. Proc. Natl. Acad. Sci. USA 94:5831-5836[Abstract/Free Full Text].

NAKAYAMA, H., K. NAKAYAMA, R. NAKAYAMA, N. IRINO, and Y. NAKAYAMA et al., 1984  Isolation and genetic characterization of a thymineless death-resistant mutant of Escherichia coli K12: identification of a new mutation (recQ1) that blocks the RecF recombination pathway. Mol. Gen. Genet. 195:474-480[Medline].

NASSIF, N. and W. R. ENGELS, 1993  DNA homology requirements for mitotic gap repair in Drosophila. Proc. Natl. Acad. Sci. USA 90:1262-1266[Abstract/Free Full Text].

NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, NY.

NEI, M., 1991 Relative efficiencies of different tree-making methods for molecular data, pp. 90–128 in Phylogenetic Analysis of DNA Sequences, edited by M. M. MIYAMOTO and J. L. CRACRAFT. Oxford University Press, Oxford, UK.

OGBURN, C. E., J. OSHIMA, M. POOT, R. CHEN, and K. E. HUNT et al., 1997  An apoptosis-inducing genotoxin differentiates heterozygotic carriers for Werner helicase mutations from wild-type and homozygous mutants. Hum. Genet. 101:121-125[Medline].

PURANAM, K. L. and P. J. BLACKSHEAR, 1994  Cloning and characterization of RECQL, a potential human homologue of the Escherichia coli DNA helicase RecQ. J. Biol. Chem. 269:29838-29845[Abstract/Free Full Text].

ROELS, S., A. DRIKS, and R. LOSICK, 1992  Characterization of spoIVA, a sporulation gene involved in coat morphogenesis in Bacillus subtilis.. J. Bacteriol. 174:575-585[Abstract/Free Full Text].

SALK, D., K. AU, H. HOEHN, M. R. STENCHEVER, and G. M. MARTIN, 1981  Evidence of clonal attenuation, clonal succession, and clonal expansion in mass cultures of aging Werner's syndrome skin fibroblasts. Cytogenet. Cell Genet. 30:108-117[Medline].

SCAPPATICCI, S., D. CERIMELE, and M. FRACCARO, 1982  Clonal structural chromosomal rearrangements in primary fibroblast cultures and in lymphocytes of patients with Werner's syndrome. Hum. Genet. 62:16-24[Medline].

SEKI, M., H. MIYAZAWA, S. TADA, J. YANAGISAWA, and T. YAMAOKA et al., 1994  Molecular cloning of cDNA encoding human DNA helicase Q1 which has homology to Escherichia coli Rec Q helicase and localization of the gene at chromosome 12p12. Nucleic Acids Res. 22:4566-4573[Abstract/Free Full Text].

SEKI, M., J. YANAGISAWA, T. KOHDA, T. SONOYAMA, and M. UI et al., 1994  Purification of two DNA-dependent A