Gene, 111 (1992) 119-124 © 1992 Elsevier Science Publishers B.V. All fights reserved. 0378-1119/92/$05.00 119 GENE 06287 Analysis of a ribosomal RNA operon in the actinomycete F r a n k i a (Actinomycetales; 16S, 23S, 5S rRNA; nitrogen-fixing; Casuarina; recombinant DNA) Philippe Normand, Benoit Cournoyer, Pascal Simonet and Syhie Nazaret Laboratoire d'Ecologie Microbienne du Sol, U.R.A. C.N.R.S. 1450, Universitd Lyon I, F-69622, Villeurbanne Cedex (France) Received by K.F. Chater: 29 August 1991 Revised/Accepted: 10 October/ll October 1991 Received at publishers: 22 November 1991 SUMMARY The organisation of ribosomal RNA-encoding (rrn) genes has been studied in Frankia sp. strain ORS020606. The two rrn clusters present in Frankia strain ORS020606 were isolated from genomic banks in phage ~EMBL3 by hybridization with oligodeoxyribonucleotide probes. The 5'-3' gene order is the usual one for bacteria: 16S-23S-5S. The two clusters are not distinguishableby restriction enzyme mapping inside the coding section, but vary considerably outside it. Sequencing showed that the 16S-rRNA-encoding gene of ORS020606 is very closely related to that of another Alnus-infective Frankia strain (Ag45/Mut 15) and highly homologous to corresponding genes of Streptomyces spp. Two possible promoter sequences were detected upstream from the 16S gene, while no tRNA-encoding gene was detected in the whole operon. Regions with a high proportion of divergence for the study of phylogenetic relationships within the genus were looked for and found in the first intergenic spacer, in the 23S and in the 16S gene. INTRODUCTION The nitrogen-fixing symbiosis between the actinomycete Frankia sp. and a number of woody dicotyledonous plant genera play a major role in agroforestry. A better utilization of the Frankia-symbiosis will necessitate consideration of such properties as persistence in the soil and competitiveness of released strains, which requires precise strain identification, such as that afforded by oligo hybridization (Simonet et al., 1990). The rRNA-encoding genes, with Correspondence to: Dr. P. Normand, Laboratoire d'Ecologie Microbienne du Sol, U.R.A.C.N.R.S. 1450, B[R. 741, Universit6 Lyon I, F-69622, Villeurbanne Cedex (France) Tel. (33)72448289; Fax (33)72448466; e-mail NORMAND @CISM.UNIV-LYON I.FR. Abbreviations: bp, base pair(s); IGS, intergenic spacer; kb, kilobase(s) or 1000 bp; nt, nucleotide(s); oligo, oligodeoxyribonucleotide; ORF, open reading frame(s); PCR, polymerase chain reaction; rrn, rRNA-encoding genes; rRNA, ribosomal RNA; tRNA, transfer RNA; Y, C or T (U). their conserved areas that alternate with highly variable zones, provide targets for oligo identification (Haun and G0bel, 1987; Wilson et al., 1988). Attempts at sequencing directly the 16S rRNA from Frankia strains were successful in the case of two Alnusinfective strains (Hahn et al., 1989) but could not be achieved in the case of Casuarina-infective strains due to problems in the isolation of suitable rRNA (S.N. and E. Stackebrandt, unpublished). For these reasons, the Casuarina-infective Frankia strain ORS020606 from which a gene bank had already been constructed (Normand et ai., 1988), was deemed an appropriate candidate for the study of rRNA-encoding genes in Frankia. EXPERIMENTAL AND DISCUSSION (a) Characterizationof rRNA-encoding gene clusters There were clear signals when probes LSUll03 and LSU2536' (Table I) labeled according to Zeffand Geliebter 120 TABLE I Listofoligosused Designation ~ 5' ~ 3 ' sequence b SSU413 SSU901 SSUllg0' SSU1508 LSU20' LSUII03 LSU2536' AGC GCC GGG TCT AGA GAA GAA GAC TTG GCA AGA TCT AGA CAG GCC GGA TGA GGT GCC GTG CCC GCG GTA TGA GAA AAG CGT AAC TGA CGG CTT GTC GCA AAT CCT GGG CCG GAC GTA TCC AGC TGG ATG CA GTC ACA ACC TCA AAC AGG GTG TAG C " The oligo designations include three letters describing whether they correspond to 16S (SSU) or to 23S (LSU) genes, a number that corresponds to the E. coli sequence coordinates (Brosius et al., 1981), and a prime or no prime when the sequence is homologous or complementary, respectively, to the RNA sequence. b Oligo probes were end-labe!ed by phosphorylation using T4 polynucleotide kinase and [~,-32P]ATP (185 TBq/mmol) according to Zeff and Geliebter (1997). (1987) were hybridized with genomic DNA prepared as described before (Normand et al., 1988)from Frankia strain CeD isolated from Casuarina equisetifolia (ORS020606; Diem et ai., 1983). One or two fragments, depending on the restriction enzyme used, gave a positive signal (Fig. 1). There are thus two rRNA-encoding gene clusters called rrnA and n'nB, present as BamHI fragments of 6 + 6 and 3.5 + 2.2 kb, respectively (Fig. 1). It has been suggested that in bacteria, the number of clusters could be directly related to the growth rate. Frankia with a doubling rate of a few days is certainly slow and the low number of clusters found would thus be consistent with this explanation. 24kb- ~'~,~,, 9.5 - tjjj~ (b) Cloning of the two rRNA-encoding gene clusters Hybridization of the ORS020606 gene bank (Normand et al., 1988) with oligo probe LSU1103 yielded five positive signals, two of which were chosen at random for further studies. These turned out to contain the two different rRNA-encoding gene clusters, i.e., they had BamHI fragments of 6 and 3.5 kb that hybridized both with oligo probe SSU413 (5' part ofl6Sgene) and LSU1103 (middle of the 23S gene). Two representative clones were named Olrnal and Olrna2 containing clusters rrnA (containing the two 6-kb BamHI fragments) and rrnB (containing the 3.5 and 2.2-kb BamHI fragments), respectively. The two phages ~i, O 6.",, 4.5 2,3 m 2.0 " 0.6 . @ A B Fig. I. Genomic blots of Frankia total DNA. Frankia strain ORS020606 total DNA was cut with the indicated restriction enzymes. Electrophoresis was carried out at 1 V/cm for 16 h in TAE buffer (Maniatis et ai., 1982), and the resulting gel was transfered onto a nylon filter and hybridized with ~2P-labeled LSU1103 that corresponds to the first half of the 23S gene (panel A) or with LSU2536' that corresponds to the second half of the 23S gen0 (panel B). Size markers were derived from phage ~. digested by HindIlI (sizes in kb marked on the left margin). Hybridizations were at 45 °C in 2 x S SC (Maniatis et al. (1982)/1 u Denhardt's (1966)/1% (w/v) SDS/10~ (w/v) dextran/l mg per ml of denatured salmon sperm DNA. 121 were hybridized with purified, labeled 16S, 23S or 5S rRNA p:epared from Frankia strain Tx30SAb (Normand and Lalonde, 1986) and partially hydrolyzed in 50 mM Tris-Cl pH 9.5 (pH at 25°C)/0.1 mM spermidine at 95°C for 15 min, and with different oligos. The resulting genetic and restriction maps of the two clones are given in Fig. 2. This map is totally consistent with the genomic blots (not shown). The three rrn genes in the two operons were found to be arranged in the order 16S-23S-5S usual for bacteria. Restriction sites were identical within the rrn genes but extensive sequence divergence started from around 700 bp upstream from the EcoRI site present at the start of the 16S genes (e.g., the Bglll site of O l r n a l is replaced by BamHl 6 3 ? 4 5 cOR 8 9 6 indl 7 I0 aral II 8 Cpnl 12 9 13 I0 II pslt -Kpnl - 14 S alI 01rnal 01rna2 Fig. 2. Comparison of the two operons. Relevant parts of the inserts in phages O l r n a l and Olrna2 are shown side by side with restriction maps. Those sites mapping at similar points on the two dusters are connected. The zones corresponding to the genes are shaded. The scales are in kb and thek coordinates start at the BamHI junction site with the phage 2 leR arm. and SalI sites in Olrna2) and about 2 kb downstream from the HindllI site where a Xhol site present in Olrnal is replaced by Sail and BamHI sites (Fig. 2). The unique EcoRI site present in the first half of several bacterial 16S genes is present in the two clusters. (c) Sequencing of the rrnA gene cluster The nt sequence ofgene cluster rrnA is shown in Fig. 3. It has been deposited in GenBank and given accession No. M55343. The whole sequence from the first of the two Bgill sites up to theXhoI site is 6481 nt long. The restriction sites found by computer search are completely consistent with the restriction analysis of the O l r n a l phage (Fig. 2). The allocation of 16S, 235 and 5S segments was determined using the high sequence similarity of that region with Streptomyces ambofaciens (Pernodet et al., 1989) with the Bestfit comparison algorithm (Smith and Waterman, 1981). The length of the 16S gene (1512 nt) is somewhat shorter (by 16 nt) than the corresponding one in S. ambofaciens, and more markedly so (by 30 nt) relative to Bacillus subtilis (Green and Void, 1983) and E. coliRI (Brosius et al., 1981). The G+C ofthe gene is 60~o, which is higher than most 16S sequences with which it was compared but lower than total DNA (71%) from Frankia according to Fernandez et al. (1989). The 16S gene is the most conserved gene of the operon, especially at its 3' end. The highest sequence similarity is with Frankia strain Ag45/Mut15 (95%) and with S. ambofaciens (90 %). The highest sequence divergence is not to be found in the first hypervariable region of Embley et al. (1988) where only two mismatches with S. ambofaciens are present. However, the hypervariable zone of Pernodet et al. (1989) at the beginning of the gene, and to a lesser extent the second hypervariable zone of Emblcy et al. (1988) are potential sites for species specific probes. This last zone has been exploited by Nazaret et al. (1991) for phylogonetic purposes in Frankia. The 23S gene was found to extend from nt 2451 to 5549 for a length of 3099 nt, a length comparable to that of S. ambofaciens (3120 nt) and thus amongst the longest bacterial 23S sequences determined so far (Pernodet et al., 1989). The percentage G+C is lower (57%) than that of the 16S gene, which appears also to be the case in other bacteria studied. The highest sequence similarity with the S. ambofaciens sequence is to be found at both extremities, especially at the 3' end of the gene (Fig. 3). Conversely, the highest divergence was found in three regions (numbered from the start of the Frankia 23S gene): at nt 310-450 (64 mismatches/140nt), at nt 1245-1350 (49 mismatches/ 105 nt) and especially at nt 1510-1715 (81 mismatches/ 210 nt). The 5S gene extends from nt 5618 to 5736 for a length of l l 9 n t . The last nt, a C, may not belong to the gene itself since it is part of the perfect inverted repeat extend- GTTGTGAGGG A T T ~ beginning of 16S qene Itpal GGTACGTGGG TAGGTTAACT ACACGGTGGT GGGCCTGTGA G G ~ T G TCGATGTCGG G G ~ C G A GCC3~TT~IT ~fACCCCG~ CATGGCCTGG CTGGGTGGTG GGrTC~TCGC TI~GAACGCG AACGTCCGGA GTGTGCAAGA " -10" ~CCGAG T~/%CTTAAAG AAGCACCAC4% ACG~AGCCCT GTACGAGAGT G G ~ C A C C TGGTGGTGTG GAAAGATTTA TCGGCTCGGG A ~ G G C C C G C GGCCTATCAG C~WGTTGGTG GGGTGATGGC ACCGGTGCCG AAGGCGGGTC TCTGGGCCGG AACTGACGCT AA~AGCGAA AGCGTGGGGA GCGAA~AG~q TTAGATACCC TGGTAGT~A Eagl G~£GGGCGCT AG~TGT~GGG ~-ACCI~CCAC GGCCTCCGTG CCGCAGCT~A CGCATTAAGC GCCCCGCCTG GGGAGTACGG C C G C ~ G C T TCGGCTGTGA EcoRV GCGCAGAT~T /~GCACCGGC AAC~CGACCC CGTGAAGTCG GAGTCGCTAG TAATC~AGA TCAGCJ~TGC TGCGG~G~T A~GTTCCCGG GCCTTGT/~A CACCGCCCGT GCACGCTGTT G G G T C C ~ G GGAGTGAGGC TCT~TCGGCT GGTGGGTTCC TGTGCTGTGG GGG/~GTGAA ACATCTCATr ACCCGCAGGA GGAGRAAACA ACCGTGA~rC CGCG~GTAGT GARTGGGCCG CCATAGA~GG TRATAGCCCT GTAGCCGARA GCAGTGTTCC TCCCGGACGT C ~ GAATGAGL'CT GCGAGIT]'GC GGTGTGTGC.C GAGGrTAACC CGTGTGGGGT AGCCGTAGCG A~GcC_~AGTC CC~AGKGGGC ~ G C Bali CCA'IGGCCAG G~I'GAAGCGC GGGTAAGACC GTUTGGAC,GA CCGA~CCAU CAGGG'I'I'GAA ,~CCTGGGGG C GTCCGCCGGA TGACC~AGGG TrCC~GGGC AG~CTMTCC C-CCCAGGGTG T A G ~ CAGGGCCGCC CGTACCCT~A N : C ~ GAGGTCAC.,GT AGAGAATACC GAGGCGTTCG #ETG~ACTGT GGTTAAGC~A CTCGC-C~AAT GCCCCGTRAE TTCGGAC~ARG GGGGUCCGTT CTCCGTGTAG GGG'ITI'ACCT CCGRACEC~G G A ~ C G C xcaI AGAGACCAGG GGA~CGAC T~TTTAUrAA AAAUACAGCT CCGTGCT~G "23CGT~AGACG&~-T&TACG6 AG'DGREGCCT GCCCGGTC~T ~ G T T A A GGffrGAT~CT AT~CTO~C~ C ~ q A A ~ T ~ CCAGC-CGGTG G'FrGTCCTGG GGCAI~GGTG TN;GACGAGG CGTN;GTAAA TCCGCGTCTr GTGTGTCTGA GkCCTGATGC CGAGCCGATT GCGGCGARGT AGTCGGGACC TAAGGCGAGG CCGACAGGCG T ~ T C G A T G G ATAACGG~'I~ C~T~TrCCCG TACCGGCGTT GACGCGGCCA ~ C C T G G ~ T BamHI 4001 AACCATCTGA TCGGATGTGT CTCTTCGGAG GTG~A-A~GG C ~ T ~ ~CCCGGCTGG TAGTAGGCAA GCGATUGGGT GACGAGGAAG GTA~CCAGC CATACGAGTG AGAATGCAGG CATGAGTAGC GAATGACGGG ~ 3501 TTCGTAGTCG AGAGGGA2~C AGCCCAGATC GCCAGCTAAG GCCCCTAI~GC G T G C U C T ~ TGTGGRGTCG CATAGACAAC CAGGTGGTGC LSUl103 TrAGAAGCAG C C A C C ~ A~C,AGTC~q' AATAGC'IEAC T C ~ ' ~ G T G A T T U ~ G ~ C GA~A~TGTAG C G G G G C T C ~ GCGCACCGCC GAAGCTGCGG SphI CATGCACAAA ATrTCCCGGC ~ C ~ T CCAGGTTTGT GTGTGGGTAG GGGAGCGTCG TGTGGCGTGT GAAGCGGCGG GGTGACCCAG CCGTGGATGC ATGAGCTGTG GGTA~GGGTG A P ~ G C C A A T C ~ ' T C G G T GATRGCTGGT TCTCCCCG~A ATGC,qTTTA~ GTGCAGCGTC GCA'AUI'A~f TGCCGGAGGT Hindl II HlndlII AGAGCACTGG ATGGCCT~J3G GGGCCCAC~d% GCTTACTGAA GTCAGCCRRA CTCCGAATGC CGGTA~GTGA AGTC~GGC~G TGAGACTC~G GGGGATAAGC ATGTCCAAGA CCCGAAGCCG AGTGATCTAC C GTTCCCAAGT AC,CACGGI~C CCGTGTAA~T CCGTGTC~AT C'IV,GCGGGAC CACCCGCTRA GCCTAAATAC TCCCTGGTGA CCGATA~CGG ACTAGTACCG Sinai 3001 TGAGC~A~%G GTGAAAAGTA CCCCGGGAGG GGAGI~AAT A G T A C ~ ACCGTGT~CC TACA~TCCGT GGGAGCTGGA CT~TGGTCTG GTG~CCGCGT TCGGGGAGTC AGAAAGAGCA TrGTTAGGCG AAGGTCATGC GGTGAGCGAA AGCGGAACAG C~TA~ACCAG TGTCGTGTGA GAGCCGGCAG GTGTTGCGAT GCTGGGG]WG TGGGATCGTC CAGGTGGAGC TGCCG'FFCUA CACCCGTGCC TGAACACATA GGGCATGTGG AGGGAACGCG TCC~GGT(3GG GATCGTGGTG GGGGGTCTGC TGGTGGAACG GACCGGTCGT GGTGGAGGAC TGCCTTCCCG T~GGGTGGGG GTGGGTTTCG CCGGGTCGGC Ball/beglnnlng o f 235 qene TGGGC,CCGTC CGTACGTTGA ~ T G C A C A GTGGACGCGA GCATCTTTGT GG~CAAGTTA T T A ~ G G C ~ ACGGTGGATG CL"TTG~CAC~ ~ G ~ C C ~ T DraI 25~I GAAGGACGT~ GGAGGCTGCG ATATGCCTCG GGGAGCTGTC ~ACCGAGCTG TGATCCGAGG ATrTCCGAAT GGGGARACCC GGCAGGGCTT TAAATCCTGT TGC~GGGTGG AACC~TCCGG GTGGGTGGCT GGGGTCCGTG CACGTCACGA A,~3TCGGTAA CACCCGAAGC CGGTGGCCTA ACCCIT~TGG GGGG~GCCGT CGAAGGTGGG ACCGGCGA~I GGGACGAAGT CGT~d~CAA~G e n d o f 16s gene SD 2001 T~C,CCGTACC GCJ%~GC:TC,CC, C~CTGGATCA~ GGCTGG'~'~ TCCTGTAGTG GGGTGGGCTG GTGCAGAGCC AGGGCCGGCT Seal GTGGATGCCG GTCTGGTTGC TCGTGGGTGG AACGCTCACG AT~GGGTT~C GGCTGGTTGT CCGGGGTCTA GTACTCCTCT GT~CTTATCT TCGGGI~GGG CGGGGTCTGC CCCTTACGTC C~V~3GCTGCA CACATGCT~J~ AATGGCCGGT ACAAAGGGCT GCGATGCCGT GAGGTGGAGC GAATCCCAPA gJ~CCGGTCT CAGT~CGGAT CAACCCTCGT CCTATGTTGC CAGCGAG~CG TGTCGGGG~ TCATAGGAGA CTGCCGGGGT CAACTCGGAG C~AGGTC~GG AT~ACGTCAA GTCATCATGC 1501 ATCTCGTAGA GATACGGGGT CCGTAAGGGT CCTGCACAGG TGGTGCATGG CTGTCGTCAG CTCGTGTCGT GAGATG~rGG G ~ A A G T C C C CCP~CGAGCG A ~ A C T C A A A GGAATTGACG GGGGCCCGCA CAAGCGGCGG AC~ATGTGGC T~AATTCGAT GCAACGCCAA C~u~CCTTACC AGGGCTTGAC ATCCAGGGAA CGCCGTAAAC CAGGAGGAAC AGCCGGCCTG C~GC~G~GCCT CTACCAAGGC GACGACGGGT AGAGGCEGAC CGGCCACACT GGGACTGACA CAEGGCCCAG A~TCCTACGG GAGGCAGCAG TGGGGAATAT TGCGCAATGG 5SU413 KpnI/Pst I GACGCAGCGA CCCCGCGTGG CC,C~I~,~EGG CCfTCGGGTr GTAAF~CTCT TTCAGCAGGG P~GAAGCGAG AGTGACGGTA CCTGCAC~AG Sstl 1001 CAACTACGTG CCAGCAGCCG CGGTAATACG TAGGGTGC~A GCG~TGTCCG GAATTA~33G GCGT~AAC~tG CTCGTAGGCG G C C T G ~ C G Smal Pstl ECORI A~ACCCGGGG CTCA%CTCCG GGCCTGCRGT CGATA~GGGC AGGCTAGAGT CCGGCAGGGG AGACTGG~AT TCCTGGTGTA GCGGTGRAAT GCCGGGCATC ATGGAGAGTT T ~ A T C ~ TCAGGACGJ~A CGCTGGCGGC GTGC~ITAACA CA~CCAAGTC GAGCCGGGAG Sstl CTTCGGCTCT CAGCGGCGAA CGGGT~AGTA ACACGTGGC-C A~CCTGCCCC GAGCTCTGGA ATAR~CTCGG GAARCCGGGG CTAATGCCGG ATATGAERTI• 501 TGZ'FL'A-J.I~-A TCGGCC'CTCT C ~ A T C - ~ TCCCACGTGG TGGGGTGGCT ~ T T G G I]qlll 1 AGATCI~CTC CGAGGC,C,GCG TAC~C~CCCC GAGGAAA~GG GCCGCGGACG GTC~CATAAG "-35" "-lO" ACCGGGCCGG AGTGATTTGA CACAATCA/~ CCPJ%AGCTCA T~ATCTAGGA / ~ A C G T T C C Bgl I I "-35" AGGTAAGATC "IY2ITCGG~TG T C ~ T T D C GC~CA~G J%AGCGATI~G J%CACCGCCGG RNA p r o c a s s l n q s l " a GAC.nGTGC,~n GTCTGGTAGC G ~ C T ~ c " r ~ ~;~:;CGTGCC GATTGTCAGT ~AGTAA AUGG~GTUG TAALWATAAC CATCCTAAGG TA~£UAAATT cuTrffI~GGG A ~ C TGGGTCCGTG A~TCA GGGACAGTGT CAC~TGGGrA GTITAACTGG AGGGCT~£CC GA~GGGATAA C C ~ G AC~A~ CGCCACGCTG CKC.~_N:CAS CTCWI"CCGGT Fig. 3. Nucleotide sequence ofoperon rrnA. Sequencing was by the chain termination method of Sanger et al. (1977) using the deazaTaqTrack TM kit of Promega (Madison, WI) and [35S]dATP on overlapping clones of both strands. The rRNA cluster was subcloned from phage OIRNAI after digestion with Sail into pBR328 (Soberon et ai., 1980) yielding pFQ180, from which further subclonings were then made. The sequencing strategy used was based on single-stranded DNA clones in Bluescript (Stratagene, San Diego, CA) or in M13mpl8 and 19 derivatives (Yanisch-Pel"ron et al., 1985) propagated in DH5otF'IQ cells (BRL), using either universal primers on clones obtained with restriction enzymes or tailor-made primers (Table l) so as to completely determine the sequence of both strands. The sequence coordinates start at the first of the two Bg/ll sites, upstream from the 16S gene. Some relevant restriction sites are shown above the sequence line. Putative promoter sites are highfighted by double underlining. The putative RNA processing site is underlined. The beginning and end of all three r r , genes are underlined. Relevant oligo sites are overlined. The messenger attachment site at the 3' end of the 16S gene (Shine and Dalgarno, 1974) is shown by a wavy line (nt 2030-2034). The putative terminator downstream from the $S gene is highlighted by a wavy line (nt 5737-5789). TCAGCTACCT CATCGAGCGC CTCCGTGCG~ GCGGI~CCGG C~ItCGTCCGG GTCTACGGt~ GCGC.~GGCGGGGTGATCCTG CCGGCGGAC~ TCGACCI~CT BclI Bcll StuI GC&CGCGCGC GGGGTCGCGC GCATCI'PC'IE CCCGCA~G~ GGGCAC.,CGGC TCGGTCTCGC A ~ A T C ~ T C AATA.~ATCA TCGN3G~CTG 1V.,ACGTCGAT Smal XhoI CTGACGGAGG ACGGGCCGAA U~TCGATGCG ~ G C T G ~CACGG CGCGCTCGC~ CGGGCGATCA CCGTGCTCGA G GCC~CGRAGT GATCCAICIC S¢a1 A ~ G C C C T G T CGTCCTATCA GGGCGGGCAT ~ A C T Eagt GCT~CGA~TA ACATCATG~ TCGCCT~TT~ CAGGOT~AGG C.~CM,CCC,GC A C ~ GTCTCATI"CC C.~CCCCC,~A GCT~%GCTCI TCAGCGCCC~ TGGTACTGCA Xcal CTAG~GAGAC AGCCCCACA~ TCGTATACGC GTATTTrCCG "-10" TACATGGAT~ TC~?ACTAT~ ~ G ~ C ~ q A G Tb'I'~CTAGGT ATC'I~GC:'CG ~f~TCC CCGACTAGAT GATCGGGTT~ ATAGGCCGGA GGTGGAAGTG EagI AAGGTGCT&C GCGTCCACTG TGCGGTIt'TC GC,G'~TACGG CCC,GTrCGGC GGTAAGGCCC GGGCATGACC GCAOCGTCGA GGAGGTCGTC ACCGCt~CGA TCCAGGAGGA CGCCCAGGGG 6001 CCGGATCGTG ACG~CGGCET C~CTCTTCC~ TG6GCATGAC GGTCTCCCAC AGGGTAGCCT e n d o f 23S gene AGGCCC~.GGG CTI'GTCT~G gene TGGCGA~SG ~ G t ' % ' C G qene XcaI ~ AD3AGTGTGG "- 35" ACTA~3GCT GGAGCACCCG CT1T~GCACC TTGGC~U~GG ~ A T ~ T A T SmaI/SD f-met CCACATCGTC C ~ C C ~ T CCTC~T,ACGC G~TGTCCIEC CC~GCGCAGG TGGCTGGTrG GCTAPT[TCG GTCGTGAUAC N3TI'CGGTCC CT&TCCGUCG AGGACCGGGA CGGACC~ACC TCTGGTC/TC~ CAGTrGTTCT GCCAAGGGCA CATCT~GCU GGAAGCCIGC ~TCG~EATGA • 5501 CGGTGACGCA TC,~J~CTGAC ~ b e g l n n t n g o f 55 TGGTATATCT G C _ ~ T A ~ I~:C-,G'I~GTrl' end o f 5S T ~ r ~ ; ~ C ~ TGTGGC~GAG ~ CTAGTACGAG I'GTTCGCCCA TTAAAGCGGT ~CGCGAGCTG GGTI~A~AAC AGTUCAASTU CACA~TC.~AG CT'IV-,)ETGTG Iq3nI/srraI 5001 AGACAGACAT GTCGAGCAGG TGCC~qAAGCA GGGACTAGTG ATCCGGCGGT G U ~ AGCGCCG~EG CTC~qUGG&T ~ G T R E C CCGGGGATAA IJJ2536 * CAGGCTGATC TTGCCCAAGA GTCCATATCG ACGGCRAGGT TTGG~ACCTC ~ATGTCGGCT C TCGCATCC TGGGGC'IGGA GTaOTrC~CA .%~..C.n'I'TC~.r-.r- GTTGTTC~AA TACCACTCIG GTCGTACTGG GGCGGTTGCC TCCTAAN~G TAACGGAC,nC GOCCA~GGT TCCCT~QCC TGG'I'IV,CCM, T C A ~ TGGTGGAGTC TAAGTI'CCGA CCTGCACGAA TC.,GCGTAAt'G ACIT~CCAC Tb'IT'Iv_A~C N~GACTCGG C ~ T I ~ : A TTACC,AGTA~ AGAIT,CTCGT TACGOSCGGC SmaI &GGACGGAAA C4qCCCt'GGC~ CCITTACTAT AGCTIT~TAT TC,GI~TI'~]G ~ GTAG~ATKC,G ~ ~ A G G ACC,CCRGI'I'C 4 5 0 | GGGGACGGGT TAGCTL'T~G GGGCGAA~ ~ A A 123 ing slightly beyond the second XcaI site 52 nt downstream. The first 411-nt IGS between the 16S and the 23S genes is markedly longer than spacers in S. ambofaciens (303 nt) and in B. subtilis (169 nt) where no tRNA-encoding genes have been detected. It is of a length comparable to that in E. coli (440 nt) which must accommodate a 75-nt tRNA °lu. In Frankia, a search using the Staden (1980) program revealed no tRNA. An 83-nt tRNA-like structure was detected in the last part of the IGS at nt 2337-2419 but the lack of a GTT(or ~)C in the side arm and the presence of mismatches in all arms make it a poor candidate for tRNA. The search for tRNAs reve',ded no other such structure in the zone upstream from the 16S gene. The second IGS is 68 nt long, comparable to the 82 nt in S. ambofaciens, 55 nt in B. subtilis and to the 92 nt in E. coll. The similarity found with S. ambofaciens between the 16S and the 23S genes is low: less than 50% similarity. Together with the three regions in the 23S gene, the first IGS is potentially the most interesting region to probe with oligos for discrimination between closely related strains. There were two potential promoters upstream from the 16S gene, one situated at nt 114-147 and the other at nt 245-276 (Fig. 3). The consensus sequence of these two was TTGACA for the --35 motif and TAAYYT for the -10 motif, separated by 18 and 17 nt respectively. The presence of two or more promoters is a common feature in E. coii (Brosius et al., 1981), B. subtilis (Green and Void, 1983) and Streptomyces spp. (Baylis and Bibb, 1988; Pernodet et al., 1989). Furthermore, a comparison with the corresponding sequence of S. ambofaciens yielded a 75% similar 99 nt sequence (with one gap) in common between the two organisms (nt 280-378), containing the motif 5'-GCTCCTTGAGAACTCAACA. The motif, which corresponds to the RNA processing site of the primary transcript postulated by Baylis and Bibb (1988), is situated 175 nt upstream from the S. ambofaciens 16S gene and 185 nt upstream from the Frankia corresponding sequence. Downstream from the 5S gene, a 53-nt sequence (5'-CGTATACGAGTGTGGGGCTGTCTCCCTAGGGAGACAGCCCCACACTCGTATAC_GCGTATTIT) of inverted symmetry (underlined) was found bordered by two Xcal sites (doubly underlined), which are otherwise rare (only one other site in the whole rrn operon). The free energy of the stem would be low (AG = -45.6 kcal/mol) as calculated using the model of Zucker and Stiegler (1981). The repeat is perfect suggesting a strong terminator, followed by a block of four Ts, characteristic of eubacterial Rho-independent terminators (Platt, 1986). Further downstream we found a putative ORF of more than 555 nt with a putative promoter (-35:TGGCAT, [N]t7,-10:TACTAT), a ribosome-binding site and an ATG. The ORF has a > 95 ~o probability of coding for a protein according to Fickett's (1982) statistic. No homologous region was found in other published sequences. (d) Conclusions (1) There are two rRNA operons in Frankia strain ORS020606. (2) Both were cloned and found to be very conserved in the coding regions and to differ outside it. The gene order is the usual one for bacteria: 16S-23S-5S. (3) The nt sequence of one operon was determined. Sequence conservation is high with S. ambofaciens. No tRNA could be detected in the spacer. ACKNOWLEDGEMENTS Thanks are expressed to Claude Lemieux, Jean Boulanger and Christian Otis (D6partement de Biochimie, Universit6 Laval) for help with cloning into Bluescript phagemids and synthesis of oligos, to P.H. Roy (D6partemerit de Biochimie, Universit6 Laval) and Guy Perri6re (Biom6trie, Universit6 Lyon I) for help with the sequence analysis programs, to Luc Simon (CRBF, Universit6 Laval) for helpful discussions about primer design and to Laurence Vergnaud for technical help with sequencing (CRBF, Universit6 Laval). REFERENCES Baylis, H.A. and Bibb, M.J.: Organization of the ribosomal RNA genes in Streptomyces coelicolor A3(2). Mol. Gen. Genet. 211 (1988) 191196. Brosius, J., Dull, TJ., Sleeter, D.D. and Noller, H,F.: Gene organization and primary structure of a ribosomal RNA operon from Escherichia coll. J. Mol. Biol. 148 (1981) 107-127. Denhardt, D.T.: A membrane filter technique for the detection of complementary DNA. Biochem. Biophys. Res. Commun. 23 (1966) 641646. Diem, H.G., Gauthier, D. and Dommergues, Y.: Isolation of Frankia from nodules of Casuarina equisetifolia. Can. J. Microbiol. 28 0983) 526530. Embley, T.M., Smida, J. and Stackebrandt, E.: Reverse transcriptase sequencing of 16S ribosomal RNA from Faenia rectivirgula, Pseudonocardia thermophila and Saccharopolyspora hirsuta, three wall type IV actinomycetes which lack mycolic acids. J. Gen. Microbiol. 134 (1988) 961-966. Fernandez, M.P., Meugnier, R.H., Grimont, P.A.D. and Bardin, R.: Deoxyribonucleic-acid relatedness amongst members of the genus Frankia. Int. J. Syst. Bacteriol. 39 (1989) 424-429. Fickett, J.W.: Recognition of protein coding regions in DNA sequences. Nucleic Acids Res. 10 (1982) 5303-5318. Green, C.J. and Void, B.S.: Sequence analysis of a cluster of twenty-one tRNA genes in Bacilf:~ subtilis. Nucleic Acids Res. 11 (1983) 57635774. Hahn, D., Lechevalier, M.P., Fisher, A. and Stackebrandt, E.: Evidence for a close phylogenetic relationship between members of the genera Frankfa, Geodermatophilus and "Blastococcus' and emendation of the family Frankiaceae. Syst. Appl. Microbiol. 11 (1989) 236-242. 124 Haun, G. and G0bel, U.: Oligonucleotide probes for genus-, species- and subspecies-specific identification of representatives of the genus Proteus. FEMS Microbiol. Lett. 43 (1987) 187-193. Maniatis, T., Fritsch, E.F. and Sambrook, J.: Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1982. Nazaret, S., Co,meyer, B., Normand, P. and Simonet, P.: Phylogenetic relationships among Frankia genomic species determined by use of amplified 16S rDNA sequences. J. Bacteriol. 173 (1991) 4072-4078. Normand, P. and Lalonde, M.: The genetics of actinorhizal Frankia: a review. Plant Soil 90 (1986) 429-453. Normand, P., Simonct, P. and Bardin, R.: Conservation ofn/f sequences in Frankia. Mol. Gen. Genct. 213 (1988) 238-246. Pomodet, J.-L., Boccard, F., Alegre, M.-T., Gagnat, J. and Gu~rineau, M.: Organization and nucleotide sequence analysis of a ribosomal RNA gene cluster from Streptomyces ambofactens. Gene 79 (1989) 33-46. Platt, T.: Transcription termination and the regulation ofgene expression. Ann. Roy. Biochem. 55 (1986) 339-372. Sanger, F., Nicklen, S. and Coulson, A.R.: DNA sequencing with chainterminating inhibitors. Proc. Natl. Acad. Sci. USA 74 (1977) 54635467. Shine, J. and Dalgarno, L.: The 3'-terminai sequence of Escherichia coil 16S ribosomal RNA: complementary to nonsense triplets and ribo- some binding sites. Prec. Natl. Acad. Sci. USA 71 (1974) 13421346. Simonct, P., Normand, P., Moiroud, A. and Bardin, R.: Identification of Frankia strains in nodules by hybridization of polymerase chain reaction products with strain-specific oligonucleotide probes. Arch. MicrobioL 153 (1990) 235-240. Smith, T.F. and Waterman, M.S.: Comparison of bio-sequences. Adv. Appl. Math. 2 (1981) 482-489. Soberon, X., Covarmbias, L. and Bolivar, F.: Construction and characterization ofnew cloning vehicles, IV. Deletion derivatives ofpBR322 and pBR325. Gene 9 (1980) 287-305. Staden, R.: A computer program to search for tRNA genes. Nucleic Acids Res. 8 (1980) 817-825. Wilson, K.H., Blitchington, R., Hindenach, B. and Greene, R.C.: Species-specific oligonucleotide probes for rRNA of Clostridium di~icile and related species. J. Clin. Microbiol. 26 (1988) 2484-2488. Yanisch-Perron, C., Vieira, J. and Messing, J.: Improved M13 phage cloning vectors and host strains: nucleotide sequences of the Ml3mpl8 and pUCI9 vectors. Gene 33 (1985) 103-119. Zeff, R.A. and Geliebter, J.: Oligonucleotidc probes for genomic DNA blots, Focus 9 (1987) 1-2. Zucker, M. and Stiegler, P.: Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Res. 9 (1981) 133-148.
© Copyright 2024 ExpyDoc