FEMS MicrobiologyLetters96 (1992) 225-230 © 1992Federation of European MicrobiologicalSocieties0378-1097/92/$05.00 Published by Elsevier

225

FEMSLE 05048

Sequence of the gene encoding type F neurotoxin of Clostridium botulinum Alison K. East, P.T. Richardson, D. Allaway, M.D. Collins, T.A. Roberts and D a p h n e E. T h o m p s o n Department of Microbiology,AFRC Institute of Food Resea:ch. Reading Laborat~.'y, Readi;;g, UK

Received5 June 1992 Accepted 24 June 1992

Key words: Clostridium botulinum; Type F neurotoxin; Nucleotide sequence; Polymerase chain reaction

1. SUMMARY

2. INTRODUCTION

Primers designed to conserved regions of botulinum and tetanus clostridial toxins were used to amplify DNA fragments from non-proteolytic Clostridiura bomlinum type F (202F) DNA using polymerase chain reaction technology. The fragments were cloned and the complete nucleotide sequence of the gene encoding type F toxin determined. Analysis of the nucleotide sequence demonstrated the presence of an open frame encoding a protein of 1274 amino acids, similar to other botulinum neurotoxins. Upstream of the toxin gene is the end of an open reading frame which encodes the C-terminus of a protein with homology to non-toxic-non-hemagglutinin component of type C progenitor toxin.

Within the genus Clostridium, organisms producing protein toxins able to cause paralysis symptomatic of botulism are deemed to be Clostridium botulinum. Outbreaks of botulism in humans and animals have resulted in the classification of the neurotoxin into seven serologically different forms identified as type A ( B o N T / A ) to type G (BoNT/G). B o N T / A , B o N T / B and B o N T / E are the main causative agents of human botulism, with rar,e incidents attributed to B o N T / F and B o N T / G [1]. The disease is most often the result of ingestion of toxin formed in foods that have been inadequately processed a n d / o r stored at inappropriate temperatures. Another source of the disease is after colonization of the gut by clostridia in infants of less than approx. 6 months, consequently known as infant botulism [1,2]. Occasionally botulism is the result of infection of a wound. There have been reports of BoNT being produced by clostridia that

Correspondence to: D.E. Thompson, Department of Health, Eileen House, 80-94NewingtonCauseway, London, SEI 6EF, UK.

226 phenotypicaily are not C botulinum viz. C butyticum and C barati [1,2]. Type F neurotoxin is produced by C. botulinum Group I (proteolytic strains), Group II (non-proteolytic strains) and by C. barati [3]. Comparison of the complete amino acid sequences from BoNT genes (BoNT/A [4,5], BoNT/C [6], B o N T / D [7], BoNT/E [8] and BoNT/E of C. butyricum [9]), and partial sequence (BoNT/B [10]), with tetanus toxin (req'x) [11] has revealed conserved regions thought to be implicated in toxin function [10]. Primers based on data from these conserved regions were used to clone and sequence the gene encoding B o N T / F from non-proteolytic C. botulinum using polymerase chain reaction (PCR). We report here the nucleotide sequence of a gene, which when translated, gives an amino acid sequence similar to those of BoNTs already reported [5-10].

3. MATERIALS AND METHODS 3.1. Cultures and preparation of DNA C botulinum strain 202F (ATCC 23387) was grown anaerobically in TPYCG (trypticase 50 g / l , bactopeptone 5 g / l , yeast extract 3 g/I, cysteine HCI 0.5 g / I and glucose 5 g / i ) broth and DNA prepared as described by Farrow et al. [12]. All manipulations of C. botulinum and cloned fragments were carried out under GMP II conditions. E. coli SURE (Stratagene) was used as the recipient for transformation. Plasmid DNA was prepared from E. coli by the alkaline lysis method of Birnboim and Doly (see ref. 13) and for sequencing, by the method of Treisman as described by Sambrook et al. [13]. 3.2. PCR amplification PCR was performed with a Biometra thermocycler using the following conditions: 95°C for 5 rain, followed by 25 cycles of: 94°C for 45 s, between 37 and 45°C for 45 s and 58°C for 2 min, followed by 62°C for 10 rain and then 4°C. Primers were made on an Applied Biosystems model 391 DNA synthesiser and used at a final concentration of 4 ng//~l. Deoxynucleotides (BCL) were present at a final concentration of 200 mM and 1

unit Taq polymerase (Amersham International) was used according to the manufacturers instructions. Template DNA was at a final concentration of 20 ng//xi except for 'inverse PCR' when the concentration was 50 n g / # ! of restriction enzyme-digested and re-ligated DNA. For 'inverse PCR' [14], DNA was prepared using restriction enzymes (BCL) according to the manufacturer's instructions and ligations performed using a 'Ligation System' kit (Amersham International).

3.3. Cloning and transformation Products of PCR were cloned into the EcoRV site of pBlueseript KS + (Stratagene) following modification and purification of the fragment by Klenow (BCL), 'Geneclean' (Stratateeh), kinase (Gibeo-BRL), phenol extraction and ethanol precipitation [13]. An alternative vector system was used to construct pCBF2 and pCBF8; DNA fragments generated by PCR were cloned directly, without enzymic modification, into 'T-vector' (D. Evans, University of Reading, personal communication) digested with Xcml (New England Biolabs). This digestion generates a single 3' overhanging T on the vector which base pairs with the A, often added to the 3' end of PCR fragments during the reaction [15]. Transformation into E. coil competent cells, prepared as described by Inoue et al. [16], was carried out as described by Sambrook et al. [13]. Plasmids containing inserts of the expected size were sequenced following testing of the recombinant-ceU lysates by the mouse bioassay [17], to ensure that no toxic polypeptides were produced.

3.4. Sequence determination Sequencing of plasmid DNA was carried out using dideoxy-nucleotidc chain termination. Primers complementary to both the plasmid and toxin were used to sequence both strands of each insert completely. As the inserts were generated using PCR, the sequence of two fragments synthesised in different PCR reactions was determined. Where two sequences differed, as they did in three positions, a third clone was sequenced and in each case confirmed one of the sequences.

227 4. RESULTS AND DISCUSSION To ensure that no toxic polypeptides were produced only fragments encoding part of BoNT/F were cloned in E. coli. Figure 1 shows the position of the primers used to amplify regions of DNA which were subsequently cloned. No recombinant plasmid was toxic in the mouse bioassay [17]. The open reading frame codes for a protein of 1274 amino acids with a molecular mass of 146 708. This calculated value compares well with the measured value of 150000 for BoNT/F purified from proteolytic C. botulinum [18]. The protein encoded by BoNT/F shows high homology with TeTx and other BoNTs. Since these toxins are usually cleaved in cico and as the two chains have different functions in toxic activity [10], the proteins were divided into light and heavy chains for comparative purposes. Comparing BoNT/F with light and heavy chains for toxins where the sequence is known (data not shown), the light chain of B o N T / F is more like that of BoNT/E, showing 58.3% and 57.6% identity with the toxins of C. botulinum and C. butyricum respectively, than TeTx (45.1% identity) and BoNT/A (35.1% identity). The heavy chain also showed greater homology with the BoNT/E from C. botulinum (64.9% identity) and C. butyricum (64.4% identity), with BoNT/A and TeTx having identities 44.6% and 35.0% respectively. The high sequence identity of BoNT/F and BoNT/E may explain SaI/3AI

I

the cross-reactivity of antiscra raised against either type toxin with both BoNT/F and BoNT/E. Within the toxin sequence there are regions of highly conserved amino acids thought to be important for structure and function (see ref. 10). In the light chain, the region noted by Niemann, DPhhnLhHELnHnnHxLYG, where h is a hydrophobic residue, n is uncharged and x is any amino acid [10], is also present in BoNT/F (Fig. 2, residues 220-239). The cysteine residues involved in disulphide linkage of the heavy and light chains are found in all the toxins (Fig. 2, residues 429 and 445). The number of residues between the cysteines varies between eight (BoNT/B) and 27 (TeTx); BoNT/F has 15 amino acids. As with BoNT/E, this BoNT/F is encoded by a non-proteolytic C botulinum, it is therefore unlikely to undergo proteolytic cleavage by an endogenous viotease, but a possible cleavage site for exogenous proteases is between residues 439 and 440 (Fig. 2). The C-terminus of the heavy chain, a region noted for its lack of homology in BoNTs [10], is very well conserved between BoNT/F and BoNT/E. It has been suggested that this region is involved in the binding of BoNT to a ganglioside receptor [10]. The high homology between BoNT/E and BoNT/F may indicate that they share a common receptor. Analysis of sequences upstream of BoNT/F revealed a possible Shine-Dalgarno sequence 5'AGGGGG-3' (Fig. 2, nucleotides 132-137), eight nucleotides upstream of the proposed start of

Sa~J3AI EcoRI

[

i

I

LIGHT C H A I N

[

HEAVY CHAIN

E~V

~ 1

I

I

"4

E~V

I

I I

C~ne

Primers

I;)CBFI"

164-236

pCSF2"

164-236

pCBF3 pCBF4

1-2 1-2

pCBF5

-,e

pCBF6*

q q -'91

I~

pCBF7

1-230 229-202 3-12

pCBFH"

399-400

pCBF9"

399-400

lkb

Fig. I. Strategy for cloningfragmentsof the BoNT/F gene. Arrows indicate the positionof the primers used to generate the fragments.Clonesgeneratedby "inversePCR" are indicatedby *: in the case of pCBFI*/pCBF2* the chromosomalDNA was digested with Sau3AL for pCBF6* with EcoRI, and pCBF8*/pCBF9* with EcoRV. Only the sites of restriction enzymes involvedin the constructionof clonesare shown.

228

translation. Further upstream, nucleotides 33-38 and 47 (Fig. 2) are identical to those mapped as the - 1 0 and transcriptional start by Binz et al. for B o N T / A using primer extension [5]. The - 3 5 region suggested by the same authors for B o N T / A of 5'-TTAACCC-Y is present virtually unchanged (5'-TFAACGC-3') in the same position in BoNT/F (Fig. 2, nucleotides 5-11). However, further analysis of this upstream region of the BoNT/F gene shows that it contains the Y-end of an open reading frame (Fig. 2, nu-

TTGTTTAACGCATGCTCATAAC;,ATAAGTATATATGT= TATCTATGA~GAT 3AGAACTA

6C

TAATTGGATGATATGTAACAATGAAAGChACA':ACCT AAAAAGGCATATTTGTGGATATT ~20

180

GAAAGAAGTATAGGGGGGATTTTTATGCCAGTTGCAATAAATAGTTTTAATTATAATGAC M P V A : N S F N Y N D

t:2~

CCTGTTAATGAT,3ATACAATTTTATACATGCA.GATACCATATGAAGAAAAAAGTAAAAAA 240 P v ~ ~ z, T ~ : V ~ O : P Y 8 ~ K S ~ K (32) a~.

.,

p,,m~,IM

TATTAT

~GCTTTTGAGATTATGCGTAATGTTTC~GATAATTCCTGA3AGAAATACAATA 330 A F E I M R N v w I : P F. R N T I (52~ Sau3AI GGAACGAATCCTAGTGATTTTGATCCACCGGCTTCATTAAAGAACGGAAGCAGTGCTTAT 360 Y

Y

K

.~

.-

~

P

s

c

r

D

P

P

A

S

L

K

N

G

S

S

A

Y

TATGATC CTA ATTATT TAACCACT GATGC T GAAAAAGAT AG ATAT T TAAAAACAACG A TA Y 3 P r~ Y L T T 0 A z ~ s R Y L ~ .'r T :

(~2*

420 (92)

480 AAAT TATT TAAGAGAATTAATAGTAATCCTG CAGGGAAAG T T T T GT TACAAG AAATAT CA K n r K ~ : ~ S ,~ ~ A S K V L L ~ z z s (112) ~cuR I TATCCTAAACCATATTTAGGAAATGACCACACGCCAATTGATGAAT:CTCTCCAGTTACT 540 Y A K P Y L G N D H T P : D £ F S ¢ V T (132) 600

AGAACTACAAGTGTTAATATAAAATTATCAACTAATGTTGAAAGTTCAATGTTATTGAAT ~ T : S v ~ I K n ST ~ v z s s M L ~ ~152~ CTTCTTGTATTGGGAGCAGGACCTGATATATTTGAAAGTTGTTGTTACCCCGTTAGAAAA L

L

V

t

G

A

G

P

D

I

F

z

S

C

C

Y

P

V

8

K

66G

H72~

CTAATAGATCCAGATGTAGTTTATGATCCAAGTAATTATGGTTTTGGATCAATT;U%TATC 720 L ! D P 0 V V Y D P S .': Y G r .~ S : .': : (192)

GTGACATTTTCACCTGAGTATGAATA?ACTTTTAATGATATTAGTGGAGGGCA'~AATAGT 780 v T F S P 8 Y E Y T F N D ! 8 G G H N S (212) AGTACAGAATCATTTATTGCAGATCCTGCAATTTCACTAGCTCATGAATTGATACATGCA 840 s T z s r : A D P A I S L A H E n I ~'A {232) CTGCATGGATTATACGGGGCTAGGGGAGTTACTTATGAAGAGACTATAGAAGTA.%AGCAA L

H

G

L

Y

G

A

R

G

V

T

Y

E

E

T

I

E

v

K

e

GCACCTC~ 1'AIGA~ A G C C G A ~ C C C A T A A G G C T A G A A G A A T T T T T A A C C T T T G G A G G T A

P

:.

M

I

A

E

K

P

I

B

L'E

E

F

L

T

r

O

G

900 (252) 860 (272)

cleotides 1-130 and data not shown), translation of which into protein sequence gives the C-terminal 57 amino acids (Fig. 3). The translated sequence shows homology with the non-toxic-nonhaemagglutinin component of C. botulinum type C [19], the gene for which is found in the same position, upstream of the toxin gene. Published sequences reveal the presence of such open reading frames in type A [5], type E [8] and C. butyricum [9], although not commented upon by the authors• As these translated open reading

!2bO (a72)

GAGAGTGACTTAGCAAATA~TTTAAAGT~TGTAG~TACITATTTTATTA~TAT E ~ ~ ~ A u K F K v r c ~ ~: T Y r : K Y

G~TTTTT~GTTCC~ATTFGTTAGATGATGATATTTATACTGTATCAGAGGGGTTTI320 E

F

L

K

V

P

N

L

L

D

D

D

I

Y

T

V

S

E

G

F

(

3

9

2

(

A A T A T A G G T ~ T T T A G C A G T A A A C ~ T C ~ C 3 G A C A A A G T A T ~ A G T T ~ A ~ C C T A A A A T T 1380 N I G N L A V N N P G Q S I ~ L N P K I ( 4 1 2 ) A T T G A T T C C A T T C C A G A T A ~ G G T C T A G T A G A A A A G A T C G T T A ~ T T T T G T ~ G A G C G T T 1440 I D S I P D K G L V E K : V K F ~ S V ( 4 3 2 ) A T T C C T A G A A A A G G T A C ~ A G G C G C C A C C G C G A C T A T G C A T T A G A G T ~ A T ~ T A G T G A G i500 l P R K S = ~.A P P R L c i R v N N S E (452~

T

T T A T T T T T T G T A G C T T C A G A A A G T A G C T A T A A T G A A A A T G A T A T T ~ T A C A C C T A ~ G ~ 1560 L F F V A S E S S Y N E N n I N T P K E ( 4 7 2 ) A T T G A C G A T A C ~ C A A A T C T A A A T A A T ~ T T A T A G A A A T ~ T T T A G A T G ~ G T T A T T T T A 1620 I : ~ T T N U B] r; N Y R N N L D E v : L (492( G A T T A T A A T A G T C A G A C ~ T A C C T C A A A T A T C A A A T C G ~ C A T T A A A T A C A C T T G T A C ~ 1680 D Y N S Q T I P Q I S N R T L N T L V Q ( 5 1 2 ) GAC~TAGTTATGTGCC~GATATGATTCT~TGG~C~GTGAAATAGAGG~TATGAT D N S Y V ? ~ Y D S N G T S E I E E Y D ( 5 3 2 )

1740

GTTGTTGACTTT~TGTATTTTTCTATTrACATGCACAAAAAGTGCCAG~,GGTGAAACC 1800 V V D F N V F F Y L H A Q K V P E G E T ( 5 5 2 ) A A T A T ~ G T T T ~ C T T C T T C ~ T T G A T A C A G C A T T A T T A G ~ G ~ T C C A A A G A T A e A T T T 1860 N ; S L T S S I D T A L L E E S K D I F ( 5 7 2 ) T T T T C T T C A G A G T T T A T C G A T A C T A T C ~ T A ~ C C T G T A A A T G C A G C A C T A T T T A T A G A T 1920 F S S E F I D T I N K P V N A A L E I D ( 5 9 2 ) T G G A T ~ G C A A A G T ~ T ~ G A G A T T T T A C C A C T G ~ G C T A C A C A A A A A A G T A C T G T T G A T 1980 W : S K V I R D F T T E A T Q K S T V D ( 6 1 2 ) A A G A T T G C A G A C A T A T C T T T ~ T T G T A C C C T A T G T A G G T C Z T G C T T T G ~ T A T ~ T T A T T 2040 K I A D I S L I V P Y V G L A L N I I I ( 6 3 2 ) CAGGCAGAAAAAGGAAATTTTGAGGAGGCATTTG~TTATTAGGAGTGGGTATTTTATTA 2100 E A E K G N F E E A F E L L G V G I L L [ 6 5 2 )

CAGGATTTAKATATTATTAC TA GTGC TAT GAAGGAAAAAATATATAACAATC TT TT AGCT 1020 (292) Q D L N I i T S A M K E 8 1 Y N N L L A

G ~ T T T G T G C C A G ~ C T T A C ~ T T C C T G T ~ T T T T A G T G T T T A C G A T A A A A T C C T A T A T A 2160 E F V P E L T ! P V I L V F T I K S Y I (672)

AACTATGAAAAAATAGCTACTAGACTTA GTGAAG T TAATAGTGC TCC T CCTGAATATGAT 1080 m Y E K i A ~ ~ us ~ v .~ s A ~ P z Y (a12~

G AT T C A T A T G A G ~ T A A A A A T A A A G C ~ T A ~ A G ~ A A A T ~ T TCAT T ~ T C G A A A G A 2220 D S Y E N K N K A I K A I N N S L I E R ( 6 9 2 )

ATTAATGAATATARAGATTATTTTCAATGGAAGTATG GGC T A GATAAAAAT G C TG ATG GA I 1 4 0 z I~ z Y K D Y r O w 8 Y G I. a ~ .~ A D O (332)

G~GCAAAGTGG~GAAATATATAGTTGGATAGTATCAAATTGGCTTACTAG~TT~T2280

AGTTATACTGT.~AATG-~AAATA.~ATTT,~J%TGI~AATTTATA,%2,,%,~ATTATATAGTTTTACA 1200

ACTCAATTTAATA=~A.~IG,~AAAGAGCAAATGTATCAGGCTTTACA/~J~TC,%AGTAGATGCA

S

Y

T

V

N

E

N

K

F

N

E

:

Y

K

R

L

Y

S

F

T

(352)

~,~

E

T

A

O

K

F

W

N

K

K

E

R

I

Y

K

S

E

W

0

M

I

V

Y

2~0

S

O

N

A

N

L

L

0

T

N

R

0

I

V

N

D

(

7

A

2

2

2340 {732)

Fig. 2. Complete nuci=otide sequence of the B o N T / F gene. The translated amino acid sequence of B o N T / F is given under the second nucleotide of each codon. The position of primers used in PCR to generate the clones is indicated by underlining; * above a nuclcotidc indicates a mismatch between the primer and the D N A sequence. The possible position of cleavage between the light and heavy chains is indicated. The cystcinc residues arc underlined at amino acids 364 and 445, Nucleotides ] to 130 form part of the open reading frame translated to give the sequence shown in Fig. 3. The sequence has been given the Genbank database accession number M92906.

)

229 ATAAAAACAGCAATAGAATATAAATATAATAATTATACTT~'AGA2";A%AAAAATAGACTT 242: GAATCTGAATATAATATCAATAATATAGAAGAAGAATT";AATA~AAAAGTTTCTTTAGCA ; 4 '%c' ATGAA~@.~TATAGAAAGATTTATGACAGAAAGTT-TATA

?C Z TA £T TAAT,~AAAT:AATA

AACTATATTCTCGACCATAGATCAATCTTAG~AGAG

................................... V

T

S

T

L

N

S

:At;At'AAA?GAATTAAGT::ATTT-"

......... S

I

7"

V

252{,

;T[AAtSA',;r;G%TT.'ATTA 2:. ~,' H V ~ S D L t (812,

AATG~u%GCCA~AGTTGGT~TT~AATATGAT/~'~CUAI N E A g V G g L K ~ Y 7 ~;

E

~64,~

T,-*T . . . . . . . . . . . . . .

L

::

?

Y

N

7

,.',:'-"' b

K

A T T C T A A T T A T A T A T T T T A A T A GAT T AT A 7AAAAA;J~T TAAA';ATA ;TT ~TA7 rTTAGAT

?" g

ATGCGATATG,~.~AATAATAAATTTA'£ A G A T A T C T C T G G A T A T t;:;TTC ~ T A T , ~ % G C A T T

2S2~

AATGGAAACGTATATATTTATTCAACAAAT

2890

~GAAAT:~,b'~T T TGGM, TATATAATAt;TAC'r~

CTTAGTGAAGTTAATATAGCTCAAAATAATGATATTA~ L S E V N ; A ~ R tl D :

A T A C A A T A G T A G A 7 Ar C A A A A T

[

Y ~

S

R

2949

Y Q N

A G A A A A n A T A ~ 5TAT RTT,~C: C T G A A 7 T S A C - G C A T T T T A A A T A T T A A T C A A C A A A G A ~ G T

3540

:TTACT:;AAJC, C T ' : T S T T T T T T T C A A C L ~ T A A A T T A T A T S A A J G A G : A G A A G Y C A T T A T A

3600

[9i2) AG A AAAAT~ T 5G T -_2 T A T A C,AT AT AT C T A A T A C A G A T AAT TT TGTTAGA,%AA;b~C "~AT C T A 3660

3~C

TTTAGTATTAGTTTCTGGGTAAGGATTCCTAAACACTACAAACCTATGi~ATCAT~TCGG

;: ATA'ATTAATC, rA~T~gAT"CTGC, T ~ T A 3 A A T A T C $ , 3 T T A T A T G C T G A T A C A A A A T ~ A A ~ 1 x v v u . a v E v ~ = v a o T ~ s E

Y

I

T

I

N

C

X

G

N

S

N

S

G

W

~

5

:

L

~

ACTGTTaGAGATTGTGAAATAATTTGGACTTTACAAGATACTTCTGGAAATAAGGA,~u%AT 312~ T V R D C S I : ~ T L Q D T S G N K E N (992] eco.~V ~ , ~ , .,02 T T A A TT T TTAGGTATG/~%G AAC T T A A T A G G A T A T C T A A T T AT AT A-%ATAAA~ GGATT T TT 3180 L ! F R Y E E h N R I $ N Y I N K W I F 11012) •

GT~ACTATTACTA.%T;~ATAGATTAGGCAATTCTAGAATTTACATC~TG(;AAATTTAATA V

T

T

|

N

N

R

L

G

s

N

R

I

Y

I

N

G N p.,,~, 4oe

L

324C~ (:032)

I

GTTG/~a,%AATC AAT TTCGA.~T T TAG GTG A T A T TC AT G T T A £ T G A T A A T A T A T TATTT~d%.%

v

e

K S

I

S

N

L O D

I

H V s

D N

:

L

F

3:22 ~::~2~

()"2)

3300

3780

SASA~.%GA ,;AAAATAATAAGAACAT CT A A T C T A A A C G A T A S C T T A G G T C ~ u ~ A T T A T A S T T E }~ E It : I [~ T D N L ~: ? :~ L G O I : V A T G :]AT I C A A T A G G A A A T A A T T ~,CACAATG;~AT TTTC~.%AA~AATAATGGGAGrAATA;

~1212) A

C 5AAG,kAAT AC T A G C A G T A A T G G A T O C T T T T G G A C T T C T A T T T

,~TAAAGAGAATC.GATG 3 1960

K tlC52) ;~AG~%2~T = A T A S S C C A ~ AGT ~%TTA2~T 2 T C A I ~ C T A C A T G A G T C T G T C A A G A A T T T T - G T

ATTGTTGGTTGTGATGATGAAACGTATGTTGGTATAA~ATATTTTAAAGTTTTTAATACG

e K

L

o

N

K

Y

W

T

I

E

G

N

~

Y

T

n

L Y

L

V

s

N

s

t~

K

~

Y

K

¥

~

[~,ta~ 12 3420 L tlo~2~

~

s

:

TATTCAATTTACTA U

F

N

4220

3360

G A A T T ~ G A T A A A A C A G A A A T T G AGAC T T T~ATATAGTAAT G AGC C A G A T C : A A G T ATe T T A

AA~AqTATTGGGGAAATTATTTGCTATATAATA.a~%AAATATTATT

~a4o

GGAT: A C T A G G T T T T C A T T ' J A A A T A A T T T G G T T G C T A G T A G T T G G T A T T A T A A C ; ~ % T A T A 390.0 J L L 5 F H i N N L v A 3 S "~ Y Y N N : (1252~

L

3480

L (:.I12)

A T A A A C A T ~ C A T A A A A A A T T T T A A A A = T A A T A T A T T T , ~ G A A T A G CTAGATATI, A3TAT'2 428~ GC T A T G T T A A T A T C TA3C T A T T T 7 ; ~ 7 T~AT I C C A T A T T A T T A T A A T A A G A A A A A A T EC~.~V ATTTC TAT: G r A A G T A C A A G T T T A T T O G T A T C T C A T A A A T G A T A C A A G A T A T C

%CT 4b;9 4193

Fig. 2. (continued).

TypeC

ggy

TypeA Typee Type E(but)

ZK~tIDXS

Lar

SpZN~I

Q.V.V S S X D N A K

TypeC

TDLrRPDC

TypeF

H D X F Z S N C L T H A H H N K Y

TypeE Type E(but)

#DX NDZ

K I :vs Mxxs

Z S M a L Q L I ~ N K ~ N ~

IOLVDNK~W~,~:~XX~ x

T F X

,

,

~o

ITF

)tMDI< YFSLS

ISNC]~'~LTYMNVN Z S N 4 = Z~T L T Y N

DGDYH

I C L S M K D E N Y N L$ IKN N V N V "L' L ~ ~ g N

DYN D Y w

frames are homologous with the non-toxic-nonhaemagglutinin component of BoNT/C (Fig. 3) it is possible that there is a toxin operon encoding the components of the progenitor botulinum toxin complex in types A and E and C. butyricum, as in type C [19].

i

TypeV 'Type A

NMI(:

TypeE Type E{but)

WVXCD M v Z

Fig.

C

3. C o m p a r i s o n

terminus protein

:PgK

NES

w M x c s ~

of open encoded

non-haemagglutinin

of the reading

by type

amino frames

acid

nucleotide

of the

of BENT.

described

of the

type E (bu0

I

sequence

upstream

C has been

from published

v

L * . ~ W S ~

PKK YLMXLKN P K K S "g

component

For type A, ~,pe E and was derived

YLWILK

[~Msxv

LNHD'I L N H

sequences

ACKNOWLEDGEMENTS

as non-toxic-

progenitor

the amino

CThe

toxin

[19].

acid sequence [5,8,9]. Type

E ( b u t ) r e f e r s t o t y p e E o f C. b u t y r i c u m .

We thank Mrs L. Taylor of Terry Research Station for testing the E. coli clones in the mouse bioassay.

2~ REFERENCES [1] McCroskey, L.M., Hatheway, C.L. Woodruff, B.A., Greenberg, J.A. and Jurgenson. P. (1991) J. Clin. Microbiol. 29, 2618-2620. [2] McCroskey, L.M., Hatheway, C.L., Fenicia, L., Pasolini, B. and Aureli, P. (1986) J. Clin. Microbiol. 23, 201-202. [3] Hatheway. C.L (1989) In: Botulinum neurotoxin and tetanus toxin, (Simpson, L.L., Ed.), pp. 3-24. Academic Press, New York. [4] Thompson, D.E., Brehm, J.K., Oultram, J.D., Swinfield, T.-J.. Shone. C.C., Atkinson, T., Melling, J. and Minton, P. (1990) Eur, J. Biochem. 189, 3-81. [5] Binz, T., Kurazono. H., Wille, M., Frevert, J., Wernars, K. and Niemann, H. (1990)J. Biol. Chem. 265, 9153-9158. [6] Hauser, D., Eklund, M.W., Kurazono, H., Binz, T., Niemann, N., Gill, D.M., Boquet, P. and Popoff, M.R. (1990) Nucleic Acids Res. 18, 4924. [7] Binz, T., Kurazono, H., Popoff, M.R., Eklund, M.W., Sakaguchi, G., Kozaki, S., Krieglstein, K., Henschen, A., Gill, D.M. and Niemann, H. (1990) Nucleic Acids Res. 18, 5556. [8] Whelan, S.M., EImore, M.J., Bodsworth, N.J., Atkinson, T. and Minton, N.P. (1992) Eur. J. Biochem. 204, 657667.

[9] Poulet, S., Hauser, D., Quanz, M., Niemann, H. and Popoff, M.R. (1992) Biochem. Biophys. Res. Commun. 183. 107-113. [10] Niemann, H. (1991) In: Sourcebook of Bacterial Protein Toxins (AIouf, J.E. and Freer, J.H., Eds.), pp. 303-348. Academic Press, London. [11] Fairweather, N.F. and Lyness, V.A. 0986) Nucleic Acids Res. 14, 7809-7812. [12] Farrow, J.A.E., Jones, D., Phillips, B.A. and Collins, M.D. (1988) J. Gen. Microbiol. 129, 1423-1432. [13] Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989) In: Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N'Y. [14] Ochman, H., Gerber, A.S. and Hartl, D.L. (1988) Genetics, 120, 621-623. [15] Clark, J.M. (1988) Nucleic Acids Res, 16, 9677-9686. [16] lnoue, H., Nojima, H. and Okayama, H. (1990) Gene 96, 23-28. [17] Hauschild, A.H.W. (1989) In: Foodborne bacterial pathogens, (Doyle, M.P., Ed.), pp. 111-189. Marcel Dekker, New York. [18] Yang, K,H. and Sugiyama, H. (1975) Appl. Microbiol. 29, 598-603. [19] Tsuzuki, K., Kimura, K., Fujii, N., Yokosawa, N. and Oguma, K. (1992) Biochem. Biophys. Res. Commun. 183, 1273-1279.

Sequence of the gene encoding type F neurotoxin of Clostridium botulinum.

Primers designed to conserved regions of botulinum and tetanus clostridial toxins were used to amplify DNA fragments from non-proteolytic Clostridium ...
370KB Sizes 0 Downloads 0 Views