Journal of General Virology (1992), 73, 737-741. Printedin Great Britain

737

Sequence analysis of M2 mRNA of bovine respiratory syncytial virus obtained from an F-M2 dicistronic mRNA suggests structural homology with that of human respiratory syncytial virus Miguel Zamora and Siba K. Samal* Regional College of Veterinary Medicine, University of Maryland, College Park, Maryland 20742, U.S.A.

The nucleotide sequences of the F and M2 mRNAs of strain A51908 of bovine respiratory syncytial virus (BRSV) were determined by sequencing cDNA of an intracellular dicistronic mRNA. Comparison of the F mRNA sequence with those of other BRSV strains showed that there was extensive sequence identity at both the nucleotide (95% identity) and amino acid (94 % identity) levels. Alignment of the nucleotide and encoded amino acid sequences of M2 m R N A of BRSV with those of human respiratory syncytial virus (HRSV) M2 m R N A showed 69% identity at the nucleotide level and 80% identity at the amino acid level. The general features o f B R S V F and M2 proteins

are similar to those described previously for the H R S V proteins. The M2 m R N A of BRSV also contained a second internal, overlapping open reading frame (ORF) similar to one reported for HRSV. The predicted products of the second ORFs of BRSV and H R S V shared 43 % amino acid identity. As described for HRSV, the T-terminal end of the M2 mRNA overlaps with the 5' end of the L gene by 68 nucleotides. The identity between the N-terminal regions of the L proteins of BRSV and H R S V is 75 %. In addition, the intergenic sequence of the F - M 2 gene junction of BRSV was determined.

Bovine respiratory syncytial virus (BRSV) belongs to the genus Pneumovirus in the family Paramyxoviridae, which also includes human respiratory syncytial virus (HRSV). BRSV is an important aetiological agent of respiratory tract disease in cattle (Bohlender et al., 1982; Collins et al., 1988; Paccaud & Jacquier, 1970; Stott & Taylor, 1985). It has been shown recently that BRSV and HRSV are similar in gene and protein composition, and are antigenically related at the level of the F, M, N and P proteins (Lerch et al., 1989; Mallipeddi et al., 1990), with the major antigenic difference between BRSV and HRSV being in the G protein (Lerch et al., 1989; Orvell et al., 1987; Taylor et al., 1984). The genome of RSV, a negative-sense ssRNA molecule, encodes 10 genes: glycoproteins G (attachment protein) and F (fusion protein); phosphoprotein (P), major nucleocapsid (N) protein, matrix proteins M and M2, small hydrophobic (SH) protein, large (L) protein, and non-structural proteins 1B and 1C. The nucleotide sequence of all 10 mRNAs of HRSV has been determined (Galinski, 1991). However, the nucleotide and deduced amino acid

sequences of only the G, F, N, M and SH genes of BRSV have been reported (Lerch et al., 1990, 1991; Samal & Zamora, 1991; Samal et al., 1991; Walravens et al., 1990). The amino acid sequences of these BRSV proteins have been compared to those of their counterparts in HRSV. The G and SH proteins showed very low similarity (30% and 38 %, respectively), whereas the N, M and F proteins were highly conserved (80 to 93% identity). To understand the relationship between HRSV and BRSV at the molecular level, we have undertaken the cDNA cloning and nucleotide sequencing of different m R N A species of BRSV for comparison with the corresponding published sequences of HRSV. To date, we have obtained the nucleotide sequence of the mRNAs encoding the N, M, SH (Samal et al., 1991; Samal & Zamora, 1991), P and G proteins (S. K. Mallipeddi, M. Zamora, M. K. Pastey & S. K. Samal, unpublished results). In this communication, we have extended our sequence analysis to the F and M2 proteins of BRSV (A51908 strain), and present for the first time the nucleotide and deduced amino acid sequences of the M2 protein gene of BRSV. In addition, the nucleotide sequences of the F-M2 intergenic region and the M2-L overlapping region were determined from an intracellular dicistronic mRNA.

The nucleotide sequence data reported in this paper have been submitted to the GenBank database and assigned the accession number M82816. 0001-0573 © 1991 SGM

738

Short communication

Construction of a c D N A library derived from intracellular mRNAs isolated from cells infected with the A51908 strain of BRSV has been described (Mallipeddi et al., 1990). The c D N A library was screened with an HRSV F gene c D N A clone (kindly provided by Dr P. L. Collins). One of the clones that reacted with the HRSV F gene, A162, had an insert of approximately 3000 bp and was selected for nucleotide sequencing. Clone A162 [2907 nucleotides exclusive of a poly(A) tail] contained the entire F and M2 coding regions. In addition, it contained the 5' and 3' untranslated regions of each gene plus the F-M2 intergenic region. To determine the frequency of F-M2 dicistronic mRNAs, clone A162 was used to screen a c D N A library, resulting in the isolation of six c D N A clones which contained only the F gene, as determined by nucleotide sequencing. This result suggests that both the M2 m R N A and F-M2 polytranscripts are very rare, in contrast to the high frequency of M-SH polytranscripts observed previously (Samal & Zamora, 1991). The nucleotide sequence of the M2 gene was confirmed by sequencing a c D N A clone obtained by polymerase chain reaction (PCR) amplification of M2 m R N A using the oligonucleotide GCAAATATGTCACGAAGAAATCC (nucleotides 1947 to 1969 in Fig. 1) as the reverse primer and the oligonucleotide GGAATTCC-poly(dT) as the forward primer. The complete nucleotide sequence [excluding the poly(A) tail] of the F-M2 dicistronic mRNA, and the deduced amino acid sequences for the F and M2 proteins are shown in Fig. 1. The dicistronic m R N A contains two major open reading frames (ORFs). Nucleotides 14 to 1729 encode a 572 amino acid protein that was identified as the F protein by its high identity with other BRSV F proteins (Lerch et al., 1991; Walravens et al., 1990). Nucleotides 1955 to 2512 encode a 572 residue protein that was identified as the M2 protein by comparison with the amino acid sequences of the HRSV M2 proteins (Baybutt & Pringle, 1987; Collins & Wertz, 1985; Elango et al., 1985). The nucleotide sequence corresponding to the F m R N A is 1892 nucleotides in length from the initiation to termination consensus sequences, which were identified at positions 1 to 9 and 1882 to 1892 (Fig. 1), respectively, by comparison with other BRSV m R N A sequences (Samal & Zamora, 1991 ; Samal et al., 1991). The 5'-terminal nucleotide was determined by primer extension and PCR amplification of the G - F intergenic region (unpublished results). The 5' untranslated region is identical to the sequence at the 5' end of the F m R N A of the 391-2 strain of BRSV (Lerch et al., 1991). Alignment of the nucleotide sequences of BRSV strains A51908, 391-2 (Lerch et al., 1991) and RB94 (Walravens et al., 1990) showed 9 7 ~ identity in the coding regions. This degree of identity is similar to that

~__~ATG,

C.CGAC~C~CCATGA~ATGATCATCA~ATTATCCTCATCTCTACCTATGTGCC M A T T T M R M I I S I I L I S T Y V P

72 20F

ACATA~ACTTTAT~CAG~CAT~CAG~G~TTTTATC~TCGACAT~AGT~AGTTAGTAGA~TTA H I T L C Q N I T E E F Y Q S T C S A V S R G Y

144 44F

CCTTAGT~ATT~G~CTGGAT~TATAC~GTGT~T~C~TAGAGT~A~AAAATACAA~TGT L S A L R T G W Y T S V V T I E L S K I Q K N V

16 68

ATGT~C~TAC~ATTCAAAAGTGAAATT~TAAA~G~CTAGAAAGATAC~C~T~AGTA~A

288 92F

ATT~ACTTAT~AA~TG~CCGACCTCCTCTAGTAGA~AAAAAGA~ATACCAGAGTCGATAC~ L Q S L M Q N E P T S S S R A K R G I P E S I

360 llT

TTATAC~GAAACTCTACAAA~GTTTTAT~ACT~TGGGCA~GAGAAAAA~AGATTTTTAGGATT Y T R N S T K ~ F Y G L M G ~ K R K R R F L G F

432 140F

CTT~TA~TATT~ATCT~TATT~GT~TGTA~AGTGTCCA~GTACTACACTT~A~AGA~T L L G I G S A I A S G V A V S K V L H L E G E V

504 164

G~CAAAATTAAAAAT~ACT~TATCCACAAATAAA~AGT~TTAGTCTGTCC~T~AGTTAGTGTCCT N K I K N A L L S T N K A V V S L S N G V S V L

576 188F

TACTA~AAAGTACTTGATCTA~AG~CTATATAGACAAAGA~TTCTACCT~GTT~C~TCATGATTG T S K V L D L K N Y I D K E L L P K V N N H D C

648 212F

TA~ATATCC~CATA~CTGTGATAG~TTCC~CAAAAAAAC~TAGATTGTT~T~TAGGGA R I S N I A T V I E F Q Q K N N R L L E I A R E

720 236F

ATTTAGTGTAAAT~T~TATTACCACACCCCTCAGTACATACATGTT~CC~TAGTGAGTTACTATC~T

792 260F

C N G T D S K ~ K L I K Q E L E R Y N N A V A E

F S V N A G I T T P L S T Y M L T N S E L L S X ~TT~ATAT~CTAT~CG~TGACCAAA~T~TGTC~TATGTCA~ATAGTCACC-C~CAGA~ I N D M P I T N D Q K K L M S V C Q I V R Q Q S

864 284F

TTATTCCATCATGTCAGTGTT~GAGA~TCATA~TTATGTTGTAC~TT~CTCTTTAT~AGTTATAGA Y S I M S V L R E V I A Y V V Q L P L Y G V I D

936 308F

CACCCCCTGTT~AAACTACACACCTCTCCATTAT~ACCACTGAT~TAAAG~GTC~AACATCT~T ~ T P C W K L H T S P L C T T D N K E G S N I ~CTA~ACAGATCGTGGGT~TATTGTGAT~T~A~TCTGTGTCTTTTTTCCCAC~AGAGACGT T R T D R G W y C D N A G S V S F F p Q A E T

~

T~TAC~TCAAACAGAGTGTTCTGTGACAC~TG~CAGTTT~CTTT~CTACTGATGTT~CT~T ~ V Q S N R V ~ C D T M ~ S L T L P T D V ~

~

F

F

F

1008 332F 1080 356F 1152 3~0F

C~CACTGACATATTC~TTC~GTATGATTGTA/~T~TGACATCTAAAACTGACAT~GTA~TCTGT N T D I F N S K Y D C K I M T S K T D I S S S V

1224 404F

GAT~CTTC~TA~A~TATTGTATCAT~TAT~GACA~TGTACA~CTCT~TAA~TCGT~ ~ T S I G A I V S C y G K T K C T A S N K N R G

1296 428F

~TCATA~GACTTTTTCC~T~TGTGATTATGTATC~C~AGTTGATACTGTATCAGTT~T~ I I K T F S N G C D y V S N K G V D T V S V G N

1368 452F

CACACTATATTATGTAAATAAACTA~GGGG~ACTCTATAT~TG~CC~TTATT~TTACT~ T L Y Y V ~ K L E G K A L M I K G E P I I ~ M

1440 a$6?

T~TCCACTAGTATTTCCTTCTGATGAGTTTGAT~A~TT~TC~GTAAAC~AA~TAAACCAAAG N P L V F P S D E F D A S I A Q V N A K I N Q S

1512 500

F

CCT~T~CATACGTCGATCTGATGAGTTACTTCACAGTGTAGATGTAGGAAAATCCACCACAAATGTAGT 1584 524

F

L A F I R R S D E L L M S V D V G K S T T N V V ~TTACTACTATTATCATAGTGATAGTTGTAG~ATATT~TGTT~T~CTGTGGGATTACTGTTTTACTG I T T I I I V I V V V I L M L ~ T V G L L F Y C

1656 548F

T~GACCA~AGTACACCTATCATGTTA~ATCA~TTAGTAGTATC~C~TCTTTCCTTTAGT~ K T ~ S T P I M L G K D Q L S 5 1 N ~ L S F S K

1728 572~

ATGA~T~AT~TGTTTAC~TCTT~CCTCAG~TCATAAATGTGA~A~C~TTTACTGATACATTC

1800

AAAAGTTCCATCT~C~GACCT~ATCTTTATCA~TCT~AC~T~CCTTACATTCTATACTCA~T F g'=e ena ~---CTATGTT~T~STT~AT~GTATTATATT~TCTC~GATCACCTATTT~T~CC~TCATTCAAAAA

1872 1944

G A T-~-A-T~GgT.CnAeC G ~ G|~a~ ~TCCCT~TATGAGATTAC~ACATT~TTAAAT~TA~ M s R R N P C K Y E I R G H C L N G K K

2016 20

T~CATTTTAGTCAT~TTACTTTG~T~CTCCACAT~TTTATTAGTGA~/~TTTTAT~TAAAT C H F S S N y F E W P p H A L L V R Q N F M L N

2088 44M

~ G A TATTA-%AATCTAT~ACA~C~CGATACCCTGTCAGA'%AT~GT~T~T~AG~TTAGATAGA K I L K S M D R N N D T L S E ~ S G A A E L D R

2160 68

M2

ACAG~GAATAT~ATTGGGTGTGATA~AGTTTT~AAAGTTACTT~TCTATC~T~TAT~CA~ T E E y A L G V I G V L E S Y L S S I N N I T K

2232 92

M2

C~TCA~CTGTGTT~TATGAGTAAACTATTA~CGAGATT~C~TGATGACATC~GAGATT~GG~C Q S A C V A M S K L L A E I N N D D I K R L R N

2304 I15M2

~GT~C~CATCACCC~GAT~G~TATAT~CACAGTTATATCATATATTGATA~C~GAGA K E V p T S p K I R I y N T V I S Y I D S N K R

2376 140M

~C~C~d%AACAAACTATACATTT~TT~GAGA~CT~AGACGT~AAAAAGACCATC~G~CACA N T K ~ T I H L L K R L P A D V L K K T I K N T

2~4~ 184M2

ATAGATATTCAC~CGAAATAAAT~T~T~CC~TGACAT~TGTTGATG~CAAAATG~T~CTC I D I H N E I N G N N Q G D I NMVLDMENQKNMEN N C~CATTATTATTTTCCCAGAAAAATACCCCTGTA~ATATCCTCTTT~T~TT~ATGAAAATGATGT N I I I F P E K Y P C S I S S L L I K D E N D V TTTTGTACT~GTCATCAG~TGTTCTTGACT~TTACAGTTTC~TATCCATAT~TATGTATTCTCAAAA ~ V L S H Q N V L D C L Q F Q Y P Y N M Y S Q N TCATAT~TTGATGATATCTATT~ACATCACA~A~TGATTGA~ATGTACTT~GATTCTTCA~TTTC H M L D D I Y W T S Q E L I E D V L K I L H L S T~ATA~CATAAAT~GTATGTGATATATGTTTTAGT~TATAGTATAT~GTCACTC~CTATTGATCA G I S I N K Y V I y V L V L

S

2520 1869

M2 2

2

M2 ORF2

2592 33ORF2 2664 570RF2 2736 81

ORF2

2808 95ORF2

ACA~CACTTCTTCATA~TA~TATAT~ATACACTCATTCATGAG~CTC~CT~T M D T L I H E N S T N

2880 IIL

GTTTACTT~CAGAT~F~A~ V Y L T D S Y L K

2907 20L

Fig. l. Nucleotide sequence of dicistronic F-M2 mRNA, and deduced amino acid sequence of F and M2 proteins of BRSV (A51908). A I ~ indicated are the amino acid sequences encoded by the second ORF (ORF2) of M2 mRNA and the N-termin~ of the L protein. Initiation and termination ~nsensus sequences are hig~ighted and identified above the nucl~tide sequence. The F2/FI cleavage site is underlined.

739

Short communication

observed in the F genes of HRSV subgroup A (Lopez et al., 1988). It is interesting to note that within bovine strains the identity (97 700)in the coding region of the F m R N A sequence is similar to that in the 3' end untranslated region (84~). The BRS¥ strains used in this comparison have different origins; A51908 and 391-2 are American strains isolated about 10 years apart, and RB94 is an isolate from Europe. The same observation has been reported for the F m R N A of HRSV subgroup A. However, there was a significant decrease in identity in the coding and untranslated regions when F mRNAs from different antigenic groups were compared, e.g. BRSV and HRSV (subgroup A or B), and HRSV subgroup A and subgroup B. Interestingly, when the 3" non-coding regions of the P mRNAs of A51908 and FS- 1 strains of BRSV were aligned, they were found to be 93 identical (unpublished results). It was thought previously that the untranslated regions of RSV mRNAs were very divergent due to a lack of selective pressure. This conclusion was drawn from comparisons made between strains from different antigenic subgroups (BRSV and HRSV, HRSV subgroup A and B). Although the amount of data is limited, the high degree of identity observed in the 3' non-coding regions of the F and P genes of BRSV suggests that separation into different antigenic subgroups occurred very early in their evolution, and since then BRSV strains have undergone very little variation. It will be interesting to see whether this observation also applies to HRSV strains. The nucleotide sequence encoding the F protein of BRSV (A51908) contains a single ORF of 1716 nucleotides (nucleotides 14 to 1729) (Fig. 1). The predicted protein is 572 residues long, compared to 574 residues for the F proteins of BRSV strains RB94 (Walravens et al., 1990) and 391-2 (Lerch et al., 1991). Alignment of the amino acid sequences of the F proteins from BRSV strains A51908, RB94 and 391-2 showed an overall identity of 94.3~. The F1 region showed 96~o identity among all three proteins, whereas the F2 fragment was 91 70 conserved. These values are very similar to those observed for the F proteins within the same antigenic subgroup of HRSV (Lopez et al., 1988). The general features of the F protein of the A51908 strain of BRSV are identical to those of the F proteins of other BRSV strains (Lerch et al., 1991 ; Walravens et al., 1990). The only difference is that four (positions 27, 120, 498 and 567; Fig. 1) of the five possible N-glycosylation sites are well conserved in all three BRSV F proteins. The site located at position 70 is present only in strains A51908 and RB-94, not in strain 391-2. The C-terminal region of the F2 subunit (residues 100 to 129) has been described as a highly variable domain in HRSV strains, as well as between HRSV and BRSV (Walravens et al., 1990). However, this region is highly conserved among

BRSV HRSV A HRSV B

MSRRMPCKYEIRGHCLNGKKCHFSHNYFEWPPHALLVRQNFMLNKILKSMDRNND MSRRNPCKFEIRGHCLNGKRCHFSHNYFEWPPHALLVRQNFMLNRILKSMDKSID MSRRNPCKFEIRGHCLNGRRCHYSHMYFEWPPHALLVRQMFMLNKILKSMDKSID *****************************************************

BRSV HRSV A HRSV B

TLSEISGAAELDRTEEYALGVIGVLESYLSSINNITKQSACVAMSKLLAEINNDD TLSEISGAAELDRTEEYALGVVGVLESYIGSINNITKQSACVAMSKLLTELNSDD TLSEISGAAELDRTEEYALGIVGVLESYIGSINNITKQSACVAMSKLLIEINSDD ************************************************ ****

ii0 ii0 110

BRSV HRSV A HRSV B

IKRLRNKEVPTSPKIRIYNTVISYID~NKRNTKQTIHLLKRLPADVLKKTIKMTI IKKLRDNEELNSPK[RVYNTVI~YIESNRKNNKQTIHLLKRLPADVLKKTIKNTL IKKLRDNEEPNSPKIRVYNTVISYIESNRKNNKQTIHLLKRLPADVLKKTIKNTL ****..* *********************************************

165 165 165

BRSV HRSV A HRSV B

D I H N E I N G N N Q G D I N V D E Q N E ......... DIHKSITINNPKESTVSDTNDHAKNNDTTDIHKSITISNSKESTVNDQNDQTKNNDITG

186 194 195

*

55 55 55

Fig. 2. Alignment of amino acid sequencesof the M2 proteins from BRSVA51908,HRSV A2 (subgroupA) and HRSV 18537(subgroup B). Perfectly conserved residues are indicated by an asterisk, well conserved positions are indicated by a dot, gaps introduced are indicated by a hyphen.

strains of BRSV. This observation is in contrast to the suggestion that this area might form a strain-specific epitope for HRSV strains (Johnson & Collins, 1988). The nucleotide sequence corresponding to the BRSV M2 m R N A is 958 nucleotides long from the initiation to the termination consensus sequences. The coding region of the M2 mRNA extends between nucleotides 1955 and 2512 (Fig. 1). By comparison with the M2 m R N A sequence of HRSV (Collins & Wertz, 1985), the initiation and termination sequences were identified as nucleotides 1946 to 1954 and 2894 to 2905, respectively, and were found to be identical to those of HRSV M2 m R N A (Collins & Wertz, 1985). As in HRSV, there is no 5' untranslated region in the M2 mRNA of BRSV: The 3' untranslated region is 395 nucleotides in length, compared to 365 nucleotides in HRSV. The identity between BRSV and HRSV in the 3' non-coding region is 52~, significantly lower than the 78 ~o identity calculated for the coding region (see below). Nucleotides 1955 to 2512 (Fig. 1) of the dicistronic mRNA encode a protein that was identified as the M2 protein by comparison with the amino acid sequence of the M2 protein of HRSV (Collins et al., 1990; Collins & Wertz, 1985). The M2 protein of BRSV is 186 amino acids long, compared to 194 and 195 amino acids for the M2 proteins of HRSV subgroups A and B, respectively. The M2 proteins of BRSV and HRSV share 80 ~o identity in amino acid sequence. The C-terminal region (residue 166 to the last residue) of the M2 protein of BRSV shows extensive divergence (33 70 identity) from that of HRSV subgroups A and B, not only in the amino acid sequence, but also in length (Fig. 2). However, the general features of the M2 proteins are conserved in both BRSV and HRSV. It has been shown recently that the M2 gene encodes the major target antigen for RSV-specific murine cytotoxic T lymphocytes (Nicholas et al., 1990; Openshaw et al., 1990). Based on the close similarity observed in the M2 proteins, it is reasonable to expect

740

BRSV HRSV HRSV

BRSV HRSV HRSV

Short communication

A B

MLM~SNI I IFPEKYPCS ISSLL IKDENDVFVLSHQNVLDCLQFQYPY - - -N MT - - -M- - PK IMILPDKYPCS ITSILITSRCRVTM~fNQKNT - - -LYFNQNNpNNH MI - -KMTKPKIMILPDKYPCS ISS ILISSESMIATFNHKNI - - -LQFNHNHLDNH

A B

MYSQNHMLDDIYWTSQELIEDVLK IL-HLSGIS INKYV IYVLVL M Y S P N Q T F N E I H W T S Q E L I D T I Q N F L Q H L -G I I E D I Y T IY I L V S QCLLNH IFDE IHWTPKNLLDATQQFLQHL-NI PEDIYTV'f I L V S

••••'••••••'•••••••••••••••••••••••'••••••••••••••'••''••'•••'•••••'`•••''••••••••'•••••••••

52 47 50

(a)

95 90 93

Fig. 3. Alignmentof the amino acid sequences encoded by the second ORF (ORF2) of the M2 mRNA from BRSV A51908, HRSV A2 (subgroup A) and HRSV 18537 (subgroup B). Perfectly conserved residues are indicated by an asterisk, well conserved positions are indicated by a dot, gaps introduced are indicated by a hyphen.

that the M2 proteins of BRSV and HRSV probably have similar roles. Overlapping with the M2 coding region, a second ORF has been described in HRSV which encodes a polypeptide of 90 to 93 amino acids depending on the subgroup (Collins et al., 1990; Collins & Wertz, 1985). The M2 m R N A of BRSV also contains a second internal ORF, which overlaps with the amino acid sequence of the M2 protein by six residues and encodes a polypeptide of 95 amino acids. The polypeptide predicted to be encoded by the second O R F of BRSV shares 43 ~ overall identity with that of HRSV subgroup A, and low (35 ~ ) identity with that of HRSV subgroup B (Fig. 3), assuming the first methionine codon is the translation initiation site. The overall identity between the second ORFs of subgroups A and B of HRSV is 62~. The hydrophilicity profiles of the polypeptides encoded by the second ORFs of BRSV and HRSV suggest that the polypeptides are significantly different (Fig. 4). The intergenic sequence between the F and M2 genes extends from nucleotides 1893 to 1945 (Fig. 1), which is six nucleotides longer than the F - M 2 intergenic sequence in HRSV. Alignment of the two sequences shows a very low degree of similarity ( < 2 5 ~ ) , which is in agreement with the homology observed between other BRSV and HRSV intergenic sequences (Samal & Zamora, 1991). The 5' end of the L gene of HRSV has been reported to overlap with the 3' end of the M2 gene (Collins et al., 1987). Overlapping has been shown to be a mechanism to regulate transcription of the L gene (Collins et al., 1987). As shown in Fig. 1, the proposed initiation sequence of the L gene of BRSV was found 67 nucleotides upstream of the termination sequence of the M2 gene. The overlap between the M2 and L genes of HRSV is 68 nucleotides (Collins et al., 1987), which suggests that M 2 - L overlap is a general mechanism for transeription regulation of the L gene in RSV. At the nucleotide level the identity in the overlapping region of BRSV and HRSV is 84~. The overlapping region encodes the first 20 amino acids of the L protein of BRSV (Fig. 1), which are 75~o identical to that of HRSV. The proposed initiation sequence, 5' G G A C A A A U 3' ( m R N A sense), of the L gene of

.o ~z

(c)

30 40 50 60 70 80 90 Amino acid position Fig. 4. Hydrophilicityprofilesof the polypeptideencoded by ORF2 of M2 mRNAs of BRSV strain A51908 (a), and HRSV subgroup A (b) and subgroupB (c).The distribution of hydrophobic(abovethe dashed line) and hydrophilic(belowthe dashed line) domains along the amino acid sequence was determined using the method and parameters of Rose & Roy (1980) with a window size of seven residues. 1

10

20

BRSV has two nucleotide differences compared to the consensus initiation sequence G G G G C A A A T (Collins et al., 1986). Both differences are localized within the four consecutive G residues: (i) the Y-terminal G residue is missing in the initiation signal of the L gene of BRSV; (ii) the last G residue in the consensus sequences has been replaced by an A residue in the BRSV L gene initiation sequence, as described for HRSV (Collins et al., 1987). Since the 5' end of all the BRSV and HRSV m R N A s for which the Y-terminal nucleotide has been determined experimentally begin with a G residue, it is

Short communication

reasonable to assume that the Y-terminal nucleotide of the L gene of BRSV is the first G residue of the proposed initiation sequence. These are the only two RSV genes reported that lack the canonical four G residues at the 5' end of the mRNA. In conclusion, the work presented here describes the nucleotide sequence of the F and M2 mRNA, and the F-M2 intergenic sequence of the A51908 strain of BRSV, plus the deduced amino acid sequences of the encoded proteins. In both cases, a high degree of similarity was observed when compared with the corresponding sequence of HRSV. The F protein of the A51908 strain of BRSV also showed a high degree of similarity to the F proteins of other BRSV strains. As described for HRSV, the 5' end of the L gene of BRSV was found to overlap with the 3' end of the M2 gene, suggesting that a similar transcription attenuation mechanism is used in BRSV. We thank Dr Sashi B. Mohanty for his support and enthusiasm, Dr Peter L. Collins for providing a cDNA clone of the F gene of HRSV and Mr Daniel Rockemann for excellent technical assistance. This study was supported by grants from Maryland Agricultural Experiment Station and USDA grant no. 89-34116-4855. Approved as scientific article no. A6210, and contribution no. 8379 of the Maryland Agricultural Experiment Station.

References BAYEUTT, H. N, & PRINGLE, C. R. (1987). Molecular cloning and sequencing of the F and 22K membrane protein genes of the RSS-2 strain of respiratory syncytial virus. Journal of General Virology 68, 2789-2796. BOHLENDER, R. E., McCUNE, M. W. & EREY, M. L. 0982). Bovine respiratory syncytial virus infection. Modern Veterinary Practice 63, 613-618. COLLINS, P. L. & WERTZ, G. W. (1985). The envelope-associated 22K protein of human respiratory syncytial virus: nucleotide sequence of the mRNA and a related polytranscript. Journal of Virology .54, 65-71. COLLINS, J. K., TEEGARDEN,R. M., MACVEAN,D. W., SMITH,G. H., FRANK, G. R. & SALMAN,S. (1988). Prevalence and specificity of antibodies to bovine respiratory syncytial virus in sera from feedlot and range cattle. American Journal of Veterinary Research 49, 13161319. COLLINS,P. L., DICKENS,L. E., BUCKLER-WHITE,A., OLMSTED,R. A., SPRIGGS, M. K., CAMARGO, E. & COELINGH, K. V. W. (1986). Nucleotide sequence for the junctions of human respiratory syncytial virus reveal distinctive features of intergenic structure and gene order. Proceedings of the National Academy of Sciences, U.S.A. 83, 4594-4598. COLLINS, P. L., OLMSTED,R. A., SPRIGGS, M. K., JOHNSON,P. R. & BUCKLER-WHITE, A. J. (1987). Gene overlap and site-specific attenuation of transcription of the viral polymerase L gene of human respiratory syncial virus. Proceedings of the National Academy of Sciences, U.S.A. 84, 5134-5138. COLLINS, P. L., HILL, M. G. & JOHNSON,P. R. (1990). The two open reading frames of the 22K mRNA of human respiratory syncytial virus: sequence comparison of antigenic subgroups A and B and expression in vitro. Journal of General Virology 71, 3015-3020.

741

ELANGO, N., SATAKE,M. & VENKATESAN,S. (1985). mRNA sequence of three respiratory syncytial virus genes encoding two nonstructural proteins and a 22K structural protein. Journal of Virology 55, 101110. GALINSKI, M. S. (1991). Paramyxoviridae: transcription and replication. In Advances in VirusResearch vol. 39, pp. 129-162. Edited by K. Maramorosch & F. A. Murphy. San Diego: Academic Press. JOHNSON,P. R. & COLLINS,P. L. (1988). The fusion glycoproteins of human respiratory syncytial virus of subgroups A and B: sequence conservation provides a structural basis for antigenic relatedness. Journal of General Virology 69, 2623-2628. LEACH,R. A., STOTr, E. J. & WERTZ,G. W. (1989). Characterization of bovine respiratory syncytial virus proteins and mRNAs and generation of cDNA clones to the viral mRNAs. Journal of Virology 63, 833 840. LERCH, R. A., ANDERSON,K. & WERTZ, G. W. (1990). Nucleotide sequence analysis and expression from recombinant vectors demonstrate that the attachment protein G of bovine respiratory syncytial virus is distinct from that of human respiratory syncytial virus. Journal of Virology 64, 5559-5569. LERCH, R. A., ANDERSON,K., AMANN,V. L. & WERTZ, G. W. (1991). Nucleotide sequence analysis of the bovine respiratory syncytial virus fusion protein mRNA and expression from a recombinant vaccinia virus. Virology 181, 118-131. LOPEZ, J. A., VILLANUEVA,N., MELERO,J. A. & PORTELA,A. (1988). Nucleotide sequence of the fusion and phosphoprotein genes of human respiratory syncytial (RS) virus Long strain: evidence of subtype genetic heterogeneity. Virus Research 10, 249-262. MALLIPEDDI,S. K., SAMAL,S. K. & MOI-LkNTY,S. B. (1990). Analysis of polypeptides synthesized in bovine respiratory syncytial virus infected cells. Archives of Virology 115, 23-36. NICHOLAS, J. A., RUBINO, K. L., LEVELY, M. E., ADAMS, E. G. & COLLINS, P. L. (1990). Cytolytic T-lymphocyte responses to respiratory syncytial virus: effector cell phenotype and target proteins. Journal of Virology 64, 4232-4241. OPENSHAW,P. J. M., ANDERSON,K., WERTZ, G. W. & ASKONAS,B. A. (1990). The 22,000-kilodalton protein of respiratory syncytial virus is a major target for Kd-restricted cytotoxic T lymphocytes from mice primed by infection. Journal of Virology 64, 1683-1689. ORVELL, C., NORRBY, E. & MUFSON, M. A. (1987). Preparation and characterization of monoclonal antibodies directed against five structural components of human respiratory syncytial virus subgroup B. Journal of General Virology 68, 3125-3135. PACCAUD,i . F. & JACQUIER,C. (1970). A respiratory syncytial virus of bovine origin. Archlyfur die gesamte Virusforschung 30, 327-342. ROSE, G. D. & RoY, S. (1980). Hydrophobic basis of packing in globular proteins. Proceedings of the National Academy of Sciences, U.S.A. 77, 4643~,647. SAMAL,S. K. & ZAMORA,M. (I991). Nucleotide sequence analysis of a matrix and small hydrophobic protein dicistronic mRNA of bovine respiratory syncytial virus demonstrates extensive sequence divergence of the small hydrophobic protein from that of human respiratory syncytial virus. Journalof General Virology72, 1715-1720. SAMAL, S. K., ZAMORA, M., McPHILLIPS, T. H. & MOHANTY, S. B. (1991). Molecular cloning and sequence analysis of bovine respiratory syncytial virus mRNA encoding the major nucleocapsid protein. Virology 180, 453-456. STOT!', E. J. & TAYLOR,G. (1985). Respiratory syncytial virus: a brief review. Archives of Virology 84, 1-52. TAYLOR, G., STOTT, E. J., BEW, M., FERNIE, B. F., COTE, P. J., COLLINS, A. P., HUGHES, M. & JEBEETT, J. (1984). Monoclonal antibodies protect against respiratory syncytial virus infection in mice. Immunology 52, 137-142. WALRAVENS,K., KETTMANN,R., COLLARD,A., COPPE,P. & BURNY,A. (1990). Sequence comparison between the fusion protein of human and bovine respiratory syncytial viruses. Journalof General Virology 71, 3009 3014.

(Received 27 August 1991; Accepted 19 November 1991)

Sequence analysis of M2 mRNA of bovine respiratory syncytial virus obtained from an F-M2 dicistronic mRNA suggests structural homology with that of human respiratory syncytial virus.

The nucleotide sequences of the F and M2 mRNAs of strain A51908 of bovine respiratory syncytial virus (BRSV) were determined by sequencing cDNA of an ...
545KB Sizes 0 Downloads 0 Views