Gene, 122 (1992) 345-348 0 1992 Elsevier Science Publishers

GENE

B.V. All rights reserved.

345

0378-1119/92/$05.00

06807

Short

Communications

Sequence

and in vitro translational analysis of a 1629-nucleotide Autographa californica nuclear polyhedrosis virus strain E2 (AcMNPV-E2;

recombinant

DNA;

insect vector; proline-rich

protein;

anomalous

ORF, in

migration)

Daphne Q.-D. Pham and Natarajan Sivasubramanian Departmetzt of Entomology, Received

University of California, Riverside, CA 92521, USA

by J.A. Engler: 9 April 1992; Revised/Accepted:

25 June/8

July 1992; Received

at publishers:

20 August

1992

SUMMARY

The complete nucleotide (nt) sequence of an open reading frame (ORF) (map unit 5.1 to 3.8) from Autographa californica nuclear polyhedrosis virus strain E2 (AcMNPV-E2) has been determined. This 1629-nt ORF has a coding potential for a 61-kDa Pro-rich protein. However, in vitro translation of the 1629-nt ORF and sodium dodecyl sulfate-polyacrylamidegel electrophoresis (SDS-PAGE) revealed a 7%kDa protein product. The discrepancy between the M, predicted by the nt sequence and that obtained from the in vitro translational analysis is due to the high Pro content of this protein. The high Pro content causes anomalous migration of this protein during SDS-PAGE.

INTRODUCTION

Autographa californica multinucleocapsid nuclear polyhedrosis virus (AcMNPV) particles are rod-shaped, enveloped virions with a 12%kb, circular, double-stranded DNA genome. A unique feature of the AcMNPV is that two forms of viral progeny are produced during viral infection. These two forms of AcMNPV have separate roles in the persistence of the virus in nature. In infected lepidopteran Sf21 tissue culture cells, both morphological forms are produced. However, whether all the tissues of the infected host

Correspondence to: Dr. D.Q.-D. versity of California, Riverside,

Pham, Department of Entomology, UniCA 92521, USA. Tel. (714) 787-3629;

Fax (714) 787-3086. Abbreviations: A., Autographa, aa, amino acids; AcMNPV, A. calijbmica NPV; AcMNPV-C6 or -E2, AcMNPV strain C6 or E2, respectively; BV, budded

virus;

nuclear

polyhedrosis

GCG,

Genetic

Computer

virus; nt, nucleotide(s);

PAGE, polyacrylamide virus; PRP, proline-rich scription start point(s).

Group,

Madison,

WI; NPV,

ORF, open reading

frame;

gel electrophoresis; PDV, polyhedra-derived protein; SDS, sodium dodecyl sulfate; tsp, tran-

larva can support the development of both viral forms is not known. For most NPVs, the occluded virus is rarely observed in the midgut but is seen extensively in other tissues of the infected host larva. These results suggest that the viral genes necessary for these two forms may be expressed differently in various tissues of the infected host larvae. To test this hypothesis, we studied the expression of several key genes of AcMNPV in an insect tissue culture system as well as in different tissues of host larvae. During our study, we found that a probe specific to a 1629-nt ORF detected RNA transcripts in uninfected tissues of Trichoplusia ni lepidopteran larvae even under highstringency conditions (Pham and Sivasubramanian, 1992). This 1629-nt ORF (map unit 5.1-3.8 on the AcMNPV-E2 genome) is located in opposite direction to and downstream from the pn gene in AcMNPV-E2. Expression of host transcripts homologous to a 1629-nt ORF appears to be tissuespecific, and the homology observed appears to be specific for the 1629-m ORF. The ‘1629-nt ORF’-specific probe hybridized to transcripts of 0.7 and 2.5 kb in uninfected midgut tissues as well as transcripts of 2.5, 8.8 and 11.0 kb in uninfected hemocytes. Our preliminary data suggest that

346 at least one of these host transcripts, the 0.7-kb transcript, is a poly(A)+mRNA in the uninfected midgut tissues (un-

of these sequences is used. The virus may also use a motif different from the two consensus sequences cited above to

published data). These results prompted us to study the complete nt sequence as well as the translatability of the 1629-nt ORF in an attempt to understand its role in viral development.

initiate transcription of this gene. The nt sequence (GGCATGA) surrounding the Met codon of this ORF conforms to the Kozak rules for a favorable eukaryotic translational initiation sequence (PuNNATGPu) (Kozak, 1983). The stop codon at position 1629 is followed by two polyadenylation recognition and addition sites. Four potential N-glycosylation sites are seen at aa 122,240,421 and 442. Although viral genomes from strains E2 and C6 of AcMNPV gave different restriction enzyme digestion patterns (Possee et al., 1991), the 1629-nt ORFs in AcMNPVE2 and -C6 show no sequence differences. The conservation of this ORF between the two strains: (1) suggests that this ORF probably plays an important role in the persistence and survival of the virus and (2) is consistent with the observation that a stop codon insertion in this ORF resulted in the production of unstable viruses (Possee et al., 1991).

EXPERIMENTAL

AND DISCUSSION

(a) Nucleotide sequence of the 1629-nt ORF The cloning, subcloning and sequencing strategies of the 1629-nt ORF are shown in Fig. 1; the entire nt sequence of the 1629-nt ORF is shown in Fig. 2. Consensus eukaryotic TATA and CAT boxes are found near the tsp and are located at positions -74 and - 136, respectively. Interestingly, a CAGT sequence often seen at the tsp of most AcMNPV early genes is found at position -5 1 of the 1629nt ORF, whereas an ATAAG sequence often seen at the tsp of most AcMNPV late genes is observed very far from the Met start codon at position -497. Yet, the 1629-nt ORF is not transcribed until late during the virus life cycle (Ooi and Miller, 1990; Pham and Sivasubramanian, 1992). Since the tsp has not been identified, it is not known which

(b) Computer analysis Extensive computer analyses of the 1629-nt ORF were carried out. TESTCODE analysis (GCG) indicated that this ORF has a very strong coding potential starting from

Fig. 1. Cloning and sequencing strategies. (A) Cloning and sucloning strategies. Clones AcuB and AcuB were made by inserting the BarnHI-F fragment (2 kb, nt 4254-6193, shown in figure as ‘F’) and the BarnHI-C fragment (8 kb, nt 6193 to approx. 15000, shown in figure as ‘C) into pUs13, respectively. Clone AcgHSl was created by inserting 1-kb HindIII-Sal1 fragment (nt 5264-6264 of the AcMNPV genome) into pGem1. Clone AcuB was further subcloned to give clone AcuHAO.5, which had the Asp718-Hind111 fragment (3’ end of 1629~nt ORF, nt 4714-5264) inserted into pUcl3. Clone AcuB was further subcloned to give clone AcuHpO.5, which had the HindIII-PvuII fragment (5’ end of 1629~nt ORF, nt 6306-6841) inserting in pUcl3. Clone AcgHSl

was further

RsaI (nt 5264-5719

subcloned

of the AcMNPV

to give the SspI (nt 5264 to 6063 of the AcMNPV genome) deletion clones. (II) Sequencing

strategies.

genome),

CZaI (nt 5264-5872

The original sequencing

of the AcMNPV

was done on AcgHSl,

genome)

and

AcuB2, AcuB

clones and their subclones. The entire nucleotide sequence was double checked by resequencing the opposite strand of the three parental clones using custom-designed primers. All sequencing was done using dideoxy method (Sanger et al., 1977) with Sequenase Version 2.0 (US Biochemical, Cleveland, OH). Sequencing was performed according to manufacturer’s recommendation using [ s(-35S]dATP (1000 Ci/mmol, Amersham) as the radioactive label. All templates were double-stranded DNA. Primers used in sequencing of clones inserted in pGem1 were the SP6 or T7 promoter sequencing primers (Promega, Madison, WI). The primers used in the sequencing of clones inserted in pUcl3 were the universal pUc or reverse pUc primers (US Biochemical). The numbers

next to the restriction

sites denote their nt. The nt positions

are based on Possee et al. (1991).

TCTGGTGC~CTCCTTTA~~~~~~~e~~~~~~e~~e~~~~~~~~~cc~~~~e~~e*~~~~~~~e~~~~ec

Fig. 2. The nt sequence tial glycosylation

of the 1629-m ORF and the deduced

sites, underlined:

2301

aa sequence.

and the aa Pro, zigzag-underlined.

Eukaryotic

Motives

consensus

CAT and TATA boxes are doubly underlined;

for early and late AcMNPV

genes are italicized

poten-

and zigzag-underlined.

Lower-case letters represent noncoding regions; capital letters represent coding regions. Plain numbers denote nt numbers; italicized numbers denote aa numbers. OCH stands for ochre stop codon. Each base pair was sequenced on the average five times. All of the plus strand and 90% (the complete ORF) of the minus strand

were sequenced.

The sequence

was deposited

in EMBL database

the ‘ATG’ start codon (nt 6486 or 5.1 map unit of the AcMNPV genome) and stopping around the stop codon (nt 4857 or 3.8 map unit of the AcMNPV genome). Indirect prediction of aa sequence (StriderTM) showed that the 1629-nt ORF has the potential to code for a 61-kDa protein. This putative protein is very Pro-rich (15% overall), especially between aa 143 and 320 (Fig. 2). TFASTA search (GCG) of the 1629-nt ORF against GenBank and EMBL databases reveals that many Pro-rich proteins are either transcriptional factors or cell ‘envelope’ proteins (i.e., cellular proteins that are part of an enclosing structure; Eckert and Green, 1986; Mehrel et al., 1990; Hong et al., 1990; Selkirk et al., 1991). Since the 1629-nt ORF is expressed late in infection (Ooi and Miller, 1990; Pham and Sivasubramanian, 1992), it is unlikely that this ORF would code for a regulatory transcriptional activator. AcMNPV polymerase and transcriptional factor genes are known to express very early in the virus life cycle (Tolmaski et al., 1988; Carson et al., 1988; 1991a,b; Guarino and Summers, 1986; 1987). Furthermore, the putative protein coded by the 1629-nt ORF has the Pro-rich domain situated in the central region of the protein. Conversely, DNA-binding proteins often have a Pro-rich domain located within either or both of the termini (Santoro et al., 1988; Mermod et al., 1989; Mitchell and Tjian, 1989). Thus, this putative protein coded by the 1629~nt ORF is probably not a DNA-binding protein. It nonetheless may still have a regulatory role in RNA-protein or protein-protein interactions during viral development. This putative protein may also be a structural

under accession

No. Z 11662.

component in the ‘envelope’ of the polyhedron or the envelope of the PDV or BV. Although Kyte-Doolittle hydrophobicity analysis of this protein indicate that its N and C termini are hydrophilic, the two Pro-rich hydrophobic domains in the center of the protein potentially can span the membrane twice. In doing so, it would leave the N and C termini on one side of the lipid bilayer and therefore could fuction as a membrane protein in the envelope of PDV or BV, both of which are true bilipid membranes. Alternatively, if it is a component of the polyhedral ‘envelope’, it need not be a membrane glycoprotein because this ‘envelope’ is not a bilipid membrane, but rather an enclosing structure consisting primarily of carbohydrate and protein. Whether this protein is a structural or nonstructural protein remains to be determined. (c) In vitro translation of the 1629-nt ORF In vitro translation of the 1629-nt ORF gene revealed a protein product that migrated at 78 kDa during SDSPAGE (Fig. 3). However, computer-assisted translation of the 1629-nt ORF predicted only 61 kDa. This discrepancy is due probably to the anomalous mobility of the protein during SDS-PAGE rather than a true difference between the two data. This 1629-nt ORF protein is very Pro-rich [especially between aa residues 143 and 322) and PRPs are known to have abnormally low mobility during SDSPAGE (Robinson et al., 1989; Ziemer et al., 1982). Moreover, other ATGs upstream from the coding region do not give ORF more than 100 nt long.

348

kDa 97-

Carson,

kDa

D.D., Summers,

a baculovirus

-78

Guarino,

L.A.

and

Guarino,

L.A. and Summers,

expression 2099.

the A4, standards

of the human

mapping

M.D.: Nucleoride

in-

of a trans-

of a baculovirus

delayed-early

sequence

and temporal

gene. J. Virol. 61 (1987) 2091-

R.T. and Key, J.L.: Characterization

cell wall protein gene family of soybean.

analysis.

and the number

The numbers

on the left represent

on the right of the gel denotes

size of the in vitro translated

protein.

the es-

The symbols ( + ) and ( - )

stand for the in vitro translation of plus-strand and minus-strand RNAs, respectively. Both plus- and minus-strand RNAs were made from PvuIIdigested AcPBA2.1 clone using the Promega Riboprobe Gemini11 Core System. T7 RNA polymerase was used in the making of the plus-strand RNA; SP6 RNA polymerase

was used in the making of the minus-strand

RNA. The in vitro translation reticulocyte

lysate

was performed (Promega),

and [a5S]methionine

with the nuclease-treated

minus-methionine

ucts was performed resolving

with a 0.1%

gel (Laemmli,

aa

mixture

(1000 Ci/mmol, Trans 35S-labelTM meta-

bolic labeling from ICN). The gel electrophoresis SDS/lo:;

of the translation

PAGE

separation

prodand 4%

1970) at 6 mA for 16 h. After electro-

phoresis, the gel was stained for 30 min in 0.125% Coomassie blue/lo?; acetic acid/50:/; methanol solution. destained for 1 h in 10”; acetic acid/ 50”; methanol

Funtional

regulatory

of

of a proline-rich

J. Biol. Chem. 26$(5) (1990)

2470-2475.

Fig. 3. In vitro translational

PAGE

M.D.:

of a baculovirus

Hong, J.C., Nagao, Pvu II

29-

(Promega)

Summers,

analysis

182 (1991b) 279-286.

activating gene required for expression gene. J. Virol. 57 (1986) 563-571.

43 -

rabbit

L.A.: Molecular

gene. Virology

Eckert. R.L. and Green, H.: Structure and evolution volucrin gene. Cell 46 (1986) 583-589.

+

68-

timated

M.D. and Guarino,

regulatory

solution

and treated

with a 16”; Na.salicylate

solution for 20 min. Autoradiography room temperature for 16 11using Kodak BA2.1 clone was constructed

was performed subsequently at X-OMATTM AR film. The AcP-

by inserting

2.1-kb PvuII-Asp718

(nt -358 to 1771 of the 1629~nt ORF) of the EcoRI-I NPV genome into %a1

enhancer

+ Asp718-digested

fragment

fragment

pGemm’-4(blue)

in AcM-

(Promega).

Kozak,

M.: Comparison

eucaryotes, Laemmli,

UK:

Most

transcriptional

commonly

activator

synthesis

used discontinuous

of CTF/NF-I

in procaryotes.

Rev. 47 (1983) l-45. buffer system

U., Bisher,

Cheng,

C., Lichti,

Yuspa,

S.H. and Roop, D.R.: Identification

cell envelope protein, Mitchell,

loricrin.

P.J. and Tjian,

M.E.,

Steven,

R.: The proline-rich

A.C.,

Bundman,

D..

Steinert,

P.M.,

of a major keratinocyte

Cell 61 (1990) 1103-l

R.: Transcriptional

cells by sequence-specific

for

is distinct from the replication

and DNA binding domain. Cell 58 (1989) 741-753. Mehrel, T., Hohl, D., Rothnagel, J.A., Longley, M.A.,

112.

regulation

DNA binding proteins.

in mammalian

Science 245 (1989)

371-378. Ooi, B.G. and Miller, L.K. Transcription gene reduces

of the baculovirus

the levels of and antisense

transcript

polyhedrin

initiated

down-

stream. J. Viral. 64 (1990) 3 126-3 129. Pham, D.Q.-D. and Sivasubramanian. N.: In vivo transcriptional analysis of three baculovirus genes: evidence of homology between viral and host transcripts.

Virology (1992) In press.

Possee, R.D., Sun. T.-P., Howard, and Gearing,

K.L.: Nucleotide

polyhedrosis

gion. Virology Robinson,

R., Kauffman,

of isoproterenol-treated Sanger,

S.C.. Ayeres. M.D., Hill-Perkins,

M.

sequence of the Autographa calfimica

9.4 kbp EcoRI-I

and -R (polyhedrin

gene) re-

185 (1991) 229-241. D.L.. Waye, M.M.Y.,

and Keller, P.J.: Properties

The authors thank Dr. B.A. Federici and Mr. CA. Post for their comments on this manuscript. We also want thank Mr. R.H. Hice for his invaluable inputs throughout this work. This work was supported by a grant from the Sigma Xi Scientific Foundation and U.C. Biotechnology Research and Education Program.

of protein

Microbial.

SDS electrophoresis. Nature 227 (1970) 680. Mermod, N. O’Neil, E.A.. Kelly, T.J. and Tjian,

nuclear ACKNOWLEDGEMENTS

of initiation

and organelles.

F., Nicklen,

chain-terminating

rats. Biochem.

S. and Coulson, inhibitors.

Blum M., Bennick,

ofproline-rich proteins

A.

from parotidglands

J. 263 (1989) 497-503.

A.R.: Nucleotide

Proc. Natl. Acad.

sequence

with

Sci. USA 74 (1977)

5463-5467. Santoro, C., Mermod, N., Andrews, P.C. and Tjian, R.: A family of human CCAAT-box-binding proteins active in transcription and DNA replication:

cloning

and expression

of multiple

cDNAs.

Nature

334

(1988) 218-224. Selkirk. M.E., Yazdanbakhsh. M., Freeman, D.. Blaxter, M.L., Cookson, E., Jenkins, R.E. and Wiliams, S.A.: A proline-rich structural protein REFERENCES

of the surface

sheath

of larval Bmgia filarial nematode

parasites.

J.

Biol. Chem. 266(17) (1991) 11002-11008. Carson,

D.D., Guarino,

L.A. and Summers,

M.D.: Functional

mapping

of an AcNPV immediate early gene which augments expression of the IE:l trarrs-activated 39 K gene. Virology 162 (1988) 444-45 1. Carson, D.D., Summers, M.D. and Guarino. L.A.: Transient expression of the AcMNPV immediate early gene, IE-N, is regulated by three viral elements.

J. Virol. 65 (1991a) 945-951.

Tolmaski, M.D., Wu, J. and Miller, L.K.: The location, sequence, transcription and regulation of a baculovirus DNA polymerase gene. Virology 167 (1988) 591-600. Ziemer, M.A., Mason, A. and Carlson, R.M.: Cdl-freetran&&nssf proline-rich protein mRNAs. J. Biol. Chem. 257(18) (1982) 1117611189.

Sequence and in vitro translational analysis of a 1629-nucleotide ORF in Autographa californica nuclear polyhedrosis virus strain E2.

The complete nucleotide (nt) sequence of an open reading frame (ORF) (map unit 5.1 to 3.8) from Autographa californica nuclear polyhedrosis virus stra...
507KB Sizes 0 Downloads 0 Views