Comp. Biochem. Physiol. Vol. 103B,No. 3, pp. 651-655, 1992 Printed in Great Britain

0305-0491/92 $5.00+ 0.00 © 1992Pergamon Press Ltd

NUCLEOTIDE SEQUENCE AND PRESUMED SECONDARY STRUCTURE OF THE INTERNAL TRANSCRIBED SPACERS OF rDNA OF THE PEA APHID, ACYRTHOSIPHON PISUM O-Yu KWON and HAJIMEISHIKAWA* Zoological Institute, Faculty of Science, University of Tokyo, Japan (Tel: 81-3-3812-2111 ext. 4446; Fax: 81-3-3816-1965) (Received 3 April 1992; accepted 5 May 1992)

Abstract--1. Internal transcribed spacer (ITS) 1 and ITS 2 of rDNA of the pea aphid, Acyrthosiphonpisum consisted of 229 and 280 nucleotides, whose G + C contents were 70 and 74%, respectively. 2. Secondary structure models constructed for the ITS 1 and ITS 2 suggested that certain structural motifs have been conserved in these regions despite extensive divergence in nucleotide sequence due to species.

INTRODUCTION The formation of mature 18S rRNA, 5.8S r R N A and 28S r R N A in eukaryotes involves a series of posttranscriptional processing of a long precursor r R N A (Perry, 1976). Within the precursor rRNA, mature r R N A sequences are separated by two special segments, which are called internal transcribed spacers, ITS 1 between 18S r R N A and 5.8S rRNA, and ITS 2 between 5.8S r R N A and 28S rRNA. While the nucleotide sequence of mature rRNAs has been strongly conserved throughout t h e evolution of prokaryotes and eukaryotes, both ITS 1 and ITS 2 sequences of distant organisms show dramatic differences in size and G + C content (Michot et al., 1983). While eukaryotic ribosomal ITS 1 and ITS 2 have been relatively well characterized in yeast (Rubin, 1973; Veldman et al., 1981) and several vertebrates (Furlong and Maden, 1981; Michot et al., 1983; Subramanyam et al., 1982), no invertebrate, including insects, ribosomal ITS 1 and ITS 2 have been so far characterized. In this paper we report the complete primary sequences of ITS 1 and ITS 2 of the pea aphid rDNA. In addition, their presumed secondary structure models are constructed based on calculation of free energy. MATERIALS

AND METHODS

Isolation of the genomic DNA of pea aphids, construction of the genomic library, screening of the rDNA and mapping with restriction endonueleases were performed as described previously (Fujiwara and Ishikawa, 1986; Herrmann et al., 1987; Maniatis et al., 1982; Ogino et al., 1990). The 1.4 kb EcoRI-ECORI fragment was derived from 2 ApR8 which encodes the full length of the pea aphid rDNA (Kwon eta/., 1991), and subeloned into the same sites of the plasmid Bluescript SK +, which was named pApR815 (see Fig. 1). The plasmid subelone pApR815 contained the coding regions for the 3'-terminal of 18S *To whom correspondence should be addressed. 651

rRNA, 5.8S rRNA and the Y-terminal of 28S rRNA, as well as the internal transcribed spacers. Plasmid subclones of deletion series were constructed and sequenced by the dideoxy chain termination method (Sanger et aL, 1977) using the Sequenase version 2.0 Kit (U.S. Bioehem. Co.). The secondary structures of the ITS 1 and ITS 2 region were constructed using the DNASIS V 6.00 program (Hitachi Software Engineering Co., Japan), which found the best stable and fit secondary structures with minimum free energy for those regions. RESULTS AND DISCUSSION The sequencing strategy of the insect pApR815 is outlined in Fig. 1, and the nucleotide sequence determined shown in Fig. 2. Since the nucleotide sequences of the aphid 18S r R N A (Kwon et al., 1991) and 5.8S r R N A (Ogino et al., 1990) was known, ITS 1 was automatically identified in Fig. 2. The start point of the aphid 28S r R N A was deduced by comparison with other insect 28S rRNAs (Tautz et aL, 1988), and confirmed by the S I mapping analysis (data not shown), which facilitated to identify ITS 2. It was shown that ITS 1 and ITS 2 consist of 229 and 280 nucleotides, respectively. The two ITS sequences shared a very high G + C content, 70 and 7 4 0 for ITS 1 and ITS 2, respectively. It is generally known that in a given species the two ITS sequences are very similar to each other in both length and G + C content (Schnare et al., 1990). The G + C content of ITS sequences of higher eukaryotes is higher than that of the corresponding sequences of insects and lower eukaryotes (Fig. 3) (Schnare et al., 1990). The aphid ITS sequences were quite exceptional in that their G + C content was very high and comparable to that of the ITSs of vertebrates. While the rRNAs of aphids and vertebrates have high G + C contents, the G + C content of the ITSs of these organisms were still higher (Schnare et aL, 1990). In contrast, insects and lower eukaryotes generally share rRNAs of a low G + C content, and the G + C content of their ITSs is still lower. This

652

O-Yu KWON and HAJIMEISHIKAWA

18S./

28S

~ApR8

5.8S 1Kbp

Sat

Stoat

pApR815

~

~

~

1/5Kbp

I

Fig. 1. A unit structure of rDNA and sequencing strategy for the internal transcribed spacers. Horizontal arrows indicate the direction and extent of the sequences which were read. 10

20

30

40

50

60

70

AATTCCCAGT~.GCGCGAGTCATCAGCTCGCGTTGA'I~ACGTCCCTGCCCTTTGTACA~.CCGCCCGTCGcTACT "" E c , ~ 85 95 105 115 125 13~ 14~ ACCGATTGAACGGTCGCATTGAGGTCTTCGGAGTGGACGCGCGTTATTCGGCCCCCCTCGGGCTGGGTCGTCGCG 160 170 180 190 200 210 220 CGATCGCAAGATGACCGAAATCGGCCGTTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCGTAGTGAACCTGCGGA

235 245 255 265 275 285 295 AGGATCATTA~AGACGGGCCCGCGGTACCGGCGGCACTCGCCGTCTCTCTCTCCCCTCATCCGACCCCGTGGCCG

=---~_JTSl

31"0320 330 340 350 360 370 CGTGCGCTCGCCGACTCGCTCGACGACCGCCCGCGCGCCGGCACCCCCGACCGAGCGTCGACCGAAAGATGGTTA

385 395 405 415 425 435 445 ACACGTCGAAAGACCGTACGTGCCCAGTCGTGCGTCGCGCACGGCGCGTGGCCGTGAGAAGTTTCGTCGAACAAA 460

470

480

490

~O0

510

520

ACA.~.~.~C.~(~CGACACCGAGCGGCGGATCATTGGCTCGTGGATCGATGAAGACCGCAGCTATCTGCGCGTC 535 545 555 565 575 585 595 GTCGTGTAATCCGCAGGTTATACGAACATCGACCAGTCGAACGCACATTGCGGCCTCGGTGACCCGCGGGTCCCG ~lO

620

630

640

650

660

670

'GGCCACGCCTGTCTGAGGGTCGT/~ACCGGATACTCGGCCAGACCGATGCGCCGCCGGCAGGCTCCGACGGCACG '

~_l_,gs2 685 695 705 715 725 735 745 CCCGTCGCCGCTCCGGTGGTGCGAGATTGGCGGTTCGACCGGGGAGAGCCGCTCGCGTCGCCGCCTCCGGTCGTC

760 770 780 790 800 810 820 CGCCCAAGTTGTGACGGCAAACAGTCACGGGTTCTCGCGTCGTCAAACGGCACGTCCCCGTCCGCCCGCCGCGCG 835 845 855 865 875 885 895 AGAGGCCGGGCGGCTGAGAGTCCACGAGACCTCTCCGGCTTCCGAGAAGCGGACGCGGCGGTCACCGATCGGCGG 910 920 930 940 95Q ~O 970 AGG(~GACCGCAGCTGCCGTGTAATT'I~AAGA.~C~,~C~CCGATACGTTCCGACCTCAGATCAGGCGGACT 985 995 1005 1015 1025 1035 1045 ACCGCGATTAAGCATATTAATAAGCGGAGGAAAACAAACAACCCGTGATCCCTTAGTAGCGCGACGAACCGGGA 1060

1070

1080

1090

1100

1110

1120

AAAGCCCATCGCCGAATCTCGCGCGCACTTTTGTGTCCGCCGTATGAGAGTTGTGGCGTACGGGCCGGCCTCGTT 1135 1145 1155 1165 1175 1185 1195 GCCGGGCGCACGTGACGGTCCAAGTCTCCCTTGAACGGGCCATGGCCCGCAGATGGTGCCAGGCCAGTAGAGGCC 1210 1220 1230 1240 1250 1260 1270 GTCACAGCGTTCCGGGATCGGACCGCGCCTTCGAGTCGGGTTGCTTGAGAGTGCAGCCCGAAGCGGGTGGTAAAC 1285 1295 1305 1315 1325 1335 1345 TCCATCTAACCGTAAATACGGCCACGAGACCGATAGTTTACAAGTACCGTGAGGGAAAGTTGAAACAATTTGAA

1360 1 370 1380 1~0 1 400 1410 1 420 GAGAGAGTTCATAAGAACGTGAAACCGCTCGGTTGTAAACGGACAGAGCTCGCGAATCCCCGCCGGGC~

_

Fig. 2. Nucleotide sequence of the pea aphid internal transcribed spacers. Boxed sequences represent regions coding for the 18S, 5.8S and 28S rRNA, respectively. ITS 1 extends from position 236 to 464 (between the 18S and 5.8S rRNA sequence), ITS 2 being from 625 to 904 (between 5.8S and 28S rRNA sequence).

Nucleotide sequence of aphid r D N A

653

2

5



,0

,

70-

°

Va

60-

50-

1 V \o, 7~ ' ~

4030-

®

.....-----~ 8 .....7°

2010.

2~o

560

~Jo

~oo ~

s

Fig. 3. Size and G + C content of eukaryotic ITSs. (1) Yeast (Rubin, 1973; Veldman et al., 1981); (2) Xenopus (Furlong and Maden, 1983); (3) mouse (Michot et al., 1983); (4) aphid (this study); (5) human (Gonzalez et al., 1990); (6) Drosophila (Tautz et al., 1988); (7) C. elegans (Ellis et al., 1986); (8) Dictyostelium (Ozaki et al., 1984); (9) Trypanosoma (Campbell et al., 1987); (10) rat (Subramanyam et al., 1982). GA 0 G G-C C-G C-G G-C C-G

I

G;t¢

o

20-C-G GG-C UA-U C-G

~-40 o

0

C-G G

G C-G

G-C.

60

G-C ~G-C C-G C-G C-G C G-C

1 - C U C U - A C G C G A-U U G U U U U G U U U U U U U U G U U G - 2 2 9 G-C C-G G A O G-C C-G U = A ).

80-G-C >

A-U-200 G U CG- c A-U G-C CU_ A G-C GUC-G GC-G G-C G-C U

W

G-C C-G O G-C 00 ~C-GoooU~ooO~OOO>

zoo.

c

G A

,,, .,,,,,,,,, OOOu? °°°°=°°°=A

o

o

G_C ~

~

C-G

O

C

G c~o

,,, uu>°° uc

O © 'T O>O~c

A

U

O ,,,,,,, O 6 o=oo.oo 0 o AU

llf C-G O c

O

G Fig. 4. A proposed secondary structure for the aphid ITS 1 region. Nucleotide positions arc numbered in intewals of" 20 nucleotides. Base pairs of the Watson-Crick type are indicated by bars. Four substructures are indicated (I-IV).

654

O-Yu KwoN and HAJXMEISHIKAWA

O

G

O

6'

~o < u G - o ~

,so

GeO,! ~.

'7

~o d'o,°~p° 0 0

~.oo,

' .,o O ~

~

i I ~

cV

~° 7

no

P~,-,

.4

rj"

;,r'j

, --,CO ,,~-,..,

¢' ,..o

-

4%sJ

"'C . " ¢" )

,

0o c' j- ~d @

~bbo°

G-C

0 0 0

AC-G C-G C-G A*C C-G C-G G-C G-C ~-200 A-U G-C C U

C

Vf

60-C-C ]I[

G-C oC-G >C-G G-C G-C G-C C-G G-C

O

UG-C C-G , C*A t)

D

o4 ~ ,~nU ~,,cjG u 0 I I --,~ C ( ~ N J

OC_ G J~

C-G

11

40-U*C G-C

C-G -240 C*A U-A GC_ G C-G G-C~

C"

I c-c~ C-G G-C

G-C U C-G G-C ,¢C-G n C-G =260 G-C 20-G-C U-A C-G c

-g~'~53P O

"

GG,oC,

u c c G-280

Fig. 5. A proposed secondary structure for the ITS 2 region. Nucleotide positions are numbered in intervals of 20 nucleotides. Base pairs of the Watson-Crick type are indicated by bars, non-standard base pairs presumably intercalated in the helices being indicated by asterisks.

Nucleotide sequence of aphid rDNA may suggest that higher and lower eukaryotes are under different evolutionary pressures. In this context, it is intriguing that the aphid seems to be under pressure similar to that affecting vertebrates. Like in other ITSs so far examined, there was no common tract shared by both ITS 1 and ITS 2 of the aphid rDNA. Additionally, neither tRNA-like sequence nor open reading frame was detected in the two aphid ITS sequences. The proposed secondary structures of the ITS 1 and ITS 2 region of the pea aphid precursor rRNA are shown in Fig. 4. The model for ITS 1 contains four stem-loop structures (I-IV). The proposed stems occur at nucleotide positions 4-68 (I), 73-100 and 185-210 (II), 104-114 (III) and 115-185 (IV) (with residue 1 being the first nucleotide of the ITS 1 region). While in general these stems are very stable and rich in G/C pairs, stem I has a large loop containing 17 residues. The last part of this structure contains a U-rich pyrimidine tract. Free energy for the proposed secondary structure of the ITS 1 is - 123.3 kcal/mol. The model for ITS 2 contains six stem-loop structures (I-VI). The proposed stems occurred at nucleotide positions 3-43 and 236-276 (I), 217-234 (II), 47-77 and 187-216 (III), 78-133 (IV), 138-153 (V) and 154-186 (VI) (with residue 1 being the first nucleotide of the ITS 2 region). All of these stems, except stem V, are stable and rich in G/C pairs. Free energy for the proposed secondary structure of the ITS 2 is -207.7 kcal/mol. The models of the secondary structure for the pea aphid ITS 1 and ITS 2 are surprisingly similar to general structures (Yeh et al., 1990) that have been reported for ITSs of other organisms, even though the aphid ITSs were very different from those of otber organisms (Schnare et al., 1990) in nucleotide sequence and G + C content. Recently, Yeh and Lee (1990) reported secondary structures for ITS 1 and ITS 2 of several organisms, which also showed that certain common folding features appear to be conserved in spite of extensive sequence divergence. Whether those common helical structures play any role in the recognition mechanism of ribosomal RNA processing must await further experimentation. research was supported by Grants-in Aid for General (No. 01440004) and Developmental Research (No. 01840028)and for ScientificResearch on Priority Areas, "Symbiotic Biosphere: Ecological Interaction Network Promoting the Coexistence of Many Species" (No. 03269102) from the Ministry of Education, Science and Culture of Japan. The research was also supported by a grant from Kirin Brewery Co. Ltd. Acknowledgements--This

REFERENCES Campbell D. A., Kudo K., Clark C. G. and Boothroyd J. C. (1987) Precise identification of cleavage sites involved in the unusual processing of trypanosome ribosomal RNA. J. molec. Biol. 196, 113-124. Ellis R. E., Sulston J. E. and Coulson A. R. (1986) the rDNA of C. elegans: sequence and structure. NucL Acids Res. 14, 2345-2364.

655

Fujiwara H. and Ishikawa H. (1986) Molecular mechanism of introduction of the hidden break into the 28S rRNA of insects: implication based on structural studies. Nucl. Acids Res. 14, 6393-6401. Furlong J. C. and Maden B. E. H. (1983) Patterns of major divergence between the internal transcribed spacers of ribosomal DNA in Xenopus borealis and Xenopua laecis, and of minimal divergence within ribosomal coding regions. EMBO 3". 2, 443-448. Gonzalez I. L., Chambers C., Gorski J. L., Stambolian D., Schmickel R. D. and SylvesterJ. E. (1990) Sequence and structure correlation of human ribosomal transcribed spacers. J. molec. Biol. 212, 27-35. Herrmann B. G. and Frischuf A.-M. (1987) Isolation of genomic DNA. In Methods in Enzymology (Edited by Berger S. L. and Kimmel A. R.) Vol. 152, pp. 346-351, Academic Press, New York. Kwon O-Y., Ogino K. and Ishikawa H. (1991) The longest 18S ribosomal RNA ever known. Nucleotide sequence and presumed secondary structure of the 18S rRNA of the pea aphid, Acyrthosiphon pisum. Eur. J. Biochem. 202, 827-833. Maniatis T., Fritsch E. T. and Sumbrook J. (1982) A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor. Michot B., BachelterieJ.-P. and Raynal F. (1983) Structure of mouse rRNA precursors. Complete sequence and potential folding of the spacer regions between 18S and 28S rRNA. Nucl. Acids Res. I1, 3375-3391. Ogino K., Eda-Fujiwara H., Fujiwara H. and Ishikawa H. (1990) What causes the aphid 28S rRNA to lack the hidden break? 3". molec. Evol. 30, 509-513. Ozaki T., Hoshikawa Y., Iida Y. and Iwabuchi M. (1984) Sequence analysis of the transcribed and 5' non-transcribed region of the ribosomal RNA gene in Dictyostelium discoideum. Nucl. Acids Res. 12, 4171-4184. Perry R. P. (1976) Processing of RNA. Ann. Rev. Biochem. 45, 605-629. Rubin G. M. (1973) The nucleotide sequence of Saccharomyces cerevisiae 5.8S ribosomal ribonucleic acid. J. biol. Chem. 248, 3860-3875. Sanger F., Nicklen S. and Coulson A. R. (1977) DNA sequencing with chain-terminating inhibitors. Proc. natn. Acad. Sci. U.S.A. 74, 5463-5467. Schnare M. N., Cook J. R. and Gray M. W. (1990) Fourteen internal transcribed spacers in the circular ribosomal DNA of Euglena gracilis. J. molec. Biol. 215, 85-91. Subramanyam C. S., Cassidy B., Busch H. and Rothblum L. I. (1982) Nucleotide sequenceof the region between the 18S rRNA sequence and the 28S rRNA sequence of the rat ribosomal DNA. Nucl. Acids Res. 10, 3667-3680. Tautz D., Hancock J. M., Webb D. A., Tautz C. and Dover G. A. (1988) Complete sequences of the rRNA genes of Drosophila melanogaster. Molec. Biol. Evol. 5, 366-376. Veldman G. M., Klootwijk J., van Heerikhuizen H. and Planta R. J. (1981) The nucleotide sequence of the intergenic region between the 5.8S and 26S rRNA genes of the yeast ribosomal RNA operon. Possible implications for the interaction between 5.8S and 26S rRNA and the processing of the primary transcript. Nucl. Acids Res. 9, 4847-4862. Yeh L-C. C. and Lee J. C. (1990) Structure analysis of the internal transcribed spacer 2 of the precursor ribosomal RNA from Saccharomyces cerevisiae. J. molec. Biol. 211, 699-712. Yeh L.-C. C., Thweatt R. and Lee J. C. (1990) Internal transcribed spacer 1 of the yeast precursor ribosomal RNA. Higher order structure and common structural motifs. Biochemistry 29, 5911-5918.

Nucleotide sequence and presumed secondary structure of the internal transcribed spacers of rDNA of the pea aphid, Acyrthosiphon pisum.

1. Internal transcribed spacer (ITS) 1 and ITS 2 of rDNA of the pea aphid, Acyrthosiphon pisum consisted of 229 and 280 nucleotides, whose G+C content...
329KB Sizes 0 Downloads 0 Views