Proc. Natl. Acad. Sci. USA Vol. 89, pp. 10753-10757, November 1992 Genetics

Genomic sequences with homology to the P element of Drosophila melanogaster occur in the blowfly Lucilia cuprina (trnpoblk elements/evolutln/Lu-Pl) HARVEY D. PERKINS AND ANTONY J. HOWELLS Division of Biochemistry and Molecular Biology, School of Life Sciences, Faculty of Science, Australian National University, P.O. Box 4, Canberra, Australian Capital Territory, 2601, Australia

Communicated by M. M. Green, July 21, 1992

ABSTRACT We have cloned two DNA elements (Lu-P1 and Lu-P2) from the Australian sheep blowfly Lucioia cupruaa that are similar to the tsble P element of Drosophila meknogaster in both structure and sequence but have divg ably. Hybrizati from it and from each other c studies indicate that a third reated element probably exists In another, as yet unsequenced, done. Neither Lu-P1 nor Lu-P2 appears to be active In terms of mobilt, and It Is not known tn-competent pies of other related whether any t elements occur in the genome of the blowfly. However, the isolation of any P-like seq s from a species outside of the family Drosophilidae ailows comparisobs to be made of more widely divergent P-related elements than has been possible o previously. We are unaware of any report of the presencef multiple P-like family members within a single species. The discovery ofLu-P1 and Lu-P2 in the blowfly fuels the posibity that similar elements may be widespread in insects, and perhaps in other orders of animals.

The transposable P element of Drosophila melanogaster has become an invaluable tool for genetic manipulation, experimental mutagenesis, and germ-line transformation in this species (1, 2); however, it is essentially nonfunctional in heterologous species. As a consequence, the basic and applied opportunities afforded by such germ-line transformation systems (3) have stimulated efforts toward their development in other organisms based on endogenous elements with similar properties. The P element belongs to a class of transposable elements called transposons, which transpose at the DNA level and are characterized by the presence of a gene encoding a "transposase" enzyme and by inverted terminal repeat sequences. Other elements of this type are common in prokaryotes (4) and plants (5-7) and have been found in some invertebrates (8), but they appear to be relatively uncommon in insects or the higher orders of the animal kingdom, where most mobile DNA belongs to a class of elements that transpose by way of a reverse-transcribed intermediate. However, a number of recent observations suggest that elements of the transposon type may be more widespread than has previously been thought. First, P-like elements have now been characterized from a range of Drosophila species, and a P-like element has also been described from Scaptomyza pallida (9), a more distantly related member of the family Drosophilidae. Second, elements of the transposon type have now been isolated from a range of insects and include mariner (10) and hobo (11) elements from various Drosophila species and TECthl from Chironomus thummi (12); mariner-like elements have also been found in the silkmoth Hyalophora cecropia (13). Finally, it has been reported that there are regions of significant homology beThe publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

PL72

X E

H

PL81 E

PL82 H x

PL83 H X

E

EH

PL91

X

E H

FIG. 1. Southern blot of five genomic clones probed with P-element DNA. Multiple hybridizing bands in HindIl digests of PL81 and PL83 are due to incomplete digestion; however, genuine hybridization occurs to several bands in both digests of PL82. The filter was hybridized and washed at 54TC and then exposed to x-ray film for 24 hr. Lanes: A, HindIIl-digested A phage DNA; E, EcoRI digests; H, HindIll digests.

tween the transposase genes of hobo and two plant transposons, the Ac element of maize, and Tam3 from Antirrhinum majus (14), suggesting a distant but common evolutionary origin. In this paper we report the cloning of two P-like elements from the blowfly Lucilia cuprina, providing further evidence that transposons may indeed be widely distributed in insects. *

MATERIALS AND METHODS Fly Stocks. All fly strains used in this study were obtained from the Blowfly Genetics Group, Commonwealth Scientific and Industrial Research Organisation Division of Entomology (Canberra, Australia). They were originally collected from various geographical locations in Eastern Australia, from Queensland in the north to Flinders Island in Bass Strait in the south. Library Construction. Genomic DNA, prepared from standard wild-type (SWT) embryos as described (15), was partially digested with Sau3A1 and, after partial end filling, ligated into commercially prepared half-filled Xho I-cut AGEM-il vector (Promega). Su log and PLmid DNA. DNA was subcloned into pBluescript II KS vectors (Stratagene) by standard ligation and transformation techniques, using Escherichia coli host strain JPA101. Plasmid DNA was prepared by the alkaline lysis method (16). *The sequences reported in this paper have been deposited in the GenBank data base (accession nos. M89990 and M89991).

10753

Proc. Natl. Acad. Sci. USA 89 (1992)

Genetics: Perkins and Howells

10754

(United States Biochemical). Manipulation and analysis of sequence data were performed with MACVECTOR (IBI) software.

Southern Analysis. DNA was transferred to nitrocellulose in 20x standard saline citrate (SSC) (17) and hybridized in 50 mM Hepes/lOx Denhardt's solution/3x SSC/0.1% SDS containing salmon sperm DNA at 50 ,g/ml. Washing was in 2x SSC/0.1% SDS. DNA probes were labeled with [a-32P]dCTP (1700 Ci/mmol, Amersham; 1 Ci = 37 GBq) by nick translation. DNA Sequencing and Analysis. Exonuclease III was used to generate progressively deleted sequencing templates (18). Where a protective 3' overhang could not be generated, exonuclease digestion was blocked with a-phosphorothioate deoxynucleotides (19) according to the Promega protocol. Sequencing of single-stranded templates was by the chaintermination method (20) as modified for use with Sequenase a

1 12 1 2 41 3 61 4 81 6 01

RESULTS We have used an internal sequence of the P element [2.4kilobase (kb) HindIII-Sal I fragment of pir25.1; see ref. 21] from D. melanogaster as a hybridization probe to search for similar sequences in the genome of the Australian sheep blowfly L. cuprina. Genomic Southern blots were initially hybridized at normal and reduced stringency. Even under the least stringent conditions (52°C) no specific bands of hybridization were obtained, although some faint diffuse hybrid-

GAATTCCAACGCTATTTAGTATATTTAGCCTCTTTTATATGGAGAAAACATATAGTACTGATGTTTCCACCAAAATGAAATTTTATTCATTCAAATTTTATTTGATTTTTACGAAACCAA XNGW GCCTACAACAGTGGTATAG;AAATAAACAAGTGAAGAAGTATAGTCGGGCGTGG CCGACCATATGTAATATTATTGTGCTGAAATTCATCACGATATTAGTTTTATATGTGAATTTTGGACATTCAAGTCATATTCTGAAGGWACTTGGTATGGGAGCAGGGTCAAATGAGG TCCGATCTTTATGAAATTTGGCAAGGTCATC :AAGGCTTGTATAAA ACTTATTTGTGCCGATTTTTTCAA AATACAGGTAATTA ATAAT ATAGCTAAGCCTTTTTCGGGGGG TTCTGTTGTATGWGGCCTAGGTGAAbAATTGGGCCGATTTTAACCATTTTCGTCCTTGCAAAACAGCGCTTGTACCAAATTTCATCCAATTATATTGAAAATTGCGACCTGTATCT

CGTTTATCTCAAAACATGGCTTAATATAGTAAAATACTTATAATGTTATTTACAM

8 41

TGCGCACAAGGTTTAAGCACATGGACAGTCAGCCAGACGGACGGACAGTCGGACATAGCTCAATCGACTCAGAMAxTGTTTCAAAGGCGATTGGTATACTATAAGGTGGTTATAAGACGT ATATTTGTGTGTGTTACAAACATCAGCACAAACCCAATATACCCTTCCCACTAAAGTGGTGTAGGGTAAAAGGAATAATTCTGTAACTTCTAAACGAGAGAkTTCTGTATATGT AAGTTTCATGAGATCGCAACACAAACGAAAAAGTAGGATCGTTATAAAAsATTTTATATATCCGGTCTCCTAAGCCCTGGTGGGCCCATGAAGTCCAC TTCAGTCGACAAAA

9 61

AAATTTTCTTAGAATTTTATTTTAATTTAGAATGCCAAGTGACATCAATATGGAAACAGATGAATGTTTATTTTTAAGTCGCCCTAGCCCTTCTTGGACATCGAATGTTGATCCGTCTGT

7 21

M N

F. L

R R

S I

Q L

H

X

F

*

R F

X R

S M

R P

I S

V D

I

I

X N

I

L

Mt E

Y T

I D

R E

S

C

P

L

X

F

P L

N S

K

R

A

P

H S

E P

V S

H W

I T

Q S

S N

G

V

I

T D

K

P

T

S

V

10 81

TCCGAGTGCTGTATTAACGGAAAGTTTTCCCACAGAATATACCAAAACTTATGTAAATCAGACCAWGAZ&&WTAATTAAAATAACATCAGTTATGTTTTTACTAATGTTA

1201

TGTT, =

13 21

TATGGCTCTTCACGGAGTCAATAAAATATTTACTGAkGGGCCAAATTCGAAAGTTGGAAGGATGCAAAAATTGTATG&AAATCCCCCGATATTTCAAGTGCTATTTGCTTACACGCCGC

14 41

TGGACCAAGGGCTTACAATCACTTGTAAGAAAGGATTTCCGCTACCTCACGTCAGCACTTTACAGCGCTGGTGTCAGAAAGTTGATGTGCATGAGGCCTTCTTAAAACGTCACTAGG

15 61

ATTTATGCAACAAGCAACATATTTGTCACAAGATGAGAAAATTTGTGTGCTAGCCTTTGACGAAATGAAGGTAGCGGAAACTTTCGAGTACGATTGTACTAATGATGTCGTAAGACAGCC

1681

AGCAAAATATGTGCAGGTGGTAATGGCACGTGGTCTAAAGAAATCCTGGAAGCAGCCAGTCTTTTATGATTTCGACTGTAGAATGTCGAAACAAATTTTAGACAGCATAATTTCAGAATT

P

M G

F A

S

A

V

L

T

E

S

F

P

T

E

Y

T

X

T

Y

V

N

Q

E

T

Q

T

E

=TCCCGATATTTTTGCAGCGGAATCAGCAAAATATGTTGACAAAATCCGCGAACTCGAAAGGGAAAACAAAGAGCTAAATTTTAGACTTAAGATGCAGGTCCACAATTCAAA P

A P M

L R Q

K

Y

H A

D

G Y

Q A V

I

V

N T

F

N

A X

A

I

L

H

L

Y

*

S

E F

X Q

S

T

A

E

X

D

Q V V M A R

K

G F

G

E

G

Y

P

R

L

I

K

P

C

T

F

Q

L

G

S

A

R

E

E

V

L

N

R

L

H

V

S

K

K

P

C

I

R

D

I

Q

X

L

V

F

K

L

D

F

Q

M

F

V

Y

R

E

I

X R X

D

W V

F

N

V

R

N

C

X

V

T

E

D

S

X

Q

A

L

F

C

N P

V

E

Y

S

H

L

R

S

E

H

D

K

R

S

I

D

D

F

R

F

D

D

H

C

H

T

V

S

N

L

K

L

N

L

V

I

L

T

I

Q

A

G

C

Q

M

A

L

R

I

N

A

S

V

I

S

G

P

Q

S

L

E

GTCTAAAGCTGGTTTTCCGGTGCTGGCAATTGTCTGTGATATGGGGCCAACGAATCGTAAGCTGTGGAGTGATTTAGGTGCAACGACAGrIAAZTTATTAAACACATGCCACATGTAA a 1~~~~~~~~~~~~~~~~~S1 1801

S

192 1 2 041 21 61

K

A

G

F

P

V

L

A

I

V

C

D

G

M

P

T

R

N

L

X

K

5

D

L

G

A

T

T

GATACATCACACAGCACATTTAACACATGGCTTACGATTTTTAATTCGGhAGAAgTTAAATTTTCTCTACTCGATCGGTTCCTGTTTTTGCCTAGCAGAAGGGATTTTTACATTCGTTGT AATAATCGGTAATTAAAAATATGAAATTTTTAACAATAQATAATTGCTTTCGTTTTTAAAGATCAAATAAQATTTGATACCAATAAGAACAGCATATTTCATATTTCGGCATTAAGGCTT

TAATTTACTTTATTAAAAATGTCACCZ=WAAAGCCTTGGTTCCCTCATCCAGTGAATACAGAGG^AAATATATACTTTTGTTGATGCTCCACATTTGTTGAAGTTAATCCGAAAC E

K

V

P

F

V

P

H

P

N

E

T

E

V

X

F

T

Y

V

D

A

H

P

L

L

L

X

I

R

N

H{nd III

2281

CATTATATCGACACCGGCTTGATTTATAACGGCGAACACTTAACGTCACGTACTATCGCGGATGTTTTACAACATACMAAAAAATGTGATACATCTATAACATTTAAGCTTAGTGATGAA

24 01

CACCTCTTAGTCAAAGGCGCCGMA=3GAATATATTGTATACATATGTATTTCAAAACTTTATTTCAAAATATTIWGAGCzAAAAGGTAAAATTGGCTGCACAGCTATTTTCAAA

25S21

TAC CACAGC TAGTG CAATC CGACG TT GCTATAC TCACGGAAAGGACATCTACAAACCACTCGAGACT GC CGAAC TTATT CAAACAGTAAACAAC TGGTTTGAT GT TGTTAAC TCT TCAAT T T A S A I R R C Y T H G X D I Y X P L E T A E L I Q T V N N W F D V V N S S M

2641

GAATACGTTCGGTCTCCCTGGCAAGGTIAAGTATGTTATTTTTATTTATAAGTAATTTTAAAGCGTAGTTTTATATCZTAMGAACCATATGGAGTTGATCTGGAAAGCCAGCAAAATAA

2761

GTTGGAACAGATGAACCAACTAATGGCTGTACCAATTATACCAGGTAGGAAAAGTCTGGAACCTTTCCAAAAGGCATTCTAATGACAAACCGGGCCTTGGTAATGCTTTATGACGATGT

2 881

TAAAAAGTATGACATGACATACATTTTAACAAATCGTCTAAATCAAGACGTCTTGGAGCATTTTTTTGGTGCGATTCGCAGCAAGGGTGGACTGAACGACCATCCCTCTCCACAAGACTT

3 001

CAAGTTTAGATTAAGAAAATACATATTGGGTAAGTTAATTATATAAGCACACTGGGTAATGCTAGTATAAATAAAAATTTCTACAATAAATTTTTTTTTATTTAATAZIQGCGAAA

312 1

TACCGAATACTTAAATGGCACTGGAAATGTCGAAAACGATAACACTGAATGGCTCAATTCCACAAATTTTACAATAACAAGTTTGGCAAAAGTTGACGAAGTATCGGAAGAAATCCATCT

32 41

TGATTGCCAAGCTCCGATATCGATACATGCCCCGACGGAAGAACAGTTTTTAGACCTAGACGAAACGGATTTAGACGACATTGGTATTCTTCAAGAGGATGCTCTAGAATATATATCCGG

33 61

ATATATCATTCGAAAGCATAACCTTGAAGAATACCAATGTCGTGAAAATACGTTTACGTGGGTCGACGAAGTATCGAAAGGTTCATTAAAAAAACCATCAAACTTTTTCCTCCAAAAGAT

34 81

AAAAAGTTTAGAAGTAGTGTTCTACAATG;TCAATGGAAGAGAAATATCGCACAG.AACAAACCTCCGTCAAkCATTTACTTAATGAAAGTTCATATGTAG;ACCTACCTGAkACGCATTAAGCAk

3 601

.ATAAAACTATATTATAAATG TAA GT GTTC TTT TTTAGATGC CGAATATT TT TTCGGATACGGAAT TTAAATAAGCACATTAAGGT CCAAA.AACAAGTGC TAATGGG TAAAAAAAGAT F F F R C R I F F R I R N L N K H I K V Q K Q V L M G K K K M I K T I L

3721

ATGTACTATTCACGGATTATTCTATTTTTTGATTTTTTTCTGTCGTATACTTATAGGCTCAGTATTTTTGTAAATTACTACACATCGTATCAACAGCCTATGTCCGATTTTGATAATAGC TCTAAGGATTACG

H

H

L X X T

D Y K

T E

X F E

C I 5

T

D

V

L

L

N

3841

I

Y

F Q Y R Y Q I L

M D L L A R E

L N M R

N P X V

Y

I

G

N

E

H

T

L

S

R

I

T

D

A

L

V

T

H

Q

P

G

Q T X G

I H V

G

L Y Y T 5 N F

I I

G

I L Y

R

E

K M

C

K

N

A

G

R

G

L

G

A

L

V

T

P N

I R

I L

P N

G Q

R D

X V

S

L

L E

E H

P

F

F F

Q G

X A

G I

I R

L

S

M X

T

D

Q

P

T

G

X

Y

N

G

S

V

G

R

L

I

X

V

A N

T

L

D

L

D

K

F

A

L

V

H

A

E

M

P

L

Q

S

L

S

L

Q Y P

H E N

F

Q D

Q

A

L N

D

S

V

A

E V

E e

Y N

N

T Q G

D E C R

N E R

E

T Q E I

E F N S

N

L T H

L

D F

N

L T

R T

S

D W N

T

E V L

N

T D R

F

D E Q

T

L V H

I D S

L

T D X L

S

I G N

L

G S E

A

I L S

X L X S

V

Q X Y

D

E P

V

E

D S

D

V

A N

L

S

L

F P

E

E

F E

E

Y

L

R

I

I

Q I

E

S

N

D

D

R

H

S

X K

N

K V

F

N L

G

I

Q

3961 4081 4201

4321 4441 4561 4681

4801

TAATTTTGGTCTCCTACCA

AAATTTTTAGGACAATTTATCCCCTTTGGTTTTCTAATTCATTACTAACTATTAATACATATTATTCACAAGTATTAATTTTAGTCTTGTCTCTTATT

ATTTAATACTGATCTAAAATTTGACTTCGCTATTTGTTTGTCCAGTAATAAAGATTGAGGTGTACAGAATCTCTAATTGGTGCAGAAAt agCCtACGGCATCAGGATTTACAAATAATTA TATGAACTTAATGCTTAAAATATATTAACTTTTATTTTATGACAAAATCTTTTAAATTTAAAATTGTTACAAGATGTGACTTATAAGTGTACGTAATAAAGGCGAAGTGTAACGAA AAATCGTATTTTGGACATTTATTTCCTAGGGAAAAAACTTTCGGTTGATACAGCTGTCGCTGzGGTTGAGAATCTTTGACCAGAGCGAATATCTAATTGCCGTGGGAACGATAAGAGTTTG TTTTCATTGGTTGTCATACAAAACAAAATTACATTAGAGAAACAAGACCTATACACAATTAAATAACTCCACTTACTCCAGGTA

FIG. 2. (Figure continues on the opposite page.)

h

1

10755

Proc. Natl. Acad. Sci. USA 89 (1992)

Genetics: Perkins and Howells

GAATTCGATTTTCCCAAAATTAAGCATTAATAATOCTTTGAGCTTCATAGATATAAGTTTCATACCAAATTTAAGTTCCTTTAAAATTGTTTTTTTTTTTGGGTATTTGGTTTCGCTATT

Hind III

12 1

AAGCTTTAAAATAATTATTCGTAATTAAACTTAATTTTAAATAATTTTAATGCCAATTTTATATGTGAAATTAACCTATTGTTTC=LQLCAQGACCATGGTTTCCTAATCCCACAAATG

241

CGGCTGATAAAGTTTTCGCTTTTGCMGATGCCCCCCACTTGTTAAAGCTAATAAGGAATCACTTGTTAGATAGTGGATTAGTGGTTAATGAACAGCTTTTGACGCCGCGGACAATAGCCG

361

ATGTTATTAATCACACATCTGCCTCCGATGTTTCTATAACATTTAAGCTGAATGATGACCATCTGACTGTTAAAGGCGCA

481

ATATTGTTAATTAACATTAjAAAWGGCAAAAGGTAAAACTGGCGGCTCAGTTATTTTCGAACACAACTGCTAACGCAATCCGCCGATGCTTCTCCCTTGGTCTGGATGTCTACCAAG

601

CCACTGAGACAGCGGATTTTATACAGCTTGTAAACGATTGGTTCGACATTCTTAATTCAACGTTAAGTACTTTCGTTTATCCGGGCAAGGAACCATTTGGACATAATTTGGCAACACAAA

2

A D

A V

D I

K N

V H

F T

AF A D S

A

S

E

A

T

E

T

A

D

F

I

D

A V

P S

R QK

Q L

V

N

H I V

L T K

L F

L

D W F

K K A

D

I

L L

R

N

D

A Q L

I

L

N

TTGGACGAAATTACCAAAATCATGGCTTCTCCAATAATACCAAATAA

N

D F

S

H

H S

T

L

L N

L

L

T T

S

D

V T

T

S

K A

F

G

G N

V

L

V

V

Y

I

P

E Q

P

L

3 F L

T

P

P

N

R

P

T

T I

N

A

ZTAAATACGAACATTTATTATTAATTCGTATCC

A A

N

R

R

R

C

G K E

F

P

S

F

L

G

G

H

L

N

D L

V

A

Y Q

T Q

TGCTTGAACCATTCCAAAGGGGCGTCGTCATCACCAACAAAGCATTGATAATGCTTCATG R R V L 13 P F Q R G V V I T N R A L I M L H

721

T I

841

AATATGTTGGCGAAAGATTTAATATGCAATACATATGCACCAGCAGGCTCAATCAAGACGTTTTGGAACACTTCTTTGGTGCCCTTCGTAGCAAAGGGGGCTTAAACGATCATCCGTCGC

961

CAAAGGATTTCAAATATCGAATGAGAAAGTACATCTTAQQTA=TTCAAACTGATTTTATTTCATTGTTATATTACATTAAATTGTTTTTTTTTTTTGTTATAACAATZATWCTAGAA

1081

ACACCGAACACCTTAGTGGTACTGGTAACGTTGGCTACGACACCACCGAATGGCTCAGCACTTCAGACACCAATGTTTCTTTCATTGAAAGTGAAACAAAACTTTCCAATGAGCCGCTTG

1201

AACTGGAAGAAATTTCAGAAGAACCCCTAACAAATGTACTTGACGACCTAGATGGATCGAATTTTGATGATATGGGTACTCTCCAAGTTGATGCTTTGGAATACATTGCTGGCTACACTA E L E E I S E E P L T N V L D D L D G S N F D D M G T L Q V D A L E Y I A G Y T Hind III

1321

TTCGAAAGCTTAAACTTAGAGAATATGAATCTC

E

P

N

I

R Y

K

T

R

X V D

E

K

L ,D G

F

H

L

E

E R

K Y

L

K

S

L

I F R

G

R

T

N M

T

E

R

I

M Q R K

G

Y

N

E

M Y Y

V

S

A

I I

G

S

C

P

T

I

S

I

R

P L

N N

Q

D

V

L

E

H

F

F

G

A

L

R

S K

G

G

L

N

D

H

A

L

Y

P

D

T

T

E W L

S

T

S

D

T

N V

S

F

I

E

S

E

T K L

S

N

E

P

S

R L

ATTCGTATACTTGGGTTGATCAAGTATCAAAGGGTATGCTGATTAAGCCATCTTCAGAGTTTCTCGAACAATTGAAGAAAC

Q R

N

S

Y

T

N

V

D Q V

S

K G

M

L

I

K P

S

S

E

F

L

E

Q L

K K

1441

TGGAAGCTGTGTTTTACACTTTAAATGGCTTGGAAATATCGCACTGCGACAACATCCATCAACGTTTAGTGGAACGAAGTAACAACATTGACCTTCCAGACGTTGTTAAATGTTTTTTCT

1561

TTAAATGTCGCATATACTTCAGAATCCGCCATTTGAATCAAAGACTCAAAAAACAAAAATGGAACAAGGAAGTAAGCAAAAAGATGCTTAAAATAAAGCTTTAATAACTGCAAGTAAGTT

1681 1801 1921 2041 2161 2281

AT TACCT TTACTTAAAAACATTTCTTAACTTAATACATATTCTTTCCTTTAATTCATAGGCTTATGCAGAAAAC CTTGCCCTTCTACACTTTACAAATTGCTTTAC GTTATCAAAACC CA

L

E

F K

A C

V R

F

I

Y

Y

T F

L R

N

I

G

R

L H

E L

I

S

H

N Q R

C

L

D

X

N

I

H

Q R

K Q X W N

L

X

V

E

E

V

R

S

S

N

N

K K M

I

D

L K

L

I

P

K

D

V

V

X

C

F

-F

L

TTGTCCCTGAGTTGTGTTTGGAIACATGTTTGGACCGAAA&ATACCTATGGCAAGTATGTAATTAACAGTTAAAAATAAAATCTTTTACATATTAATGTTTTTCTTTGAAATATTCTGATTT CAGAAACAACTTTTGTTTGGGTCTACGTGATCCTGCCGTTAATCGTCTTACTTTATAGTTCGGGGAIATGTTAAGCATTTGTTTAAGTCTCTTGCCTTCAGAACAAAACCGGGAGALCAC TACATTTGAAAATGAGTTTCATTTGTTATCACCAGTACACCCTCTGTAGCCACAGTTTTCCACCCTCCACCTGGATTGTTTACAATMAGCCAAAAGGALCCAGCCGAGAATAAGTGMATAA GTCCAGTTTCCATATTTCTTGGAAAAACACTATTTTTTTTACTAAATATATGTCAAACTTAAATAAATTGCTAACACATTTTTTATTTATTTGCATTTTTATTTATTCTTTACTCA AATATTATTTTT

FIG. 2. Nucleotide and derived amino acid sequence for Lu-P1 (a) and Lu-P2 (b). Splice consensus sequences are indicated by single underlines; inverted repeats mentioned in the text are shown by double underlines. Restriction sites used to generate internal probes (see text and Fig. 4) are shown above the sequence. Also indicated by underlining is the 495-base-pair (bp) region of repeated sequence mentioned in the text. A short stretch of ambiguous sequence from PL83 (positions 4529-4534) is indicated by lowercase letters. The sequence from PL83 (Lu-P1) was obtained by cloning a 5.6-kb EcoRI fragment as overlapping 3.6-kb EcoRI-Sal I and 2.8-kb Xho I-EcoRi subclones, which were sequenced from both strands. The sequence ends about 550 bp upstream of the second EcoRI site. The 4.0-kb EcoRI fragment from PL91 (Lu-P2) was sequenced from one strand only for the first 2.3 kb.

ization was observed that appeared to correspond to satellite bands visible in the gel (results not shown). However, work in our laboratory aimed at the cloning of eye color genes (22) and retrotransposons from L. cuprina by cross-species hybridization has shown that, due to the amplification of the particular sequences, appropriate clones can often be isolated from genomic libraries despite a lack of clear signals on genomic Southern blots. Consequently a L. cuprina genomic DNA library was screened with the P-element probe under conditions of low stringency (54°C). From a screen of 35,000 clones (approximately one genome equivalent of L. cuprina DNA), about 30 positively hybridizing clones were detected. Five of these were plaque-purified and analyzed further. Preliminary restriction mapping indicated similarities between only two of the five clones (PL72 and PL81), and Fig. 1 shows the results obtained when DNA from all five was Southern blotted and probed with P-element DNA. Distinct bands of hybridization occur in all cases and the intensity of the signal varies amongst the five clones, with clone PL83 having the strongest. DNA sequence was obtained from the P-hybridizing region of PL83 (Fig. 2a) and later, for comparison, from a smaller region of PL91 (Fig. 2b). These obviously related but different elements have been named Lu-P1 and Lu-P2, respectively. Analysis of the sequences produced a number of open reading frames and amino acid sequences with striking similarity to those of the P element of D. melanogaster. Alignments based on this similarity have enabled some structural features of Lu-P1 and Lu-P2 to be deduced (Fig. 3a) and a basis on which to assess the similarities and differences between these elements. While sequence from Lu-P1 appears to cover the corresponding full length of the Drosophila P element, amino acid homology does not begin until part way into the putative exon 1, then continues to the end of exon 3. No homology is detected between exon 0 of the P element

and the putative exon 0 of Lu-P1. The incomplete sequence data obtained for Lu-P2 begin in the second intron, whereafter homology can be traced throughout exons 2 and 3. The upstream part of Lu-P2 is presumably present in the adjacent 6-kb fragment, which in EcoPJ digests remains attached to the right arm of the vector and hybridizes to the P-element probe (see Fig. 1). Interestingly, in Lu-P1, "exon 2" is interrupted by two small introns not present in the P element, while Lu-P2 possesses only one of these additional introns. Otherwise the insertional position of intron sequences for the three elements is identical. Comparisons of the exon 2 sequences from p7r25.1, PS18 from S. pallida (9), Lu-P1, and Lu-P2 (summarized in Table 1) show that pir25.1 and PS18 are more closely related than Lu-P1 is to Lu-P2, and that despite the -34% difference between them the Lu-P elements are about equally divergent from both pir25.1 and PS18. Despite the availability of 850 bp of sequence upstream of the proposed exon 0, and nearly 1200 bp downstream of the end of exon 3 of Lu-P1, no good evidence for the presence of inverted repeat termini could be found. (The 31-bp perfect inverted terminal repeats of the P element lie within 200 bp of the start and end of the transcription unit.) The best candidates are a 10-bp sequence (TCGGACATAG) and a 12-bp sequence (AGAGATTCTGTA) indicated in Fig. 2a; interestingly the 12-bp repeat shows some similarity to the 12-bp inverted repeat (CAGAGAACTGCA) of the hobo transposable element of D. melanogaster (11). However, neither of these inverted repeats is flanked by a direct repeat (transposable elements characteristically produce a "target site" duplication as a result of insertion) and neither is present in the 3' sequences of Lu-P2. In the case of the 12-bp repeat, the situation is further obscured by the presence of a stretch of DNA between the end of exon 3 and the downstream 12-bp repeat which appears to encode a series of 24 tandemly repeated 7-amino acid motifs similar to those

10756 a

Genetics: Perkins and Howells

Proc. Natl. Acad Sci. USA 89 (1992)

pit25.1

2

*

Lu-PI

*

=

Lu-P2

b pc25.1 PS18 Lu-P1 Lu-P2

pW25.i

PS18

Lu-P1 Lu-P2

SK WFSHPADDHHLI VFSDTPHLIKLVRNHYVDSGTINGKKLTKK[K WFSHPTDDNSKIJVFSDTPHLI KLVRNHYVDSGFT

SK

EKPWFPHPVNTEEKVYTFVDAPHLLKLIRNHYIDE3G IYNGEHLTSRTIA RPFWF AA D PH LLK LI RNHL L DSG VVE Q LLTP A

PNUPTNAADKVAF

R

V (Lu-P 1 and Lu-P 2) EALHLCNKSDLSI QTLNHCAKSDVSI LFKISENIH DV LQHTNK DTS ITFKLSDEH LLVKGAGRQKVK

LFKINEN INVRSLAK|QKVKLTLFSNTTASSIRRC NIVIRSLEKIQKVKLAITQLFSNTTASSIIRRCI

DVINHTSA

D

SI

LAAQLFSNTTASAIIRRCI

FKLNDD[DjTMKGAERQKVK LA|AQLFSNTTA|NAII RjRC V

pc25.1

PS18 Lu-P Lu-P 2

P25.1 PS18 Lu-P 1 Lu-P 2 p251

PS18 Lu-P' Lu-P 2

Q

NGKK LTjKT TyQ

YSLGYDIENJT|

T ADFFK YSLGYDVEN[CETS YKPLETA

f7RGIKIDII L JV Y

BT

D| F

MNDWFDIFJ

(Lu-P 1

ody)

TF1SNCIECSQPGK QEDIO

LFKNLINDWFD|VF|NSIKLTIANF IOST Q|PYG|K Q|L|DF|Q| LIQTVINE8 WFDIVVINSISITI-FGLPGKEPYGVDIL ESIQ| AF I Q[gVIN D W F DII L"T -FV Y P G K EFH NjA T

LNRMSEIMRTGILDK4P[RL-PFQKGG IVNNASLDGLYKILQENF SMQ SLEPFQKG ILMTNR| ALVM|L|YDDVK-KY DMT QNKLEQMNQL|AVPIIPG I IK F NDI

RDVLEKMTQLMCSNILGKSQKL-PFQKG IIVNNASLDGLFIYLKDKYNIM|E

KULDEITKI

ASPUIIPNNKVLE

VTVV TKIATIMtjHEJVGERF JQ

[IYILTSRLNQDI E SMfRRGQF[DHP|TLQFKYRLRKYII YjLTSRLNQ DI E8FFG MRSRGGQFDHPTPLQFKYRLRKYLI

IYILTE8RLNQDIV LEHFFGAI|RS|K|GG|LN|DHP|S|P|QD|FKE]RLRKYIL NVLEHFFG

ALRSKGGLNDSUPKD

present at the 3' end of the largest subunit of RNA polymer-

II (23, 24); although possible, it seems unlikely that these part of the Lu-P1 structure. Hence the ends of these elements remain undefined. The presence of single stop codons in both exon 0 and exon 1 of Lu-P1 indicates that it cannot produce an active transposase. While there are no obvious defects in Lu-P2, the lack of the entire sequence prevents us from speculating on its potential activity. To gain some indication of whether there are transposition-competent copies of either of these elements in the genome of the blowfly, we probed genomic Southerns of five different wild-type strains of L. cuprina with internal sequences from each element. The results (Fig. 4a) indicate that there is only a single copy of Lu-Pl, which appears unchanged in all five strains, whereas Lu-P2 is apparently present in 5-10 copies, again with little difference in the pattern of hybridizing bands between strains. This indicates that neither of these elements is transpositionally active; similar findings were reported by Capy et al. (26) for the mariner element in Drosophila sechellia. The same probes from both Lu-P1 and Lu-P2 were also used to probe DNA from the five PL clones to determine the relationships ase are

Table 1. Matrix of the proportion of nucleotide and amino acid differences between the various P-like elements Lu-P1 Lu-P2 PS18 pw25.1 0.219 0.496 0.475 pi25.1 PS18 0.219 0.533 0.492 Lu-P1 0.410 0.426 0.354 Lu-P2 0.429 0.436 0.327 Values have been determined from the exon 2 region of each element only. The P-element sequence used was pir25.1 (21), and PS18 is the Scaptomyza pallida P-like element (9). Nucleotide differences are shown in roman type, and amino acid differences in italics.

L

FIG. 3. Structural and amino acid sequence similarities between the P element (pir25.1), Lu-P1, and Lu-P2. (a) Schematic representation of the structures of the three elements. The 31-bp inverted repeat termini of the P element are represented by arrowheads; stars mark the positions of stop codons in putative exons 0 and 1 of Lu-P1. Minor variations in exon size occur among the three elements; however, the main structural differences are the small introns interrupting exons 2 of the Lu-P elements. (b) Amino acid alignments for the exon 2 regions of the P element, PS18 (a P-like element from Scaptomyza; see ref. 12), Lu-P1, and Lu-P2. Dashes indicate gaps introduced to optimize the alignment, and the positions of introns in Lu-P1 and Lu-P2 are marked by arrowheads. Identical amino acids shared by at least three of the four sequences are boxed. Similar levels ofhomology are also found for exons 1 and 3.

between thenn. The results shown in Fig. 4b indicate that, apart from LiLu-Pl and Lu-P2 (present in PL82 and PL91), a third elementt also exists (in PL72 and PL81) that is apparently phylogeenetically intermediate between the other two but, given the: stronger signal, more closely related to Lu-P2. Notably theree is no hybridization between Lu-P1 and Lu-P2 under these bugh-stringency conditions (630C), further illustrating the dihLstinctness of these two elements.

DISCUSSION The discoverry of the Lu-P sequences in L. cuprina is of considerable interest because they represent P-like elements isolated from a species outside of the family Drosophilidae and because, ,as far as we are aware, the isolation of three related but sstrongly divergent P-like elements from one species has nlot been reported before. One of the most interesting aspects of the Lu-P elements, given their di ivergence and apparent immobility, is their high degree of inte-grity (e.g., absence of deletions or duplications and preservaltion of splice junction sequences). In Lu-P1 the helix-turn-heelix motif and two of three leucine zipper motifs of the transpsosase also appear to have been preserved. This would argueas either that the Lu-P elements have only recently cBecome immaobiie or that they have been selectively maintained in the genome as non-mobile elements. The latter is perhaps mor -e likely given the apparent lack of inverted repeat terminii (at least for Lu-P1), a situation that is reminiscent of th4e P-like elements of Drosophila guanche and Drosophila siubobscura, for which a role in the suppression of transpositiion has been suggested (27, 28). If the Lu-P elements also) have some regulatory function, they may act by a mechanism different from that of the P element. Certainly, the production of a protein product equivalent to the 66-kDa P-element relpressor (29) would require some read-through

Genetics: Perkins and Howells

Proc. Natl. Acad. Sci. USA 89 (1992)

a Lu-PI probe X

1

3

2

4

5

X

1

Lu-P2 probe X2 3 4

5

A

44 .......

am ~

~

loss&.

.6-M. Jn

AA

b Lu-P1 probe X

72 81 82 83 91

Lu-P2 probe X 72 81 82 83 91

O-

f. . _i *V,

FIG. 4. Southern blots of genomic and cloned L. cuprina DNA probed with the two Lu-P elements. (a) Genomic DNA from five wild-type strains of L. cuprina digested with EcoRI and probed at 63°C with internal fragments of either Lu-P1 or Lu-P2. The Lu-P1 probe was a 1-kb HindIII-Sal I fragment, and the Lu-P2 probe a 1.2-kb HindIII fragment, both from regions spanning exons 2 and 3 (see Fig. 2). The presence of two bands in each digest probed with Lu-P1 appears to be due to polymorphisms between individual flies within each strain [the DNA used for these blots was prepared by the method of Lifton as described by Bender et al. (25), using five adult flies per preparation]. The slight variability in banding pattern with Lu-P2 is also likely to be due to polymorphism; however, the less distinct bands may represent cross-hybridization to other related elements. Lanes: A, HindIII-digested A phage DNA; 1, LBB; 2, SWT (standard wild type); 3, FBWT; 4, Weller; 5, Llandilo. (b) DNA from the five PL clones digested with EcoRI and probed as for a.

mechanism to permit translation of the stop codons in exons 0 and 1 of Lu-P1. The extent of the sequence differences between the Lu-P elements, the Drosophila P element, and the P-like elements of Scaptomyza suggests that these sequences have been evolving independently for a substantial period of time, and comparisons of the nucleotide sequences from the various elements fit well with the phylogenetic relationships of the respective host species. It therefore seems likely that P-like elements were present in the common ancestor of Drosophila and Lucilia and have been evolving independently since their divergence, estimated to be about 110 million years ago (30). This suggests that the apparent horizontal transmission of the P element from D. willistoni to D. melanogaster (31) may be an isolated incidence of such transfer and that transmission of P-like sequences occurs primarily by normal vertical inheritance. Certainly our data provide no supportive evidence to suggest that lateral transfer has occurred between the Drosophila and Lucilia genera in the recent past.

10757

In conjunction with the currently known distribution of P-like elements, these results suggest that the P-element family is much more diverse and widespread than previously thought and indicate that further work on the distribution of such elements is warranted. The sequences of Lu-P elements, when taken together with the sequences of the P element of Drosophila and the P-like elements from Scaptomyza, provide us with a better idea of the highly conserved regions of these elements, which enables the design of oligonucleotide primers to use in searching for similar elements in other species. We thank P. East and P. Atkinson for comments on the manuscript. This work was supported by a postgraduate scholarship from the Australian Wool Corporation. 1. Cooley, L., Kelley, R. & Spradling, A. (1988) Science 239, 1121-1128. 2. Rubin, G. M. & Spradling, A. C. (1982) Science 218, 348-353. 3. Cockburn, A. F., Howells, A. J. & Whitten, M. J. (1984) Biotechnol. Genet. Eng. Rev. 2, 69-99. 4. Berg, D. E. & Howe, M. M., eds. (1989) Mobile DNA (Am. Soc. Microbiol., Washington). 5. Fedoroff, N., Wessler, S. & Shure, M. (1983) Cell 35, 235-242. 6. Sommer, H., Carpenter, R., Harrison, B. J. & Saedler, H. (1985) Mol. Gen. Genet. 199, 225-231. 7. Kikuchi, S., Liu, X. J., Frommer, W. B., Kostertopfer, M. & Willmitzer, L. (1991) Mol. Gen. Genet. 230, 494-498. 8. Moerman, D. G. & Waterston, R. H. (1989) in Mobile DNA, eds. Berg, D. E. & Howe, M. M. (Am. Soc. Microbiol., Washington), pp. 537-556. 9. Simonelig, M. & Anxolabdhere, D. (1991) Proc. Natl. Acad. Sci. USA 88, 6102-6106. 10. Jacobson, J. W., Medhora, M. M. & Hartd, D. L. (1986) Proc. Natl. Acad. Sci. USA 83, 8684-8688. 11. Streck, R. D., MacGaffey, J. E. & Beckendorf, S. K. (1986) EMBO J. 5, 3615-3623. 12. Wobus, U., Baumlein, H., Bogachev, S. S., Borisevich, I. V., Panitz, R. & Kolesnikov, N. N. (1990) Mol. Gen. Genet. 222, 311-316. 13. Lidholm, D. A., Gudmundsson, G. H. & Boman, H. G. (1991) J. Biol. Chem. 266, 11518-11521. 14. Calvi, B. R., Hong, T. J., Findley, S. D. & Gelbart, W. M.

(1991) Cell 66, 465-471. 15. Perkins, H. D., Bedo, D. G. & Howells, A. J. (1992) Chromosoma 101, 358-364. 16. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular Cloning:A Laboratory Manual (Cold Spring Harbor Lab., Cold

Spring Harbor, NY). 17. Southern, E. (1975) J. Mol. Biol. 98, 503-517. 18. Henikoff, S. (1984) Gene 28, 351-359. 19. Putney, S. D., Benkovic, S. J. & Schimmel, P. R. (1981) Proc. Nat!. Acad. Sci. USA 78, 7350-7354. 20. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Nat!. Acad. Sci. USA 74, 5463-5467. 21. O'Hare, K. & Rubin, G. M. (1983) Cell 34, 25-35. 22. Elizur, A., Vacek, A. T. & Howells, A. J. (1990) J. Mol. Evol. 30, 347-358. 23. Allison, L. A., Wong, J. K.-C., Fitzpatrick, V. D., Moyle, M. & Ingles, C. J. (1988) Mol. Cell. Biol. 8, 321-329. 24. Nawrath, C., Schell, J. & Koncz, C. (1990) Mol. Gen. Genet. 223, 65-75. 25. Bender, W., Spierer, P. & Hogness, D. S. (1983) J. Mol. Biol. 168, 17-33. 26. Capy, P., Maruyama, K., David, J. R. & Hartl, D. L. (1991) J. Mol. Evol. 33, 450-456. 27. Miller, W. J., Hagemann, S., Reiter, E. & Pinsker, W. (1992) Proc. Nat!. Acad. Sci. USA 89, 4018-4022. 28. Paricio, N., Perezalonso, M., Martinezsebastian, M. J. & Defrutos, R. (1991) Nucleic Acids Res. 19, 6713-6718. 29. Rio, D. C. (1991) Trends Genet. 7, 282-287. 30. Beverley, S. M. & Wilson, A. C. (1984) J. Mol. Evol. 21, 1-13. 31. Daniels, S. B., Peterson, K. R., Strausbaugh, L. D., Kidwell, M. G. & Chovnick, A. (1990) Genetics 124, 339-355.

Genomic sequences with homology to the P element of Drosophila melanogaster occur in the blowfly Lucilia cuprina.

We have cloned two DNA elements (Lu-P1 and Lu-P2) from the Australian sheep blowfly Lucilia cuprina that are similar to the transposable P element of ...
1MB Sizes 0 Downloads 0 Views