Gene, 121 (1992) 237-246 0 1992 Elsevier Science Publishers

GENE

B.V. All rights reserved.

237

0378-l 119/92/$05.00

06764

Sequence of rat lipoprotein (Mouse;

human;

bovine;

lipase-encoding

guinea pig; chicken;

PCR; untranslated

cDNA exon; A+T-rich

sequences;

CpG islands)

Didier Brault a, Lydie No& a, Jacqueline Etienne a, Jocelyne Hamelin b, Alain Raisonnier ‘, Aziz Souli a, Jean-Claude Chuat b, Isabelle Dugail d, Annie Quignard-Boulangi: d, Marcelle Lavau d and Francis Galibert a uLaboratoire de Biochimie et Biologic Molkulaire, Fact& de M&decine St-Antoine, 75012 Paris, France: b C.N.R.S. UPR 41, Centre Hayem, H6pital St-Louis, 75010 Paris. France. Tel. (33-l/42 02 1605; ’ Biochimie. CHU Piti&Salp&i&e, 75013 Paris, France. Tel. (33-l/40 779805; ’ INSERUM France. Tel. (33-l/4633 Received

U I 17. 75006 Paris,

7105

by G. Bernardi:

5 May 1992; Revised/Accepted:

9 July/l2

July 1992; Received

at publishers:

30 July 1992

SUMMARY

A rat lipoprotein lipase (LPL)-encoding cDNA (LPL) has been entirely sequenced and compared to the sequences of all the LPL cDNAs reported in other species. As expected, high homology was found between the coding exons. The putative catalytic triad, Ser’32, Aspire, His24’, according to human numbering, is conserved in rat. As is the case in mouse, an Asn444 present in human LPL is also missing. The major divergences between human, mouse and rat LPLs were observed in the untranslated exon 10, where (i) the rat cDNA exhibits a 157-bp insertion and an 81-bp deletion relative to human; (ii) neither the B 1 repeat nor the homopurine stretch reported in mouse can be recognized, and (iii) the rat cDNA displays several A+T-rich stretches.

INTRODUCTION

Lipoprotein lipase (LPL) is a key enzyme of lipoprotein metabolism that hydrolyzes the triglyceride moiety of chylomicrons and VLDL. It is involved in various pathologies, such as hyperlipemia, atherosclerosis and obesity. For studies on lipoproteins and LPL, rat is one of the most useful laboratory animals. Since it is resistant to this affection, rat is also a model for atherosclerosis research. Moreover, the genetical nature of its obesity makes the

Correspondence to: Dr. J. Etienne, Biochimie, Mtdecine

St-Antoine,

Tel. (33-1)40306249; Abbreviations:

27 rue Chaligny,

aa, amino acid(s); bp, base pair(s); DlT,

dithiothreitol;

or 1000 bp; h (prefix), human;

lipase; LPL, gene encoding

godeoxyribonucleotide;

507, Faculte de

Fax (33-1)40307840.

(prefix), guinea pig; kb, kilobase lipoprotein

Laboratory

75012 Paris, France.

PCR, polymerase

LPL; nt, nucleotide(s); chain reaction;

cytoplasmic RNA required for both structural of signal recognition protein.

oligo, oli-

7 SLRNA,

and functional

gp LPL, small

properties

Zucker rat a valuable model for research in this field. Lastly, cell lines derived from rat tissues lend themselves to studies of LPL gene expression. However, whereas in recent years the sequencing of LPL has been carried out, via cDNA, in five species, human (Wion et al., 1987), bovine (Senda et al., 1987) mouse (Kirchgessner et al., 1987; Semenkovich et al., 1989; Zechner et al., 1991), guinea pig (Enerback et al., 1987) and chicken (Cooper et al., 1989), it has not yet been determined in rat. The aim of this work was to clone and to sequence rat LPL cDNA as a preliminary to metabolic studies on LPL mRNA. Furthermore, it is meant as a contribution to the comparative study of LPL cDNA in different species, with a view to pointing out which aa have been conserved during evolution and are thus likely to belong to domains playing a major role in LPL activity. Four rat cDNA LPL clones were isolated. The longest, a 3.18-kb clone, lacked about 0.20 kb at the 5’ extremity with regard to the mRNA. The missing 5’ fragment was obtained by PCR. The cDNA sequence was compared to

238 of all the LPL cDNAs

the sequences

reported

in other

species. The divergences appearing between rat, mouse and human in the untranslated exon-10 are pointed out.

RESULTS

(b) Comparison of coding exons of rat LPL cDNA with LPL cDNAs from other species

AND DISCUSSION

(a) Nucleotide sequence of the rat cDNA The strategy used for sequencing the clone isolated from a rat cDNA library, as well as the 0.5-kb 5’ fragment obtained by PCR is shown in Fig. 1 and Table I. The clone comprises a total of 3.18 kb. It lacks the first 183 5’ nt coding

region has thus been achieved by sequencing the OS-kb 5’ fragment obtained by PCR. Nucleotide and aa sequences of rat, mouse, human, bovine, guinea pig and chicken have been aligned (Fig. 2).

for mature

LPL. The sequence

1) Clone RP . _. . . . . .) 5

’ 267

of this

(1) High homology

High homology is found between the coding exons of rat and human cDNA LPL, as well as with the other published An exception concerns guinea pig exon 1, which cannot be aligned with exons 1 of other species.

LPL cDNAs.

w19 El3 \“,, __________+ El . _.--. -. ) W5 -------w ..W!.3... w ______* ______+ w21 w9 Wll ______* ______+ WlS ___Y! ________) ____-_____) .--. w3_.._ t 3444 )

Wl ._..._. _)

OAC

3’

3’ 4-&---

2 ) 0.5 kb 5’ fragment

W4

WlO

4_.__.._.__ w2

I

s

RP

W16

w20

, W26

a)

RP ._.._.. __* b)

)

4 Fig. 1. Strategy

(Clontech

for sequencing

Laboratories,

.___._.._ UP

rat LPL cDNA.

The cDNA

Palo Alto, CA). It was screened

library used was prepared

from 4-week-old

with a 150-mer probe synthesized

the last 14 nt of exon-7 and to the 135 first nt of exon-8 of human the Ml3 vector). Four clones were isolated and subcloned

cDNA

LPL (plus

Sprague-Dawley

by the phosphoramidite

rat testicular

method.

1nt added to create a 5’ BumHI restriction

in the EcoRI site of pUC 18. The OS-kb 5’ fragment

was prepared

I

fat cells in lgtl

This probe corresponds

to

site for cloning into

by PCR. Briefly, total RNA

was isolated from rat frozen adipose tissue using the guanidium isothiocyanate/LiCI method as described by Cathala et al. (1983). Total RNA (6 pg) was heated at 65°C for 10 min before priming for cDNA synthesis with 125 ng random hexamer oligos (Pharmacia, Piscataway, NJ) in a 50 ~1 reaction containing

1 x reverse transcriptase

1 mM each dATP,

dGTP,

stopped by incubating This cDNA

dCTP

buffer (50 mM Tris.HCI and dTTP,

pH 8.3/75 mM KCljlO mM DTT/3 mM MgCl,),

and 200 units M-MLV

at 70°C for 10 min. The resulting cDNA

was used as a template

reverse transcriptase

was purified and concentrated

for PCR with (i) an upstream

primer (Table I: PCRl)

20 units RNasin

(BRL). The reaction on a Centricon consisting

(Promega,

was performed

Madison,

WI),

at 37°C for 60 min and

30 column (Amicon

of the first 20 nt transcribed

Corp., Danvers,

MA).

in mouse according

to Zechner et al. (1991). This was attempted in view of the high putative sequence similitude in rat and mouse; (ii) a rat LPL specific downstream primer (Table I: PCR2) complementary to rat nt 382 to 361 (Fig. 2) (plus EcoRI and Hind111 recognition sites). For PCR, samples were denaturated at 94°C (5 min), annealed at 52°C (2 min) and extended at 72°C (2 min)followed by 30cycles of 94”C(l min), 52°C (1 min) and 72°C (2 min) with a final extension of 10 min. The PCR product phosphorylation ing was carried

of approx.

580 bp was excised

and eluted from a 2% Nusieve

LMP

agarose

gel. This fragment

was subcloned

after

into M13mp89 cut by EcoRV and sequenced. The 0.5-kb fragment was inserted in the two possible orientations (a) and (b). Sequencout by the dideoxy method of Sanger et al. (1977), using [C(-35S]dATP, T7 DNA polymerase and the specific probes.

239 TABLE

I

Synthetic

oligos for PCR and for sequencing

the cat LPf. cDNA

(A) Oligos used for PCR PCR 1 = 5 ’ -TGTCAGACTCTCGATTTCTC

PCRZ = 5’-TATAGCCGGCAGACACTGGATA

382-361

(B) Oligos used for sequencing” (1) The 3.18kb

clone UP = TGTAAAACGGCCAGT

RP = AACAGCTT’AGACCATG

= ATGTCCAC~C~AGGGTAC

435-474

W2 = AATCACTGATGGAGGTGGAC

1887-1868

W3 = CGTGTAATTGCAGAGAAGGG

748-767

W4 = AGCCTAATGAAACCAGTCAC

1652-1633

WI

W5 = CGCTCTCAGATGCCCTACAA

997-1016

W6 = TTTCTGTTCCCAGCAACAG

1439-1421

W7 = GGTCAGACTGGTGGAGCAGT

1250-1269

W8 = GCTCCTCACTTTGCACGCAA

1247-1228

El3 = GGACGCTGCAGTGTTTGTGA

1371-1390

WlO = GCCAAGGCAGGATGGTTGAGA

940-920

W9 = GCAGAGAGGAGAAGCATGCC

240-259

W 12 = TGACTTGTACTTCGT-TGTGG

636-617

EI

385-366

= GGGCCT~TA~TCA~AG

349-368

W 14 = ~AAAACCTCAGAGATCTA

W Ii = AAGCAATGGACGACGTGGCT

572-591

W16 = ACT~CAAGATATAGCTGGG

W 13 = TCAGGCTTACCTTGAACTCT

903-922

W 18 = TCCCTGGCACAGAAGATGAC

1343-1324 1100-1081

148-129

W152 = GTACAAGTI-TTAGAGCAGGA

1141-1160

W20 = TACAGAGAAATCTCGAAGGC

W 17 = GCTCGTTGCCGCTCTTTTGT

1279-1298

W22 = TGTATGCCTTGCTGGGGTI’T

868-849

W 19 = CAGAAAGGAAAGAGTCAAGA

1515-1534

W24 = ACATCTACGAAATCCGCATC

624-604

W21=

1773-1792

W26 = TCAll-l-CCCACCAGCTTGGT

401-382

AGCTGTAAATAATGTGTGGG

(2) The 0.3-kb 5’ fragment RP = AACAGCTTAGACCATG

UP = TGTAAAACGGCCAGT

S( = PCRI) = TGTCAGACTCTCGATTTCTC

0 = CTGCTGTGGTTGAAGTGGCA

it The sequence Numbe~ng

of all the probes is given in the 5’ to 3’ direction.

is as in Fig. 2 (coding

sequence)

The left column corresponds

Divergences between the rat sequence and the five other sequences, from the first ATG to the stop codon (1428 nt), are shown in Table II. The ratio of transitions to transver-

TABLE

II

Divergence

between

six LPL cDNA

sequences

(1428 nt from the first

ATG 5’ nt, except for bovine, to the stop codon) Rat vs.:

Mouse

Human

Bovine

Guinea

Chicken

pig Identities Transitions

(I)

Transversions I/V Insertions

(V)

(i)

1358

1242

1151

1163

59

123

136

148

193

114

209 0.923

18 3.277

60 2.050

75 1.813

1023

1.298

0

1

1

1

1

1425

1426

1363

1426

1426

Informative positions” % Divergence’

5.40

12.90

15.55

28.26

18.44

’ For determining the number of positions, each gap was counted as one position, regardless of insertion length, as these gaps are no targets for substitution. Consequently, the insertion of an Asn codon at the end of the protein is counted for one position only rather than three, since obviously the two other positions stitution. b The percentage (1987) equation:

of divergence

to the sense sequence,

and Fig. 3 (exon 10). RP, reverse primer; UP, universal

are not informative was calculated

as regards

to nt sub-

by the Miyamoto

P % = (I + V + i) x lOO/Nb of informative

positions.

et al.

218-199 the right column

to the antisense.

primer.

sions decreases as divergence increases. This could be explained by a double substitution occurring at one position. Paradoxically, the percentage of divergence between rat and guinea pig is greater than between rat and human or human and bovine. However, results tending to place the guinea pig differently on the mammalian phylogenetic tree have been reported (Graur et al., 1991). The longest segments that present homology among the six species studied up to the present time are the end of exon 2 and the beginning of exon 3, This homology has not been previously pointed out. These regions comprisent 42-87, with only three nonconserved aa out of 46; even position 45, which has Thr in chicken, rather than Ser in the five mammals, is functional, since it belongs to the potential ~-glycosylation site, which accepts Ser or Thr (see section bl). Also conserved is the well-known nt 170-217 region (exon 5), with five nonconserved aa out of 48, or else region 234-264 (exon 6) with three nonconserved aa out of 31. But region 287-320, which Semenkovich et al. (1989) indicated as being well conserved at a time when the chicken sequence was not published appears to be less so (seven nonconserved aa out of 34). The ATG in position 1 for the five mammals, or positions 1 and 19 for chicken, which has two alternative start codons (Fig. 2) codes for the N-terminal Met of the hydrophobic signal sequence. The

240 Open Reading Frame

/

rLPt. mLPL hLPL

-174 T&TCT + GTAOCTGTTATGCCCT CcrJ TTT~~~~~TT~T~~ TACTCCTCCTCCGWEAATTCTGCGC CCTGTAACTGTTCT0XCTCXXCTlTAAAGGTTGACTT@XCTACG3ZSC TCCTCCTCCTCAAGGG~AAGCTGCCCACTTCTAGCTGZCCTGCJZATCCCCTTTAAAG~X~%CTT~XTC

rLPL mLPL hLPL

GACXXCTCCGGCTCAACCCTTTBXA TOCOCCTCCTGCTCAACC CXAGCCTCCGOCTG4GCC

rLPL mLPL hLPL bl.PL LPL cge PL

rLPL mLPL hLPL bLPL 2z.L rLPL r?tLPL hLPL bLPL c PL TPL

I

I

TrXCGDkXTGACT

CTATAGTCCTCT TCC4ccMi CTCCAGTETCT ACCGCCAAACCOCGGCTCCAGCCGTCT

’ GCCCGf&-GTTC&CA&CA GAA& CGCCGXTAGTTKZAGCAGAKAGAA~~~G CCGCCCCTTGTAGCTCCTC CAGAGGGA~CGAG

EEEEEW

GAG

ACW,TTTC!CAG4CATC&AGTAAATT 4 W.XTAAG&CCCTGAA d CACAGCTGA ki4 CACTTGT&ATCTG4TTC&TU3ATTA&KTCTGTG A~OATTTCTCAU\CATCGAAAGCAAATTTOCCCTAAOGA AG4GATTTTATCGACATCAAGTAAATTTOCCCTAAC AAAW\TTTTAOAG4CATTG4AAGTAAATTTGCTCTCAGGAG G4AC9CAOIT64GG4CACCTGTUICCTCATT~T~~~~~TCT~G AAAC?ATTATACGG4TATCAGTAAATTT‘XCO34AGG4fXCCCG4 ATGAATTTTGACYYjAATCGI\GAGCAAGTTTTCCTTAAGTTG l l e l 0 l l l l l ArgAspPheSerAsp Ile Glu Sar LysP~ AlaL~Arg Thr ProGl~ AspThr Ata~uA~pThrCysHis Lau lle Peony Lau AlaA~S~r ArgAspPheSerAsp Ile Glu Ser Lys Pha Ala LsuArg Thr Pro Glu AspThr Ala Glu AspThrCys His Lau Ile Pro Gly Leu Ala AspSer ArgAspPhs Ile Asp Ile Glu Ser Lys Pha Ala Lw Arg Thr Pro Glu AspThr Ala Glu AspThrCys His Lsu Ile Pro Gly Val Ala Glu SW LysAspPhsArgAsp lie Glu Ser Lys Phs Ala Lau Arg Thr Pro Glu AspThr Ala Glu AspThrCys His Lau Ile Pro Gly Val Thr Glu Ser L s Asp Tyr Thr Asp Ile Glu Ser Lys Phs Ala Arg Arg Thr Pro Glu Asn Thr Val Glu Asp Thr Cys His Lau lle Pro Gly Val Thr Glu Ser Myet Lys Pi-m Glu Gly Ile Glu Ser Lys Phs SW Lw Arg Thr Pro Ala Glu Pro Asp Glu Asp Val Cys Tyr Lau Val Pro Gly Gln Mat Asp Ser

Ser AsnCts Ser AsnCys

hLPL bLPL

Ala Thr Cys His AlaAsnCys His Ala AsnCys His Ala Gln CysAsn

192 l;: 126 162 102 Val Val Vat ‘2.4 Vat Leu

Phe Asn Phe Asn PheAsn PheAsn

l * l e*e His Ser SW Lys Thr Pha Val Val His Ser SW Lys Thr Pha Val Val His His His His

SW Ser Ser Thr

SW SW SW Ser

Lys Lys Lys Lys

Thr Thr Thr Thr

Phe Pha Pha Pha

Met Val Met Val

Val Val Val Val

c:: 291 225 261 291

*ee*****ee***eeeee Ile Ile

His Gly Trp Thr Val Thr Gly Mel Tyr Glu Ser Trp Val Pro Lys Lau Val Ala His Gly Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val Ala

Ile 11s Ile Ile

His His His His

Gly Gly Qly Gly

Trp Trp Trp Trp

Thr Thr Thr Thr

Val Vat Vat Val

Thr Thr Thr Thr

Gly Gly Gly Gly

Met Met Met Mat

Ttr Tyr Tyr Tyr

Glu Glu Glu 610

Ser Ser Ser Ser

Trp Trp Trp Trp

Val Val Val Val

Pro Pro Pro Pro

Lys Lys Lys Lys

Leu Leu Leu Leu

rLPL InLPL hLPL bLF3. LPL 9e c PL

Ala Ala Ala Ala

rLPL mLPL hLPL bLPL

rLPL IllLPL

GTGOGAAAT b TGT-LTCATCAA k TGfXTGWIGkAAGAATTT J CTACWCCT 1 GACAATGT &ACCTCTTAG~GTA~GTCT~G~GCCC.AT GTGGG9AATGATCiT-GATTCATCAACT~T~~~~TT~~A~TA~~~~~TCTTA~A~~TT~~~T TTATCAACTG04TGGAc33WWGTTTAACTACCCTCT~~AT~CC4TCTCTTGGGATACA~TT~~T GTGf33CAGGATOTGGCCWGT GGCGGPTGAATTTAACTATCCCCTGGGCAATGTGCATCTCTT~TACAGCCTT~T GTGWACAGGATGT GGCCAAGTTTATOAACTGGAT GTTG6909AGATGTAGCCA~TTATCAACTOOATGG4aC GIGOG4AAEOATFTTGCCATGITCATTGATIGG9TGG(\GT l eeeeeeeeeee V& $y AsnA?pV% A?a Arg P% lie Asn Tfp Leu Glu Glu Glu Pt Asn TTr Pro Leu AspAsn Val His Lau Leu Gly Tyr Val Gly A:n Asp Val Ala Arg Phe lie Asn Trp Met Glu Glu Glu Pha Lys Tyr Pro Leu AspAsn Val His Lau Lau Gly

hLPL bLPL LPL c9: PL

Val Gly Vs.1 Gly Val Gly Val Gly

(continued

on

Lau Lsu Lau Leu

p.

Tyr Tyr Tyr Tyr

&In Gln Glu Lys

Lys Arg Glu Pro Lys Arg Glu Prc Lys Arg Glu Pro Lys Arg Glu Pro

Asp Asp Asp Asp

Val Val Val Val

Ala Arg Ala Lys Ala Arg AlaMet

Asp Asp Asp Asp

Pha Phs Phs Pb

Ser Ser Ser Ser

Ile Met Ile Ile

Asn Asn Asn Asn

Vat Vat Val Val

Asn Trp Asn Trp Asn Trp AspTrp

lie Val Ile Val Ile Val Ile Val

Met Met Met Met

Glu Ala Glu Glu

Val Val Val Val

Glu Asp Asp Glu

Asp Asp Asp Asp

Glu Glu Glu Lys

Trp Tip Trp Trp

V’,l Val Val Val

l l l l l l l l e l l l Leu TyrArgAla Gln Gin His Tyr Pro Val Ser Ala Gly Tyr Thr Lys Leu Lau Tyr Arg Ala Gln Gln His Tyr Pro Val Ser Ala Gly Tyr Thr Lys Leu

SW

Lw Arg Ala Lau SW Arg Ala La! Ar Arg Ala Lau Va I Arg Ala

PhaAsn PhaAsn Phe Lys PheAsn

Tyr Tyr Tyr Tyr

Pro Pro Ser Pro

Gin Gin Gln Gln

Glu Gln His Gln

Lsu AspAsn Lau Gly Asn ValAspAsn LsuAsnAsn

His His His His

Val Val Val Val

Tyr Tyr Tyr Tyr

His His His His

Pro Pro Pro Pro

Lsu Lsu Lw Lau

Val Val Glu Val

Leu Leu Lsu Lsu

Ser Set Ser Ssr

Gly Gly Gly Gly

Ala Ala Ala Ala

Gly Gly Asp Ala

Tyr Tyr Tyr Tyr

Thr Thr Thr Thr

Lzs Lys Lys Lys

:II

Ala Ala Ala Asp

+ GTAT ~-CAT!ATC~~~T~TA!ACCG GCAACATTATC09GTGTCAOCTGGCTACACCAACCTG TACACCAAACTG GZAGCATTATCCAGTGTCTGCAGWTACACZAAGCTG GQ3XCAWXCATTACCCAGGTCTGCGGACTACACCAA~TG

COXTATA~AAAGWi4ACkTGACTCCAA 4 GTCATTGTA&TAGACTGGT GCCCTGiftCAAWIGAWUlCCT~ATGTUITTGTAG OCCCTGTACAA~AW\WU\CCAGI\CTCCAATOTCI\TTGT GCCTTGTACAAG9GGG4ACCGGACTCCAACOTCATCOTOO GCTCTGTACAAOAGGOAACCAGACTCCAATGTUITTGTOO GCTCTGTACAAOAGOGAACCTGATTCAAAT~~TTGTTTG l ee*eeeeeee*eeeee Ala Lsu Tyr LysArg Glu ProAspSerAsnVal Ile Val ValAspTrp Ala Lsu Tyr Lys Arg Glu ProAsp Ser Asn Val Ile Val ValAspT;p

OPL TPL

Fig. 2

l l His PhaAsn His PheAsn

rLPL mLPL

2% LPI. cBe PL

1; -1

Met Asn tie AspArg Lys Ire Lau Asn Lys Ala Leu Ala Lys Glu Lys Val Met Glu Arg Gly Arg Gly Met Gly Lys Tyr Ala Lw Lsu Ala Val Lsu Cys LauCys Lsu Arg Gly Ala Ala Gly

3EL

rLPL InLPL

-72

_Sienal eplide Met Glu g er Lys Ala Lsu Leu Lau Val Ala Leu Gly Val Trp Leu Gln Ser Leu Thr Ala Phs Arg Gly Gly Val Ala Ala Met Glu Ser Lys Ala Lsu Lau Lsu Val Val Leo Qly Val Trp Lau Gln SW Lau Thr Ala PI-e Arg Gly Gly Val Ala Ala Met Glu SW Lys Ala Leu Leu Val Lsu Thr Lw Ala Val Trp Lau Gln Ser Leu Thr Ala Ser Arg Gly Gly Val Ala Ala

rLPL nlLPL hLPL bLPL

%F

:p:

Leu Leu Leu Lw

390 390 390 324 360 300 103 103 103 105 103 105

469 469 489 423 459 460 136 136 136 136 136 136

241)

first aa of the mature protein is conserved in four of the six species. It is Ala in rat, human, mouse and guinea pig. In rat, human and mouse, this Ala is the third of a stretch of three Ala, while there is no such stretch in guinea pig. There are two more N-terminal aa in bovine, -Asp-Arg, followed by Ile instead of Ala and in chicken, Ser-Asp followed by Pro instead of Ala. In rat, we found that aa 444 of the

human sequence (Asn) is missing, as was the case in mouse (Zeckner et al., 1991). In chicken, the C-terminal sequence is 15 aa longer (see below for an explanation). All these particular differences result in hLPL and gpLPL being 448 aa long, while rat and mouse are 447 (448 -1) bovine 450 (448 + 2), and chicken 465 (448 + 2 + 15).

241

rLPL mLPL hLPL bLPL LPL Be c PL

GCT~T~TTGCA~OCT~~TM~~~~~TTACT~T~T~~T~~T~CTTT~~AT~~~GT GCT~T~ATTGCAGOI\AGTCT~~TM~~~~~TM~TTA~T~~T~~TMCTT~~AT~~~T~GT GCTOCTGGCGTTGCTGGGAGTCOGACCAATACGAAGGTCAGT GAAAAAGGTOI\ACAGPIATTACTOOTCT~T~T~T~~~T~T~.~~T~T~T~TATC ~T~T~TT”.‘~~TTAAeW l eeeee*e*e l l Pro Ala Gly Pro As” Phs Ala Ala Gly Val Ala Gly Se, Leu Asn Lys Lys Val As” Arg Ile Thr Gly Leu rLPL P;o Ala Gly Pro As” Pi-m Ala Ala Gly Val Ala GJy Se, Leu Thr As” Lys Lys Val As” Arg Ile Thr Gly Lsu mLPL * P;o Ala Gly Pro As” Phe hLPL Ala Ala Gly Its Ala &y Se, Lsu Thr As” Lys Lys Val As” Arg Ile Thr kly Lsu Pro Ala Gly Pro As” Phs bLPL Ala Ala Gly Ile Ala Gly Se, Lsu Thr As” L s Lys Val As” Arg Ile Thr Gly Lsu Pro Ala Gly Pro Am Phs LPL Ala Ala Gly Val Ala Gly Se, Arg Thr As” T 6 r Lys Val Se, Arg Ile Thr Gly Lsu Pro AlaGly ProThrPhsGlu cge PL Ala AlaGly Ile AlaGly Se, LsuThrLys LysLysValAsnArg Ile Thr Gly LB”

Th,

. Gl” Tyr Ala Glu Ala Glu Tyr Ala Gl” Ala

Pro Se, Pro SW

169 169

Gl” Tyr Ala Glu Ala Pro Se, Glu Tyr Ala Glu Ala Pro Se, Glu Tyr Ala Glu Ala Thr Se, Tyr AlaAspAla Pro Ile

169 171 169 171

66.7 667 607 621 657 667

rLPL

KY-

bLPL LPL ge c PL rLPL mLPL

hLPL bLPL LPL cge PL

_-__.-...__.. Pro

l eeeeee l eeeeee Arg Leu SW AspAsp Ala AspPhs Val AspVal Arg Lsu Ser ProAspAspAlaAspPhs ValAspVal * Arg Lsu Ser Pro AspAsp Ala AspPhs ValAsp Val Arg Lsu Ser Pro AspAsp Ala AspPhs Val AspVal Arg Lsu SW ProAspAsp Ala Gln PheVal AspVal Arg Lsu SW Pro AspAsp AlaAsp Phs Val AspVal

l l l l l l eeeeeeeeee Lsu His Thr Phe ThrArg Gly SW Pro Gly Arg Ser Ile Gly II8 Lsu His Thr PhsThr Arg G:y Ser Pro Gly Arg Ser Ile Gly Ile * Lsu His Thr Phs Thr Arg Gly SW Pro Gly Arg SW Ile Gly Ile Lsu His Thr Phs Thr Arg Gly SW Pro Gly Arg Ser Ile Gly Ile Lsu His Thr PheThr Arg Gly Ser Pro Gly Arg SW Ile Gly Ile Lsu His Thr Tyr Thr Arg Gly Ser ProAspArg Ser Ile Gly Ile

Gln Lys Pro Val Gly His Gin Lys Pro Val Gly HIS

202 202

Gln Gl” Gln Gln

202 204 202 204

Lys Lys Lys Lys

Pro Pro Pro Pro

Val Val Val Val

Gly Gly Gly Gly

His His His His

766 766 766 720 756 706

,LPL mLPL hLPL bLPL LPL cBe PL rLPL mLPL

235 237 235 237

hLPL bLPL LPL cgP PL

rLPL mLPL hLPL bLPL LPL cge PL ,LPL mLPL

CTAGTGAAGTGCTCCCACGAGCGCTCCATTCATCTCTTCAA CTGGTG9AGTGCTCCCAC~T~T~TCTCTCTTCA CTAGT~GTGCTCCCATGAGCOCTCCATTCATCTCTTCG CTGGTOPIAOT~TCTCATOAACGPITCCATCTCTT~TT~CT~CT~TCTAT~~e~~~T~TA~T~A~~A~G l eeeeee*eeeooeeeee Glu Arg Ser Ile His Lsu Phs Ile Asp Ser Lsu Leu Asn Glu Glu Asn PTo St, Lys A7.s TT,A?gCTsAsn Se, L:s &J Glu Arg Ser Ile His Leu Phs Ile Asp SW Lsu Leu Asn Glu Glu Asn Pro Ser Lys Ala Tyr Arg Cys Asn SW Lys Glu Glu Glu Glu Glu

hLPL bLPL ZK’

rLPL mLPL hLPL bLPL LPL cge PL rLPL mLPL hLPL bLPL LPL cge PL

rLPL mLPL hLPL bLPL LPL cge PL rLPL mLPL hLPL bLPL LPL cge PL

Fig. 2. (continued

A?g Arg Arg Arg

S*r Ser SW Ser

Ile Ile Ile Ile

His His His His

Leu Lsu Lsu Lsu

Phs Phe Phs Phs

II8 Ile Ile Ile

A:p Asp Asp Asp

Ser Ser Ser Ser

Leu Lsu Lsu Lsu

Leu Leu Leu Lsu

Asn Asn Asn Tyr

Glu Glu Glu Glu

Glu Glu Glu Glu

Asn Asn Asn Lys

Pro Pro Pro Pro

~TTTGAGAAAOGGCTCTGCTTGAGTTGTAGAAAGPJ\TG ~TTT~-OGTCTCTG~~T~~~~T~~~T-TA~~T~~~~~~~~~-~TG ~TTTGAAAAAOGGCTCT~T~~T~~M~~T~~~~~TA~~T~A~~~~~~~~-~TG G9ACCGTTGCAACAACTTGTATAAAGTCAACAGAGTGTG GXTTTG4WGajCCTCTGCCTAA~Tl l eeeeeeeeeeeee*e* l l Ala Phe Glu Lys Gly Lsu Cys Lsu Se, Cys Arg Lys As” Arg Cys As” As” Val Gly Tyr Glu Ile Asn Ala Phs Glu Lys Gly Lsu Cys Leu Se, Cys Arg Lys Asn Arg Cys AsnAs” Leu Gly Tyr Gl” Ile Asn Ala Phs Glu Lys Gly Lsu Cys Leu Se, Cys Arg Lys Asn Arg CysAsnAs” Lsu Gly Tyr Glu II8 As” Ala Phs Glu Lys Gly Lsu Cys Lsu SerCys Arg LysAs”ArgCysAsnAsnMeI Gly Tyr Glu Ile As” Ala Phs Glu Lys Gly Lsu Cys Leu Se, Cys Arg LysAs”ArgCysAs”As” Val Gly Tyr Gl” Ile As” Ala Phs Glu Lys Gly Leu Cys Leu Se, Cys Arg LysAs”ArgCysAsnAs” Lsu Gly Tyr Lys Val As”

Gt CAAGATTCA A TTTTCTGGP, TACCTGAA d CTCGCTCT d GATGXCTA &AA AGTATTCkATTACCAA TACCTGPJIGACTCGCTCTC~T~TACAAAGTGTTCC TACCTGPIP~CTCGTTCTCAGATGCCCTACAAAGTCTT~TTA-~~~TT~TTTTTCT-CT~~~~~TA-T~~C TACCTGAA~CTCGTTCTCAG4TGCCTTACAAAGTCTTCCC TACCT09AGACTCCCTCACA~T~TACAAAGTGTTWC TACTTG4AGACCCGT~TC~T~TACAAAGTCTTC l eeeoeeeeeeem T:r L’, L;s T!, A?g SW Gln Met Pro Tyr Lys Val Phs His Tyr Gln Val Lys Ile His P’;, Ser G?y Tyr Leu Lys Thr Arg Ser Gln Mel Pro Tyr Lys Val Phs His Tyr Gln Val Lys Ile His Phs SW Gly Tyr Lsu Lys Thr Arg Ser Gln Met Pro Tyr Lys Val Phe His Tyr Gln Val Lys Ile His Phs SW Gly Tyr Lsu Lys Thr Arg Ser Gln Met Pro Tyr Lys Val Phs His Tyr Gln Val Lys 11s His Phs Ser Gly Tyr Lsu Lys Thr Arg Ser Gln Met Pro Tyr Lys Val Phs His Tyr Gln Val Lys Ile Tyr Phs Ser Gly Tyr Lsu Lys Thr Arg Ala Gln Met Pro Tyr Lys Val Phs His Tyr Gln Val Lys Ile HIS Phs Phs Gly

Ser SW Ser Ser

Lys Lys Lys Mel

Ala Ala Ala Ala

Tt, Tyr Tyr Tyr

Arg Arg Arg Arg

Cys Cys Cys Cys

Ser Asn Asn Asn

Ser Ser Ser Thr

Lys Lys Lys Lys

Glu Glu Glu Glu

665 665 085 619 655 605 266 266 266 270 266 270

964 964 964 916 954 964 Lys Lys Lys Lys Lys Arg

l Val Val Val Val Val Val

l Arg Arg Arg Arg Arg Arg

Ala Ala Ala Ala Ala Thr

ACTGAGAAT d

Thr Thr Thr Thr Thr Lys

Glu Glu Glu Glu Glu Thr

Asn Asn Ser Ser Thr Asn

. Lys Lys Lys Lys Lys Lys

l Arg Arg Arg Arg Arg Arg

Se, Se, Se, Se, Se, As”

CAAGCAAAA

Asp Gly Glu Asn Thr Val

Lys L s T yhr Thr Thr Thr

Gln Gln His Tyr Tyr Lys

Se, Se, Se, SW Se, Thr

l Lys Lys Lys Lys Lys Lys

l Met Met Met Mel Met Met

kwc.acc

AsnAsn His Asn Thr Am Thr Asn Thr Asn Val Asp

C% Gln Gln Gln Gln Gln

301 301 301 z 303

1063 1063 1063 1017 1053 1063 Ala Ala Ala Ala Ala Pro

334 E 336 334 336

on p. 242)

The putative catalytic triads Ser13*, AsP’~~, His241 coded by exons 4, 5 and 6 in human is conserved in rat, with the same aa in the same positions, the codon for Ser13*, however, being AGC in human and AGT in rat. Ser13* occurs in the consensus sequence Gly-Xaa-Ser-Xaa-Gly present in serine proteinases and in human pancreatic lipase. It has

been shown that hydrolysis of tri- and monoacylglycerol by LPL stems from a common active site. The catalytic site and the heparin-binding domain reside on two separate folded domains. Although the heparin-binding domain of LPL has not yet been definitely assigned, this site was shown by peptide cleavage experiments (Olivecrona et al.,

242

1182 1162

1LPL mLPL hLPL bLPL LPL c PL ge rLPL mLPL hLPL bLPL LPL cge PL

rLPL mLPL hLPL bLPL LPL cge PL rLPL mLPL hLPL bLPL ZK’

rLPL I$?blPL LPL cBe PL rLPL mLPL hLPL bLPL splPL CLPL

1::: 1t52 1182 & Phe Phe Phe Phe Phe

Glu Glu Glu Glu Glu Leu

It, Ile Ile I le Ile lie

Sir Ser Ser Ser Ser Ser

t Leu Leu Leu Leu Leu

T-r Tyr Tyr Tyr Tyr Tyr

T:r Thr Thr Thr Thr Thr

Val Val Val Val Val Leu

l Glu Glu Glu Glu Glu Glu

Ala Ala Ala Ala Ala Asp

e**ea**ee**e Ser Giu As” Ser Glu As” Ser Glu As” Ser Glu As” Ser Glu As” Ser Glu Asn

ile Pro Phe Thr Leu Pro Glu Ile Pro Phe Thr Leu Pro Glu Ile Pro Phe Thr Lw Pro GIu Ile Pro Phe Thr Leu Pro Glu Ile Pro Phe Thr Leu Pro Glu Ile Pro Plm Thr Leu Pro Glu

Val Val Val Val Val Val

Ser Ser Ser Ser Ser Ser

Thr Thr Thr Thr Ala Ser

A:” Lys As” Lys As” Lys As” Lys AsnAsn As” Lys

l Thr Thr Thr Thr Thr Thr

Tyr Tyr Tyr Tyr Tyr Phe

l Ser Ser Ser Ser Ser Ser

l Phe Ptw Phe Phe Phe Phe

l Lw Leu Leu Leu Leu Leu

lie Ile I le Leu Ile lie

T:r Tyr Tyr Tyr Tyr Tyr

367 367 367 369 367 369

1281 1261 1281

l eeeee l ee Thr Glu Val Asp I le Gly Glu Leu Leu Met Met L;s t Lys T:p Thr Glu Val Asp 118 Gly Glu Leu Lw Met Met Lys Leu Lys Trp * Thr Glu Val Asp Ile Gly Glu Leu Lea Me( Leu Lys Leu Lys Trp Thr Glu Val Asp Ile Cly Glu Leu Leu Mel Leu Lys Leu Lys Trp Thr Glu Val Asp Ile Gly Glu Leu Leu Met Leu Lys Leu Lys Trp Thr Glu Val Asp lie Gly AspLeu Leu Met Leu Lys Leu Gl” Trp

I

I

I

ATUMCNAW\TCCG4GTG4AACD.%XA~GACTCAGAA ATCGAG9GGATCCGAOTOAAAGCCOGI\OAGACTCAGAAA ATTCAGAAGATCA~OTAAAAGCAGGAGPIGACTCAGAA ATTEYrJ\AGAT~WIG~~~~CT~~~~~TCTTCT~T~ ATCGAAAAG9TCAG9GTAAAAOCAOU\GAGR04CAG4AA ATTCAGAG4GT~GAGTG9AGTCRGOCUIAACTCAG4AA l eeeee . eee Ile Glu Lys Ile Arg Val Lys Ala Gly Glu Thr Gln I le Glu Arg Ile Arg Val Lys Ala Gly Glu Thr Gln Ile Gln Lvs Ile Am Val Lvs Ala Glv Glu Thr Gin Ile Gly c(‘s lie Ari Val L,, Ala Glj, Glu Thr Gln lfe Glu Lys II8 Arg Val Lys Ala Gly Glu Thr Gin I le Gln Arg Val Arg Val Lys Ser Gly Glu Thr Gln

I rLPL “lLPL hLPL bLPL

‘.i?y Gly Gly Gly Gly Gly

I

1::: 1281 Lys Ser Asp Ser T yr Pk S:r T:p Ser Asp Ttp T:p Ser SW Pro Ser P: Val Mei Ser Asp Ser Tyr Phe Ser Trp ProAsp Trp Trp Ser SW Pro Ser Phe Val Lys Ile Ile Glu

SerAsp Ser Ser Asp Ser Thr Glu Ser Lys AspThr

Tyr Tyr Tyr Phe

Phe Phe Phe phe

SW SW Ser Ser

Trp Trp Trp Trp

SerAsp Ser As” Ser Ser Ser Asn

Trp Trp Trp Trp

Trp Trp Trp Trp

Ser Ser Gly Thr

SW Ser Arg Pro

Pro Pro Pro Phe

Gly Gly Thr Ala

Plw Ala PheAsp Phe Thr Phe Thr

1380 1380 1380 1314 1350 1360

Ad, GGTCATCT!CTGTGCCAC~~~T~CT~~CT~~A~~~~T~A ~TGTCTTATCTOCAOA~A~~~~~~

l

Lys Lys Lvs L+ Lys Lys

Lys Lys Lvs L,, Lys Lys

Val Val Val Val lie Vat

Ile Ile Ile lie Vat Val

*

PheCys Ala PheCys PheCvs PheC;s PheCys Phe Cys

Ala Ser Ser Ser Ser

l Arg Arg Ara Ar; Arg Arg

Glu Glu Glu Glu Glu Asp

Lys Lys Lvs Ljrs Lys Gly

I

I

Val Val Val Met Vat Ser

l Ser Ser Ser Ser Ser Ser

His His His Tyr Lys Arg

e Lw Leu Leu L&w Leo Leu

I

Gln Gln Gln Gl” Gin Gly

400 402 400 402

l Lys Lys Lvs L$s Lys Lys

l Gly Gly Glv Glj, Gly Gly

LysAspArg Lys Asp Ser Lvs Ala Pro Lis Sar Pro Lys Glu Ala Glu Glu Ala

I

Ala Ala Ala Vat Pro Ala

433 433 433 435 433 435

I

GTGTTTGTWAATGCCATGKAAGTCTCTG GTGTTTGTGAAATWCAT69CAAGTCTCTG

ZKL rLPL mLPL

l l l l Val Phe Val Lys Cys His Asp Lys Ser Leu Val Phe Val Lys Cys His Asp Lys Ser Lw

hLPL bLPL gpLPL CLPL

Val Ile Val Ile

Fig. 2. Nucleotidc

Val Val Val Val

sequences

guinea pig and chicken aa (although

Pha Pha Phe Ptw

Lys Cys LysCys Lys Cys Lys Cys

Lys Lys Lys Gln

mouse (mLPL), bovine (bLPL), guinea

pig

of chicken

are translated

again in Fig. 3 in order to align all the exon-10 in the six species are indicated

and deduced

The beginning

aa of chicken

*

Ser Lw Asn Lys Lys Ser LeuAsn Arg Lys Ser Lou Asn Lys Lys Pro Val Ser Arg Lys

of rat ML cDNA

sequences.

the C-terminal

His Asp His Asp His Asp LWJ Glu

. l Lys Lys SW GtySlop Lys Lys Set GlyStop

sequences.

(gpLPL)

SW Ser SW Arg

447 447

Slop

Gly GlyStop GlySlop Gly Gly Ala

aa sequence.

448 450 448 Lys Lys Ala Ser Lys Glu Asn SW Ala

The nt sequence of rat LPL cDNA.

exon-10 sequence

from the beginning

1987) to be located in the middle part of the C-terminal region. A candidate region is Ly~~~~-Lys”“” (numbered as in human LPL), that harbours a cluster of positively charged aa. Thus, 5 aa out of 9 (italicised) are positively charged (KVRARRSSK). This sequence, located in exon 6 (human gene), is entirely conserved in terms of aa with, however, four codons differing at the third base in art, mouse, human, bovine and guinea pig (Enerbgck et al., 1987). In chicken, there is only functional homology and no identity, RVRTKRNTK. Cooper et al. (1989) proposed three other candidate regions by analogy with the heparinbinding regions of apolipoproteins E and B. Such a region is found in 279-282 (human numbering), where RKNR is conserved in all six species (in terms of aa not codons). The ten conserved Cys of the previously described species are also conserved in rat (Cys 27, 40, 216, 239, 264,

465

with human, mouse, bovine,

as predicted

mutations

all the C-terminal

and not from the end of exon-9 as for the other 5 cDNAs)

and chicken (cLPL). The aaof the catalytic

by black spots; those involved in missense

Comparison

is written twice, once in this figure, in order to gather

of exon-10

Aligned aa sequences

His Glu SW AlaStop

from LPL cDNA

clones for rat (rLPL),

human

and

(hLPL),

triad (Ser’j2, AsP’~~, His2” ) are boxed. Those aa conserved

in human

are indicated

by asterisks.

275, 278, 283,418,438). The sequences around these Cys are highly conserved. It has been shown for bovine LPL that all Cys were disulfide-bonded (Yang et al., 1989). Two N-glycosylation sites, Asn43 and Asn359, that correspond to the consensus sequence Asn-Xaa-Ser/Thr where Pro is excluded as Xaa and that have been previously shown to be conserved in all five species, are also conserved in rat. At the present time, the LPI, site of interaction with apolipoprotein CII has not yet been identified. Similarly, the site involved in the formation of the LPL homodimer, the active predominant form of LPL bound to endothelial heparan sulfate, remains to be established. Interestingly, the aa involved in the missense mutations already known to be responsible for human LPL deficiency are all conserved in all six species, with two apparent ex-

243 ceptions.

These

following.

In exon 3: Tyr61 (+ Stop), Va169 (+Leu),

aa, that are indicated

in Fig. 2, are the Trp*’

(+Arg),

Lys102 (+insertion + frameshift), Glnlo6 (- Stop); in exon 4: His136 (-+ Arg), Gly14* (4 Glu), G~Y’~~ (4 Ser); in exon 5: Asp156 (+Gly or +Asn), Pro15’ (+Arg), Ala’76 (+Thr), Gly”’ (-+Glu), Ile’94 (+Thr), Asp204 (+Glu), Ile205 (+ Ser), Pro*” (+Leu), Cys216 (+ Ser), Ala221 (+frameshift); in exon 6: Arg243 (-+His), Ser244 (+Thr), Tyr262 (+ Stop); in exon 8: Trp 382 Asp250 (+Asn), (+ Stop); in exon 9: Ser447 (+ Stop). One exception occurs in Ser447, but this exception is more apparent than real, because this mutation has been likened to a polymorphism rather than to a source of metabolic pathology. In fact, the two last aa of LPL are not necessary for LPL activity. Another exception is Glnlo6, but in this case we are dealing with premature termination of the peptide chain rather than with a missense mutation. For references concerning these mutations, see a review by Etienne et al. (1992). (2) CG content CG sequences are known to be rather unevenly distributed in the genome. CGs have been lost in the course of evolution, leaving vertebrates with a remarkable deficiency of this dinucleotide in the different genes (where CG/GC: is approx. 0.1) except in the regions surrounding the promoters of housekeeping genes, the CpG island, where CC/CC is approx. 1. LPL exons 2 to 9 are relatively rich in CG. The ratios CG/GC are 0.41,0.42,0.43,0.47,0.53,0.26 for rat, mouse, human, bovine, guinea pig and chicken, respectively. The role of CG enrichment in these exons cannot be presently explained. (c) Comparison of the sequences of the untranslated exon 10 The 3.18-kb clone alone covers the whole of rat exon 10, while several overlapping clones in mouse and in human were necessary to achieve this. The untranslated exon 10 of rat LPL cDNA is 2019 nt long, i.e., it is longer than the total of the nine other coding exons (1160 nt). This exon is 1948 nt long in human (Wion et al., 1987) and 2353 nt in mouse (Zechner et al., 1991). Gene LPL organization is known for human (Deeb and Peng, 1989; Kirchgessner et al., 1989), mouse (Zechner et al., 1991), and chicken (Cooper et al., 1992). Exon 10 starts with A, the last nt of the stop codon TGA that was interrupted by intron 9. This is the only known example so far of stop codon interruption by an intron (3.1-kb long in human and mouse) or with the last noncoding exon containing only the third nt of the stop codon and overriding in length the total of the coding exons located upstream. Remarkably, in chicken (Cooper et al., 1992) the last codon in exon 9 interrupted by intron 9 is GG/T (Gly)

rather than TG/A (stop codon), as is the case in human and the four other species.

Chicken

exon 10 thus presents

a

short ORF translated into Gly + 14 aa up to the next stop codon. This explains why the chicken aa sequence is longer, not because exon 9 is longer, but because there is no stop codon between exon 9 and exon 10 (Figs. 2 and 3). Zechner et al. (1991) have reported in mouse an insert (relative to man) consisting of (i) a Bl repetitive element (Kalb et al., 1983) (belonging to the Ah family) of 152 nt followed by (ii) a homopurine stretch of 169 nt consisting solely of A and G residues. This insert is flanked just upstream (GAAAATGAGCTTATAA) and downstream (GAAAATGAGCTTGTAA) by a 16-nt direct repeat. (differing only where italicised; see Fig. 3.) Six different mouse strains were tested by Zechner et al. (1991) and the B 1 element was found in exon 10 of all these. The mouse insert was not described by Kirchgessner et al. (1987) because their clone stopped about 250 nt ahead of it. In rat, both the Bl repetitive element and the homopurine stretch are missing. Consequently, only one of the two repeats is present. One copy of this direct repeat is found in human exon 10 and bovine exon 10, which likewise lack the mouse-specific Bl element (Fig. 3). Zechner et al. (1991) have reported in mouse a 195-nt region, located just after the 3’ direct repeat following the insert, that has no homology to human 3’ cDNA. We have also found this region in rat, where it includes a stretch of 157 nt that is highly homologous to the 3’ part of this mouse sequence. However, it is deleted in man. This clearly appears when the sequences of mouse, human and rat are aligned. The 157-nt stretch was compared to known rodent repetitive elements, such as mouse Bl (or Ah type I), also present in rat (Kalb et al., 1983); rat B2 (Bains et al., 1989) (or Ah type 2) (Kalb et al., 1989) and rat newtype (Kalb et al., 1989) both also present in mouse. There is no homology with any of these. Of course, there is no homology with 7 SLRNA either, since this element, which is assumed to be the progenitor of the Bl family, is itself homologous to the Bl element (Quentin, 1989). However, a sequence of this mouse/rat insert can be recognized in other genes: 930CCTGCCTTGGCTTCCTGAGTGCTGGGA956 (Fig. 3). Its percentage of homology (interrogation of the data bank GenBank) is 74% with the murine eosinophil differentiation factor (interleukin 5)-encoding gene, 88 % with the hamster replication-initiation locus, 88 % with mouse pim-1 proto-oncogene, 85% with mouse p-2 microglobulin-encoding gene, 8 1 y0 with human cosmid clone HDAB, 85% with human PRT, 77% with rat heme oxygenase-encoding gene, 74% with R. norvegicus toninencoding gene, 66% with human apolipoprotein E-encoding gene, 88 y0 with HSAG-1 middle repetitive element, 81 y0 with the mouse surfeit locus, 74% with the mouse

244 Exons 10

rLPL mLPL hLPL bLPL LPL cge PL



c&4 cw4AkGCATCTGA G-+TCTTTGudCCGAAGAAA A TGAAGTAAA!TTTATTT AAkAA AATAkC TTGTTT CAAGA GAAGAAA GCATCCOAGTTCTTTGUGCAGUGUAACAMGTAAATTTAATTT AAAAMATAATACCC TTGTTT CAGAACAMG9ACGGCATOT~TTCT~~~T~~~~~MCTTTTAC AAAA CATACCCAGTGTTT CGGAAGAAAGUCAGCATATWTTCTATWAGAATG AAGTAACTTTTAC MM GATGCCCAGTGCTTT CAGAAAW~TTAGTACTWGCTCTGTGAAOAACA AAATWTTTTAC AAAA GATGTTCAGTGCTTT AAAGAAAATTCTGCACACGAGTCTGCT~~AAG ACACT-GG4GGATGGTCTGTTCA

rLPL mLPL hLPL bLPL LPL cge PL

GGG TGTTTkAA GTGGA + TTTCCTGA G+ ATTAATCCCkCTATA TC + TGTTAGTTdAT GGG TGTTTCiUA GTGGGTTTTCCTOAGTATTMTcCCAf3ZTCTA TCTTGTTAGTTAAAC CGGGTGTTTCAAAAGTGGATTTTCCTGAATATTAATCCCATTCA ACi4 TGOT Ci4AATGTGWTTTTCCGGAGTATTAACCCCA~TCTAGCCTTATTAGTTATTT Go0 TGCTTAAAA GTWTTTTCCTWGTATTAATCCCAQZTATCTCCT~TTAGTTC.UC TCCCTGT G9ACTOGATGTTCA09ACCAAATATATATA~TATC

AGAAG4 L GTGTCAAA + ATTAAAA CK&TMCAC~ AOlUIOACAGTCTWAATATTAAACOOTOGCTAACCCCA

rLPL rnLPL hLPL bLPL LPL cge PL

CCTAAT G&G ATAGCA+GTCCTCCA~TCAG4AG9’CAGCAGAGA~GAAGCATCTCTTAT~T~TT~~-TCAT+ ACGTGPI G& GGGTGAGGM TCTAATOOCCC ATAWAaTTCTTCCAGCATCAGU~ CATCAGGCAGGWAAACATOGTCTTGTATCCCTTAAGAAG~XATCATT ATTTATGOGG TATAGTGGCCAAATAGCACATCCTccAAcGrT~~ CAGTGGATCAT WAAAGTOCTGTTTTG TCCTTTGAGAAAWAATAAT ATTTGAGGTG TATAGTGXCAAA TAOCACATCTTCCMCATTAAAAAAA TAACAGAT AT WAAAGCACTGCATTCTGTCTTTT WAAAAATATGAGT AT TAAGGCC TCTAGTGGGTA ACTCCA ATCCTC AGCATTA AA09 TGGTA ATCG CS,AGC CCGTGTT TGTCT TTOAGOGACAAA A TTGTAAGGAAGTCTCAGACAAAAGTTACTAACCATAATTA CATTATCTGATAGTTAAGAACAAAW,AACCcCT(3ZAACAACTTccGAAAG

rLPL mLPL hLPL bLPL LPL cge PL

TGTTCCCAA JXATA CAAGACTCC!TCATGTGA &A TTTGGTdTGGTCT d TTAGTAAGGkCTCTTATT~TCATTAGAT A TCTGAGG TGTTCCCAACAATA TAAGACTCC4TCATGTGACTTTGGTcAT~CTAAAATTAGTAAGAAC TCTC%GG TGTTT04GCOCAOAGTAAMTAAGGCTCCTTCATGTGGCGTCTT TATTTAAAATGATAAAATAATCAGATCTCTTCATGTAGTAT TTTTAATTGGGATTCTQXT TGTKCCA TAGTAATATCC GCTTGAGTGTTAGGAAOTAATAAGTAATTTT~TTAATC~~~TT~~~~TTCTT~T~~T~TCTTTA~~TTT~A~ATTTATT

rLPL rnLPL hLPL bLPL CLPL

GACCTTCTkAAGTTCTC + TGAAGTCT TT AAATTd MTATAkAACAACA T + TTTTTGTGC!GTWTCAdTCCATTTCT + TAGCAGT TT ATATTGA G4CCTTTTCAMGTTTTCTCGTCT AATATAGACAATA TTTTTTGTGGCATGAGTCAGGTCCATTTCTTTAOCGGT CTTTTCCGCGGCA CCAATCAGACTCATCTACACQZAGTATG TC GOACTGPI GGCCTTCTCAAACTTTACTCTAAGTCTCCAAGAATACAGAAATG TCTAG4CTG4TAACCTTCTCAGAGTTTTCTCCG4GTCT AAATATAOOAAGTAAGTTTTTTT@XXGTCi4GTAGGKccGTTTAccTATcAATCAA GAAAAGATGCAAAGCTCTOTACCTMjTCCTCTTTTTAGTGTTTTT

rLPL mLPL hLPL bLPL CLPL

190 192 193 177 161 195

AGGAGACAGTCTCAMTACTAAAA CTAATTCA G~GS~GVITCTCAAATACTAAAAAGTGACTMTTCA TTC





T

T+ TT TT

267 269 260 274 264 294

360 363 366 372 263 394

T& TG

470 452 464 466 494

C!TTTACCAT d GG4TATA&CCCTACCA~TAAAATA AAACAGCTGkCCA TTGTAkTAGTT J TAAATAAAG&CAGUjA~T~GACTT AAACACCTGGCCT TTWAACTAGTTTTTTTTTACCATT~TATATTCCCCCCA CCAAAAAAAAAAAAAAAAAAAAGTAACCAGGAACGTGTGACTTG TGATGTTTTAGAATGA TTCCCTCTTGCTATTGCAATGTCXTCCA~cGTC4 ACCAGGAACATGTAACTTG 2 TCCCTACTTTCTTGGAATTACTCTCCTCTTGGAA ACCAGGAACTAGTGACTTG TGTTGTTCTCCTGGCCTTGACTGAATTTTCAGGCAC;TTTCTCAT

+

567 551 556 545 594

rLPL mLPL hLPL bLPL CLPL

GfJ TTCCTGA &TT @XAAGCA~TGWGA~GGCTCGT Ac&WCAdC CWTACkTCAGTA d GGTACAAAA k TAG4 GCAAAAGCAGTTGAAG4CATGOCTCATGAAGTCCTGACCCTT GGTCCCACCACAAC AAAGTACAAGTC’MCAGAGTACAAMCCTAGA GAGAGGGACGAAGAAAG CTGATAAA CACAGAGGTTTTAAACAGTCCC TACCATTGGCCTGCATCATGC+IAAGTTACAAATTcAAGSAGATA GAG4TAGAAATG4AGAATAGAGTTGATAAAGCACTGAAccTTTAAAC CCCCTCTACGGTTGGT TGCATCATAACTAAGTTACCAATTAAAGGAGATA GGAAAAAAATACCCTTGTGTGTA04GCTATAACAGAGATGTTTAC



rLPL mLPL hLPL bLPL CLPL

CT &I G T A ATTCTTAGTTIGACTTCAA GTTTTATOOC + TAATTCCTC!GTCTTTT AAAAACGT $A: ?:::;:;;;:A CTGAG TAATTCTTAATAGACTTGAA TTTTTATGXTTMTCCTTCTATCTTTTAAATATTT TA AMTCT AGATCAATTAATTCTTAATA~TTTATCGTTT ATTGCTTMTCCCTCTCTCCCCCTTCTTTTTT GTCTCAAMTTATATTATAATA TATAMGTTGWATCAATTAMTCT TTAACAGTTT ATGGTTTAGTATTTCCCCTTCCTTTTCCT~TTTGTCTCAAGATTATATTTTAATA TmGTAGAT-ATOCTTCTTA TATAATCTGTTMGATATGTAATCCTATGCACCTTAT ~AGCAATAGC~~G~~~SU~

rLPL mLPL hLPL bLPL

TTATTCTC + AGACAGAT G-+TGAAA T&TTGTGA TTGTTCTCTOGATAGATGT ATGTTCTCTWGTAGGTGT GTTTTTCCCTAGATAGGCTTTTWACTGTTA

I

I

I

I

I

I

656 640 651 642 694

733 717 746 741 707

I

TGGZAATGGTGGCOCTCACCTTTMTcccA@ZACTTGGZAG3XGAG%AGWGGATTTC

766 616 762 778

rLPL mLPL hLPL bLPL

916

rLPL mLPL hLPL bLPL

1016

rLPL mLPL hLPL bLPL

rLPL rnLPL hLPL bLPL

I

I

I

I

AAGAAA~MGAMGAAAGAAAGAAAGAAA~M~A~AA~~-~AA~A

I

T +

Insertion G&c TGGCTGA!TTTATTTCTATGTTTGCT GAGCTCiKi&AATAAT + TCTTGAGAAAAGGAATACT TTGCTG4AAGACAAAMTOTAGGTTGATTTTTACTTCTCTTTTTT~TTTCTT~A~~~TT~~T~T~~~A~CT~TA~ GTGCAGUAAAAAAAA ACAGAGGWAAAAMT

d



T GGTGACATA TGAAT! TGAGGTGACACA TAAATT CCTCAGCTGACACATAATTTGAATG TCATCAGCTGATACAGAATTTTAACT

+ GTCCCACT &ATcTGA&TwzcAAJ

rLPL mLPL hLPL bLPL

ACTAAACTA + GTACTTCAGbCTTACCTTdACTCTCAA ACTAAACTATGTATTTCAGGCTGGCCTTGAACTCTCAACCTTT

rLPL mLPL hLPL bLPL

TAGAACA&TTCAAT TTCTTAGT d TTTTCACCAd GCC~ATCGTTAG&TTT~TT~GKTCATC!TG~CCO l&T TTCTTACTTGTTTTCATCAATTTGAAAT GCCCAATATCCAATACTTTGTATTTCATTTG~GACTCATCT~CGCCATGC~TCTGTCACACTT~~~~~~A T CCAMTGATTTTCATCAATTTAAAATCATTCAATATCTU\G T CACAATTACATTCA TTGAAGTCCTTTAATGT GATAGTTACTGTTCA TTTTAWCTTATTTCAQXATGCTTcAGTcAG4CTTTOAGATG

Fig. 3 (continued

c&A TCCTGCCT!GGCTTCCTdGT~TG&CTT~TMc&ATAATTTTA OATA G4TA

on

p.

707 1109 606 604

+ TATCAGATT

867 1209 824 620

+

967 1309 650 646

1

1066 1406 949 937

CCGTAATTTTATTATTAGATTC CTGTAATTTTATTATCAGACTT

245)

kidney androgen-regulated protein-encoding gene and 88 % with human cytochrome P450 IIEl-encoding gene. Another interesting feature of the rat sequence that was not pointed out by Zechner et al. (1991) in mouse, but again

clearly appears upon alignment of the three sequences, is a deletion of 8 1 nt (rat)/7 1 nt (mouse) relative to man. This rat deletion corresponds to a fragment spanning 15911671 in human (Fig. 3).

245



rLPL mLPL hLPL bLPL

ad TTTTTTTTTC ICAAGTTTTA!WCAGG~ T>TCWT~cTGCAcT G&MAGT TCACATTAA + TTCTAGTTT 1 GMGTOA AWGTTTTA~XAGGACCATTTTTTTTT TCAAGTTCUATTCTGCACTGT-GT TCACATTAATTTCTAGTTTAaTGTGA TCTCTTTTOTTCCTCrCTTTGAU\TOAAAAGATAGGTTTGTTTT TCTCTTTC TCCCTAACTTTGATA GAAA CACGTATTCAC09GTA CTTTGCATCAT-GTCCTACAAATTTTAGWAAA~TTGTTTTT

1173 1493 1044 1027

rLPL mLPL hLPL bLPL

ACCACGTAG TCG+d TTACTAG&AATGTGTA+AT C&A TGCTTGTk.4CTGTT&-GTGkiWCCTTC+ATTGTGATA~ GM TGCTTGTAAACTOCT GT-GMAAGWXCTCAACTGTAAT~~ ATCACATWG TTGAAA TTACTAGAAAATGTOCATAT GCAGTGCTTGTAAACCATCGCGTGCAATO ACTAAGTUAA~OOAGAGOTTCCTOGGG T(XjPTTCCTAA ACTAAGTAAAGWACA~AAAGTTACAGTCA GTCATATAGTCACTOCCAGTGCTTA AAACCGTTGTG64CXATGGGATcAATT~TATATCCA

1262 1560 1117 1124

rLPL mLPL hLPi_ bLPL

CCATAOACA &r ACCAGGCT CTATAGUAGTACCAGGATTGTTGCcGCTGTTTTGTTTTAcCTT~AA.4AcAAGWAAAAATCMTAATGAACiUT ~~TGGI\GTACCATGA~T~TATTT~T~~~~~ AACAAC ATTGTCATWGTTG

rLPL k? bLPL

CJA TATAAAAT&TAAAAAAdAAAATAAAA CAAGPITCT d TATGTT CA&TTGCTTTTkTATTCAT CGAGPlTCTCACATTTT CAGATTGCTTTTATTATTCATTAATGTAAAAAAATAAAO CTAOATCTCCTATTTTTGWAATGCTCTT CTACGTATAAATATWTGTUG CTAWTCTCATATTTT CAGUTGCTTTT CTATATATTATTATCAAATGTAGAG

rLPL mLPL hLPL bLPL

+ GAACAC ~TACACA ~-~cAAGAGc+TT~~TTC~ATGTCAAA AACAGAAAC 1,GTGUA TGk-TGWTATC CAACCCA CAT~CCCCACAAGTGTAGTCGTCATTCAATGTAA AGCAGAAACTFTGAAA TTTGTGGGTATCTGUCAC AACCCGA CTGTGAAAGTATGTG ATATCTWC4CATACTAGCZTCTGCATGTGTGTTG CA CGTATGAAECCMCATACACG9TTATTGCTCAGCATGGAAA AATCCAAACTGT04ATGTGTGTGGGTGTCTGpJ\CACATATT

rLPL mLPL hLPL bLPL

CSGTCAAGA L TATAC’XT kI TATGTCAG’ GKEAAGCGPTATACTGTATGTTAG TCAAWGATGTATTGGAACATGT~GTA~ TGATTAAGAOGTACATTAGAACACACTGA

rLPL mLPL hLPL bLPL

ATAGGA c+d AGTGKT ATAGG4GUAGOT~CCWTTTCATCA GTAKWXAATGTTGTGATTAA~T~T~~TTGGAAT GTAGAATGAATGCTGTGATTGKATGVXCA~TTGMATC+IATTCTCTCT

rLPL mLPL hLPL

ATTCA T G&I GTATATA C& ATTCA TTACAGTATATACACATCCACATGCA ATTAAATTTCTOGATTTOOGTTGTGACCCAGGGTGCATTAT

rLPL mLPL hLPL

TACATATC + CAATGATGC + TTGACTTTA & TTTTATTT TTAGCTGT TACATATGTWT~TGCTTTAOCTTTTCAATTT ATTTATTAGCTGTAAATA TTATATATATCAAGG9TGTTCTGGCTTTTA~TTTTTACATT

rLPL mLPL hLPL

G+ GTCCACCT ~ATCAGTGAT+GTCT GTGGCTATCkGGG TGTA&TTTGTGGTbCTAACTCT GTGGTTATCTGCAG TATAGATTTGTGGTCCTAACTTTGTGTCCGTCTCCATCCAGTA GTGETATCTGCATTTATAAAAATGTFTajTOCTAACTGTATGr GTCTTTATCAGTGATGGTCT

rLPL mLPL hLPL

~GGAAG4AAATAAAC+C~T~~~-T~TCT~TTT CAATOGAAT GCTTTT MAA~AGUAACTCACCTGTGTGAAGUATWTATCTGCTTT CAATCSAATAGGCTTTT ATCSAATGGGCTTTAACAAAACAAG#,AWAAcGTACTTAACTGTGTC&WAAATCXUTcA~TTT

rLPL mLPL

CGCG TAAC

bT~&TTmTTTAcTATd

window

TATTTATOA TATTTATGAA TAATCAA~GTG4GTMACAACTATTTATAAA TAGTCAAG4GTG4GTGUCATTTATTTATAAA

1194 1166

1431 1752 1279 1272

TATCTCAGA Gd TGTTGCTGd &AA GTTC AAGATC TATCTCAGA%CTATAGCTGGG ATGTCAAATATCTCAG4GATAGCTGGG AAGTCAAATATTTCAG4GATTACTGAA











Deletion



1525 1646 1374 1367

1621 1942 1470 1463

GATGTA c&A TATGTCA TATGTATMTTTGTTAC G4TGTAT~TTTGTTGTTGTCUXTTTATTAATTC GTTGTGTGKTTGTaaaaaaaaa

1712 2034 1570 1539





KGAAAAACTTTG

1731 2063 1670

,&TA ATaOC-mdmA mTTdt,t,AwCT&mTGd 1620 ATGTGTGOGTATGTMWT GCTTGTAAACACT-GTCTGTT 2159 1770

&TGAGCCAA&TC

ACTCTdTCi4A

CACAGAGCCAACTC

ACTCTT

CAGC

A 2256 1921 1652

TC&.WACGTTT~ATCAT CTGKSACATTTTATCATGATAC TTGKAACATTTTATTACCaa

L

2015 2349 1946 2019 2353

of exon-10 in six species. The polyadenylation

signals are boxed. GenBank/EMBL

and that of the five other species was made using NUCALN manually

A 1334 1669

I

TGAACiUT’

'

size = 20; gap penalty = 7. Aligned

was optimized

L

TkTTCC4GAT~TOCTGTkCTACXXcTTkCTAGG4G&TTGGTTGTk!CTATGTAA+ TAGTTCCAGPITATCCTKiMTGTTAGCCCTTCCTAGOI\ATGTAAT TTGTTCCTG4TGTGCXXGCTTCG4cCCTTTCTCTG4Ci4GKNTGkTCGECCTATAAATA ACTCCTTCATCTGAGAGATACGGTTGRXCTGTACAAA GTACiUAATGGTTCCAGT@ZTjTGCCGGI\AC

A

A

T

ACi4AG AWAG



Fig. 3. The nt sequence the rat sequence



for a minimum

sequences

were transferred

rate of divergence,

respecting

(Wilbur and Lipman,

into a WingZ

spreadsheet

the choice order: transitions>

We have also sequenced another independent 2.5kb clone, that was shorter than the 3.18-kb clone at the 5’ end but identical to it at its 3’ end. It provided confirmation of the 157-bp insertion and of the 8 1-bp deletion and was also devoid of the mouse Bl element and of the homopurine stretch insert. The consequences of the presence of these inserts and deletions in the untranslated region of mRNA remain to be elucidated. (d) Particular features of rat cDNA LPL exon 10 (I) Polyadenylation signals Mouse and human LPL cDNA have two polyadenylation signals. It is likely that only one of these is functional in rat, since a single LPL mRNA species (3.6 kb) was isolated from different rat tissues (Kirchgessner et al., 1987; Semenkovich et al., 1989). However, eight AATAAA se-

accession

No. L03294.

Alignment

1983) with the following parameters: (Informix,

USA) and alignment

between

K-tuple size = 3;

of the six sequences

transversions>gaps.

quences (four of which overlap) can be found (Fig. 3). These A+T-rich sequences might be involved in mRNA stability. (2) Richness in A+T exclusive sequences Our attention focused on elements rich in A+T, such as: 22-nt: TAAATTTTATTTAAAAAAAATA (64-85); 1 lnt: AAATATTAAAA (167- 177); 20-nt: AAATAAAATAAATAAATAAA (526-545); 9-nt: ATTTTATTT (816824); 13-nt: ATAATTTTATTAT (968-980); 12-nt: ATTTTTTTTTTA (1163-l 174); 14-nt: TATAAAAAAATAAT (1304-1317); lo-nt: AATTATTTAT (1322-1331); 30-nt: AATATAAAAT(G)TTAAAAAAAAAAAATAAAA (1373-1402); 18-nt: TTTAAATTTTATTTATTA (1756-1773). The role of these A+T-rich motifs is presently unknown. It was suggested that A+T-rich motifs such as ATTTA could play some part in the instability of mRNAs (Caput et al., 1986; Shaw et al., 1986; Wilson and Treisman, 1988;

246 Brawerman 1989) or else could be repressive elements decreasing translation efficiency (Kruys et al., 1987; 1989; Han et al., 1990). They presumably play a role in the secondary or tertiary structure of RNA and might be involved, whether directly or indirectly, in the recognition of mRNA by specific degradation endonucleases. It must be pointed out that the consensus sequence AATAAATAAATAAA,

which has been described

(Reeves

and Magnuson, 1990) in the 3’-untranslated region of a variety of genes involved in cellular stimulation (lymphokines, cytokines and proto-oncogenes), is also found in rat (nt 1692-1705). The half-life of LPL mRNA in different species is poorly documented. In view of the present results, it would be interesting to study the stability of all these mRNA and to investigate

the role of their A+T-rich

Han, J., Brown, T. and Beutler, B.: Endotoxin-responsive trol cachectin/tumor necrosis factor biosynthesis level.: J. Exp. Med. 171 (1990) 465-475. Kalb, V.F., Glasser, tive elements

S., King, D. and Lingrel, J.B.: A cluster of repeti-

within a 700 base pair region in the mouse gcnome.

Nucleic Acids Res. 11 (1983) 2177-2184. Kirchgessner,

T.G.,

sequence

Svenson,

of cDNA

K.L., Lusis, A.J. and Schotz,

encoding

lipoprotein

M.C.: The

lipase. J. Biol. Chem.

262

C., Etienne, J., Guilhot,

S.,

(1987) 8463-8466. Kirchgessncr,

T.G., Chuat, J.C., Heinzmann,

Svenson,

K., Ameis, D., Pilon, C., D’Auriol,

M.C., Galibert,

L., Andalibi,

F. and Lusis, A.: Organization

A., Schotz,

of the human

lipo-

protein lipase gene and evolution of the lipase gene family. Proc. Natl. Acad.

Sci. USA 86 (1989) 9647-9651.

Kruys, V., Wathelet,

M., Poupart,

P., Contreras,

J. and Huez, G.: The 3’ untranslated beta mRNA has an inhibitory

R., Fiers, W., Content,

region of the human interferon-

effect on translation.

Proc. Natl. Acad.

Sci. USA 84 (1987) 6030. Kruys, V., Marinx,

sequences.

sequence con-

at the translational

lational

O., Shaw, G., Deschamps,

blockade

imposed

J. and Huez, G.: Trans-

by cytokines-derived

UA-rich

sequcnccs.

Science 245 (1989) 852-855. Miyamoto,

ACKNOWLEDGEMENTS

M.M.,

Slightom,

tions of humans

We thank Dr. Michel Bouillot (Genset) for his stimulating participation in the construction of the synthetic probe. This work was supported by grants from Institut National de la Sante et de la Recherche Mtdicale (grant 900 203 to J.E.) and from the Centre National de la Recherche Scientifique through UPR41. We thank Beatrice Pelletier for typing the manuscript.

globin region. Olivecrona,

J.L. and Goodman,

and African

rela-

in the $n-

Science 238 (1987) 369-373.

T. and Bcngtsson-Olivecrona,

milk-the model enzyme in lipoprotein J. (Ed.),

M.: Phylogenetic

apes from DNA sequences

Lipoprotein

Lipase.

G.: Lipoprotein lipasc research.

Evener

lipase from

In: Borensztajn,

Publishers,

Chicago,

1987,

pp. 15-28. Quentin,

Y.: Succcssivc

history. Reeves,

waves of fixation of B 1 variants

R. and Magnuson,

pression

in rodent lineage

J. Mol. Erol. 28 (1989) 299-305. N.S.:

of mammalian

Mechanisms

cytokine

regulating

transient

genes and cellular oncogenes.

exProg.

Nucleic Acid. Res. Mol. Biol. 38 (1990) 241-282. Sanger, F., Nicklen,

REFERENCES

terminating Bains, W. and Temple-Smith, K.: Similarity and divergence among rodent repetitive DNA sequences. J. Mol. Evol. 28 (1989) 191-199. Brawerman,

G.: mRNA

decay: finding the right targets.

Caput, D., Beutler, B., Hartog, R., Thayer, Cerami, A.: Identification of a common 3’-untranslated mediators.

region of mRNA

Proc. Natl. Acad.

57 (1989) 9-10.

K., Brown-Shimer, S. and nucleotide sequence in the

molecules

specifying

inflammatory

Sci. USA 83 (1986) 1670-1674.

of mRNA

levels in adipose

and heart. Biochim.

Biophys.

A.: Avian regulation Acta 1008

(1989) 92-101. Cooper, D.A., Lu, S.C., Viswanath, R., Freiman, R.N. and Bensadoun, A.: The structure and complete nucleotide sequence of the avian lipoprotein

lipase gene. Biochim. Biophys. Acta 1129 (1992) 166-171.

Deeb, S.S. and Peng, R.: Structure of the human lipoprotein Biochemistry 28 (1989) 4131-4135.

lipase gene.

lipoprotein

C.F.,

Chen,

Chan, L.: Lipoprotein expression,

S.H.,

Wims, M., Luo, CC.,

Li, W.H.

and

lipase and hepatic lipase mRNA tissue specific

developmental

regulation,

and evolution.

J. Lipid Res. 30

(1989) 423-43 I. Senda, M., Oka, K., Brown, lipase. Proc. Natl. Acad. Shaw, G. and Kamen,

W.V., Qasba,

P.K. and Furuichi,

Y.: Mo-

radation.

lipase of guinea

AU sequence

mRNA mediates

from the 3’ un-

selective mRNA deg-

Cell 46 (1986) 659-667.

Wilbur, W.J. and Lipman,

D.J.: Rapid similarity

and protein data banks. 730. Wilson, T. and Treisman, radation

Sci. USA 84 (1987) 4369-4373.

R.: A conserved

region of GM-CSF

of

Proc. Natl. Acad. R.: Removal

searches

of nucleic acid

Sci. USA 80 (1983) 726-

of poly(A) and consequent

c-fosmRNA facilitated by 3’ AU-rich sequcnccs.

dcgNature

336 (1988) 396-399. Wion, K.L.,

Kirchgessner,

R.M.: Human

Enerback, S., Semb, H., Bengtsson-Olivecrona, G., Carlsson, P., Hermansson, M.L., Olivecrona, T. and Bjursell, G.: Molecular cloning and sequence analysis of cDNA encoding pig. Gene 58 (1987) l-12.

with chain-

Sci. USA 74 (1977) 5463-

5467. Semenkovich,

translated

acid. DNA 2 (1983) 329-335.

Cooper, D.A., Stein, J.C., Strielemen, P.J. and Bensadoun, adipose lipoprotein lipase cDNA sequence and reciprocal

A.R.: DNA sequencing

Proc. Natl. Acad.

lecular cloning and sequence of a cDNA coding for bovine lipoprotein

Cathala, G., Savouret, J.F., Mendez, B., West, B.L., Karin, M., Martial, J.A. and Baxter, J.D.: A method for isolation of intact translationally active ribonucleic

S. and Coulson,

inhibitors.

T.G.,

lipoprotein

Lusis, A.J., Schotz,

lipasc complementary

M.C. and Lawn,

DNA sequence.

Sci-

ence 235 (1987) 1638-1641. Yang,

C.Y., Gu, Z.W.,

Pownall,

H.J.:

Yang,

Structure

H.X.,

of bovine

Rohde,

M.F.,

Gotto

milk lipoprotein

Jr., A. and

lipase. J. Biol.

Chem. 264 (I 989) 16822- 16827.

Etienne, J. and Brault, D.: Les deficits en lipoproteine Clin. 50 (1992) 299-309.

lipase. Ann. Biol.

Graur, D., Hide, W.A. and Li, W.H.: Is the guinea-pig 351 (1991) 649-652.

a rodent?

Nature

Zechner, R., Newman, T.C., Steiner, E. and Breslow, J.L.: The structure of the mouse lipoprotein lipase gene: a B 1 repetitive element is inserted into the 3’ untranslated 62-76.

region of the mRNA.

Genomics

11 (1991)

Sequence of rat lipoprotein lipase-encoding cDNA.

A rat lipoprotein lipase (LPL)-encoding cDNA (LPL) has been entirely sequenced and compared to the sequences of all the LPL cDNAs reported in other sp...
1MB Sizes 0 Downloads 0 Views