Gene, 98 (1991) 209-216

209

Elsevier

GENE

03901

Primary structure of the rat gene encoding an inhibitor of the insulin receptor tyrosine kinase (Recombinant

DNA;

chromosome

mapping;

promoter)

Laurence Falquerho a, Gilles Patey a, Laurent Paquereau a, Vakie Szpirer ‘, G&an Levand and Alphonse Le Cam a

Rossi%, Olivier Lahuna b, Josiane

Szpirer ‘, Claude

” Centre CNRS-INSERM de Pharmacologic-Endocrinologie, Montpellier (France); ’ Unit6 99 de I’INSERM, H6pital Henri Mondor, CrCteil (France) Tel. (33)49813544: ’ DPpartement de Biologie Mokculaire, UniversitC Libre de Bruxelles, Rhode-St-Genkse (Belgium) Tel. (32102 358 35 30; and ‘I Department of Genetics, University of GBteborg, Giiteborg (Sweden) Tel. (46)3185 32 90 Received by J.-P. Lecocq: 19 July 1990 Revised: 21 September 1990 Accepted: 26 October 1990

SUMMARY

The gene (PP63) encoding the inhibitor (PP63) of the insulin receptor tyrosine kinase was isolated from a rat genomic library. The intron/exon organization was deduced from Southern-blot analysis and sequence data (i.e., the exons + the boundaries). The PP63 gene, which maps to chromosome 11, spans approx. 8 kb and contains seven exons separated by six introns of different sizes. All of the boundaries match the consensus GT/AG sequence for donor and acceptor splice sites. Primer extension and Sl mapping experiments were used to locate the transcription start point (tsp) 73 nt upstream from the translational initiator. Both in vitro transcription assays and transcription of a chimeric gene in intact hepatoma cells indicated that the sequence located immediately upstream from the tsp contained a promoter. Several putative &-regulatory elements, including a TATA box and a C/EBP-binding site were found within the 250 bp preceding the tsp.

INTRODLJCTION

A large body of evidence has led to the hypothesis that protein phosphorylation of Tyr residues plays a major role in the control of cell growth and proliferation (Hunter and Cooper, 1985). Many oncogene products and growth factor Correv/~or2c/~,lcp to: Dr. A. Le Cam. Centre CNRS-INSERM cologie-Endocrinologie.

rut de la Cardonille.

Tel. (33)6754428X;

Abbreviations:

aa; amino

pair(s);

CAT,

encoding

CAT;

cDNA,

acid(s);

or 1000 bp;

AHSG,

chloramphenicol DNA

CCAAT/enhancer-binding otide(s);

complementary

protein; MIS,

NF-1, nuclear

HNF-I,

Mtillerian

factor

receptor

NaCI/O.O15 M Na, tion start point(s):

TyK);

human

a2-HS-glycoprotein;

acetyltransferase; hepatocyte

inhibiting

car, gene

to mRNA;

C/EBP,

NF-1; kb, kilo-

substance;

nt, nucle-

1; oligo, oligodeoxyribonucleotide;

open reading frame; PP63, phosphoprotein insulin

(France)

Fax (33)67542432.

bp; base

base(s)

de Pharma-

34094 Montpellier

PP63,

gene

of63 kDa (the inhibitor encoding

PP63;

SSC,

ORF, ofthe 0.15 M

citrate pH 7.6; SV40, simian virus 40; fsp, transcripTyK, tyrosine

kinase.

0378-I 119,‘911$03.50 0 1991 Elsewer Science Publishers B.V. (Biomedical Division)

receptors exhibit a TyK activity which is essential for their biological function (for a review, see Heldin et al., 1987). Although many laboratories have attempted to elucidate the mechanisms that regulate this enzyme activity, little is known, more particularly with respect to the existence and the role of putative natural effecters (i.e., activators and inhibitors). Only two examples of biological molecules which are capable of inhibiting TyK activity have so far been described. The MIS was found to inhibit the epidermal growth factor receptor TyK (Coughlin et al., 1987). PP63, a phosphoprotein secreted by normal rat hepatocytes (Le Cam et al., 1985) was shown to block the insulin receptor TyK and, concomitantly, inhibit the growthstimulating activity of insulin in rat hepatoma cells (Auberger et al., 1989). Gene cloning, which has already been achieved for MIS (Cate et al., 1986) represents an important step towards both the elucidation of the mechanism underlying the regulation of the production of these molecules and the understanding of their mode of action.

210 The aim of the present study was to determine the structure and organization of PP63, to analyze in detail the 5’-flanking sequence, and determine its chromosome location.

RESULTS

AND

INTRONS'

tt’t Bg P

DISCUSSION

(a) Isolation of genomic clones: organization of PP63 In a previous Southern-blot analysis, a PP63 cDNA probe which covered more than 75”, of the cDNA was shown to hybridize to a single, large sequence, (> 20 kb), BumHI rat genomic fragment (Auberger et al., 1989). This suggested that the complete gene might be included in such a fragment. Based on this observation, a rat genomic library, containing size-selected Bun?HI fragments was constructed and screened using a PP63 cDNA probe (Auberger et al., 1989). A clone bearing an insert of approx. 18 kb was isolated, purified and analyzed by restriction enzyme mapping using standard techniques (Maniatis et al., 1982). Restriction fragments, varying from roughly 2.5-0.5 kb and covering the entire sequence of the clone, were isolated and analyzed using the Southern-blot method, with various cDNA probes. These fragments were also subcloned in M 13 vectors and either totally or partially sequenced. All of the exons and the boundaries were sequenced on both strands. Thus, the number of exons and introns in PP63, as well as the exact sizes of the exons and the approximate sizes of the introns were determined. The organization of PP63 is represented in Fig. 1A. The gene spans approx. 8 kb and comprises seven exons and six introns. Comparison of the introniexon boundaries (Fig. 1B) shows that each corresponds to a canonical splice consensus sequence (Breathnach and Chambon, 1981). The exon size displays rather uncommon variability, ranging from 84-687 nt. As found for MIS (Cate et al., 1986), the two extreme exons (1 and 7) are the largest, whereas the central exons are much smaller (Fig. 1). The first exon contains the translation initiation site preceded by a short (73 nt) untranslated sequence, whereas exon 7 contains the terminator UAA (nt 1148-1150 in the cDNA sequence). This delineates an ORF encoding a 358-aa polypeptide, a size smaller (17 aa) than that of PP63 precursor which we originally reported (Auberger et al., 1989). This difference is accounted for by the fact that the first 45 nt of the 5’-end region of the gene (Fig. 5) differ from those reported in the original PP63 cDNA (Auberger et al., 1989). This 45-nt sequence, which is present both in the genomic clone and in a new cDNA clone which we have isolated and sequenced (data not shown), contains the tsp (see section b, below) and therefore belongs to the gene. However, although we have found agenomic clone bearing the 5’-end sequence (i.e., the first 140 nt) which we originally described as part of PP63

I~:\r,ll

A

Bg P

Bg E I

I

0

2

-4

\(’

a

\(i

Fig. I. Organization

of a genomic

library constructed

used to propagate

the library

phages

were screened

cDNA

probe,

performed

(specific

(i.e., for fragments

bearing

deletions,

diagrams, putative

activity

the diagram.

Bg,BgIII;

were generated

boxes arc cxons

cxons are indicated

below the horizontal

alignments

and homology

data bases were performed (C.I.T.I.2,

Paris).

PP63 cDNA

sequence

sequencing;

rcspectivell,

was

(Amers-

et al.. 1977). Ml3 clones, by Dale et al.

system).

and 0 corresponds

In the to the

site\ are noted belo\\

(I-VII).

data

The SIXS of mtrons

brackets.

searches

Computer

sequence

library

of

analysis programs

accession

GT and AG sequences

see A), at the donor

and

analysis

in protein and nucleic acid number

IS M29758. (Lower part) Intron-cxon

of PP63. The highly conserved genomic

(Sanger

as described

using BISANCE

The GenBank

(Pharmacia,

P, Purl; S, Snc,I. Open boxeh are introns

(l-6) and hatched sequences,

PP63

Sequencing

(IBI rapid dclctions

of the main restriction

E.EcoRI;

host

et al., 1982).

>4Oll Ci:mmol)

technique

the scale (in kb) is at the bottom IS/J. The positions

(Maniatis

>0.5 kb), overlapping

(1985), using the ‘Cyclone’ system

into the

LE392. 4 x 10’ recombinant

and sequencing.

ham, UK), usmg the chain-termination various

( IS-20 kb) were

into pUC or M 13 vectors

When required

from the

from a rat genomic

WI). The bacterial

methods

for amplification

\t;

using the B,$II 878.bp

to standard

with [a-“S]dCTP

wah deduced

and inserted

Madison,

was strain

were subcloned

Sweden)

( ( ( (

DNA fragments

by hybridization

according

DNA fragments Uppsala,

I (Promega,

iGEM-

8kb

map of gene PP63. (Upper

(104J”,)

gradient

I

I (i(‘\(;

was isolated

as follows. BnmHl

f

E

E

6

restriction

clone which

t

/

shown in the diagram

on a sucrose

bacteriophage

I

\(‘....................

and partial

part) The gene organization

size-selected

\

PBg

Bg p

I

(1.... \(’

structure

t t f/K.

A P s

of the

boundaries

(determined

and acceptor

bq

splice sites.

arc underlined.

cDNA, the latter did not overlap the genomic clone described above. In addition, this 140-nt sequence does not hybridize to the mRNA species recognized by the rest of the cDNA, suggesting that it might represent a cloning artifact. Besides this discrepancy, two other differences were found between the sequence of the exons determined in this study and that ofthe previously published cDNA: GC substituted for CG at nt positions 901-902 (Arg”” + Ala) and an additional stretch of 26 nt was found at the 3’-end of the seventh exon. Again. these differences were found in the sequence

211 ofthe new cDNA clone. These modifications were therefore taken into account when the sequence was entered into the data bank. Comparison of PP63 cDNA with sequences found in data banks reveals a high degree of identity (approx. 747,) with AHSG cDNA (Chong-Chou et al., 1987). The aa sequences of precursor polypeptides are also identical by of 60”,, ; more importantly, there is a perfect conservation the Cys residues, shown to be crucial in determining the secondary structure of AHSG (Kellerman et al., 1989), throughout the sequences. In addition, PP63 (Le Cam and Le Cam, 1985) and AHSG (Lebreton et al., 1979) are both negative acute-phase reactants secreted by the liver. (b) Determination

of

the

tsp and identification

of

a

functional promoter in PP63

To identify the 5’ boundary ofPP63 mRNA, a [ “‘P]oligo complementary to the 5’ -end region of the coding strand of the cDNA (nt + 77 to + 101; Fig. 5) was hybridized to total hepatocytc mRNA. The primer was then extended with avian myeloblastosis virus reverse transcriptase (Pharmacia, Uppsala, Sweden) and the sizes of the resulting products were determined by gel electrophoresis under denaturing conditions, followed by autoradiography. The major cxtcnsion species has a length of 101 nt; a minor species, which is 2 nt shorter, is also detected (Fig. 2). This suggests that tl-~~nscriptjon may start at a T residue, 73 nt upstream from the start codon AUG. The tsp was confirmed by S 1 nuclease mapping. When a uniformly labelled, single-stranded M 13 anti-sense probe, corresponding to a P.crI-LlgllI (nt -282 to + 246; Fig. 5) fragment ofthe 5’-end region of PI>63 was hybridized to total hepatocyte mRNA and digested with S 1 nuclease, a major protected fragment of approx. 250 nt was detected (Fig. 3, lane 4). This result agrees \?~ll with that of the primer-extension experiment. To determine whether the cloned genomic DNA containing fP63 S’-flanking sequence actively initiates and supports tr~~nscription, we performed an in vitro transcription assay using liver nuclear extracts and, as a template, a plasmid bearing the BglII fragment than encompasses the t.sp (nt -692 to 246; Fig. 5). The transcription products wcrc hybridized to the same probe as the one reported above (S 1 Inapping), digested with nuclease S 1 and analyzed on a polyacrylamide gel under denaturing conditions. A major protected fragment of approx. 270 nt was detected (Fig. 3, lane 3). This fragment is slightly larger than the protected fragment obtained after mRNA hybridization. This difference is accounted for by the presence of the same vector sequence (i.e., a fragment of the polylinker containing the SUCI, f(pflI, Smal and BarnHI restriction sites) in both the M 13-derived probe and the pEMBL-derived template used for transcription. Thus, it seems likely that transcription was initiated at the correct position, in vitro.

Fig. 2. Determination

ofthe tsp. Total hepatocyte

from normal adult rats as described

by Chirgwin

ized with a 2%nt oligo (see Fig. 5). labelled nucleotide

kinase

(Boehringer,

Mannheim.

Primer-extended

products

polyacrylamide)

and sized by comparison

ladder

run alongside

indicated includes

RNA

( 10pg),prepared

et al. (1979). was hybridat the 5’-end F.R.G.)

were run on a sequencing

and

with poly[Y-~‘P]ATP.

gel (8 M urea/6%

with a dideoxy

sequencing

(the first four lanes from the left). The main band,

by an arrow, corresponds

to a lOl-nt extended

fragment.

which

the 25-nt primer.

The ability of the cloned putative PP63 promoter to support transcription in intact cells was also demonstrated using a transient transfection assay. For this purpose we transfected a plasmid bearing theBg/II-BstNI, PP63 5’-end fragment (nt --692 to + 52) inserted in a CAT vector, into a human hepatoma cell line (HepG2, Knowles et al., 1980). Based on CAT assays, a significant promoter activity which reaches about 25 ‘;, of that of the strong SV40 viral promoter, was observed for the chimeric PP63 construct (Fig. 4). (cl Analysis of 5’-flanking putative cis-regulatory

PP63 sequence; identi~cation

of

elements

The sequence of the region (approx. 800 bp) located upstream from the tsp was determined (Fig. 5) and a search for potential cis-acting regulatory elements was carried out.

212

+982

+3-ac +I -ac

~270 4-250 -OR

1

2

Fig. 4. PP63 expression

3

as measured

by the transfection

mids into intact cells. The following chromatography

on QUIAGEN

were used: pEMBLCAT, same plasmid (-692

1234 Fig. 3. In vitro transcription

ofPP63.

Assays were performed

by Gorski et al. (1986), using nuclear from 3-month-old (1 ~g per assay)

(Dente et al., 1983)containing, genomic

proteins

male rats (Dignam was a plasmid

fragment

and

(45 pg per assay) prepared

derived

from

a pEMBLCAT

at

55’C

using a nuciease for

pH 7.4)/N&J stranded Davies

16 h

the PP63 zsp and includes

(1 mM), to a uniformIy

(0.4 M);!EDTA

M 13 probe

scribed

above.

F.R.G.)

was carried

enzyme,

in a medium

products

(50000 cpm per assay) containing

Digestion

with

prepared

the PstI-EstNI Sl

nuclease

labelied

containing

fragment

(Boehringer

by de-

Mannheim, were then

to Fig. 2), and their

sizes were

determined by comparison with a dideoxy sequencing ladder run alongside [the first four lanes on the left). Lanes: 1, migration of undigested probe; 2, probe hybridized the in vitro transcription denatured protected

to 25 pg yeast tRNA; products;

total hepatocyte fragments

RNA.

3, probe hybridized

4, probe hybridized

(QUIAGEN,

Studio City, CA),

devoid of a promoter

(lane 3); the

fragment

a plasmid

of PP63 (putative

bearing

the strong

promoter)

viral

(lane l),

SV40 promoter

plastic

dish)

in Dulbecco

HepG2

serum150 119 gentamycin described

phate precipitate

by Chen and Okayama

was left overnight

[dichloroa~etyi-l-‘~Cl species,

luciferase

2pg

gene driven

(1987). The

DNA-Ca phos-

48 h later (Gorman

et al., 1982), using r>-threo-

(spec.

act.

Saclay,

(ch) and the acetylated were resolved by thin-layer

by autoradiography. efficiency,

fetal calf

B per ml, using the

with the cells, and the yield

g L’Energie Atomique,

chloramphenicol

et al., 1980)

in contact

chloramphenicol

ated; 3-ac: 3-acetylated) detected

cells (Knowles Eagle’s medlum/lO”,,

per ml/l0 pg amphotericin

ofthe CAT was measured (Commissariat

hepatoma

and Vogt-modified

To account

of a plasmid

forms

“C-1abelled

(I-ac: l-ucetyl-

chromatography

for the variations

(pRSVLUC),

by the Rous sarcoma

SO-60 mCi~mmol~

France).

bearing

virus promoter,

and

in transthe firefly were co-

transfected with the test plasmid. Luciferase activity, present in detergent lysed cells, was determined as described by Brasier et al. (1989). The amounts

of cell extracts

containing

were used for the CAT assays.

the same number

of luciferase

units

OR, origin.

to

to 20 big of heat-

The sizes of the probe

(in nt) are indicated

grown

fection

of IO units of

0.25 M NaCI. The products

gel (see legend

single-

as described

genomic

out at 45°C for 1 h, in the presence

run on a sequencing

were

of CAT plasby ion-exchange

(lane 2). used as a reference. Highly purified supercoiled plasmid DNA (6 pg) was transfected into exponentially growing (5-7 x 10’ cells/60 mm

procedure

part of

protection assay. They were first hybridized, formam~de~Tris HCl buffer (20 mM 807,

in

et al. (1986),

vector

in front ofthe CAT, the 335-bpP.TzI-BstNI

which encompasses

to + 52) genomic

diameter

et al., 1983). The DNA template

the first exon (nt -282 to i 52) (see Fig. 5). Transcription analyzed

as reported

columns

a construct

purified

in front of the cat, the 5’-end 744-bp ffglII-EstNl

bearing

pSVZCAT,

plasmids,

and of the

on the right margin.

An oligo that could represent a TATA box is found at the correct position, namely 25-30 nt before the potential tsp. However, this A + T-rich sequence, 5’-AAATAAATT (nt -3 1 to -23), does not closely match those considered as optimal for strong TATA boxes (Efstratiadis et al., 1980). The gene appears to lack a typical CCAAT box, the sequence originally found to bind the nuclear factor CjEBP which was first isolated from rat liver nuclei (Graves et al.,

1986; Johnson et al., 1987). However, the sequence 5’-TTACGCAAT, found at nt positions -59 to -5 1 in the PP63 promoter, contains the pentanucleotide 5’-GCAAT which was recently shown to be optimal for C/EBP-binding sites (Vinson et al., 1989). This sequence also matches the consensus sequence of c®ulatory elements reported to define a new class of liver-specific genes which includes the hemopexin gene (Poli et al., 1989). These genes, like PP63, do not contain a binding site for HNF-1, the nuclear factor shown to confer liver-specific expression to a large group of genes (Courtois et al., 1987). Considering the prominent

213 AGATCTTCAGGGAGGAGGGTGAGGTCTTGAAGGAGGAGGGGTGAGGTC -750

TGCATGGAGGAGGGTGAGGTCTTCAGGGAGGAAGGTGAGGTCTGCATGGA -

-700

GGAAGGTGAGATCTTCAGGGAGGAGGGTAAGTTCTTTAGGGAGGAGGGTG

-650

AGGTCTTCAGGGAATAGGGTGAGGTCTGCAGGGAGGAGGGTGAGGTCTGC

-600 -550 -500

CATTCTTCTGAAGTCTTTGGCATCTCCCATGACTCACTGTATTTTTGCTA

-450

CATGTGTGGACCTTATTCTCCTTCTAGGAGAATATCTTGAAGGAGATCAA

-400

TCAACAACATTAGAAGAGTCAGTAAGAGCATGTTATTGCCCTAGCATGCT

-350

CACATTCAGGGTGCCATGGCATGATCATATTCAGGTAGAGGACCCCCTGT

-300

CCCCACTCCCTGCTGCCACTGCAGCTACGCTCTGTGCAGCTCGTGTCCAT

-250 -200 I--Cl CGGGGATAGATGATGTCCTAACTTATTATTTGCTTTCCCAGAGCTGTTGTTTG

-150

-b-

-50

CAAGGATGATTTGGAACCAGAACAAAAATCAGCCCACGCCTTTACGCAAT -aTCCTTCAGTGGGCTCTGTCAAATMATTGAGCCCTCTGCCCCTCTATCGG

1

TCTAGCTCTCCAAGCTGATTATCTGGGCTGCCCCTGACATTTGTCCATCT

-100

CCCAGGGCCTCTCTGGAGCAGCCATGAAGTCCCTGGTCCCTGGTCCTGCTCCTTTGT __--__---------

i-51

TTTGCTCAGCTCTG~GCTGCC~TCAGCTC~C~GGTGCAGGGCTGGG

+101 +151 +201

TAGCCGTGCACTACCTCAATAAACATCTTCTTCAGGGATTCAGGCAGATC

+251

T

Fig. 5.Sequence ofthe 5’.flanking M 13mp19 and sequenced located

downstream

region ofPP63.

on both strands

(see

Two BgUI genomic

fragments

legend toFig. 1). Numbering

from the rsp are those which differ from the original

( -798 to -692 and -692 to + 246) were subcloned

of nt, noted on the left margin, PP63 cDNA

sequence

(Auberger

starts

into M13mp18

from the putative

and

tsp.The first 45 nt

et al., 1989). The following

putative

DNA

regulatory elements were tentatively identified in the sequence: a, A + T-rich region or TATA box: b, C/EBP-binding site; c, cctamer binding site; d, NF- 1 binding site. The 18-m repeated sequence is underlined with a continuous line. The sequence selected to make the oligo used in the primer extension experiment

is underlined

with a dashed

line. The GenBank

data library

role of C/EBP in regmating gene expression in differentiated cells, in particular hepatocytes and adipocytes (for a review, see McKnight et al., 1989) and the fact that PP63 is specifically expressed in liver (Auberger et al., 1989) this DNA element is likely to be involved in regulating PP63 promoter activity. Besides these elements, an oligo: 5’-TGGCAA, that perfectly matches the potential DNA-binding site for the ubiquitous transcription factor NF-1 (Jones et al., 1987) was also found at nt positions -162 to -157. Furthermore, the ohgo 5’-ATTTGCTT, found at positions -126 to

accession

number

of PP63 5’-flanking

sequence

is M36547.

-119, corresponds, with a single mismatch, to the inverted complementary sequence of the octamer motif 5’-ATGCAAAT which binds both tissue specific (Ott-2A and Ott-2B) and ubiquitous (Ott- 1) transcription factors (Kemler et al., 1989). These DNA elements, which are capable of binding ubiquitous factors, might contribute to the strength of PP63 promoter. Lastly, it is worth noting that the G-rich 18-nt oligo: S’-GGAGGAGG~TGAGGTCT~ was repeated ten times, with a conserved 3-nt spacer, between nt -787 and -581.

214

+++I

+++‘I+

i+il,++,,+

t~++l+++i

+*

+

-+

+ + + + + + /

+++t,

+ i

+

I

fi

+

++

I

+

-+

+

c

i-

i

.cf

‘r,

0

a

Primary structure of the rat gene encoding an inhibitor of the insulin receptor tyrosine kinase.

The gene (PP63) encoding the inhibitor (PP63) of the insulin receptor tyrosine kinase was isolated from a rat genomic library. The intron/exon organiz...
944KB Sizes 0 Downloads 0 Views