Gene, 101 (1991) 97-104 0

1991 Elsevier

GENE

Science

Publishers

97

B.V. 0378-l 119/91/$03.50

03993

Cloning, sequence and chromosomal NCYC396 (Recombinant

DNA;

a-galactosidase;

Hilkka Turakainena, ‘I Department

location of a MEL gene from Sacckzromyces

melibiase;

mRNA;

codon usage; homology;

carlsbevgensis

brewing lager yeast)

Matti Korhola b and Sirpa Aho b

Helsinki (Finland) Tel. (358-O)191 I and b Research Laboratories of the Finnish State (Alko Ltd.), SF-00101 Helsinki (Finland)

of Genetics, University of Helsinki, SF-00100

Alcohol Company

Received by J.K.C. Knowles: Revised: 16 November 1990 Accepted: 5 December 1990

10 August

1990

SUMMARY

Yeast strains producing cc-galactosidase (aGal) are able to use melibiose as a carbon source during growth or fermentation. We cloned a MEL gene from Saccharomyces carlsbergensis NCYC396 through hybridization to the MEL1 gene cloned earlier from Saccharomyces cerevisiae var. uvarum. The aGal encoded by the newly cloned gene was galactose-inducible as is the crGa1 encoded by MELI. A probable GALCprotein recognition sequence was found in the upstream region of the NCYC396 MEL gene, The gene was transcribed to a 1.5-kb mRNA which, according to the nucleotide sequence, encodes a protein of 47 1 amino acids (aa) with an M, of 52 006. The first 18 aa fulfilled the criteria for the signal sequence, but lacked positively charged aa residues, except the initiating methionine. The enzyme activity was found exclusively in the cellular fraction of the cultures. The deduced aa sequence was compared to the aa sequences of other clGal enzymes. It showed 83 y0 identity with the S. cerevisiae enzyme, but only 35 y0 with the plant enzyme, 30% with the human enzyme and 17% with the Escherichia coli enzyme. With pulsed-field electrophoresis, the MEL gene was located on chromosome X of S. carlsbergensis, whereas the S. cerevisiae var. uvarum MEL1 gene is located on chromosome II.

INTRODUCTION

Only some strains of the genus Saccharomyces are able to produce aGa1 and thus completely utilize raflinose as a carbon source (Yarrow, 1984). Normal baker’s yeast is unable to produce this enzyme. Introduction of aMEL gene Correspondence to: Dr. H. Turakainen Laboratories

of the Finnish

Box 350, SF-00101 Tel.(358-0)13311; Abbreviations:

Helsinki

at her present

State Alcohol

aa,

ing aGal;

amino

frame; medium.

S.,

bromide;

(Finland)

acid(s);

aGal,

kb, kilobase

nt, nucleotide(s);

electrophoresis;

Research

(Alko Ltd.), P.O.

Fax(358-0)1332781. a-galactosidase;

5-bromo-4-chloro-3-indolyl-a-D-galactopyranoside; EtdBr, ethidium

address:

Company

oligo,

OFAGE,

bp,

YPD,

aXGal, pair(s);

or 1000 bp; MEL, gene encodorthogonal

oligodeoxyribonucleotide;

Saccharomyces;

base

yeast

extract,

field alternation ORF,

open

peptone,

gel

reading dextrose

coding for crGa1 into a commercial baker’s yeast strain should result in a more complete use of substrate sugars during propagation and thus in a decreased effluent load in the baker’s_yeast-producing factory. S. carlsbergensis strains have traditionally been used to make beer; introduction of the MEL gene from a brewing lager yeast into baker’s yeast would not add anything to the human diet that had not been there before. The MEL1 gene from an S. cerevisiae strain MPY1041 (Post-Beittenmiller et al., 1984) and from a strain of S. cerevisiae var. uvarum ATCC9080 (Ruohola et al., 1986) and the MEL genes from E. coli (Liljestrom and Liljestrom, 1985), plant (Cyamopsis tetragonoloba; Overbeeke et al., 1989), and human (Bishop et al., 1986) have been cloned and sequenced. We have cloned and sequenced a MEL gene from S. carlsbergensis NCYC396, derived from the authentic

98 S. ca~~sbergensjsbottom-fermenting strain No. I described by E.C. Hansen in 1908 (Pedersen, 1986). In a preliminary report (Turakainen et al., 1988), we called the new gene MEL2 because it was the second yeast MEL gene sequenced. However, while this work was proceeding, we

I

2

E

I

E

I

1

B

IO kb

PPB

\ . .

I Bg

I Ho

XH

I

J

‘.

‘.

AAA II HB

I

Ha

&.A 1 Hc

I Ha

H

3.6 kb

3

ma1 location different from any other known MEL gene. Until allelism tests have been performed we call the new

Fig. 2. Restriction

map ofthe IO-kb EcoRI fragment

gene. The restriction

gene NCYC396 MEL according to its strain number in the National Collection of Yeast Cultures (Norwich, U.K.).

from the pBR322

E

II

\ . .

A

II BgHo

X

III

‘--. AA

and called MEL3 and MEL4 (Naumov, 1989). Here, we show that we have cloned another Saccharomyces MEL gene with divergent sequence homology and a chromoso-

I

6

Ii

HB

I I I I I

learned that the MEL gene from an S. cerevisiae strain was also called h4EL2 and another S. cerevisiae strain contained multiple MEL genes two of which were genetically isolated

RESULTS

I 4

I

0

containing

sites were derived from the digestions

clone containing

in section a or from its subclones.

the IO-kb EcoRI fragment Detailed

described

sites were con-

in Fig. 3. A, AIuI; B, BarnHI;

firmed from the nt sequence

shown

BglII; E, EcoRI;

Ha, HaeIII;

H, HindIII;

restriction

the MEL

of the DNA

Bg,

Hc, HiptcII; P, PstI; X, XbaI.

AND DISCUSSION

obtained. Its restriction map is shown in Fig. 2. No sites were observed for AvaI, BclI, BglI, PvuII or SalI. Hybridization of a Southern blot, containing fragments generated by several restriction enzymes, with the MEL1 probe showed that the MEL gene was located mainly in the 2.3-kb EcoRI-BamHI fragment. Also the adjacent 1.3-kb HindIIIBarnHI fragment showed weak hybridization. The lo-kb EcoRI fragment but not the 2.3-kb EcoRIBamHI fragment, when inserted into the E. co&yeast shuttle vector and transformed into yeast, conferred the Mel+ phenotype upon the S. cerevisiae Mel- strain (data not shown). Nutrient agar plates containing the chromogenic substrate clXGa1 (Sigma) were used to differentiate Mel+ from Mel- colonies (Tubb and Liljestrdm, 1986).

(a) Isolation of the MEL gene Genomic DNA was isolated from S. carlsbergensis NCYC396 and digested with EcoRI, Southern blotted, and hybridized using S. cerevisiue var. uva~~ MELI gene as a probe (Fig. 1). A weak signal was detected as a IO-kb band. EcoRI-generated DNA fragments about 10 kb in size were extracted from agarose gel and ligated to the EcoRI-cut pBR322. An E. cd’ transformant containing the pBR322 with a IO-kb insert hybridizing to the MEL1 probe was

1 2 *y-1Okb

(b} Nucleotide sequence of the MEL gene We determined the complete nt sequence of the 2.3-kb EcoRI-BamHI fragment and the 1.3-kb HindIII-BamHI fragment (Fig. 2). An oligo primer was used to confirm the sequence around the Barn HI site. The sequence is presented in Fig. 3. The nt sequence revealed a single ORF of 1413 bp, sufficient to encode a polypeptide of 471 aa, with a deduced M, of 52006. The direction of this ORF was consistent with the direction of the MEL transcript which was determined by using RNA-RNA hybridization and probes generated in vitro from the strands of the MEL gene as template for RNA polymerases. Only one of these clones generated a transcript which hybridized to yeast RNA in a dot-blot hybridization experiment (data not shown).

5 kb-

Fig. 1. Southern

hybridization

0.4 pg) and NCYC396

analysis

of ATCC9080

DNA (lane 2,1.0 pg). Genomic

with EcoRI endonuclease,

electrophoresed

DNA

(lane 1,

DNA was digested

on a 0.5% agarose

gel, trans-

ferred to a nitroceIlulose filter (Southern, 1975), and hybridized with MELI RNA probe (a 2%kb genomic BarnHI-Sal1 fragment in pGEM3 plasmid; Ruohola et al., 1986). The probe was prepared by the in vitro transcription system and labelled with [a-32P]UTP (PB10163, Amersham) (Melton

et al., 1984).

Fig. 3. The nt sequence of the MEL gene and the deduced aa sequence (GenBank accession number M58484). The nt numbering begins from the A in the start codon, ATG. The asterisk indicates the stop codon. The TATA-like sequence preceding the coding region and the consensus sequences for transcription termination are underlined. The putative sequence for GALB-mediated regulation is shown in brackets, DNA fragments were subcloned into pIBI76 plasmid (International [a-?S]thio-dATP

Biotechnologies,

(SJ304, Amersham).

to the first nt in each row.

Inc.) and the nt sequence

The oligo primers

were prepared

was obtained

by the chain-termination

using a DNA synthesizer

(Applied

method of Sanger et al. (1977) with

Biosystems

model 381A). Numbers

refer

AGACTGA~TTAAAGTGTTCA~CCCTATGTTCACTAACAACTAACTAkATCTATATA TTGGTCATCTTAGAGATTACAGTACCTAGAGTACTCGAEAAATATTCATGAC ACCTGAGACAACTAACCAGATCATATCGATATATCAGAAGATTTTCTCACGCCATGCCCG TTTTGATTGAGCACCTTATTTATACTTCCTGTTCACTAAAAAATACCTT~ACTCTCACA AATGAATTATTTEAAATATTGAACCACATCGC~TCAATACAATATC~AGTACACATC TCTTCACAAGGTTTCTCTAGCACGTATAATACGGCTTGGT AATTTTTCTCAAAAATGCTTTTTTCATAGTCTTTTAGTTGCGGTTTAGATCTCGGTATTG CCAAGTACTTCAACAGATCCACCACTGCCCTACTCTTTACCGATGGTGAAGG~CCTCAA TACTGAAACATTGGTTCTGCTGAGAGAATAAAATAAAAT~~ATTGATTAGTATTGGGAAAAT ACTACTCCARGACTATTACATCTGTATGCCTGTTATCGCATAGATCAAAGAGTGGAGTAA TTTGAAAAGTACAGAAAGCGGAAATTGAATTTTTCAAATCATTGAGTTATTTTTTTAGAA CTAGTTAAGAGTGCTGATGACATTGTAACAACATTTTTTCCCAC~ACCTTCCGATTCA TCATATTGCGGGGAATATTATGATATAGTAAGGCAAACX;GGACACGTTTAGATAGGTACC AGATTCTGTTTTAAACGACGCTACTAATCTCTCTTTTATTAGATGAATTACCATCAAAGGAT CGTGGGGTATCACTCAACATACACCGTCTGCTATTCATTTCATTCAATGCACCCTAGCTC CCTACTCCCAACAATGTTAAACCCCTACGAACCCTGACAAGGCGGATCGGAGACGAATAA TTCTGAAACATACTCCAGAGCTCCGAACTGACCCTGACCCTTTTTGAACTCAACACAAAAATTAAC CGCAGTTAAAGCTGATAATGATTATGTGTTTGTTAGGATATCAAAGATCTTGTGCTTAGA CATTCTCGTTAAGGTTTTAGTTGGGACCCCTTAGACCCCTCCCGA~TACGACAGTCCAT TACT~GGCCTAGTGTTGTCC~GACGGGGCAAAATTkATGCCT TCTCTATACTACATGATGATATTATGAACCATAGGATGA~GGTAGCTGATG~GGTT ATTAAAATGATAAGATAGGATGAAGCTAGCTGATGAAG(;TTATAAAAATGGAGATAAGAT TAATGATAAAATTCAAGCGGTATACAGTTGTTGTTTGTTCCTAGTGA~AAA~GTATCA 1 ATGTTTCTTTTATATCTTTTTACCTCTTTCGCCGCTGTGAGCGGTGTTCTCGGTTCGTCA MetPheLeuLeuTyrLeuPheThrSerPheAlaAlaValSerGlyValLeuGlySerSe~ 61 CCGAGTTACAATGGCCTTGGCCTCACCCCTCAAATGGGTTGGGACAATTGGAACACATTT ProSerTyrAsnGlyLeuGlyLeuThrProGlnMetGlyTrpAspAsnTrpAsnThrPhe 121 GCCTGCGACGTCAGTGAGCAATTGCTTTTGGATACCGCTGACCGGATTTCTGAAATAGGA AlaCysAspValSerGluGlnLeuLeuLeuAspThrAlaAspArgIleSerG~uIleGly 181 CTAAAGGATTTGGGTTACACTTATGTTATATTGGATGACTGCTGGTCTAGTGGCAGAACT LeuLysAspLeuGlyTyrThrTyrValIleLeuAspAspCysTrpSerS~rGlyArgThr 241 GCAAATGGTACACTCGTTGCAGACAAGGAAkftGTTTCCCATCATGTGGCT AlaAsnGlyThrLeuValAfaAspLysGluLysPheProAsnGlyMetSerH~sValAla 301 GACCACCTGCATAACAACAATTTTCTCTTTGGCATGTACTCATCTGCTGGTGAGTACACT AspHisLeuHisAsnAsnAsnPheLeuPheGlyMetTyrSsrSerAlaGlyGluTyrThr 361 TGTGCAGGATACCCTGGATCTTTAGGACACGAGGAAGAAGATGCGGAGTTTTTTGCTAGC CysAlaGlyTyrProGlySerLeuGlyHisGluGluGluAspAlaGluFhePheAlaSer 421 AATGGTGTTGATTACTTAAAGTACGACRRCTGTTACARCAhAGGTCAGTTTGGTGCACCA AsnGlyValAspTyrLeuLysTyrAspAsnCysTyrAsnLysGlyGlnPheGlyAlaPro 481 GAAACTTCCTACAAACGTTACAAGGCTATGTCTGATGCTTTGAACAAAACAGGCAGACCT GluThrSerTyrLysArgTyrLysAlaMetSerAspAlaLeuAsnLysThrGlyArgPro 541 ATATTTTATTCTTTGTGTAACTGGGGTC~GATTTGACGCATTATTGGGGGTCTGATATA IlePheTyrSerLeuCysAsnTrpGlyGlnAspLeuThrHisTyrTrpGlySerAspIle

-1390 GAATTCGAGA

TAGGTTTTTTCTTCAGATTAGTTTAAAGGGTACA~TTTATACGTTATGCAAAGCGTTT TGAAATTTTTACCAGCAAGTGACGAGACGACAGTATTTATATCTTTACAGACGCAATCAC GGGTAGCGCCGCATGGTTAAGAGTGCCCGAATTTTATTAAATCAGTCTTAGTATAATGATG GGTCTAAGTTATAATAGACTATGTTGCAAGTGCCAAAATAAATTA ATTTCCCTGCGGTTTTTAAACCATAAAGTATCGTATCGTTTG~TTGTTAGCA~TCTGGCCTAC GTGAGAGTTATCAAACACTATTGGTAGTGTCCGCAAAAAATGATG CTCCCCTAACATCATGGTTGCTTGACGTGAAAGTATGATATGATA~G~GATTCATTGTAAGC GCTAGAATAACTCAATAGCTCTTCCTTGTTATTTCAGCTTTGGGTTTTTTTTTT~GTTG TGCAACAGACCGATCATATTCAATTTTCATTTATTCATATTTAGCTCGAA~AATAAAAT TAATTTCAGGTGATATTAGTAGTTCAACTGCAATGAAGTCA TTTAGTATATATATATATATATATCTATCAATTAGAATAC ACACATGCCCTGGTATGTCCTTGGTTTCTGCTAGTTCCAACTC GATGGCGGATTTTTCGTTATGGAATCGCCTTATTTGTTTCTAGCGGTATTTGCCTT~AA GCT

601 GCAAACTCCTGGAGAATGAGTGGGGATATTTATCCTCAGTTTACTCGTCCTGACAGTAGG AlaAsnSerTrpArgMetSerG~yAspIleTyrProGlnPheThrArgProAspSerArg 661 TGTCCTTGCGATGGTGACCAGTTTGATTGCGCATATGCATATGCTGGTTTTCATTGTTCTATCATG CysProCysAspGlyAspGlnPheAspCysAlaTyrAlaGlyPheHisCysSerIleMet 721 AATATTCTARACAAAGCAGCTCCAATGGGGCAAAATGCGGCGAT AsnIleLeuAsnLysAlaAlaProMetGlyGlnAs~laGlyIleGlyGlyTrpAsnAsp 781 CTGGATAACTTAGAAGTCGGTGTCGGAAACTTGACAGATGATGAAGAGAAAGCGCATTTC LeuAspAsnLeuGluValGlyValGlyAsnLeuThrAspAspGluGluLysAlaHisPhe 841 TCTATGTGGGCAATGGTTAAGTCTCCACTAGTCATCGGCGCTGATGTGAATCACTT~~ SerMetTrpAlaMetValLysSerProLeuVa~I~eGlyAlaAspValAsnHisLeuLys 901 GCGTCTTCATATTCAATTTACAGTCAAGCTTCCGTAATAGCAATCAACCAGGATCCCAAG AlaSerSerTyrSerIleTyrSerGlnAlaSerValIleAlaIleAsnGlnAspProLys 961 GGAGTACCAGCAACAAGAGTTTGGAGACATCAAGTGCCACAAACTGACAAGTATGGTCAA GlyValProAlaThrArgValTrpArgHisGlnValProGlnThrAspLysTyrGlyGln 1021 GGTGAAATTCAATTTTGGAGCGGTCCACTTGATAACGGGTCTCTTA GlyGluIleGlnPheTrpSerGZyProLeuAspAsnGlyAspGlnVa~IleAlaLeuLeu 1081 AATGGGGGAATCAAACCAAGACCAATGAATACGAACTTGGAAGAAATTTTCTTTGACAGC AsnGlyGlyIleLysProArgProMetAsnThrAsnLeuGluGlu~lePhePheAspSer 1141 TACTTAGGTTTCGAGCAATTGTCCTCAAACTGGGATATTTATGACTTATGGGCTAACAGA TyrLeuGlyPheGluGlnLeuSerSerAsnTrpAspIleTyrAspLeuTrpAlaAsnArg 1201 GTTGACAATGCAACATCGGCTAACATTCTGAATAATAACGCTACTATT ValAspAsnAlaThrSerAlaAsnIleLeuAsnAsnAsnSerValG~yAsnAlaThrIle 1261 TACAATGCTACCGCACTATCATACAAGGATGGGATGGCAAAGAATGATACCAGATTATTC TyrAsnAlaThrAlaLeuSerTyrLysAspGlyMetAlaLysAsnAspThrArgLeuPhe 1321 GGTACCAAAATCGGCAGTATTTCTCCAGATGGCTTGCTTAACAC~CTGTTCCAGCACAC GlyThrLysIleGlySerIleSerProAspGlyLeuLeuAsnThrThrValProAlaHis 1381 GGAATTGCTTTCTATAGGTTAAGACGGTCTACTTAATAAGAAGATTTCCATCAACCGGG GlyIleAlaPheTyrArgLeuArgArgSerThr *

100

123

5-10% have extra AUG codon upstream from the known translation start site (Kozak, 1986). The promoter region of the NCYC396 MEL gene lacked the C + T-rich block and the CAAG sequence, found at the tsp in a number of yeast genes encoding abundant mRNAs (Dobson et al., 1982). The entire 5’-untranslated sequence of most S. cerevisiae mRNAs is very rich in A residues (Hamilton et al., 1987). In the MEL1 gene, the noncoding 5’ region preceding the AUG start codon is composed of 20 nt, 9 of them A residues (Liljestrem, 1985). In the NCYC396 MEL gene, A comprised 11 out of 20 nt upstream from the AUG start codon (-1 to -20). These values agree with the average occurrence of 47% A residues within 5’-untrans-

Ukb

lated regions (Cigan and Donahue, 1987). The expression of the MEL1 gene is controlled

Fig. 4. Northern

hybridization

analysis

(lane 3) conditions. same medium,

galactose

4 h. The culture

to one portion.

noninduced isolated

The cells were grown

to McAlister

cells was denatured

on 1% agarose/2%

cellulose

filter and hybridized

RNA

probe

Hind111 fragment

and Finkelstein

with formamide

phoresed MEL

and the cells were grown for

into two portions,

sample was taken before galactose

according

(2.3-kb

in YPD medium with 3 y0

glucose. The culture was diluted in the

(2%) was added,

was divided

in

(lane 2) and glucose-repressed

Cells were grown overnight

glycerol and 2 % ethanol replacing

added

of total RNA from NCYC396

(lane l), galactose-induced

noninduced

and glucose for another addition.

(9%) was 1.5 h. The

Total RNA was

(1980). RNA from 10’

and formaldehyde,

and electro-

formaldehyde gel, transferred to nitrowith a 32P-labeled anti-sense NCYC396

EcoRI-BumHI

fragment

+ 1.3-kb BarnHI-

in pIBI76). The arrows show the migration

ofribosomal

RNA-subunits.

(c) Northern hybridization analysis The size of the mRNA in vivo was estimated from the Northern blot hybridized with the NCYC396 MEL RNA probe. The mRNA was estimated to be 1.5 kb (Fig. 4), which is in good agreement with the size of the presumed protein (52 kDa). mRNA encoded by MEL1 gene has been found to be slightly larger (Post-Beittenmiller et al., 1984; Ruohola et al., 1986). Because both genes code for a protein of 471 aa, mRNA from the NCYC396 gene must contain shorter untranslated regions. Human clGa1 mRNA has been estimated to be 1.45 kb (Bishop et al., 1986). (d) The 5’4anking region The 5’-TATAAA sequence, the proposed transcription initiation signal, was located between nt -80 and -75, which is a typical distance from the start codon. The NCYC396 MEL gene had two out-of-frame ATG-triplets between this TATAAAA sequence and the suggested translation start codon. These two ATG triplets were both in the same frame and were followed by two stop codons TAG and TGA just before the third ATG. Of eukaryotic mRNAs

by GAL4

protein and thus its 5’ region contains a sequence required for the binding of this protein (Liljestrdm, 1985). The consensus GAL4-protein recognition sequence proposed by Giniger et al. (1985) is a 17-bp dyad symmetric sequence 5’-CGGA(G/C)GAC(A/T)GTC(G/C)TCCG. A related sequence, 5’-CGGCCTAGTGTTGTCCG, was found in the 5’-flanking region of the NCYC396 MEL gene (nt -239 to -223). In yeast genes, the -3 position flanking the AUG start codon strongly favours A and avoids pyrimidines (75 y0 A, 13 % G, 8 % C and 3 % U ; Cigan and Donahue, 1987). In NCYC396 MEL mRNA, the -3 position was occupied by U, as is the case in another galactose-inducible gene, GAL10 (Johnston and Davis, 1984). An A+ U transversion at -3 may decrease the efficiency of translation initiation (Zitomer et al., 1984; Werner et al., 1987). In animal cells, R- 3 --f Y transversion has been shown to decrease the mRNA translation efliciency about 20-fold (Kozak, 1986). Incidentally, the aGa1 activity in NCYC396 yeast strain was remarkably lower than the activity in the

TABLE

I

Production strains

of aGa1 in the absence

NCYC396

and presence

of galactose

by the yeast

and ATCC9080 aGa1 activity” Medium -gal

NCYC396

0

ATCC9080

0.02

Cells + gal

-gal

0

+ gal

0.0004

26.8

1.26

2.29

82.3

il The yeast cells were grown for 8 h on YPD medium,

which contained

3% glycerol and 2% ethanol instead of glucose in the absence (-gal) presence ( + gal) of 2% galactose. The enzyme activity was measured the medium and in cell samples as described

by Kew and Douglas

Enzyme activity is expressed in nmol substrate pyranoside) hydrolyzed/min/lO’ cells.

or in

(1976).

(p-nitrophenyl-a-galacto-

101 E.&i lMM-A-KITFI-AGSTIFVKSSPSYNGLGLTPQMGWDNWNTFAC S.url. 1 V___________--------____ s.cf?r.1 AE----Q--P---NS--H-Gplant 1 LD---AR--T---LH-ER-M-NLHumn 1

ILG

D _ -

-FHREA-K--HIALMDVSEQLLLDTA DRISEI __________ ____DL IN-NVVRE--AMVST I--K-FMEM-ELMV--

iDCQEEPDSC' * DPTR-EESHI-VRKLMD - A-ASGKI-CHTQQ--ALEDADFVV-E.coli42 GYTYVILDDCW SSGRTANGT LVADKE KFPNGMSHVA S.car1.42 GLKDL ___-M --K-I-__----__DSD_F ____EQ _---e-C_S.cer.42 --Q-IN-----AELN-DSE-N M-PNAA A--S-IKAL--AAPlant 39 --E-LCI----MAPQ-DSE-R -Q--PQ K--H-IRQl.Humn 49 -W--A

E.&i 86 S.car1.83 s.ccr.83 Plant 81 Hm 91

FQICCYEPCTVTD-EVCKRH-L-Q-I DHLHNNNFLFGMYSSAG -----_S_--______-YV-SKGLKL-V--D-NYV-SKGLKL-I-ADV*

-DTL-PG-IMRALRTIPH EYTC AGYPGSLGHEEEDAEFFA ___~ ________R-m-_-Q__NQ--SKRM--------Q--KT--_F_--F-yyDI_mQTm_ NK--

E.coll130 S.Carl.l22 s.cer.122 Plmt 121 Htmm 1W

LWQICEDMTEV-PDATMLNYVNPMAMNTW--yARYPHIKQVGLCH SNGVDYLKYDNCYNKGQFGAPETSYKRYKAM N-R---------__--___T--I--H----IS"K--PP- w_---------E-LDw--_L__FmG-_ CDSL-NLADG--H-

E.cdi 175 S.carl.1~ S,cer,l& Plant 163 Hmm 176

- VQGTAEE-ARDLNI-P-TLRY-CA-INHMA-YLELE-KTA--St SLCNWGQDLTHYWGSDIA NSWRMSCDIYPQFTRPDSRCPCDG ----------F----G--_~-----VTAE-----_~-----M-E--WEDPQI-AKS-G ----TT--EDNWNPYM-PFQKP YTERYCNHW-NFA-I - -E-

E.wli 2.X s.car1.m s.cer.m PLanL 193 !!.xm 21"

VNLYPELLA--E--QAPKPN-HG-TRCQNIVRYEMFKKL-YFVTE DQFDCAY AG FHCSI MNILNKAAP~GQ~AGIGGWNDLD -Ey--K__ ~~___ ___~--_~~----~-~~----~_Sy--p-----p_ MT-ADSND-W3 c r.1 !'" '! ? S!!K SI !'s I I," 1.: TSF

E.&i 265 s.car1.245 S.m.245 Plant 217 Hmm 246

ssHFAEY-PWFI-PGREDLIERY-v--DEYPKRc-EQ-ANWHK NLEVGVGNLTDDEEK AHFSNWAMVKSPL VIGADVNHL ____~__________ ____~______~~ I---N--NM----N-GM-TE-YR S---I--LA-A-I.V-C-IRAM M-VT-NFG-SWNQQV TQMAL--IMAA-FMSN-LR-J *

E.di 311) s.car1.285 s.cer.285 Plant 257 Hurm 2%

ELEE-KK--R-D-KPSREYASTIMNAIWTGEPSVI--NVRNDGLI SYSIYSQASVIAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFb~S ----______~-____SN-I______-yy_SD---_--~~THELI-N-E----V---KL-QGKK-KSAKALLQDKD--------LKQGY-LR-G-NFEVW-

SDALNKTGRPIFY _____________ GK--LSS-----F -L---R---S-V*

"

i

NDL-

DNPQ-CC-EV-C-VDAN--Q-TKVG-LPSHLAALMQTNI E.&i 3% GPL DNGDQV IALL NGGIKPRPMNT NLEEIFFDSYLG s.car1.m S.cer.3x) --_ -----V___ _mmSVS___mT__----e-N-- -W - RSSS-ATVPlant 297 ---S--KVAASWS-IGfkmn 3x) - - AWAVAM -XRQ EIG-SY-AVAt c E.di 397 s.car1.369 s.cer.369 Plant 329 Hmm 362

* * * MMDPHTAAVL-IDE-SANILNNNSVGNATIYNATALSYKD --GR-KTATGIL----EQ----GE-SAEIDSHACKM-VL-PR-37C YEWTSRLR-HI-P-GTVLLQ-ENTM * *

TLLTEA-LTENRD--YH-A LSSNWDIYDLWANRVDNAT -T-T-----___----_S_A-QGTTV-AR---EHSTQSLV CNPACF-TQ-LPVKRKLGF *

*

KAS ___ DDT SPQ

V-A RPL-

NVFEQ SKK --KGVA

--

VD-

*

E.&i 437 LI-AHGDW-P-WLHR451 s.car1.413 GMAKNDTRLFGTKIGSISPDGLLNTTVPAHGIAFYRLRRST471 s.cer.413 _LS-____---Q----~--~~I-___----__--------~-~~~~ Flant

Q-SLK-

HImma

L-414 *

Fig. 5. Protein

sequence

alignment

of aGal

*

from yeasts

tetrugonoloba; Overbeeke

et al., 1989) human

(Bishop

shown

single-letter

are aligned

by their standard

N-glycosylation

sites in yeast sequences

codes,

(S. cereviriae:

Sumner-Smith

et al., 1986) and E. coli (Liljestrom to maximize

(above the sequences:

identity

et al., 1985; S. corlsbergensb:

this study),

and Liljestrbm,

show the identical

with the NCYC396

1987). Dashes

MEL sequence.

S. carlsbergensis; below: S. cerevisiae).

The asterisks

plant denote

(Cyamopsis aa. The aa, the putative

102 strain ATCC9080

(Table I), which has A _ 3 in the MEL1

mRNA (Liljestrdm, 1985). The highly expressed yeast genes have A - 3 in all 18 cases (reviewed by Hamilton et al., 1987). This biased nt usage is more pronounced in highly expressed genes than in genes in general. Yeast mRNAs

codons were used (not shown). The codon bias index (Bennetzen and Hall, 1982) of the mRNA was 0.23 which is the same as the index of MEL1 (Liljestrbm, 1985). The aa sequence contained nine putative N-glycosylation sites, six of which were conserved compared to MEL1 (Fig. 5).

show a preference for U at positions + 4 (39%) and + 6 (47%) (Cigan and Donahue, 1987). The NCYC396 MEL mRNA also had U at these positions.

To characterize the aGa1 protein as deduced from its gene, the aa sequence was analyzed for hydropathic profile.

(e) The characteristics

similar.

of MEL signal sequences

The signal peptides of yeast secretory proteins contain 17-20 aa, as reported also for ctGa1 (MEL1 product; Sumner-Smith et al., 1985; Ruohola et al., 1986). Although the signal sequence of the MEL1 (ATCC9080) protein and the presumed signal sequence of the NCYC396 MEL protein had only 7 identical aa out of 18 aa, they both fulfill the requirements of a signal sequence: they are very hydrophobic and the last residue is a small neutral aa, Gly, which allows the cleavage by signal peptidase (von Heijne, 1983). At least one positively charged residue is present in yeast signal sequences (von Heijne, 1985). In the deduced signal sequence of the MEL1 protein a Lys residue is present at aa -5. The NCYC396 MEL protein lacked any charged residues at the signal peptide region except the initiator Met which is assumed to provide one positive charge in eukaryotes (von Heijne, 1985). On the other hand, no clGa1 activity was secreted into the culture medium by the strain NCYC396 in conditions where the strain ATCC9080 secreted 25 y0 ofthe enzyme activity into the culture medium (Table I). This supports the importance of the charged residue for the functional signal sequence of yeast secretory proteins. (f) Comparison of MEL genes and their protein products The differences in the restriction maps and in the intensities of the hybridization of MEL1 probe to the two MEL genes suggested that the genes are not identical. The rule of thumb that 10% dissimilarity in the nt sequence reduces the hybridization intensity by 50% (Maniatis et al., 1982) suggested 70-80% identity between the two yeast MEL genes. The comparison of nt sequences revealed 77.1% identity in the protein coding region. Based on the alignment shown in Fig. 5, the yeast MEL proteins exhibit 83.0% sequence identity of mature proteins (as defined by Beckman’s Microgenie program). The degree of similarity reached 87.9% when accepted aa replacements were included (Be-Leu-Val, Ser-Thr, Phe-Tyr, Arg-Lys, Asp-Glu). The homology between the NCYC396 MEL protein and ctGal from other species was lower (17.3% between yeast and E. coli; 29.6% between yeast and human; 35.3% between yeast and plant). A summary of the codons used in the coding region of NCYC396 MEL showed that 59 out of the 61 possible

Comparison of the profiles of NCYC396 MEL protein and MEL1 protein indicated that the two proteins are very Only minor

differences

were observed,

the most

remarkable changes being a hydrophobic peak in NCYC396 MEL protein lacking in MEL1 protein (aa 142 in Fig. 5) and a hydrophobic peak in MEL1 protein lacking in NCYC396 MEL protein (aa 185-190 in Fig. 5). The putative N-glycosylation sites were found in the hydrophilic regions, except Asn404 and Asn436, suggesting that they are located on the surface of the protein and available for addition of sugar side chains. (g) The 3’-flanking region The tripartite consensus sequence 5’-TAG...TAGT. ..TTT has been found in many yeast genes and has been proposed to be important for transcription termination and polyadenylation (Zaret and Sherman, 1982). In the 3’-flanking region of the NCYC396 MEL gene one set

Fig. 6. Pulsed-field NCYC396

gel electrophoresis

of chromosomes

(lanes 1 and 3) and ATCC9080

show the EtdBr-stained

gel, lane 3 shows the corresponding

gram of lane 1 hybridized

with MEL probe

shows the autoradiogram

of lane 2 hybridized

reman

numerals

from the strains

(lanes 2 and 4). Lanes 1 and 2

on the left correspond

autoradio-

from NCYC396 with MELl

to the positions

and lane 4 probe.

The

of the chromo-

somes in the standard yeast strain YNN295 (BioRad). Yeast chromosomes were prepared as described by Carle and Olson (1985) and separated by OFAGE (Carle and Olson, 1984) for 17 h at 280 V and 110 mA at 16°C with a switching interval of 50 s. Gels were stained with EtdBr. DNA was denatured and transferred onto a nitrocellulose filter after a 30-min depurination

step in 0.25 N HCl. Arrowheads,

origin.

103 could be identified

between

nt 21 and 58 after the trans-

lation stop codon TAA. The NCYC396 MEL gene contained only 20 nt between the stop codon and the tripartite sequence, whereas MELI has 110 nt (Liljestrom, 1985). Poly(A) addition usually takes place lo-40 bp downstream from the TAGT-sequence of yeast mRNAs (Zaret and Sherman, 1982). This places the proposed end of the NCYC396 MEL mRNA between nt 1470 and 1500 from the start codon

(Fig. 3).

T.F.: Sequence

ciated with translational

initiator

and structural

regions

features

asso-

in yeast - a review. Gene

59 (1987) l-18. Dobson,

M.J., Tuite, M.F., Roberts,

S.M., Perkins, Conservation

R.E., Conroy,

A.J., Kingsman,

B. and Fothergill,

L.A.:

in Saccharomyces

sequences

Acids Res. 10 (1982) 2625-2637.

E., Vanum,

S.M. and Ptashne,

GAL4, a positive regulatory Hamilton,

N.A., Kingsman,

S.C., Dunbar,

ofhigh efficiency promoter

cerevisiae. Nucleic Giniger,

R., Watanabe,

parison

M.: Specific

C.K. and deBoer,

of the sequence

DNA binding

of

protein of yeast. Cell 40 (1985) 767-774.

context

H.A.: Compilation

around

Succhuromyces cerevisiue mRNAs.

location of the MEL gene

(h) Chromosomal

Cigan, A.M. and Donahue,

the AUG

Nucleic

Acids

and com-

start

codons

Res.

15 (1987)

in

3581-3593.

Pulsed-field gel electrophoresis (OFAGE) was used for the chromosomal assignment of the MEL gene. Fig. 6 displays the chromosomal pattern of strains NCYC396 and ATCC9080 (MEL1 ). The MEL1 probe hybridized to chromosome II from strain ATCC9080 as shown previously by Naumov et al. (1990). When chromosomes from strain NCYC396 were studied the homologous MEL probe hybridized to a faster migrating diffuse band, presumably chromosome X. The chromosome pattern from this strain contained several diffuse bands, presumably due to the presence of multiple homologous/homeologous chromosomes. Strain NCYC396 has been shown to have two bands of chromosome III, and another lager yeast strain contains at least two copies of chromosomes III, V, X, XII and XIII (Casey, 1986).

Johnston,

S.A. and Davis, R.W.: Sequences

GALl-GAL10

that regulate

the divergent

in Saccharomyces cerevisiae. Mol. Cell. Biol.

promoter

4 (1984) 1440-1448. Kew, O.M. and Douglas,

H.C.: Genetic

Kozak, M.: Point mutations codon

co-regulation

of galactose

in Saccharomyces. J. Bacterial.

melibiose utilization that modulates

define a sequence translation

and

125 (1976) 33-41.

flanking the AUG initiator

by eukaryotic

ribosomes.

Cell 44

(1986) 283-292. Liljestrom,

P.L.: The nucleotide

Nucleic

sequence

MELZ gene.

of the yeast

Acids Res. 13 (1985) 7257-7268.

Liljestrom,

P.L. and Liljestrbm,

P.: Nucleotide

sequence ofthe met54gene,

in Escherichia coli K-12. Nucleic Acids

coding for alpha-galactosidase Res. 15 (1987) 2213-2220. Maniatis,

T., Fritsch,

Laboratory Harbor,

E.F. and

Manual.

Sambrook,

Cold Spring

J.: Molecular

Harbor

Cloning.

Laboratory,

A

Cold Spring

NY, 1982.

McAlister,

L. and Finkelstein,

D.B.: Alterations

in translatable

ribo-

nucleic acid after heat shock of Succharomyces cerevisiae. J. Bacterial. 143 (1980) 603-612.

(i) Conclusions

Melton,

In the present study, we have cloned another yeast MEL gene with 77.1 y0 nt sequence homology to the MEL1 gene cloned earlier. These two genes encoding functionally similar enzymes with distinct nt sequence and different secretion properties will allow us to study (1) the promoter efficiencies and the regulation of transcription; (2) the structure vs. function of the gene by sequence comparison and enzyme activity and substrate specificity assays; (3) the physiological significance of the differences in the secretion of these two MEL gene products.

D.A., Krieg, P.A., Rebagliati,

tion probes from plasmids Nucleic Naumov,

G.I.: Identification

selection

in yeast. J. Biol. Chem.

Bishop, D.F., Calhoun, M. and Desnick,

D.H., Bernstein,

R.J.: Human

of a cDNA clone encoding

H.S., Hantzopoulos,

a-galactosidase

the mature

P., Quinn,

A: nucleotide

sequence

enzyme. Proc. Natl. Acad. Sci.

M.V.: Separation

cules from yeast by orthogonal

of chromosomal

field alternation

DNA mole-

gel electrophoresis.

polymeric

genes

(in Russian)

304

E., Aho, S. and Korhola,

M.:

(1989) 1475-1477. Naumov,

G.I., Turakainen,

A new family

H., Naumova,

of polymorphic

a-galactosidase

genes

genes MELI-MEL7.

in Saccharomyces cerevisiae: Mol. Gen. Genet.

224 (1990)

119-128. Overbeeke,

N., Fellinger,

A.J., Toonen,

Verrips, CT.: Cloning and nucleotide from

M.Y., Van Wassenaar, sequence

Cyumopsis retragonoloba (guar).

Proc. Natl. Acad.

M.V.: An electrophoretic

karyotype

for yeast.

D. and

ofthe a-galactosidase Plant

Mol. Biol.

13

Homoeologous

polymorphisms

in the genus Succharo-

chromosomes

III

and

of Succhuromyces Res. Commun.

51

(1986) 185-202. Post-Beittenmiller,

M.A., Hamilton,

of basal and induced

genetic

Saccharomyces carlsbergensk

analysis

Carlsberg

Ruohola,

R.W. and Hopper,

levels of the MELZ transcript

H., Liljestrbm,

Kalkkinen,

of chromosomes

Res. Commun.

Sanger,

P.L., Torkkeli,

N. and Korhola,

F., Nicklen,

terminating

Sci. USA 82 (1985) 3756-3760.

Molecular

M.B.: DNA sequence

myces, IV.

yeast MELI gene. FEMS

Acids Res. 12 (1984) 5647-5664.

G.F. and Olson,

343-362.

Nauk

J.E.: Regulation in Saccharomyces

cerevisiue. Mol. Cell. Biol. 4 (1984) 1238-1245.

USA 83 (1986) 4859-4863. Carle, G.F. and Olson,

G.P.:

SP6 promoter.

fermentation Akad.

bayanus, S. carlsbergensis and S. uvarum. Carlsberg

257 (1982) 3026-3031.

Casey,

of melibiose

in Saccharomyces cerevisiue. Dokl.

Pedersen,

J.L. and Hall, B.D.: Codon

Nucleic

a bacteriophage

(1989) 541-550.

REFERENCES

Carle,

containing

T., Zinn, K. and

of RNA and RNA hybridiza-

Acids Res. 12 (1984) 7035-7056.

cDNA

Bermetzen,

M.R., Maniatis,

Green, M.R.: Efficient in vitro synthesis

X in

5 1 (1986)

Microbial.

S. and Cot&on

inhibitors.

T., Kopu,

M.: Expression

Proc.

H., Lehtinen,

and regulation

Lett. 34 (1986) 179-185.

A.R.: DNA sequencing Natl.

P.,

of the

Acad.

Sci.

USA

with chain74 (1977)

5463-5467. Southern,

E.: Detection

separated

of specific

by gel electrophoresis.

sequences

among

DNA fragments

J. Mol. Biol. 98 (1975) 503-517.

104 Sumner-Smith, J.E.:

M., Bozzato,

Analysis

carlsbergensis (melibiase).

and

inducible

its

MEL1

secreted

gene

product,

of Saccharomyces

strains

method which differen-

of yeast.

J. inst.

Brew.

92

(1986) 588-590. Turakainen,

H.,

S.

and

Korhola,

M.:

MEL2

gene

from

Saccharomyces cerevisiae var. carlsbergensis. Yeast 4 (1988) S370. von Heijne,

G.: Patterns

of amino acids near signal-sequence

sites. Eur. J. Biochem. 133 (1983) 17-21. von Heijne, G.: Signal sequences. The limits ofvariation. (1985) 99-105.

cleavage

N.J.W.

Amsterdam,

Zitomer,

of its

termination

R.S.,

The

Yeast,

a Taxonomic

Study,

Elsevier,

1984, pp. 379-395.

Wathall,

F.: DNA sequence

required

for efficient tran-

in yeast. Cell 28 (1982) 563-573. D.A.,

Rymond,

Saccharomyces cerevisiae ribosomes codons.

J. Mol. Biol. 184

repression

22. Saccharomyces Meyen ex Rees. In: Kreger-Van

(Ed.),

KS. and Sherman,

scription

A.: The leader peptide

for the translational

Cell 49 (1987) 805-813.

D.: Genus

Rij, Zaret,

Aho,

F. and Pierard,

of yeast gene CPA1 is essential Yarrow,

P.L.: A colony-colour

a-galactosidase-positive

Werner, M., Feller, A., Messenguy, expression.

alpha-galactosidase

Gene 36 (1985) 333-340.

Tubb, R.S. and Liljestrom, tiates

R.P., Skipper, N., Davies R.W. and Hopper,

of the

B.C. and recognize

Mol. Cell. Biol. 4 (1984) 1191-l 197.

Hollenberg,

non-AUG

C.P.:

initiation

Cloning, sequence and chromosomal location of a MEL gene from Saccharomyces carlsbergensis NCYC396.

Yeast strains producing alpha-galactosidase (alpha Gal) are able to use melibiose as a carbon source during growth or fermentation. We cloned a MEL ge...
858KB Sizes 0 Downloads 0 Views