GENOMICS
6, 333-340
(1990)
cDNA Isolation, Expression Analysis, and Chromosomal Localization of Two Human Zinc Finger Genes LUIGI LANIA,’ EMILIO DONTI,* ANTONIO PANNUTI, ANNA PASCUCCI,GINA PENGUE,ISIDOROFELICIELLO, GIROLAMA LA MANTIA, LUISA LANFRANCONE,*AND PIER-GIUSEPPE PELICCI* Department
of Genetics, General and Molecular Biology, University of Naples, via Mezzocannone 8, 80124 Naples, Italy; *Institute of Medical Clinics I, University of Perugia, 06100 Perugia, Italy Received
July 14, 1989;
revised
Press,
Inc.
INTRODUCTION
Differential regulation of gene expression is achieved at least in part by the specific binding of trans-acting factors and relevant &-acting DNA sequences. The best characterized DNA-binding structure is the evolutionarily conserved helix-turn-helix motif (reviewed in Pabo and Sauer, 1984; Gehring, 1987). A second type of DNA-binding domain is the zinc finger sequence motif that has been identified in a variety of transcriptional regulatory proteins and is thought to be directly involved in DNA binding (Klug and Rhodes, 1987; Evans and Hollenberg, 1988). This DNA-binding structure was initially identified as nine tandemly repeated sequences of approximately 30 amino acid residues in the RNA polymerase III transcription factor TFIIIA (Miller et al., 1985; Brown et al., 1985). It has been postulated that two Cys and two His residues in each repeat bind a Zinc(B) ion and that the resulting metalloprotein structures are involved in the sequenceSequence data from this article have been deposited with EMBL/GenBank Data Libraries under Accession No. 504751. 1 To whom correspondence should be addressed.
9, 1989
specific binding to DNA. Similar motifs homologous to the fingers in TFIIIA have been subsequently identified in other genes, including Drosophila genes (Rosenberg et al., 1986), in yeast regulatory proteins (Hartshorne et al., 1986), in frog (Ruiz i Altaba et al., 1987), in mouse (Chowdhury et al., 1987; Chavrier et al., 1988), and in human (Kadonaga et al., 1987; Page et al., 1987). We (Pannuti et al., 1988) and others (Ruppert et al., 1988) have utilized a finger-motif-containing probe to isolate human genes encoding putative DNA-binding finger domains. Here we report the isolation, characterization, and chromosomal location of two members of this human gene family, ZNF7 and ZNF8. Transcriptional studies indicate that these two human finger genes are ubiquitously expressed in many human cell lines. Moreover, upon in vitro induced terminal differentiation of human HL-60 myeloid cell line, their expression is drastically reduced.
On the basis of sequence similarity in the repeated zinc finger domain, we have identilled and characterized two human cDNA clones (ZNF7 and ZNFS), both encoding proteins containing potential finger-like nucleic acid binding motifs. Northern blot analysis indicates that both genes are expressed as multiple transcripts and they are ubiquitously present in many human cell lines of different embryological derivation. Moreover, their expression is modulated during in vitro induced terminal differentiation of human myeloid cell line I-IL-60. By in situ hybridization experiments, we have localized the ZNF7 gene to chromosome 8 (region q24) and the ZNFS gene to the terminal band of the long arm of chromosome 20 (2Oq13). o 1990 Academic
October
and
MATERIALS
AND
METHODS
Library Screening and DNA Sequencing A hgtll human placenta cDNA library (Clontech Laboratories Inc.) was screened with a 503-bp Hi&IIBamHI DNA fragment from an HF.10 cDNA clone (Pannuti et al., 1988). The ZNF nomenclature for human zinc finger genes described in this work was suggested by the Human Gene Mapping Workshop Nomenclature Committee. Approximately 5 X lo5 phages were plated out and after the plaques were transferred to nitrocellulose filters, the phage DNA was denatured and immobilized according to standard procedures. Hybridization to the DNA probe was carried out at 65°C in 5X SSPE, 5X Denhardt’s solution, 0.2% SDS, and 100 pg/ml of denatured salmon sperm DNA for 18 h. The DNA probe was labeled with a multiprime labeling kit (Amersham) to a specific activity of 5 X 10’ cpm/pg. The filters were washed at 37°C in 4X SSPE, 0.2% SDS for 2 h and in 2X SSPE, 0.2% SDS for 1 h
the
333 All
Copyright 0 1990 rights of reproduction
OSeiS-7543/90 $3.00 by Academic Press, Inc. in my form reserved.
334
LANIA
at 37“C. The filters were dried and exposed to Kodak X-Omat films at -70°C. The positive hybridizing plaques were isolated and purified. Inserts from positive clones were subcloned in pGEM-3 plasmid (Promega Biotech) for restriction enzyme mapping, hybridization analysis, and DNA sequencing. DNA sequencing was performed on both strands, and all DNA sequences were determined independently by two different methods using the GemSeq K/RT system as recommended by Promega Biotech. Nucleotide and amino acid sequences were analyzed using the MicroGenie Sequence Analysis Program software (Beckman).
ET AL.
hardt’s, 0.066 M phosphate buffer, pH 6.5, and 200 pg/ ml salmon sperm DNA. Slides were covered with a coverslip and incubated in a humidified chamber at 42’C for 16 h. The slides were washed repeatedly in 2X SSC at 45°C and in 0.2X SSC at room temperature, coated with autoradiographic emulsion NTB-2 (Kodak, Rochester, NY) diluted 1:l in tap water, and exposed for 15 days at 4°C. After development, slides were stained with Wright-Giemsa solution and destained two or three times to induce G-bands (Chandler and Yunis, 1978). RESULTS
Cell Cultures The following human cell lines were used: HL-60 (acute promyelocytic leukemia); NTERA2/Dl (embryonal carcinoma) (Andrews et al., 1984), MOLT-4 (T-cell lymphoma), WI-38 (embryonic lung cells), HeLa (cervical carcinoma), H5 (EBV-infected B lymphocytes), and CLL220.1 (colon carcinoma). Cell lines were cultured as previously described. HL-60-induced terminal differentiation was performed as described (Pannuti et al., 1988). Briefly, HL-60 cells were induced to differentiate to granulocytes by the addition of 1.25% DMSO to the culture medium (RPM1 1640 with 10% fetal calf serum). RNA Isolation and Northern Analysis Total RNA was extracted from the various cell lines by the guanidine isothiocyanate method (Chirgwin et al., 1979). Aliquots of 15 pg were electrophoresed on 1% agarose-formaldehyde gels and transferred to nitrocellulose filters. The blots were hybridized with the appropriate finger fragment in buffer containing 50% formamide, 5X SSPE, 1X Denhardt’s, 0.1% SDS, and 100 pg/ml salmon sperm DNA. The final washing was at 60°C in 0.2X SSC and 0.2% SDS. The entire ZNF8 cDNA insert and the large EcoRI fragment of the cZNF7 were used (see Figs. 1 and 2). The filters were autoradiographed at -70°C with an intensifying screen for 1 week. In Situ Hybridization Probe hybridization to metaphase chromosomes was performed according to method of Gerhard et cd. (1981), slightly modified. Briefly, chromosome spreads were obtained from PHA-stimulated peripheral blood lymphocytes (PBL), pretreated with RNase, and denatured in 70% formamide/2X SSC for 2 min at 7O’C. The appropriate finger probes were labeled by nick-translation using [3H]dCTP. The probe was denatured by boiling for 3-4 min and used at a concentration of SO100 rig/ml in a hybridization buffer containing 50% formamide, 5% dextran sulfate, 2X SSC, 2~ Den-
Isolation and Nucleotide Sequences of Two Human Finger cDNA Clones On the basis of cross-hybridization under nonstringent conditions to a probe corresponding to the Kruppel finger region, we have recently isolated two members of the human zinc finger family (Pannuti et al., 1988). Southern blotting analysis of human genomic DNA, carried out at low stringency with a probe consisting of a 503-bp HindIII/BamHI fragment containing the finger region of the HF.10 cDNA, revealed the presence of several related DNA sequences (data not shown). We used the 503-bp HindIII/BamHI fragment from the finger region of the HF.10 cDNA clone to screen a Xgtll cDNA library derived from human placenta. Several cDNA clones containing multiple copies of the zinc finger motif were identified. Two cDNA clones (cZNF7 and cZNF8) containing the longest inserts were further analyzed. The complete nucleotide and predicted amino acid sequencesof ZNF7 and ZNF8 cDNA clones are shown in Fig. 1 and Fig. 2, respectively. Translation of the longest open reading frame in both cDNA clones yields multifingered proteins containing 15 and 7 finger repeats, respectively. The cZNF7 clone is 2356 nucleotides long and contains a long open reading frame that begins at position 51 and extends to an in-frame TAG termination codon at position 2290; a putative ATG initiation codon (Kozak, 1987) is located at nucleotide 232. The open reading frame starting at the ATG at position 232 would code for a polypeptide of 687 amino acids with a calculated relative molecular mass of 77,894. The ZNF7-encoded protein contains 15 finger domains that comprise nearly 70% of the protein, and the 15 fingers are grouped into three sets of 6, 7, and 2 finger repeats, respectively. The nucleotide and amino acid sequences of cZNF8 are shown in Fig. 2. The longest open reading frame starts at the beginning of the cDNA clone and terminates at position 1623 with the TAG termination codon. The termination codon is followed by a 3’ untranslated region of 300 nucleotides, which lacks a polyadenylation signal and poly(A) tail. Seven zinc
GCGCGGCGGCGGACCTCGGGTTGCCCTC
28
GGTCCGAGTGATCCCTGGTCGCTTCCTTAGCCCTCCCGCCTTCGGCATTGGGGTCCCCGCGTCCCCCGGGCCTCCAGGCGGGAAAGCGCGGGGGCTTTGCGGGGC ***
133
CTTGAGCGCCTGGTGTGGGAGGTGGTCGAGCCCAGCCACCCTCCCCCGCGGCGGCGCGAGGTCTCTCGGCCAGAACACGTGGATGCCCACCCACCACTGAGCCTC
238 343
ATGGAGGTGGTAACATTTGGCGATGTGGCTGTGCACTTCTCTCGGGAGGAGTGGCAGTGTCTGGACCCTGGCCAGAGGGCCCTCTACAGGGAAGTGATGCTGGAG 1 MetGluValValThrPheGlyAspValAlaValHisPheSerArgGluGluTrpGlnCysLeuAspProGlyGlnArgAlaLeuTyrArgGluValMe~LeuGlu AACCACAGCAGTGTGGCTGGACTAGCAGGATTCCTGGTTTTCAAGCCTGAGCTGATCTCTCGGCTGGAGCAGGGAGAAGAGCCATGGGTCCTCGACCTGCAGGGA 36 AsnHisSerSerValAlaGlyLeuAlaGlyPheLeuValPheLysProGluLeuIleSerArgLeuGluGlnGlyGluGluProTrpValLeuAspLeuGlnGly
448
GCAGAGGGGACAGAGGCACCAAGGACCTCCAAGACAGATTCTACGATTAGGACTGAAAATGAGCAGGCCTGTGAGGACATGGACATCCTAAAATCAGAATCCTAT 71 AlaGluGlyThrGluAlaProArgThrSerLysThrAspSerThrIleArgThrGluAsnGluGlnAlaCysGluAspMe~AsplleLeuLysSerGluSerTyr
553
GGGACAGTGGTCAGAATCTCCCCACAGGACTTTCCTCAGAATCCTGGCTTTGGAGACGTTTCTGATTCTGAGGTCTGGTTAGACAGTCATCTGGGCAGTCCCGGG GlyThrValValArgIleSerProGlnAspPheProGlnAsnProGlyPheGlyAspValSerAspSerGluValTrpLeuAspSerHisLeuGlySerProGly
658
106
CTGAAAGTGACAGGCTTTACCTTCCAAAATAACTGTTTGAATGAGGAGACTGTGGTTCCCAAGACCTTCACCAAGGACGCACCCCAGGGATGTAAGGAGCTGGGA LeuLysValThrGlyPheThrPheGlnAsnAsnCysLeuAsnGluGluThrValValProLysThrPheThrLysAspAlaProGlnGlyCysLysGluLeuGly
763
141
AGCAGCGGCCTGGATTGTCAGCCTCTTGAAAGTCAGGGAGAGAGTGCGGAAGGGATGTCCCAGAGATGCGAGGAGTGTGGCAAAGGCATCAGAGCCACTTCAGAT SerSerGlyLeuAspCysGlnProLeuGluSerGlnGlyGluSerAlaGluGlyMe~SerGlnArgCysGluGluCysGlyLysGlyIleArgAlaThrSerAsp
868
176
ATCGCTCTGCATTGGGAAATTAATACACAGAAAATTAGCAGATGTCAAGAATGCCAAAAAAAGTTATCTGACTGCTTGCAGGGGAAACATACAAATAACTGCCAT lleAlaLeuHisTrpGlulleAsnThrGlnLyslleSerArgCysGlnGluCysGlnLysLysLeuSerAspCysLeuGlnGlyLysHisThrAsnAsnCysHis (1) GGAGAGAAGCCGTACGAATGTGCAGAGTGTGGGAAAGTCTTCAGGCTCTGCTCGCAGCTTAATCAGCATCAGAGAATCCACACGGGAGAGAAACCCTTTAAATGC GlyGluLysProTyrG~uCysAlaGluCysGlyLysValPheArgLeuCysSerGlnLeuAsnGlnHisGlnArglleHisThrGlyGluLysproPheLys~
973
211
246
1078
(2)
(31 . I
316
ACTGAGTGTGGAAAAGCCTTCCGCCTGAGCTCAAAACTTATTCAGCATCAAAGAATCCACACTGGGGAGAAGCCCTACAGATGTGAGGAATGTGGAAAAGCTTTT ThrGluCysGlyLysAlaPheArgLeuSerSerLysLeuIleGlnHisGlnArgIleHisThrGlyGluLysProTyrArgCysGluGluCysGlyLysAlaPhe (4) GGTCAGAGCTCAAGCCTCATCCACCATCAGAGAATCCACACAGGAGAGAGGCCCTATGGTTGTCGTGAGTGTGGGAAAGCCTTCAGCCAGCAGTCGCAGCTGGTT GlyGlnSerSerSerLeuIleHisHisGlnArgIleHisThrGlyGluArgProTyrGlyCysArgGluCysGlyLysAlaPheSerGlnGlnSerGlnLeuVal
351
(5) AGACACCAGAGAACTCACACTGGGGAGAGGCCCTACCCTTGCAAGGAGTGTGGGAAGGCCTTCAGCCAGAGCTCCACCCTAGCCCAGCATCAAAGGATGCATACT ArgHisGlnArgThrHisThrGlyGluArgProTyrProCysLysGluCysGlyLysAlaPheSerGlnSerSerThrLeuAlaGlnHisGlnArgMe~HisThr
386
GGGGAGAAAGCTCAAATTCTAAAAGCCTCAGACAGTCCAAGCCTTGTTGCACATCAGAGAATTCACGCTGTAGAGAAACCATTTAAGTGTGATGAGTGTGGGAAA GlyGluLysAlaGlnIleLeuLysAlaSerAspSerProSerLeuValAlaHisGlnArgIleHisAlaValGluLysProPheLysCysAspGluCysGlyLys
421
171 *, I GCTTTTAGGTGGATCTCTCGCCTGAGTCAGCATCAGCTGATTCACACTGGAGAGAAGCCTTATAAATGCAACAAGTGTACAAAAGCCTTTGGTTGTAGTTCACGG ~laPheArgTrpIleSerArgLeuSerGlnHisGlnLeulleHisThrGlyGluLysProTyrLysCysAsnLysCysThrLysAlaPheGlyCysSerSerArg
281
1183
1288
1393
(6) 1498
1603
(8)
491
CTTATTCGCCATCAGAGAACTCACACTGGAGAAAAACCATTTAAATGTGATGAGTGTGGCAAAGGCTTTGTTCAGGGCTCACACCTTATTCAGCATCAGCGAATC LeuIleArgHisGlnArgThrHisThrGlyGluLysProPheLysCysAspGluCysGlyLysGlyPheValGlnGlySerHisLeuIleGlnHisGlnArgIle (9) CACACTGGAGAGAAACCCTATGTGTGTAATGACTGTGGAAAAGCCTTCAGTCAGAGTTCCAGCCTTATTTACCATCAGAGAATCCATAAAGGAGAGAAGCCCTAC ~ThrGlyG1uLysProTyrValCysAsnAspCysGlyLysAlaPheSerG1nSerSerSerLeuI1eTyrHisG1nArgI1eHisLysGlyG1uLysProTyr
1918
526
(10) GAATGCCTCCAATGCGGAAAAGCCTTCAGTATGAGCACACAGCTTACAATACATCAAAGGGTTCACACTGGAGAGAGGCCCTATAAATGTAATGAATGTGGGAAA GluCysLeuGlnCysGlyLysAlaPheSerMetSerThrGlnLeuThrIleHisGlnArgValHisThrGlyGluArgProTyrLysCysAsnGluCysGlyLys
2023
561
(11) (12) GCCTTCAGTCAAAACTCAACCCTTTTCCAACACCAGATAATTCATGCAGGGGTGAAGCCCTATGAGTGCAGTGAGTGTGGAAAAGCCTTCAGCCGGAGCTCATAT AlaPheSerGlnAsnSerThrLeuPheGlnHisGlnIleIleHisAlaGlyValLysProTyrGluCysSerGluCysGlyLysAlaPheSerArgSerSerTyr
2128
596
(13) CTTATTGAACACCAGAGAATACACACTAGGGCCCAGTGGTTTTACGAATATGGGAATGCCCTGGAAGGGTCCACCTTTGTGAGCCGTAAAAAGGTTAATACTATA LeuIleGluHisGlnArgIleHisThrArgAlaGlnTrpPheTyrGluTyrGlyAsnAlaLeuGluGlySerThrPheValSerArgLysLysValAsnThrIle
456
631
666
AAGAAACTGCATCAGTGTGAAGACTGTGAGAAGATATTTAGGTGGCGTTCACACCTAATTATACACCAGAGAATTCACACCGGGGAGAAGCCTTATAAATGCAAT LysLysLeuHisGln~ysG1uAspCysGluLysIlePheArgTrpArgSe~HisLeuIleIleHisGlnArgIleHisThrGlyGluLysProTyrLysCysAsn (14) GACTGTGGCAAAGCTTTTAATCGTAGCTCAAGGCTTACCCAGCATCAAAAAATTCACATGGGATAGACCACTTACATATAAATGTGTATATATGTG~CCT AspCysGlyLysAlaPheAsnArgSerSerArgLeuThrGlnHisGlnLysIleHisMe~Gly***
1708
1813
2233 (15) 2338
ATAGCCTTAACTTAAAAAAAAAAAA
FIG. 1. Nucleotide and deduced amino acid sequences of ZNFP cDNA. The DNA sequence is numbered on the right. The 15 finger *** indicates an in-frame termination codon. The AATAAA polyadenylation signal is underlined. At domains are underlined and numbered. the bottom is a schematic representation of the ZNF7 protein showing the relative positions of the finger domains.
336
LANIA
ET
AL.
GACTTTACCCAGGAGGAATGGGGGCAGCTGCACCCTACCCAGAGGATCCTCTACCGTGACGTGATGCTGGAGACCTTTGGCTACCTGCTCTCCATAGGT AspPheThrGlnGluGluTrpGlyGlnLeuAspProThrGlnArgIleLeuTyrArgAspValMetLeuGluThrPheGlyTyrLeuLeuSerlleGly
99
CCTGAGCTTCCGAAGCCTGAAGTCATCTCCCAGCTGGAGCAAGGGACCGAGCTATGGGTGGCTGAGAGAGGAACCACCCAGGGCTGCCATCCAGCCTGGGAGCCT 35 ProGluLeuProLysProGluVallleSerGlnLeuGluGlnGlyThrGluLeuTrpValAlaGluArgGlyThrThrGlnGlyCysHisProAlaTrpGluPro
204
CGATCTGAAAGCCAAGCATCACGCAAGGAAGAGGGCCTGCCTGAAGAGGAGCCATCCCATGTCACGGGAAGGGAAGGATTCCCGACAGATGCTCCTTATCCCACC 66 ArgSerGluSerGlnAlaSerArqLysGluGluGlyLeuProGluGluGluProSerHisValThrGlyArgGluGlyPheProThrAspAlaProTyrProThr
309
ACGTTAGGGAAAGACAGGGAGTGTCAGAGCCAGAGTCTGGCACTCAAGGAGCAGAATAACTTGAAGCAGTTGGAATTTGGCCTCAAGGAAGCACCAGTTCAAGAT 97 ThrLeuGlyLysAspArgGluCysGlnSerGlnSerLeuAlaLeuLysGluGlnAsnAsnLeuLysGlnLeuGluPheGlyLeuLysGluAlaProValGlnAsp
414
CAAGCCTACAAAACTCTCAGACTCAGGGAAAACTGCGTCCTGAGTTCAAGCCCAAATCCATTCCCAGAGATCTCTAGAGGGGAGTATTTGTATACTTACGACTCA GlnGlyTyrLysThrLeuArgLeuArgGluAsnCysValLeuSerSerSerProAsnProPheProGluIleSerArgGlyGluTyrLeuTyrThrlyrAspSe~
519
132
CAGATTACAGACTCAGAACATAACTCCAGCTTAGTCAGTCAGCAGACAGGCTCCCCAGGAAAACAGCCCGGTGAAAACAGTGACTGTCACAGAGATTCCAGTCAG GlnIleThrAspSerGluHisAsnSerSerLeuValSerGlnGlnThrGlySerProGlyLysGlnProGlyGluAsnSerAspCysHisArgAspSerSerGln
624
167
GCCATTCCAATTACGGAACTCACAAAAAGCCAGGTGCAGGACAAACCCTACAAATGTACTGACTGTGGGAAGTCGTTTAACCATAACGCACACCTCACCGTGCAC AlaIleProIleThrGluLeuThrLysSerGlnValGlnAspLysProTyrLysCysThrAspCysGlyLyrSerPheAsnHisAsnAlaHisLeuThrValHis (1) AAGAGGATTCATACGGGAGAAAGACCTTATATGTGCAAGGAGTGTGGGAAAGCCTTCAGCCAGAACTCCTCCCTCGTCCAGCATGAGCGCATCCACACTGGAGAC LysArglleHisThrGlyGluArgProTyrMetCysLysGluCysGlyLysAlaPheSerGlnAsnSerSerLeuValGlnHisGluArgIleHisThrGlyAsp
729
202
237
834
(2)
342
AAGCCCTACAAGTGTGCCGAATGTGGGAAGTCTTTCTGCCATAGTACACACCTTACCGTCCATCGGAGGATTCACACTGGGGAGAAGCCCTATGAGTGTCAGGAC LysProTyrLysCysAlaGluCysGlyLysSerPheCysHisSerThrHisLeuThrValHisArgArgIleHisThrGlyGluLysProTyrGluCysGlnAsp (3) (4) TGTGGGAGGGCCTTCAACCAGAACTCCTCCCTGGGGCGGCACAAGAGGACACACACTGGGGAGAAGCCATACACCTGCAGTGTGTGTGGGAAATCCTTCTCTCGG CysGlyArgAlaPheAsnG1nAsnSerSerLeuGlyAr~HisLysArgThrHisThrGlyGluLysProTyrThrCysSerValCysGl~LysSerPheSerArq (5) ACCACTTGCCTTTTCCTGCACCTGAGAACTCACACCGAGGAGAGGCCCTACGAGTGTAACCACTGCGGGAAGGGCTTCAGGCACAGCTCATCCCTGGCCCAGCAC ThrThrCysLeuPheLeuHisLeuArgThrHisThrGluGluArgProTyrGluCysAsnHisCysGlyLysGlyPheArgHysSerSerSerLeuAlaGlnHis
1254
377
CAGCGGAAGCACGCGGGGGAGAAGCCCTTTGAGTGCCGCCAGAGGCTGATCTTTGAGCAGACGCCAGCTCTCACAAAGCATGAATGGACAGAAGCCCTGGGCTGT GlnArgLysHisAlaGlyGluLysProPheGluCysArgGlnArgLeuIlePheGluGlnThrProAlaLeuThrLysHisGluTrpThrGluAlaLeuGlyCys
1359
447
GACCCACCTTTGAGTCAAGATGAGAGGACTCACCGAAGCGACAGACCCTTCAAATGTAATCAGTGTGGGAAGTGTTTCATTCAGAGCTCTCACCTCATCCGGCAC AspProProLeuSerGlnAspGluArgThrHisArgSerAspArgProPheLysCysAsnGlnCysGlyLysCysPheIleGlnSerSerHisLeuIleArgHis (7) CAGATAACTCACACCAGAGAGGAGCAGCCCCATGGGCGAAGCCGGCGGCGTGAACAATCCTCGAGCAGGAACTCACACCTGGTTCAGCATCAACACCCGAACTCC GlnIleThrHisThrArgGluGluGlnProHisGlyArgSerArgArgArgGluGlnSerSerSerArgAsnSerHisLeuValGlnHisGlnHisproAsnS~r AGAAAGAGCTCTGCAGGAGGACGAAAGGCAGGGCAGCCGGAAAGCAGAGCCCTGGCTTTGTTTGACATCCAAAAAATCATGC~GAGAAAAACCCTGTGCACGTT ArgLysSerSerAlaGlyGlyArgLysAlaGlyGlnproGluSerArgAlaLeuAlaLeuPheAsplleGlnLysIleMetGlnGluLysAsnProValHisVal
I569
482
ATTGGGGTGGAAGAGCCTT~TGTGGGTGCTTCCATGTTATTTGACATCAGAGAATCCACATAGGAGAGAAACTTTGCTGATGACTTTTAACCACAAGTAAAAAAT IleGlyValGluG~uProSerValGlyAlaSerMetLeuPheAspfleArgGluSerThr***
1674
537
GTGGTAAGTCCACATAGTGTACTCATGGAAGGAGGGGCTGGGGGTAGAAATGTCATGGGTGACTTCTGACTTTCTAAGGAAATGATGCTTCCCAAGCACCCGAGG
1779
TTGGTTGGTCCCAAATCTATCAAACTCAGTGCCCTCTTTAGCGACATATTTTGTGACATTCCTTCCATTACACCACAGTGAGTTCACAGGTAATATAACCTACCC
I884
272
307
939
1044
1149
(‘5)
412
1464
ACCTGTGTAATGTCGAAAAAAATCAATATGCGGCCCCTATTTGTAAAGGATCATTAAAAT
FIG. 2. termination the bottom,
DNA and amino acid sequences of ZNFB cDNA. The DNA sequence is numbered on the right. *** indicates the in-frame codon. The seven finger domains are underlined and numbered. In the schematic representation of the ZNF8 protein shown at the dotted line that extends past the cDNA clone indicates the portion of the amino terminus lacking in ZNFS.
finger domains can be identified in the ZNFB-encoded protein; however, six of them are clustered between amino acid positions 220 and 380, and the seventh finger domain is located at position 430. Therefore, the last two finger domains are not separated by the classical H/C-link consensus sequence (Schuh et al., 1986).
A comparison of the finger motifs present in both ZNF7- and ZNFB-encoded proteins indicated that each unit conforms almost exactly to the consensus sequence CX2CX3FXSLX2HX3H (Fig. 3) diagnostic of DNAbinding zinc fingers. Furthermore, in both proteins the fingers are connected by “H/C links” (TGEKPF/YX,
HUMAN 1
CQE CAE CTE CEE CRE CKE CDE C N‘K CCE CND
CLQ
ZINC
10
Y CQKKL CGKVF CGKAF CGKAF
CGKAF CGKAF
CGKAF CTKAF
CGKGF CGKAF CGKAF CGKAF CGKAF CEKIF
SQQSQ
G
NNC
ii:: IQ IH VR
SQSST RWISR GCSSR VQGSH
AQ SQ
siisss
IY TI
IR
IQ
337
GENES
20
r SDCLQ RLCSQ RLSSK GQSSS
FINGER
28
ORI ORI QRI QRT QRM QLI QRT QRI QRI QRV QII QRI ORI QKI
H H H H H H H H H H H H H H H
GEKPYE TGEKPFK TGEKPYR TGERPYG TGERPYP TGEKAQI TGEKPYK TGEKPFK TGEKPYV KGEKPYE TGERPYK AGVKPYE TRAQWFY TGEKPYK MG
Qri
H
tGekpyx
Consensus
KRI ERI RR1 KRT LRT
H H H H H H H
TGERPYM TGDKPYK TGEKPYE TGEKPYT TEERPYE AGEKPFE TREEQPH
ZNF8
ZNF7
CGKAF
SMSTQ SQNST SRSSY RWRSH NRSSR
Cxe
CgKaF
xxxsx
CTD CKE CAE
NHNAH SQNSS CHSTH
CNQ
CGKSF CGKAF CGKSF CGRAF CGKSF CGKGF CGKCF
cxx
CGkxF
H
tgekP:x
Consensus
ZNF8
cxx
CxxxF
H
TGEKPYx
Consensus
Kruppel
cxx
CgKrf
H
TGExxFx
Consensus
Spl
CNE CSE CED CND
CQD csv CNH
FQ IE I I
TQ
TV
VQ
NQNSS
TV GR
SRTTC RHSSS IQSSH
1; IR
QRK QIT
ZNF7
FIG. 3. Amino acid comparison of ZNF7, ZNF8, Spl, and Krnppel zinc finger motifs. The sequences are given in one-letter code and are aligned tc show the repeated units. Amino acids that are strictly conserved in the finger motifs are boxed. For each gene, a consensus for the repeated motif is displayed: uppercase letters correspond to strictly conserved amino acids, and lowercase letters to amino acids conserved >60%; x indicates no conservation.
Schuh et al., 1986) found in genes from mouse, Drosophila, and Xenopus that cross-hybridize to the Kruppel finger domains (Schuh et al., 1986; Chowdhury et al., 1987; Ruiz i Altaba et al., 1987). The sequence similarity among the ZNF7 fingers is 65-80% and that among the ZNF8 fingers is 60-75%, whereas the sequence similarity between any of the ZNF7 fingers and those present in ZNF8 is 50-60%. No sequence similarity was found outside the finger regions. Expression of Human Finger Genes in Various Cell Lines and during in Vitro Induced Differentiation To determine the expression and distribution of ZNF7 and ZNFS finger gene transcripts, a number of RNAs extracted from different established human cell lines were analyzed by Northern blotting; the result of this analysis is shown in Fig. 4. The ZNF7 probe detected two discrete transcripts of sizes 2.8 and 3.3 kb (Fig. 4A). Both transcripts were expressed at constant level in all the cell lines tested; however, the level of the 3.3-kb transcript was in each case three to fivefold lower than that of the 2.8-kb transcript. A similar pat-
tern of expression was also observed in poly(A)-selected RNA (data not shown). The ZNFS probe detected two transcripts 11 and 2.2 kb long. However, no expression of either ZNF8 transcript was detected in HeLa RNA, and only the 2.3-kb transcript was observed in NTERAS/Dl cells (Fig. 4B). In light of alternate processing of ZNF7 and ZNFS transcripts, we suggest that the cDNA clones reported in Figs. 1 and 2 represent the major messages from these loci. This suggestion is based on our isolation of additional cDNA clones; after restriction map analysis and partial DNA sequencing, the shorter cDNA clones were found to be identical to the ZNF7 and ZNF8 cDNA clones reported in Figs. 1 and 2 (data not shown). Next, we sought to determine the expression of ZNF7 and ZNF8 genes during in vitro induced terminal differentiation of HL-60 cells. HL-60 is a human acute leukemia cell line with phenotypic features of immature myeloid cells and is capable of undergoing terminal differentiation upon treatment with dimethyl sulfoxide (DMSO) (Collins et aZ., 1977). Figure 5 shows a Northern blot analysis of total RNA isolated from untreated and DMSO-treated HL-60 cells and hybridized to
338
LANIA 1
234567
ET
AL.
1
234567
FIG. 4. Northern blot analysis of RNA isolated from various human cell lines hybridized to ZNF7 (A) and ZNFB (B) cDNA probes. The RNA samples are HeLa (lane l), NTERAQ/Dl embryonal carcinoma (lane 2), HL-60 (lane 3), MOLT-4 (lane 4), WI-38 (lane 6), B lymphocytes (lane 6), and colon carcinoma (lane 7). Aliquots of 15 fig were electrophoresed on 1% agarose-formaldehyde gels and processed as described under Materials and Methods. Ribosomal RNAs (28 S and 18 S) are indicated by arrowheads.
ZNF7 and ZNFB probes. The relative amounts of the ZNF7 and ZNF8 transcripts decreased after DMSOinduced terminal differentiation. Rehybridization of the Northern blots to the GADPH probe (Piechaczyk et al., 1984) indicated that comparable amounts of total RNA were present in each lane. We have also monitored the expression of ZNF7 and ZNF8 finger genes during the in vitro induced differentiation of the human embryonal carcinoma cell line NTERAB/Dl (Andrews et aZ., 1984). No differences in the level of expression of either the ZNF7 or the ZNFS finger gene were found upon retinoic acid-induced differentiation of NTERA2/Dl cells (data not shown).
Chromosome Mapping of ZNF? and ZNF8 Genes by in Situ Hybridization Analysis DNA probes representative of the ZNF7 and ZNF8 genes were hybridized to human chromosome metaphases obtained from normal peripheral blood lymphocytes. Using the 1459-bp EcoRI fragment of ZNF7, of 146 metaphases analyzed, a total of 246 silver grains (1.68 grains/metaphase) were found. Thirty-nine grains (15.8% of the total) were located on chromosome 8 (P < O.OOl), 27 of which (69.2%) were present on band q24.1-qter (Fig. 6). Using the entire EcoRI insert of the ZNFS cDNA clone, a total of 62 metaphases were scored for the assignment of 110 labeled sites (1.77 grains/metaphase) to chromosome bands. Twenty-one (19% of the total) were observed on chromosome 20 (P < O.OOl), 17 of which (80.9%) were clustered on band 2Oq13 (Fig. 7). In both experiments no specific labeling was observed in any other chromosomal region. Moreover, these results were further confirmed using
GAPDHFIG. 6. Expression of ZNF7 (left) and ZNF8 (right) genes during in vitro induced differentiation of HL-60 myeloid cells. HL-60 cells were treated with DMSO for 5 days and RNAs were extracted from undifferentiated cells (lane l), at Day 1 (lane 2), Day 3 (lane 3), and Day 5 (lane 4) of culture in the presence of the inducing agent. The same blots were rehybridised to the GAPDH probe. RNA extraction and Northern blot analysis were performed as described in the text. In overexposed autoradiograms of total RNAs, some hyhridisation to both 28 S and 18 S RNA is detectable. Ribosomal RNAs (28 S and 18 S) are indicated by arrowheads.
a FIG. 6. Idiogram of chromosome 8 showing ZNF7 hybridization grains from 146 metaphases.
the distribution
of
HUMAN
FIG. 7. Idiogram of ZNFS hybridization
of chromosome 20 showing grains from 63 metaphases.
ZINC
the distribution
ZNF7 and ZNF8 subclones that did not contain finger domains as probes (data not shown).
the
DISCUSSION
Taking advantage of the relatively conserved nature of zinc finger domains (Schuh et aZ., 1986), we have isolated and characterized two human finger genes by finger-motif-containing DNA probe hybridization. The finger motifs of both ZNF7- and ZNFB-encoded proteins contain the appropriate amino acids residues to fit the consensus sequence of the Cys/His finger (Klug and Rhodes, 1987; Evans and Hollenberg, 1988). The predicted amino acid sequences of ZNF7 and ZNFB cDNA clones indicate the presence of 15 and 7 finger repeats, respectively. The 15 fingers in ZNF7 are grouped into three sets of 6, 7, and 2 finger repeats, respectively. This organization resembles that of the Xenopus Xfin gene, in which the 37 fingers are grouped into six sets (Ruiz i Altaba et al., 1987). Seven zinc finger domains can be identified in the ZNF8 protein; however, six of them are clustered, with the last being a single finger domain. A recent study on a single-finger peptide indicates that a single zinc finger can exist as an independent structure sufficient for DNA binding (Parraga et al., 1988). Transcription studies indicate that the ZNF7 and ZNF8 genes are expressed as multiple transcripts in many human cell lines of different embryological derivation. Moreover, the expression of both genesis negatively regulated during in vitro induced terminal differentiation of HL-60 myeloid cells. The low levels of expression of ZNF7 and ZNF8 transcripts in terminally differentiated HL-60 cells may be due to expression of the genesin a small subpopulation of proliferating cells. Alternatively, the expression patterns of ZNF7 and ZNFS genes may be due to cell-type specificity. Current research is aimed at addressing this hypothesis. Although we have not yet performed experiments to test whether the ZNF7 and ZNF8-encoded proteins bind DNA, we believe that this is likely becausespecific DNA binding is a common feature of other multifingered proteins (Miller et al., 1985; Kadonaga et al., 1987). Moreover, because of the precedent of TFIIIA
FINGER
339
GENES
protein (Klug and Rodes, 1987), we cannot exclude the possibility that ZNF7 and ZNF8 proteins bind RNA and/or DNA. Using the in situ hybridization technique, we found that the ZNF7 finger gene is located on band q24 of the long arm of human chromosome 8. The same chromosomal region contains the c-myc oncogene locus and is implicated in the 8,14 translocation associated with Burkitt’s lymphoma (Dalla Favera et al., 1982). In addition, it is involved in microdeletions associated with the Langer-Giedion syndrome (tricho-rhino-phalangeal, TRP syndrome) (Bowen et al, 1985), in which 8q24.11-q24.13 appears to be the shortest overlapping region (Fryns and Van den Berghe, 1986). The ZNFS gene was localized on the terminal band of the long arm of human chromosome 20. Chromosome 20 is involved in a nonrandom karyotype rearrangement, i.e., deletion of its long arm (2Oq-), in chronic myeloproliferative disorders, most frequently in polycythemia vera (27% of the cases) (Heim and Mitelman, 1987). The 2Oq- marker chromosome is the result of an interstitial deletion; molecular-cytogenetic analysis demonstrated that the C-Q-Cgene, which normally maps to the most distal band of 2Oq, still remains on the rearranged chromosome in 2Oq- (Le Beau et al., 1985). We are currently investigating whether ZNF8 sequences are lost in the 2Oq- marker. In the last year the localizations of other human multifingered protein genes have been described: TDF to chromosome Y (Page et al., 1987), ZPX to Xp21.3 (Muller and Schempp, 1989), the GLI gene to chromosome 12q13.3-q14.1 (Arheden et al., 1989), EGR.l to 5q23-31 (Sukhatme et al., 1988), the human homolog of mouse Krox-20 to chromosome lOq21.1-22.1 (Chavrier et aZ., 1989), and ZFP3 to chromosome 17p1217pter (Ashworth et al., 1989). The chromosomal assignment of members of the finger gene family will be useful in establishing the evolutionary relatedness of this gene family and also in determining their possible functions. ACKNOWLEDGMENTS This work was paid for by grants from Progetto Finalizzato CNR “Oncologia” and MPI to L. L. and by the Italian Association for Cancer Research (AIRC) to L.L. and P-G.P.
REFERENCES ANDREW& P. W., DAMJANOV, I., SIMON, D., BANTING, G., DRACOPOLI, N. C., AND FOGH, J. (1984). Pluripotent embryonal carcinoma clones derived from the human teratocarcinoma cell line Tera-2. Lab. Znuest. 60: 147-162. -DEN, K., RONNE, M., MANDAHL, N., HEIM, S., KINZEL, K., VOGELSTEIN, B., AND MITELMAN, F. (1989). In situ hybridization localizes the human putative oncogene GLI to chromosome subbands 12q13.3-14.1. Hum. Genet. 82: l-2. ASHWORTH, A., WILLIAMS, FELLOW, P. N., SOLOMON,
B. P., BUCHBERG, A. M., GOODE., POTTER, J., AND WILLISON,
340
LANIA
K. R. (1989). Chromosomal localization of zinc protein genes in man and mouse. Genomics 4: 323-327. 4. BOWEN, P., BIEDERMAN, P., AND Hoo, J. J. (1985). The critical segment for the Langer-Giedion syndrome: 8q24.11-24.12. Ann. Genet.
28:
224-227.
5. BROWN, R. S., SANDER, C., AND ARGOS, P. (1985). The primary structure of the transcription factor TFIIIA has 12 consecutive repeats. FEBS L&t. 186: 271-274. 6. CHANDLER, M. E., AND YUNIS, J. J. (1978). A high resolution in situ hybridization technique for direct visualisation of Gbanded early metaphase and prophase. Cytogenet. Cell Genet. 22:352-356. 7. CHAVRIER,P., SERIAL, M., LEMAIRE, P., ALMENDRAL, J., BRAVO, R., AND CHARNAY, P. (1988). Characterization of mouse multigene family that encodes zinc finger structures. EMBO J. 7: 29-35.
8.
9.
10.
11.
12.
13. 14.
15. 16. 17.
18. 19.
CHAVRIER, P., JANSSEN-TIMMEN, U., MALI, M-G., ZERIAL, M., BRAVO, R., AND CHARNAY, P. (1989). Structure, chromosome location, and expression of the mouse zinc finger gene Krox-2O: Multiple gene products and coregulation with the proto-oncogene c-fos. Mol. Cell. Biol. 9: 787-797. CHIRGWIN, J. M., PRZYBYLA, A. E., MACDONALD, R. J., AND RUTTER, W. J. (1979). Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease. Biochemistry 18: 5294-5299. CHOWDHURY,K., DEUTSCH, U., AND GRUSS, P. (1987). A multigene family encoding several “finger” structures is present and differentially active in mammalian genomes. Cell 48: 771778. COLLINS, S. J., GALLO, R. C., AND GALLEGHER, R. E. (1977). Continuous growth and differentiation of human myeloid leukemia cells in suspension culture. Nature (London) 270: 347350. DALLA FAVERA, R., BREGNI, M., ERIKSON, K., PATTERSON, D., GALLO, R. C., AND CROCE,C. M. (1982). Human c-myc oncogene is located on the region of chromosome 8 that is translocated on Burkitt lymphoma cells. Proc. Natl. Acad. Sci. USA 79: 7824-7826. EVANS, R. M., AND HOLLENBERG, S. M. (1988). Zinc fingers: Gilt by association. Cell 62: l-3. FRYNS, J. P., AND VAN DEN BERGHE, H. (1986). 8q24.12 interstitial deletion in tricho-rhino-phaIangeaI syndrome type I. Clin. Genet. 31: 188-189. GEHRING, W. J. (1987). Homeoboxes in the study of development. Science 236: 1245-1252. GERHARD, D. S., KAWASAKY, E. S., BANCROFT, C. F., AND SZABO, P. (1981). Localization of a unique gene by direct hybridization in situ. Proc. Natl. Acad. Sci. USA 78: 3755-3759. HARTSHORNE, T. A., BLUMBERG, H., AND YOUNG, E. T. (1986). Sequence homology of the yeast regulatory protein ADRl with Xenopus transcription factor TFIIIA. Nature (London) 320: 285-287. HEIM, S., AND MITELMAN, F. (1987). “Cancer Cytogenetics,” pp. 130-132, A. R. Liss, New York. KADONAGA, J. T., CARNER, K. R., MASIARZ, F. R., AND TJIAN, R. (1987). Isolation of cDNA encoding transcription factor SPl and functional analysis of the DNA binding domain. Cell 51: 1079-1090.
ET AL. 20. KLUG, A., AND RHODES,D. (1987). ‘Zinc fingers’: A novel protein motif for nucleic acid recognition. Trends Biol. Sci. 12: 464469. 21. KOZAK, M. (1987). An analysis of 5’-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 16: 81258148. 22. LE BEAU, M. M., WESTBROOK, C. A., DIAZ, M. O., AND ROWLEY, J. D. (1985). c-src is consistently conserved in the chromosomal deletion (2Oq) observed in myeloid disorders. Proc. Natl. Acad. Sci. USA 82: 6692-6696. 23. MILLER, J., MCLACHLAN, A. D., AND KLUG, A. (1985). Repetitive zinc-binding domains in the protein transcription factor IIIA from Xenopus oocytes. EMBO J. 4: 1609-1614. 24. MULLER, G., AND SCHEMPP, W. (1989). Mapping the human ZPX locus to Xp21.3 by in situ hybridization. Hum. Genet. 82: 82-84. 25. PABO, C. O., AND SAUER, R. T. (1984). Protein-DNA recognition. Annu. Reu. Biochem. 53: 293-321. 26. PAGE, D. C., MOSHER, R., SIMPSON, E. M., FISHER, E. M. C., MARDON, G., POLLACK, J., MCGILLIVRAY, B., DE LA CHAPELLE, A., AND BROWN, L. G. (1987). The sex-determining region of the human Y chromosome encodes a finger protein. CeU 61: 1091-1104. 27. PANNUTI, A., LANFRANCONE, L., PASCUCCI,A., LA MANTIA, G., PELICCI, P-G., AND LANIA, L. (1988). Isolation of cDNAs encoding finger proteins and measurement of the corresponding mRNA levels during myeloid terminal differentiation. Nucleic Acids Res. 16: 4227-4237. 28. PARRAGA, G., HORVATH, S. J., EISEN, A., TAYLOR, W. E., HOOD, L., YOUNG, E. T., AND KLEVIT, R. E. (1988). Zinc-dependent structure of a single-finger domain of yeast ADRl. Science 241: 1489-1492. 29. PIECHACZYK, M., BLANCHARD, J. M., MARTY, L., DANI, C., PANABIERES, F., EL SABOUTY, S., FORT, P., AND JEANTEUR, P. (1984). Post-transcriptional regulation of glyceraldehyde-3phosphate-deydrogenase gene expression in rat tissue. Nucleic Acids Res. 12: 6951-6963. 30. ROSENBERG, U. B., SCHRODER, C., PREISS, A., KIENLIN, A., COTE, S., RIEDE, I., AND JACKLE, H. (1986). Structural homology of the product of the Drosophila Kruppel gene with Xenopus transcription factor IRA. Nature (London) 319: 336-339. 31. RUIZ I ALTABA, A., PERRY-O’KEEFE, H., AND MELTON, D. A. (1987). Xfin: An embryonic gene encoding a multi-fingered protein in Xenopus. EMBO J. 6: 3065-3070. 32. RUPPERT, J. M., KINZLER, K. W., WONG, A. J., BIGNER, S. H., KAO, F-T., LAW, M. L., SEUNAEZ, H. N., O’BRIEN, S. J., AND VOGELSTEIN, B. (1988). The GLI-Kruppel family of human genes. Mol. Cell. Biol. 8: 3104-3113. 33. SCHUH, R., AICHER, W., GAUL, U., Coti, S., PREISS,A., MAIER, D., SEIFERT, E., NAUBER, U., SCHRODER,C., KEMLER, R., AND JACKLE, H. (1988). A conserved family of nuclear proteins containing structural elements of the finger protein encoded by Kruppel, a Drosophila segmentation gene. Cell 47: 1025-1032. 34. SUKHATME, V. P., CAO, X., CHANG, L. C., TSAI-MORRIS, C.-H., STAMENKOVIC, D., FERRIERA, P., COHEN, D. R., EDWARDS,S. A., SHOWS, T. B., CURRAN, T., LEBEAU, M. M., AND ADAMSON, E. D. (1988). A zinc finger-encoding gene coregulated with c-fos during growth and differentiation and after cellular depolarization. Cell 63: 37-43.