Proc. Nati. Acad. Sci. USA Vol. 87, pp. 2167-2171, March 1990

Evolution

Concerted evolution of class I genes in the major histocompatibility complex of murine rodents CRISTINA RADA*t, ROBERTO LORENZI*, SIMON J. POWIS*, JOHAN

VAN DEN

BOGAERDE*§, PETER PARHAM¶,

AND JONATHAN C. HOWARD*II *Department of Immunology and tMonoclonal Antibody Centre, Agricultural and Food Research Council, Institute of Animal Physiology and Genetics

Research, Cambridge Research Station, Babraham, Cambridge CB2 4AT, United Kingdom; and 1Department of Cell Biology, Stanford University School of Medicine, Stanford, CA 94305

Communicated by Irving Weissman, November 7, 1989 (received for review August 29, 1989)

ABSTRACT Full-length cDNA sequences of two class I major histocompatibility complex molecules from the DA strain of Rattus norvegicus are reported. One codes for the classical class I restriction element RTL.Aa, which maps to the locus in the rat major histocompatibility complex homologous to H-2K in the mouse. The other probably codes for a soluble nonclassical class I molecule present in DA rat serum; a short deletion in the fifth exon implies that the translated product will terminate in the membrane-spanning region. These sequences have been compared with mouse classical class I sequences as well as with three published rat class I cDNA partial sequences. The results show, first, that "locus-specific" substitutions from the H-2K, H-2D, and H-2L data set are scrambled in the RTl.A molecule; a majority of these substitutions have H-2D/L-specific features. Second, the data show that the four rat sequences are strikingly similar to one another regardless of locus or haplotype of origin; they share a number of apparently species-specific features that distinguish them all from mouse classical class I sequences, which likewise share distinctive features of their own. The results suggest that segmental sequence exchange plays a major role in determining the evolution of sequence in class I masjor histocompatibility complex molecules.

Some multigene families display a degree of species specificity at the level of DNA sequence, as if the whole family were evolving within a species as a unit (1-3). Such concerted evolution has been suspected in mouse major histocompatibility complex (MHC) class I genes because of the lack of clear locus-specific sequence variation distinguishing the widely separated H-2K and H-2D end classical loci (4, 5). Short segmental exchanges of sequence are indeed occurring between widely separated class I MHC loci in laboratory mice (6). Nevertheless it remains controversial whether such processes contribute significantly to the pattern of evolution of class I genes (7), since sequences of class I genes from a closely related murine species such as the rat are not yet available. Rattus norvegicus is in the same subfamily, the Murinae, as the mouse. The rat MHC, RTI, is like H-2 in structure, with a classical class I gene, RTJ.A, at the left end of the MHC in the same location as H-2K and a long array of other class I genes to the right (8). In the middle is a region specifying class II genes that closely resembles the homologous region of H-2 (9). It is most likely that the translocation of class I genes that formed the H-2K end was a unique event occurring shortly before the separation of the rat and mouse lineages (7). Rat class I gene sequences thus provide a stringent test of the concerted evolution hypothesis. With simple divergent evolution alone, the sequence of an RTI.A The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. 2167

allele should have greater similarity to H-2K on average than to other rat class I sequences, since RTI.A and H-2K share a more recent common ancestor than either does with genes at the other end of the MHC. If, on the other hand, concerted evolution is at work in this system, RTI.A should resemble other rat class I genes more closely than it does H-2K. Here we report the full-length sequence of an RT1.A classical class I rat MHC molecule, as well as that of a nonclassical class I molecule from the same haplotype.** Using these and other published rat and mouse class I sequences, we show concerted evolution operating on many class I genes from these two murine species.

MATERIALS AND METHODS Preparation and Screening of a DA Strain Rat cDNA Li-

brary. A cDNA library in AgtlO, constructed essentially as

described (10), was prepared from DA strain male rat Con A blasts and screened with the mouse 3' class I cDNA probe, pH-21Ia (11). Clones containing the classical class I sequence, RT1.Aa, were identified by a second screen with an 18-mer oligonucleotide probe, TAT TCTACACAGCNGTG (where N = G, A, T, or C), representing amino acids 7-12 (YFYTAV) of the published N-terminal sequence (12). Thirteen clones were identified as positive with both the cDNA and oligonucleotide probes. Clones 3.3/1 and 3.6 were selected for analysis. Sequencing Strategy and Sequence Analysis. Inserts were subcloned into Bluescript KS(+) (Stratagene) and sequenced on both strands from exonuclease III/exonuclease VII nested deletion sets (13). Sequences were assembled using Staden programs (14) and analysis was with the University of Wisconsin Genetics Computer Group programs (15). Published class I sequences were assembled from the following Genbank files: Kb, MUSMHKB1 and MUSMHKB2; Kd, MUSMHKDA; Kk, MUSMHKK; KW2S, MUSMHKTA; K, MUSMHKQM; Dk, MUSMHH2DK; DP, MUSMHH2DP; Dd, MUSMHDD; Ld, MUSMHLDA3 and MUSMHLDB1; Db, MUSMHH2DB; pRTI.A-1, RATMHC2; pRTI.A-2, RATMHC1. The rat class I cDNA sequence pARI.5 was taken from ref. 16. Expression of Rat Class I cDNAs in Mouse Cells. Inserts from pBS3.3/1 and pBS3.6 were subcloned into the expression vector pMSD (17). pMSD3.3/1 and pMSD3.6 were transfected into C3HAn/Boy mouse Ltk- fibroblasts and Abbreviations: MHC, major histocompatibility complex; CTL, cytotoxic T lymphocyte. tPresent address: Medical Research Council Laboratory of Molecular Biology, Hills Road, Cambridge, U.K. §Present Address: Department of Surgery, Addenbrookes Hospital, University of Cambridge, Hills Road, Cambridge CB2 2QH, U.K. I'To whom reprint requests should be addressed. **The sequences reported in this paper have been deposited in the GenBank data base (accession nos. M31018 and M31038).

Proc. Natl. Acad. Sci. USA 87 (1990)

Evolution: Rada et al.

2168

3.6 has a deletion of 13 nucleotides (931-943) in the presumed fifth exon, causing premature termination in the transmembrane domain. The second deletion of 33 nucleotides (10401072) in 3.6 corresponds to the entire sixth exon. 3.3/1 contains the 18-mer probe sequence used for library screening and is consistent with the DA rat classical class I molecule RT1.Aa (12). Clone 3.6 departs from the probe sequence by one nucleotide: histidine is substituted for tyrosine at position 9 of the first domain. The termination of the 3.6 translated product in the transmembrane domain and the histidine at position 9 of the first domain are consistent with a nonclassical class I molecule present in DA rat serum (12). 3.3/1 Specifies RT1.A1. The 3.3/1 product expressed on a BW5147 transfectant carried six epitopes typical of RT1LAa (Fig. 2A). The epitopes identified by the alloantibodies R3/ 13, YR1/100, MAC210, and JY1/232 represent an antigenic signature found only on the RT1LAa classical class I molecule (21). The 3.3/1 product is a typical class I molecule with a heavy chain of slightly slower mobility than H-2Kk and H-2Dk (Fig. 2B). Rat and mouse CTLs generated against RT1LAa in mixed lymphocyte cultures are specific for the rat class I product expressed on pMSD3.3/1 transfectant mouse cells (Fig. 2C). Furthermore, target lysis is blocked by the RT1LAa-specific alloantibody, R2/15S (Fig. 2C Inset). These experiments establish the 3.3/1 sequence as representing the RTJ.Aa gene. Comparison of RT1.A' with Mouse and Rat Class I Sequences. A comparison between RT1LAa and mouse H-2K, H-2D, and H-2L sequences was undertaken to identify locusspecific features. As frequently noted, the three long exons

AKR mouse hypoxanthine phosphoribosyltransferasedeficient BW5147 thymoma cells by using calcium phosphate and the selection plasmid pBR322/ASV-2LTR/tkA (Ltkcells) (18) or electroporation and the selection plasmid pMSD-HGPRT (BW5147). Surface expression of rat class I molecules on transfected mouse cells was sought by using the mouse anti-rat class I monoclonal antibodies MRCOX18 (19) and F16.4.4 (20). RT1.Aa was identified by using four rat alloantibodies against distinct epitopes (21). Antibody binding was detected with a fluorescent anti-immunoglobulin antibody by using a model 420 fluorescence-activated cell sorter (Becton Dickinson). Rat cytotoxic T cells specific for RT1.Aa were prepared and assayed as described (22). Mouse cytotoxic T lymphocytes (CTLs) with a similar specificity were

prepared by stimulating C3H/AnBoy

mouse

spleen

cells with PVG.R1 rat lymph node cells. Inhibition of cytotoxicity by the RT1.Aa-specific alloantibody R2/15S was as described (22). Rat RT1LAa class I molecules were precipitated from detergent lysates of surface-iodinated Ltk3.3/1 cells by using MRCOX18; mouse H-2Kk and H-2Dk class I molecules were precipitated from the same cells by using the rat anti-mouse class I antibody R1-21.2 (23). Both antibodies were conjugated directly to Sepharose CL-4B.

RESULTS Sequence and Structure of Clones 3.3/1 and 3.6. Fig. 1 shows the nucleotide sequences and translation of two DA rat class I cDNA clones. The translation product of 3.3/1 corresponds to a typical eight-exon class I gene. The translation product of pHS3 .6

RIl.Aa RTI.Aa

H V G V H PTY T A V S IAN A PR T L L L L L A A A L AP T Q T R A G S H S L R T gggtaatgtcgatcaaccg~tc120 gttaagaggtgacccccgtcgtcgcgcccggccacaac

pHS3.6

Q

S

P

G

L

G

E

P

R

F

I

A

tg

t

g

g

R

V

VS

D

ODTE

PF

V

R

D

S

D

P

A

HE

R

P

R

RVW

A

cggcccggoctoggg gagoccccggttcatcgctgtcggctacgtg gacgacaocggagttcgtgcgcttocgacagcgacgocggagaatcogaggatg gagccgocgggogcggtggatggag tt L

V

G

L

Q

S

L

S

D

G S 8KZG V T L R GT TY D L R E Q PHE EVQH Q T R I A K E VWE Q cgggaggggcoggagtattgg gagcagca gacacggatocgocaa ggaatgg gagoaga tttaccgagtggacctgag gaccctgcgcggctact acaaccagagcga gggc Yc tctcac

gs9a

tg

t ga

ag

a

A

YTOG

240

9c9

atac

a

360

taa

R

Q

T

E

CD

M

V

G

D

S

GS

L

L

R

GIY

R

QD

R

D

SA

D

L

L

TV

K

accatccaggagatgtatggotgtgacgtggggtcggacgggagcctcctccgcggatataggcag gacgcoctacgatg googcga ttacatcgccct gaacgaagacoctgaagac gtgg

T

A

Q

S

V

A

A

OF

A

Q

RHN

IT

G

EVER

A

A

R

L

S

A

R

L

ER

A

IL

C

KG

L

V

S

L

R

aoggcggoggactttgcggcagaga tcacccg gaacaa gtgggagcgggctcgttat gcagagagaoctcagggcctacctggagggcacgtgtgtggagtggct cagcaga g tt

tg

a

a

aa

g

400

a

ga

ag A

tacctggag

600

ct

t

ag

go

R L

G

L

T

K

L

R

P

S

P

A

V

L

H

P

R

P

G

ODV

T

L

R

C

A

L

G

P

F

ADSI

ctcgggaag gagacaoctgctgcgctcagatoccoccaga ggcaca tgt gaooottoaoccoagaocctgaaggtgatgt cacocctgaggtgotgggccctgggcttctaccctgctgacatc

720

A

L

G

LH

TV

E

D

L

LI

R

V

P

G

A

D

G0T17

A

S

V

V

V

P

GEK

L

aocctgacctggcagttgaatggggaggacct gaccoa ggacatggagcttgtg gagaooa ggcctgcaggggatg gaacocttcooagaa gtgggcatctgtggtggtgcctcttgggaag L

L

TC

R

0KEG

V

L

P

EKP

L

gagcagaa ttacacatgccgtgtggagcatgaggggctgcct aagccaocttt

LVw G

A

V

S

Q

R

P

ccoagaga tgg

S

P

S

S

HM

IV

EV

IS

L

L

P

L

TV

gaaooctgctccatcgactgactcooaaca tg gaaaccac tgtoatttatgtcatcctt

GA

ag tt

SQS

960

L

QF A

V

G

A

A

V

A

V

V

R

R

G

ERR

R

G

EGG

YA

P

A

R

PG

D

ggagotgtggooatgattggagctgtggccatcataggagctatggtggctgttgtgaggaggag gaagagaaaoaoa ggtgggaaaggaggagact aogooootgctccaggcagggac S

640

L

S

t

D

V

L

P

D

1000

t

t

t

t t

S

A

C

agotcoooagagoctccgatgtgtctctcoooagattgtagaotgagaoacgcgatgatataaaga tgtgttoaggtototootgtgacatcooagagoooctcagtt

1200

t

otottta gaoaooa

gtgtotgaagttccctgtgattctatggcttcagtgtg aagaaoctgtg gagoooagooctgoootgoaottoag gaoootgtooottoaot gooogotttcccttcc t

a

t

ao

tg

g

aoattcagccttgctggtcooagooctgggggtggagggtggggacatctgcatcctgtc00000tgtgotaoootgagcttcoagooooctoaottooaoooctgagaataagaatotgagtgtg aaottgattgttcacatcctt gacaoaa

gtgtagatggctttttaaattactggattgagaatacttagaggttttttgttttggg.ttttgctttgtttgattttgttttt a

taaatggcagatggagaagocttccaaa

1320

a

gaaaaaaa

1640

1560

cc 1 59 2

pBS3.3/1 (RT1LA1) and pBS3.6. Sequences are aligned to optimize colinearity; by dots. Putative exon boundaries are indicated by arrowheads. The following are highlighted in boldface letters: the oligonucleotide probe sequence tatttctacaccgccgtg at bases 100-117, putative N-linked glycan attachment signals at bases 347 and 847, conserved cysteine residues associated with intradomain disulfide bridges at bases 382 and 571 and bases 688 and 856, and a putative aataaa polyadenylylation signal starting at base 1558. The tag and tga termination codons at bases 980-982 (pBS3.6) and 1123-1125 (RT1LAR), respectively, are indicated by an asterisk. FIG. 1.

gaps

are

Nucleotide sequence and translation of cDNA clones

indicated

X~

Evolution: Rada et al.

Proc. Natl. Acad. Sci. USA 87 (1990)

A

C

B BW5147

BW3.3/1

1 2 3

CAB

PVG vs. Rl

100 0)

80 -

cU

R3/13

2169

50

60 -

4

-

u

40 -

C)

YR1/100

20

i)

-

0.

U,

0-

el

-204

\MAC 210

-1

0

-2

-1

3

4

1 2 log3E:T

3

4

0

2

3

-14 U

JY1/232

80 C1) W Co 0

60

*) C)

F16.4.4

40 U C)

20 -

4-O

\LMRC fX18

0-

Log fluorescence

1

log3E:T FIG. 2. The 3.3/1 sequence codes for the RTL.Aa classical class I glycoprotein. (A) Expression of RTL.Aa on the surface of BW5147 transfected with pMSD3.3/1 (BW3.3/1) assayed by monoclonal allo- and xenoantibodies in the fluorescence-activated cell sorter. Nontransfected BW5147 and DA strain rat Con A blasts (CAB) were negative and positive controls, respectively. (B) Immunoprecipitation of RT1.Aa by MRC OX18 from surface-iodinated Ltk- cells transfected with pMSD3.3/1 (lane 3). Immunoabsorbents conjugated with an irrelevant monoclonal antibody (lane 1) and R1-21.2 (lane 2), specific for mouse H-2Dk and H-2Kk class I molecules, were negative and positive controls, respectively. (C) Lysis of BW3.3/1 targets (e) by PVG rat anti-PVG.R1 (Upper) and C3H mouse anti-PVG.R1 (Lower) cytotoxic T-cell populations. PVG.R1 (m), PVG (o), and C3H mouse (A) Con A blast targets, as well as nontransfected BW5147 (o), were also used to indicate the specificity ofthe CTL populations. (Inset) Inhibition ofthe lysis of BW3.3/1 by C3H anti-PVG.R1 CTLs with the RT1.Aakspecific monoclonal alloantibody R2/15S at concentrations of 0-50 ,g/ml. E:T, effector to target ratio.

of H-2 classical class I alleles showed no evidence of locusspecific divergence (Fig. 3 and data not shown). Only a single position out of 822 compared showed complete locus specificity. However, in the short exons and untranslated regions up to the beginning of the short interspersed nuclear element (SINE) insertion (25), 24 positions were fully locus-specific out of 660 compared (Fig. 4). Out of the total of 25 apparently locus-specific positions, RTL.Aa showed the H-2D/ H-2L-specific nucleotide at 15 positions, the H-2K-specific nucleotide at 5 positions, and a third nucleotide at 5 positions. When all available rat class I sequences were compared with mouse classical sequences, a striking degree of speciesspecific homogenization was apparent. A matrix of compar-

isons in the three long exons (Fig. 3) showed that the rat departed significantly from the interallelic or interlocus comparisons within the mouse species group. However, despite representing the products of different loci, the three available rat sequences, including RT1.Aa, were strikingly more similar to each other than to the mouse classical class I sequences. The short exons and untranslated regions also displayed extensive species-specific homogenization (Fig. 4); this was conspicuous in a series of deletions and insertions by which the two sequence groups differed. Only completely homogenized positions are indicated in Fig. 4, but many other partially homogenized positions and motifs are apparent. sequences

MOUSE

H-2Kb

RAT

H-2Kd

H-2Kk H-2K28 H-2Dk

H-2DP

0.905

0.935 0.910 -_

0.941 0.897

0.929

0.936

0.908

0.917

0.912

0.910

0.906

0.912

0.911 0.914

0.918 0.920 0.931

0.908

0.901

0.918 0.927

0.928

.952

0.935 0.917 0.944 00.941

0 .931

H-2D

0.925

eane(Mous

M

=0.921

H-2L

H-2D

0.910

0.914

0.919

0.919

0.924

0.9se,,,

58

pARI.5 0.875 0.863 0.873 0.875 0.886 0.883 0.877 0.869

pBS3.6 0.878 0.874 0.879 0.880 0.888 0.886 0.883 0.880

RT1.Aa 0.894 0.878 0.885 0.891 0.897 0.902 0.894 0.883

H-2K0 H-2Xd H-2Kw2

H-2Dk

U)

8

11-29P

H-2Dd H-2Ld

H-2D0'

Men(mousexrat)mO88 -

mean (ratxrat)'=0

91

0.907

0.916

-

0.927

=9

FIG. 3. Similarity matrix for 2-4 (822 nucleotides compared) of mouse and rat class I sequences. By a simple randomization argument, the rat sequences differ significantly from the mouse group (P = 0.0045), since the observed arrangement of the data gives rise to the most extreme values of the intra- and interspecific means out of the 220 different possible arrangements. The sources of the sequences are given in Materials and Methods. exons

H-2Kk

PARI.5

E4

pBS3.6 RT1.AJ

g

-gc~ .

Proc. Nati. Acad. Sci. USA 87 (1990)

Evolution: Rada et al.

2170

translated region (25, 26) is truly a locus-specific marker in these sequences. Segmental exchange between HLA loci has been proposed from patterns of sequence shared between alleles at different loci (27-29). However, clear-cut divergence of sequence between loci in HLA suggests that intralocus exchanges play a more important evolutionary role than interlocus exchanges in generating polymorphism in humans and the higher primates (29, 30). Extensive recent duplication in the murine MHCs may favor nonhomologous pairing and thus a relative excess of interlocus exchanges in this taxonomic group. An explicit genetic mechanism in the MHC that generates a high frequency of short segmental sequence exchanges in the germ line has been proposed (31). The exploitation of a developmentally regulated, tissue-specific mitotic gene conversion process to generate somatic diversity in chicken immunoglobulin V genes (32) may be relevant since it is reminiscent of previous evidence that germ-line conversion in class I genes may also be a mitotic rather than a meiotic event, presumably occurring during germ cell multiplication in fetal life (24).

DISCUSSION

This paper documents species-specific sequence homogenization in the classical and nonclassical class I genes of mice and rats. This result demonstrates beyond reasonable doubt that unconventional mechanisms of sequence exchange, such as gene conversion, are significant in determining the pattern of murine class I sequence evolution. The outstanding problem now is to understand the interplay between mutation and segmental exchange, selection, and drift in originating and sustaining polymorphism in class I MHC molecules. Allelic diversity is apparently accumulating in the population faster than single-base replacements. Thus RTL.Aa differs at about 90 positions on average from any one mouse classical allele in the three long exons. Yet only 11 nucleotides in these three exons are unique to RTL.A5-that is, not found in any of the nine mouse alleles sequenced so far. We therefore conclude that segmental exchange between alleles and between loci has been the principal mode of sequence evolution in classical class I genes in the murine rodents. The scrambling in RTL.Aa of locus-specific markers for H-2K and H-2D/L suggests that the apparent locus specificity arises from stochastic properties of the homogenization process. Perhaps only the B2-SINE insertion in the 3' unAD

IRTle H-2K1 H-2K H-2K H-2K

ME

1

D

D

D

.

.

We thank George Gutman for providing DA strain rats; Dan Denney for his gift of pMSD plasmids; David Lawlor, Tony Jackson, Steve Jameson, Gos Micklem, and David Brown for advice; Nigel

D

ATCTAGU ATGGAGGO CAITGCACCGCGCACGCTGCTCCEGCTGCTGGCGGCCGCCCTGGCCCCGACCCAACCCGCGCGG -g-c- -ca-gt-g-c- -ccagt-g-c-

-cc-9t-c-gt-

-c

-g-

-g-c-

H-2Dk H-2DP H-2Dd

-c-

H-2Ld

-C-

H-2Db pARI.5 pBs3 .6 RT1A

-

-C

-

-

-g

-g-

-g........

.......

g--

-g--c ATCTCA;

A GGMGCCGATGGCACCGCGCACGCTGCTCCTcCTGCTGGcGGCCGCCCTGGCCCCGACCCAGACCCGCGCGG H 73 1

EHJ 5

D.

RTl.Ae H-2Kb H-2Kd

H-23k

H-2K2 Ht-2Kq' H-2Dk

H-2DP H-2Dd H-2L

H-2Db pRTI .A-1 pARI. 5

PBS3.6 R1.Aa

K

.

DD

D

D

..GAAGM&AAC... ......... cc.c..cA..ATG...cc.GG...TG.cATG...G..TG.c..c...G.TA...... AACCICTCCATCGACTGACTCCAACSAKGGAGA t g t-a-tga-a-g -g-ct ................... caatag---ct cg- .......-gt-ctg-ctg-tgt t-a-tga-a-q-... .................. -caatag--ct-----g-.g -t........ t-t-ctg-ctg-tg t -g-t-cC t - t-a-tga-a-g g .................... caatag--ct -g-ccc-tt--ctg-c t ...-tga .....................---....caatag---c t . ..t-t--ctg-ctg-tg -g-c ---cg -gC

-gc

cg-t c-

-g---c----g-c

...caat-tga-ag-........t-t--ctg-ctg-tg .--

c

tt

tt

.

..................

t -

.................. t-gt--ctg--ctg-gt tqt-. .................. -ct.-tg-. tg- .ttt-gt-cgg

................... .......... t-t-ctg--ccg-tg -g--c-t----c-ca-ga-t----ca-t......a tt tg- . t-gt-ctg-ctg-gtg -g--c---g-c ......a tt t-gt-ctg-ctg-gtg tg-. -g--c---g-c

-ggg-c--c-cg-ca

g

c

--c

c

g

t-c-t---g-a-t

t t-t-tg.. -g ta... ga g

..... .ga-a-g ..... .ga-a-g

t-t t-t

... -ta..c c..t-gt-a-.

~~

-g-tgtctgttattct-ga-c-gtg-c-a---c-t-a

......................

t-t-tg...

t

tt

-t. tt-t---ct-c

--C

..........

t

ag-tt-t-t-t

..... -t-c--c-tg-t

.

a-.. .-tg-t-at-b t-c-a-

-a-...

..-

t

AACCTTCTCC&TCGACTGACTCCAACATGGAAA .........C..ACrGGCGGTCC.AGN.G... C.CIAAGSOCAC 147 HH11 1 HI 6-8

RT1.A H-2Kb

H-2Kd H-2Kk

H-2K~l H-2K"s H-2Dk H-2DP H-2D H-2L

H-2Db pRTIl.A-1 pARI .5 pBS3 .6 RT1.A

D

A

.

AG.CATGA

CA:CTTGTCA CTC .G ACGCCCCTGCTCCAGGCA GGACA; G^CG G=

...........................

agtgatggttcatgaccctcattCtct-.-g-

...............

. -c - t--c t-t-tg . g -c -t---c .-a -gt-t-tg . -c . g-t-t-tg.-a t-aa -a---ta-t-tga g

...............

-a

g-ta-t-tg

-a ..

t-a t-t-tg ..........

-a ...............

...............

...............

...............

-a----------- -g -

........

agtgatggttcatgaccctcattctct-.-g-

........

agtgatggttcatgaccctcattctct-.-g........................... -

.........

t-aa

.........

.g-

agtgatggttcatgactctcattctct-.-g-

g

.........

t...........a

t

t-t-tg-

t- aa g.... g

t-aa-

.9.

.

........................... ..g

-.g

...........................

t t -C t................................ -t. ........................................t....tg ................................................ AG.CATrGA CCAG^ G~TCTCTCCCA;A=TAA ........................................................... GT.... .... _CTACGCCCCSGCTCC_- cilrj,,,~~ 1s114 N M -a-t-t ...............

tg

...........................

cttttcttctcacaaa ...............

FIG. 4. (Figure continues

on

opposite page.)

Evolution: Rada et al. 3 ' I1

_

_w _C _n ~ ~ ~ ~ ~ CA;.

Proc. Natl. Acad. Sci. USA 87 (1990)

EGIOK

A.

XR7 .A~

_

H 2Kd

H-2Kk

_

.

2171 D

.

c=.casccascaCCC

-c-t

g -----t.. -a t-ac----_te-ac t-. t- q c a-a t-ac----c----ac t-. ~ ~ tt tg.-e-ca-.t-. *c cC t _ --*c c_---gt-g . . -a-t a c---caea t-. . -t gc-aa t-ac tg ac t gct.. C-.c-a-t ac c-t c-a-t t -ct. . * ac e t t t ac

g

H-2K2 H2Dk

a

tg a---__e__t__ e tg -e--.a-a-t e

H-2DP

H-2Dd H-2Ld H-2Db pRTI .A-1 p~ll .5 pBS3.6 RT1.Aa

_ 1

31

.

A

A .

R51.A~

g

H-2Kk H-2r

H-202" H-20" E-2Dk

H-2DP

c

D

.

.

.

.

.

31

.

ta-acac- ......a -t-t-tgc a ag-t-ca t-g get--aaaacac. .g-t-ca 9-t-t-tc g t-t-tc-a -t----naaacac-. g---. t -ag-t-ca g a--a t-t-tce *aa -.a-g gag-t-ca

.

c-a t--aacac .g.ag-t-cag-a c aacac ......aag-t-ca t-gc -a aacac aag-t-ca --t-tg gc caaacat-............ -t g ag-t-ca t-gc-a-------c- t-aaacacacc-t-tgaag-t-ca t-t-tc- -

--cgt-tg e-t-tg

H-2Dd a e H-2Ld a q -a---H-2Db -c-t-tgt-gc-a pRTI .A-1 -tg .-t-tg gc--at. tt-tg-t-t----c-g------a---q-gpARI .5 pBS3.6 .-t-tga RT1.A~ CAGGACCCTGTCC._CTGTCACTG. 161 H

aaacac-

c

t

t

aag-t-ca

t- g .. . ttg -cct

c -

c--g-a-t --a-t t t g c--a-t g t t g c

-

a_.-t-a--g-ttet---g-a-t -a . -t-a---g--g-tt-a--g-a-t

.-c

.

c--g--a-t-

c-tg -c- -ga-t

......

g a * g

.

CCTGCCCTGCAI 160

a

.

D

t-t-a aagt-ta t-t-a---at-t-a a-

a a a t-t_-aat-a g a t-ta g

*

t-t-a--g

a a

t-t-a---g

~tt_ . a

g

3GGCACCCCTGTGUCC.XCL_ -

H

H

X

X

319

FIG. 4. Sequence alignments for the 5' untranslated region, short exons, and 3' untranslated region to nucleotide 319 [the H-2D/L-specific SINE insertion (25), also found in rat (26) but absent from 3.3/1, begins at nucleotide 320 of the 3' untranslated region] of nine mouse H-2K and H-2D/L locus class I alleles and four rat class I sequences. The sequences are compared against the full sequence of RT1.Aa (top and bottom lines, clone 3.3/1). Locus-specific nucleotides among the mouse classical sequences are marked by letters above the alignment blocks. These letters indicate whether RT1.Aa has the H-2D/L (D) or H-2K (K) base at these positions. Positions where RT1.Aa follows neither H-2D/L nor H-2K are indicated by the letter A. Positions exhibiting species-specific homogenization are indicated by the letter H below the alignment blocks. In the case of single nucleotides, the letter H indicates complete homogenization in both species at each position. For insertions and deletions, the letter H refers only to the presence or absence of the differential segment. The sources of the sequences are given in Materials and Methods. Gaps are indicated by dots, and identity with the sequence of RT1.Aa is indicated by a dash. Miller for fluorescence-activated cell sorter analysis; Evelyn Tredget and Cariel Taylor-Edwards for technical assistance; and Gabriel Dover and Nick Holmes for reading the manuscript. This work was supported in part by grants from the Agricultural Food Research Council Cell Signalling and Recognition Programme and the National Institutes of Health. S.J.P. was supported by a studentship from the Agricultural and Food Research Council; J.v.d.B. was supported by the Bradlow Foundation; and R.L. was supported by St. John's College, Cambridge. J.C.H. acknowledges the generous hospitality from Garry Fathman and his colleagues. 1. Hood, L., Campbell, J. H. & Elgin, S. (1974) Annu. Rev. Genet. 9, 305-353. 2. Dover, G. A. (1982) Nature (London) 299, 111-117. 3. Dover, G. A. & Strachan, T. (1987) in Evolution and Vertebrate Immunity, eds. G. Kelsoe & D. H. Schulze (Univ. of Texas Press, 4. 5. 6. 7. 8. 9. 10.

Austin), pp. 15-33. Lafay, F., Ewald, S. J., McMillan, M., Frelinger, J. A. & Hood, L. (1981) Immunogenetics 12, 21-32. Baltimore, D. (1981) Cell 24, 592-594. Melvold, R. W. & Kohn, H. I. (1989) in Transgenic Mice and Mutants in Major Histocompatibility Complex Research, eds. Egorov, I. & David, C. S. (Springer, New York), in press. Klein, J. & Figueroa, F. (1986) Crit. Rev. Immunol. 6, 295-386. Gunther, E., Wurst, W., Wonigeit, K. & Epplen, J. T. (1985) J. Immunol. 124, 1257-1261. Diamond, A. G., Hood, L., Howard, J. C., Windle, M. & Winoto, A. (1989) J. Immunol. 142, 3268-3274. Ennis, P. D., Jackson, A. J. & Parham, P. (1988) J. Immunol. 141,

642-651.

Steinmetz, M., Moore, K. W., Frelinger, J. G., Sher, B. T., Boyse, E. A. & Hood, L. (1981) Cell 25, 683-692. 12. Spencer, S. C. & Fabre, J. W. (1987) J. Exp. Med. 165, 1595-1608. 11.

13. Yanisch-Perron, C., Vieira, J. & Messing, J. (1985) Gene 33, 103-109. 14. Staden, R. (1984) Nucleic Acids Res. 12, 499-503. 15. Devereux, J., Haeberli, P. & Smithies, 0. (1984) Nucleic Acids Res. 12, 387-395. 16. Radojcic, A., Stranick, K. S., Locker, J., Kunz, H. W. & Gill, T. J. (1989) Immunogenetics 29, 134-137. 17. Holmes, N., Ennis, P., Wan, A. N., Denney, D. W. & Parham, P. (1987) J. Immunol. 139, 936-941. 18. Folger, K. R., Wong, E. A., Wahl, G. & Capecchi, M. R. (1982) Mol. Cell. Biol. 2, 1372-1387. 19. Fukumoto, T., McMaster, W. R. & Williams, A. F. (1982) Eur. J. Immunol. 12, 237-243. 20. Hart, D. N. J. & Fabre, J. W. (1981) Transplant. Proc. 13, 13291332. 21. Diamond, A. G., Larkins, A. P., Wright, B., Ellis, S., Butcher, G. W. & Howard, J. C. (1984) Eur. J. Immunol. 14, 405-412. 22. Livingstone, A. M., Powis, S. J., Diamond, A. G., Butcher, G. W. & Howard, J. C. (1989) J. Exp. Med. 170, 777-795. 23. Koch, S., Koch, N., Robinson, P. & Haemmerling, G. (1983) Transplantation 36, 177-180. 24. Geliebter, J., Zeff, R. A., Melvold, R. W. & Nathenson, S. G. (1986) Proc. Natl. Acad. Sci. USA 83, 3371-3375. 25. Kress, M., Barra, Y., Seidman, J. G., Khoury, G. & Jay, G. (1984) Science 226, 974-977. 26. Mauxion, F., Sobczak, J. & Kress, M. (1989) Immunogenetics 29, 397-401. 27. Seeman, G. H. A., Rein, R. S., Brown, C. S. & Ploegh, H. L. (1986) EMBO J. 5, 547-552. 28. Strachan, T. (1987) Br. Med. Bull. 43, 1-14. 29. Parham, P., Lawlor, D. A., Lomen, C. E. & Ennis, P. D. (1989) J. Immunol. 142, 3937-3950. 30. Hughes, A. L. & Nei, M. (1989) Genetics 122, 681-686. 31. Jaulin, C., Perrin, A., Abastado, J.-P., Dumas, B., Papamatheakis, J. & Kourilsky, P. (1985) Immunogenetics 22, 453-470. 32. Reynaud, C. A., Anquez, V., Grimal, H. & Weill, J.-C. (1987) Cell 48, 379-388.

Concerted evolution of class I genes in the major histocompatibility complex of murine rodents.

Full-length cDNA sequences of two class I major histocompatibility complex molecules from the DA strain of Rattus norvegicus are reported. One codes f...
969KB Sizes 0 Downloads 0 Views