Accepted Manuscript Insect midgut α-mannosidases from family 38 and 47 with emphasis on those of Tenebrio molitor Nathalia R. Moreira, Christiane Cardoso, Alberto F. Ribeiro, Clelia Ferreira, Walter R. Terra PII:

S0965-1748(15)30027-8

DOI:

10.1016/j.ibmb.2015.07.012

Reference:

IB 2743

To appear in:

Insect Biochemistry and Molecular Biology

Received Date: 2 April 2015 Revised Date:

8 June 2015

Accepted Date: 10 July 2015

Please cite this article as: Moreira, N.R., Cardoso, C., Ribeiro, A.F., Ferreira, C., Terra, W.R., Insect midgut α-mannosidases from family 38 and 47 with emphasis on those of Tenebrio molitor, Insect Biochemistry and Molecular Biology (2015), doi: 10.1016/j.ibmb.2015.07.012. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Man1 Man2

EP

TE D

(TmMan2 codes for Man2)

AC C

Man1 and Man2 hydrolase α1,2 α1,3 mannobiose α1,6

M AN U

SC

RI PT

cDNA Library Tm Man2

Peptide synthesis

1

ACCEPTED MANUSCRIPT

Insect midgut α-mannosidases from family 38 and 47 with emphasis on those of

RI PT

Tenebrio molitor

Nathalia R. Moreiraa, Christiane Cardosoa, Alberto F. Ribeirob, Clelia Ferreiraa

M AN U

a

SC

and Walter R. Terraa,*

Departamento de Bioquímica, Instituto de Química, Universidade de São

Paulo, C.P. 26077, 05513-970 São Paulo, Brasil b

Departamento de Genética e Biologia Evolutiva, Instituto de Biociências,

TE D

Universidade de São Paulo, C.P. 11461, 05513-970 São Paulo, Brasil

* Corresponding author. Fax.: +55 11 3091 2186.

AC C

EP

E-mail address: [email protected] (W.R. Terra)

2

ACCEPTED MANUSCRIPT Abstract

α-Mannosidases are enzymes which remove non-reducing terminal residues from glycoconjugates. Data on both GH47 and GH38 (Golgi and lysosomal) enzymes are available. Data on insect midgut α-mannosidases acting in digestion are preliminary and do not include enzyme sequences. T. molitor

RI PT

midgut α-mannosidases were separated by chromatography into two activity peaks: a major (Man1) and a minor (Man2). An antibody generated against a synthetic peptide corresponding to a sequence of α-mannosidase fragment recognizes Man2 but not Man1. That fragment was later found to correspond to

SC

TmMan2 (GenBank access KP892646), showing that the cDNA coding for Man2 is actually TmMan2. TmMan2 codes for a mature α-mannosidase with 107.5 kDa. Purified Man2 originates after SDS-PAGE one band of about 72 kDa

M AN U

and another of 51 kDa, which sums 123 kDa, in agreement with gel filtration (123 kDa) data. These results suggest that Man2 is processed into peptides that remain noncovalently linked within the functional enzyme. The physical and kinectical properties of purified Man1 and Man2 are similar. They have a molecular mass of 123 kDa (gel filtration), pH optimum (5.6) and response to

TE D

inhibitors like swainsonine (Man1 Ki, 68 nM; Man2 Ki, 63 nM) and deoxymannojirimycin (Man1 Ki, 0.12 mM; Man2 Ki, 0.15 mM). Their substrate specificities are a little different as Man2 hydrolyzes α-1,3 and α-1,6 bonds better than α-1,2, whereas the contrary is true for Man1. Thus, they pertain to

EP

Class II (GH38 α-mannosidases), that are catabolic α-mannosidases similar to lysosomal α-mannosidase. However, Man2, in contrast to true lysosomal α-

AC C

mannosidase, is secreted (immunocytolocalization data) into the midgut contents. There, Man2 may participate in digestion of fungal cell walls, known to have α-mannosides in their outermost layer. The amount of family 38 αmannosidase sequences found in the transcriptome (454 pyrosequencing) of the midgut of 9 insects pertaining to 5 orders is perhaps related to the diet of these organisms, as suggested by a large number of lysosomal α-mannosidase in the T. molitor midgut.

Key words: alpha-mannosidase, substrate specifity, imunocytolocalization, posttranslational processing.

3

ACCEPTED MANUSCRIPT 1. Introduction

α-Mannosidases (EC 3.2.1.24) are enzymes which remove non-reducing terminal residues from glycoconjugates. Mammalian α-mannosidases are classified under two classes (I and II) on the base of functional characteristics

RI PT

and sequence homology (Gonzalez and Jordan, 2000). Class I α-mannosidases are of family 47 of glycosyl hydrolases (GH47), only hydrolyse α-1,2 mannose bonds and are localized either in endoplasmic reticulum or Golgi complex. Class II α-mannosidases are of family 38 of glycosyl hydrolases (GH38). They

SC

hydrolyze α-1,2, α-1,3, and α-1,6 mannose bonds from the nonreducing termini of N-glycans during biosynthesis in the endoplasmic reticulum and Golgi

2000).

M AN U

complex and catabolism in the lysosomes and cytosol (Gonzales and Jordan, GH38 α-mannosidases are activated by Zn++ and require two carboxylate groups to catalyze hydrolysis: one serving as a nucleophile and the other as a proton donor (Davies and Henrissat, 1995). The Golgi enzyme has a dual specificity

for

α-1,6

and

α-1,3-linked

mannoses,

whereas

lysosomal

mannosidases (LAM) have broad specificity, cleaving α-1,2, α-1,3, and α-1,6

TE D

linked mannoses, or narrow specificity, hydrolyzing only α-1,6 linked mannoses (Rose, 2012). The LAM is post-translationally modified during maturation and cellular transport by proteolytic processing and by N-linked glycosylation

EP

(Tollersrud et al., 1997).

Insect α-mannosidases have been studied in detail with recombinant enzymes. Data on both GH47 (Ren et al., 1995; Kawar and Jarvis, 2001) and

AC C

GH38 (van den Elsen et al., 2001; Rose, 2012; Nemcovicova et al., 2012) Golgi enzymes are available and lysosomal α-mannosidases (Nemcovicova et al., 2012), as well. There have been also reports on insect midgut α-mannosidases acting in digestion (Chipoulet and Chararas, 1985; Terra et al., 1988; Silva and Terra, 1994; Foster and Roberts, 1997; Bedikou et al., 2009). Those papers, however, are preliminary characterizations of the enzymes and none of them describe the enzyme sequence. In this paper we describe the purification, characterization, sequencing and processing of a T. molitor α-mannosidase and show that it is similar to

4

ACCEPTED MANUSCRIPT

lysosomal α-mannosidase, but it is secreted into the midgut contents, probably acting in fungi digestion.

2. Materials and Methods

RI PT

2.1. Animals

Stock cultures of T. molitor were maintained under natural photoregime conditions on wheat bran at 24–26 ºC and 70–75% r.h. Fully grown larvae of both sexes (each weighing about 0.12 g), having midguts full of food, were

SC

used.

M AN U

2.2. Preparation of T. molitor midgut and wheat bran samples

Larvae were immobilized by placing them on ice after which they were rinsed in water and blotted with filter paper. The larvae were dissected in cold iso-osmotic saline (342 mM NaCl solution) and the midguts removed. The isolated midguts were further divided into two sections: one corresponding to the anterior two-thirds and another to the posterior third and the tissue and

TE D

contents corresponding to each section were recovered. The midgut tissue sections were then homogenized in cold double-distilled water using a Potter– Elvehjem homogenizer, filtered through glass wool (to remove fat) and centrifuged at 25,000 xg for 30 min at 4 °C. The supernatant (“soluble fract ion”)

EP

was recovered, whereas the sediment (“membrane fraction”) was dispersed in double distilled water. Peritrophic membranes and contents were homogenized

AC C

like tissue preparations and centrifuged at 10,000 xg for 10 min at 40C. Centrifugation is necessary to remove undigested food fragments from the homogenates of peritrophic membrane and contents. Wheat bran was treated like the peritrophic membrane and contents. All

samples were stored at -200 C until used. No enzyme inactivation was detected on storage for as long as 6 months.

2.3. Protein determination, enzyme assays, and kinetic parameters

5

ACCEPTED MANUSCRIPT

Protein was determined according to Smith et al. (1985), as modified by Morton and Evans (1992) using ovalbumin as a standard. The activity of αmannosidase was routinely determined with 2.5 mM p-nitrophenyl-α-Dmannopiranoside (NPαMan) as substrate in 20 mM citrate-phosphate buffer pH 6.0 at 30 0C. The activity was determined by measuring the release of p-

RI PT

nitrophenolate at 420 nm as described previously (Terra et al., 1979). The activity of α-mannosidase in 2.5 mM 2α-mannobiose, 3α-mannobiose and 6αmannobiose was determined by measuring the increase in reducing groups (Noelting and Bernfeld, 1948). All assays were carried out in triplicate under

SC

conditions such the activity was proportional to protein concentration and to time. Incubations were carried out for at least four different time periods and the initial rates of hydrolysis were calculated. Controls without enzyme or without hydrolyzes 1 µmol substrate/min.

M AN U

substrate were included. One unit of enzyme (U) is defined as the amount that

The effect of substrate concentration on the activity of T. molitor purified digestive α-mannosidase was determined by using 15 different substrate concentrations (range 0.25-23 mM). Km values (mean ± SEM) were determined

TE D

by a weighted linear regression using the software SigmaPlot (Jardel Scientific, USA). For Ki value determinations, purified α-mannosidase was incubated with at least 5 different concentrations of the substrate (0.2-2 times the Km value) at each of 5 different concentrations of the inhibitor (1-5 times the Ki value). Ki

EP

values were determined from the replots of Lineweaver-Burk plots against inhibitor concentration (Segel, 1975).

AC C

The effect of pH on enzyme activity was measured in the presence of the following buffers (0.2 M): citrate-sodium phosphate (pH 2.6-7.0) and Tris-HCl (pH 7-8). pH stability was evaluated by maintaining enzyme samples at the same buffers as before at 30 ºC for 2 h before assaying in standard conditions.

2.4. Purification of the digestive α-mannosidase from T. molitor larvae

The soluble fraction of T. molitor midgut homogenates (1000 mg protein corresponding to 500 insects) was loaded onto a Hitrap Q XL column (5.0 mL), (FPLC system, Amersham Biosciences) equilibrated with 20 mM Tris-HCl buffer, pH 7.0. Elution was accomplished with a gradient of NaCl from 0 to 1M

6

ACCEPTED MANUSCRIPT

in the same buffer. At all performed purification steps, the flow was set at 0.5 mL/min and fractions of 0.4 mL were collected. Fractions active on NPαMan were pooled, passed through a Superdex 75 HR 10/30 column (Pharmacia Biotech) equilibrated with 20 mM Tris-HCl buffer, pH 7.0, containing 1M NaCl. The active fractions were pooled, the pool buffer was changed with the aid of a

RI PT

High Trap desalting column and then it was passed through a Phenyl Superose column HR 5/5 (AKTA System) equilibrated with 50 mM phosphate buffer, pH 7.0, containing 2M (NH4)2 SO4. Elution was accomplished with a (NH4)2SO4 gradient from 2 to 0 M in the same buffer. The fractions corresponding to each

SC

of the two activity peaks were used as a source of α-mannosidases (peak one named Man1; peak two, Man2).

M AN U

2.5. Gel Filtration for molecular weight determination

Gel filtration was performed in a FPLC System (Amersham Biosciences) using a Superdex 200 column equilibrated and eluted with 20 mM Tris-HCl buffer pH 7.0 containing 1 M NaCl. Fractions of 0.3 mL were collected. Molecular weights were calculated by using the following proteins as standards: cytochrome C (12.4 kDa), ovalbumin (43 kDa), bovine serum albumin (67 kDa),

Dextran (2000 kDa).

TE D

β-amylase (200 kDa). The void volume (VO) was calculated with the aid of Blue

EP

2.6. Sodium dodecyl sulfate (SDS)-Polyacrylamide Gel Electrophoresis (PAGE), Western blotting, and in gel assays Samples containing approx. 2 mg protein were combined with sample

AC C

buffer containing 60 mM Tris–HCl buffer, pH 6.8, 2.5% (w/v) SDS, 0.36mM βmercaptoethanol, 0.5 mM EDTA, 10% (v/v) glycerol and 0.005% (w/v) bromophenol blue. The samples, heated for 5 min at 95 ºC, were loaded onto polyacrylamide gel slabs. The gel slabs and the running buffer contained 0.1% SDS (Laemmli, 1970), and electrophoresis was run at constant voltage of 200 V. Non-denaturating electrophoresis was conducted as described above, but without heating and neither SDS nor β-mercaptoethanol were added in gels. Protein was silver stained (Blum et al., 1987). Molecular-weight markers used: hen egg white ovoalbumin (45 kDa), bovine serum albumin (66.2 kDa),

7

ACCEPTED MANUSCRIPT

rabbit muscle phosphorylase b (97 kDa), Escherichia coli β-galactosidase (113.5 kDa) and rabbit skeletal muscle myosin (200 kDa). Western blotting was performed essentially as previously described (Cristofoletti

et

al.,

2001).

After

SDS-PAGE,

the

proteins

were

electrophoretically transferred onto a nitrocellulose filter, which after a blocking

RI PT

step reacted with the anti-α-mannosidase serum and then with anti-rabbit IgG coupled with peroxidase. After washings, the strips were incubated with chloronaphtol as described by Cristofoletti et al. (2001). The transfer efficiency was evaluated using pre-stained molecular weight markers (BioRad or Sigma,

SC

USA). Pre-immune serum was used in control experiments to show that antiserum was specific.

Prior to in gel detection of enzymatic activity, the gels (prepared without

M AN U

SDS and β-mercaptoethanol) were washed three times for 15 min with 20 mM citrate-sodium phosphate buffer pH 6. α-Mannosidase activity was assayed in gel at 30°C as described in Manchenko (1994), using

0.25 mM 4-

methylumbellipheryl α-D-mannopyranoside in the washing buffer.

2.7. Molecular cloning of a fragment (Tm000) of a cDNA coding α-mannosidase

TE D

Tm000 was obtained by polymerase chain reaction (PCR) amplification of a midgut cDNA library (Cristofoletti et al., 2005) as template. For this we used the primers designed basing in a partial sequence obtained in our laboratory

EP

(Contig 5, Ferreira et al., 2007): Man reverse, 5’ ATTCTTGCCTCCTCGCGAAC 3’; Man forward, 5’ ACCATCAGCTCCACCTCTCC 3’ and of the T3 promoter, ATTAACCCTCACTAAAGGGA;

T7

universal

primer,

AC C

TAATACGACTCACTATAGGG. The PCR was performed using TAQ DNA polymerase (5 units) in 20 mM Tris-HCl buffer, pH 8.4, with 50 mM KCl, 0.2 mM dNTP, and 40 mM MgCl2. The amplification was reached using 30 cycles at the following conditions: 30 s at 94 0C, 45 s at 55 0C and 1.5 min at 72 0C. At the end, the reaction mixture remained a further 10 min at 72 0C. The product of PCR was cloned in pGEM-T Easy Vector (Promega, USA) and sequenced by the Sanger Procedure, resulting in a fragment similar to α-mannosidase sequences deposited in GenBank. The sequence was named Tm000 and is shown in Supplementary Table 1.

8

ACCEPTED MANUSCRIPT 2.8 Preparation of midgut α-mannosidase antiserum

One section of the translated sequence of Tm000 was chosen to be synthesized to generate antibodies in a rabbit. The ideal sequences should be hydrophilic, which are expected to be exposed to the medium. With the aid of the Hopp and Woods scale of mean hydrophilicity (www.expasy.org/proscale/)

RI PT

the following sequence was selected: TDSNGREVLQRTRNSRPDYDY. The peptide was synthesized (90% pure) and linked to KLH (Keyhole Limpet Hemocyanin) by Invitrogen (Life Technologies, Brazil). The synthesized peptide was used to raise antibodies in a rabbit as detailed elsewhere (Jordão et al.,

SC

1996). For each injection, one mg of protein was mixed with Freund’s adjuvant and subcutaneously applied into rabbits. Pre-immune serum was collected as

M AN U

control.

2.9. Ultrastructural immunocytolocalization

T. molitor larvae were dissected in their own hemolymph and tissue sections were fixed (paraformaldehyde/ glutaraldehyde), embedded in L.R. White acrylic resin, incubated with the primary (α-mannosidase antiserum) and

TE D

secondary (goat anti-rabbit IgG coupled to 15nm gold particles) antibodies and examined in a Zeiss EM 900 electron microscope as detailed elsewhere (Silva et al., 1995). As controls, sections were incubated with pre-immune serum

EP

under the same conditions.

2.10. Cloning and sequencing the cDNA coding Man 2 and expression analysis

AC C

of selected T. molitor midgut α-mannosidases

Larvae immobilized on ice were dissected in cold 342 mM NaCl with

gloves,

sterile

forceps

and

glassware

previously

treated

with

diethylpyrocarbonate. The dissected tissues were maintained in an ethanol-ice bath and eventually were stored at -80 ºC. RNA were extracted from the tissues with Trizol following the instructions of the manufacturer, Invitrogen, which are based on Chomczynski and Sacchi (1987), and used to synthesize the corresponding cDNA with the aid of a reverse transcriptase present in the kit

9

ACCEPTED MANUSCRIPT

Superscript (Invitrogen). The resulting cDNA was used as a template for amplifying sequences by PCR with specific primers. The following procedure was employed to clone the cDNA coding for Man 2. Tm000 is contained in contig Tm408 (see next item). This contig lacks the 3’-terminal end of the gene. A search among all contigs which are similar to

RI PT

α-mannosidase, even if did not meet the criteria to be accepted as a true αmannosidase (see next item), revealed contig TmXXX (see Supplementary Table 1) as a probable 3’-end of the α-mannosidase 2 gene. With the aid of primers designed from contig TmXXX (forward 5’ ATGAGCTGCTGGTGTTT 3’)

SC

and contig Tm408 ( reverse 5’ GTTATAGCTCACGGTCGC 3’) and with a midgut cDNA library prepared from midgut mRNA by reverse transcription, a cDNA clone coding for the Man 2 full length sequence was obtained (Fig. 5) by

M AN U

PCR. This was carried out using TAQ DNA polymerase (Invitrogen) in reaction buffer containing 1.5 mM MgCl2. PCR conditions were: 35 cycles of 30 s at 94 ºC (denaturation), 45 s at 55 ºC (annealing) and 2.5 min at 72 ºC (synthesis). At the end, the reaction mixture remained a further 10 min at 72 0C for completing elongation. The full length cDNA coding for Man2 was named TmMan2 and was deposited in GenBank with the accession number KP892646.

TE D

For tissue expression analysis, cDNA prepared from the different tissues were used with the following primers: Contig Tm000 (forward 5´ GAT GTG GCT GCT TGT TGG TCA 3´; reverse 5´ GAC ATT CTC TTC CAA GAG CAT 3´); 1774

(forward



EP

Contig

GTTGTAGTCTGGACTGTA

AGATTCGTCCAAGTGGAA3´,

3´),

Contig

602

reverse

(forward

5´ 5´

TGGCAGATAGATCCTTTC 3´, reverse 5´ GCTGACGATTTCTGTTCC 3´).

AC C

PCR reaction was realized using TAQ DNA polymerase (Invitrogen) in

reaction buffer containing 1.5 mM MgCl2. PCR conditions were: 25 cycles of 1 min at 94 ºC (denaturation), 1 min at 52 ºC (annealing) and 1 min at 72 ºC (synthesis). The number of cycles was chosen after several trials so that the amplification was in log phase and resulting in a clearly visible amplified band. The sizes of the products of RT-PCR were those predicted, thus discounting a possible contamination with genomic DNA.

2.11. Annotation of contigs obtained by cDNA pyrosequencing of midgut samples and computational analysis of sequences

10

ACCEPTED MANUSCRIPT

Total messenger RNA for cDNA transcription was extracted from the midgut

of

(Hemiptera:

Periplaneta

americana

Heteroptera)

and

(Dyctioptera),

Bucephalonia

Dysdercus xanthophis

peruvianus (Hemiptera:

Auchenorrhyncha) adults and from the larval midgut of T. molitor (Coleoptera:

RI PT

Tenebrionidae), Zabrotes subfasciatus (Coleoptera: Bruchidae), Rhynchosciara americana (Diptera:Nematocera), Musca domestica (Diptera:Cyclorrhapha), Diatraea saccharalis (Lepidoptera:Crambidae) and Spodoptera frugiperda (Lepidoptera:Noctuidae). cDNA pyrosequencing of all samples, except those

SC

from Z. subfasciatus was performed using a platform 454 Genome Sequencer FLX (454 Life Sciences/Roche), and the contigs were assembled as described in Ferreira et al.(2014). cDNA sequencing of samples from Z. subfasciatus was

M AN U

performed in the HiScan sequencer of Illumina, as described in Gomez et al. (2013). The contigs were annotated with the aid of the dCAS software (http://exon.niaid.nih.gov), which performs BLASTx in databanks (nr, pfam, GO, KOG and Swiss-Prot). The hit with higher score was annotated. The sequences were searched in our transcriptomes using the words αmannosidase or α-mannopyranosidase and the annotation of selected

TE D

sequences was manually curated by multiple sequence alignments (Bioedit version 7.1.11, Hall, 1999) with reference sequences. The prediction of sequence features was done as follows: signalP

EP

(Emanuelsson et al., 2007), big-PI Predictor (Eisenhaber et al., 2000), NetNGlyc (Gupta et al., 2004), NetOGlyc (Steentoft et al., 2013). The presence of transmembrane regions was analysed by TMHMM Server v. 2.0

AC C

(http://www.cbs.dtu.dk/services/TMHMM-2.0); HMMTOP (Tusnády and Simon, 2001). Selected amino acid sequences of α-mannosidases from different families were aligned using the ClustalW Multiple Alignment tool in BioEdit Sequence Aligment Editor (http://www.mbio.ncsu.edu/BioEdit/BioEdit.html). Phylogenetic analysis was performed with the program MEGA 6 (Tamura et al., 2013). The cladograms of chosen sequences were inferred using the maximum likelihood algorithm and confidence estimated with bootstrap (1,000 replicates) (Felsenstein, 1985; Hillis and Bull, 1993). The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history

11

ACCEPTED MANUSCRIPT

of the taxa analyzed (Felsenstein, 1985). Branches corresponding to partitions reproduced in less than 50% bootstrap replicates were collapsed.

3. Results

RI PT

3.1. Distribution of α-mannosidase activity along the midgut Most α-mannosidase activity is soluble and is found in luminal contents of the first two-thirds of the midgut, with a specific activity of the same order as that in cells of the same region (Table 1). This means that α-mannosidase

SC

activity is secreted into the midgut contents; otherwise specific activities in contents would be significantly lower. The small amounts of α-mannosidase activity recovered in pellets of the centrifuged samples of midgut homogenates

M AN U

(membrane fraction) may be the soluble enzyme adsorbed onto the cell surface instead of truly membrane proteins, as shown before (see Terra and Ferreira, 2012, for references). This was not further studied. The possibility that part of the luminal α-mannosidase activity originated from the ingested food was discounted. The activity of α-mannosidase recovered from 20 mg wheat bran

mU/animal).

TE D

(which corresponds to the fresh weight of T. molitor midgut) is negligible (0.18

3.2. Purification of midgut soluble α-mannosidase

EP

The soluble fraction of midgut contents was applied onto a Hitrap Q column and a single major activity peak of α-mannosidase was found (Fig. 1A). The eluted fractions active fractions were pooled and passed through a

AC C

Superdex 75 column and again a single activity peak was recovered (Fig. 1B). Finally, the pooled active fractions were applied onto a Phenyl Superose column and a major (Man 1) and a minor (Man 2) peak of activity were eluted (Fig. 1C).

SDS-PAGE of the fractions corresponding to Man 1 (Fig. 1D, lane 2) and Man 2 (Fig. 1D, lane 4) showed several bands. Man1 originates major bands of about 75, 54, and 43 kDa; whereas Man2 forms bands with 72 and 51 kDa. Nevertheless, Man1 and Man2 result in a single activity peak in gel filtration with 123 kDa (Fig.2) and correspond to a single α-mannosidase activity in gel assays (not shown). The fact that Man1 and Man2 originate single active bands

12

ACCEPTED MANUSCRIPT in native conditions (gel filtration and

assays) and smaller peptides in SDS-

PAGE suggests that the detected fragments remain noncovalently linked in the active form.

3.3. Kinectical characterization of purified α-mannosidases

RI PT

Purified Man 1 and Man 2 were separately assayed at different conditions with NPαMan as substrate for determining kinetic parameters. Both enzymes were stable between pH 4 and 7 and have a pH optimum of 5.6 (Table 2). Their Km values towards NPαMan are close (Table 2).

SC

Man 1 is competitively inhibited by deoxymannojirimycin (Fig.3) and by Swainsonine (Table 2). The same is true for Man 2 (Table 2). Ki values for the two inhibitors do not significantly differ between Man 1 and Man 2. As expected,

M AN U

inhibition by Swainsonine occurs in the nanomolar range, whereas for deoxymannojirimycin, in the milimolar range.

Substrate specificity was evaluated for both Man 1 and Man 2 regarding 3 substrates (Table 3). The results showed that Man 2 hydrolyzes α-1,3 and α1,6 bonds better than α-1,2, whereas the contrary is true for Man 1 (Table 3).

TE D

3.4. Immunocytochemical localization of Man 2 in T. molitor midgut cells. Purified Man 1 and Man 2 were not good enough to generate antibodies to be used in immunocytochemical studies. For this, an alternate strategy was

EP

employed.

A fragment of a cDNA-coding α-mannosidase from T. molitor midgut was cloned by PCR amplification of a midgut cDNA expression library with

AC C

appropriate primers. The sequence obtained (Tm000, Supplementary Table 1) was used in selecting a peptide (TDSNGREVLQRTRNSRPDYDY, see details in Material and Methods) that once synthesized was used to raise antibodies in a rabbit.

The α-mannosidase antiserum recognizes Man 2 (Fig. 1D, lane 5), but not Man 1 (Fig.1D, lane 3). Furthermore, the antiserum recognizes a single protein band both in midgut contents (Fig. 1D, lane 6) and in midgut tissue (Fig. 1D, lane 7) with the same migration as Man2. Thus, the antibody is specific for Man 2 and can be used to study its midgut localization.

13

ACCEPTED MANUSCRIPT

Man 2 labeling in T. molitor midgut is observed in large vesicles and outside the anterior midgut cells (Fig. 4A, arrow), whereas in posterior midgut, labeling is seen outside and rarely inside the cells (Fig.4B, arrow). The data suggest that Man 2 is secreted by an apocrine route in the anterior midgut, as previously described for amylase (Cristofoletti et al., 2005). The posterior

RI PT

midgut does not show significant labeling inside the cells, discounting a major Man 2 secretion by these cells. These results agree with the midgut distribution of α-mannosidase activity (Table 1).

SC

3.5. Cloning and sequencing the cDNA coding Man 2

Tm000 is contained in contig Tm408 (see next item). This contig lacks the 3’-terminal end of the gene. A search among all contigs which are similar to

M AN U

α-mannosidase (as compared to XP_968517.2 from Tribolium castaneum), even if did not meet the criteria to be accepted as a true α-mannosidase (see next item), revealed contig TmXXX (see Supplementary Table 1) as a probable 3’-end of the α-mannosidase 2 gene. With the aid of primers designed from contig TmXXX and contig Tm408 and with a midgut cDNA library prepared from midgut mRNA by reverse transcription, a cDNA clone coding for the Man 2 full

TE D

length sequence was obtained (Fig. 5). This sequence (TmMan2) was deposited in GenBank with the accession number KP892646. The putative TmMan2 precursor sequence has a signal peptide

EP

(corresponding to 18 amino acids) and codes for a mature protein with 954 amino acids, with an estimated molecular mass of 107.511 kDa and an isoelectric point of 4.33. TmMan2 amino acid sequence has the catalytical

AC C

residues (D166 and D288, precursor numbering), the residues responsible for Zn++ coordination (H50, H52, D166, H416) and the Arg residue (R190) that modulates the pKa of D166 (Suits et al., 2010). TmMan2 sequence also has 6 Cys residues homologous to Bos taurus α-mannosidase (AAC48763.1) that form disulphide bonds (C33-C329, C237-C242, C466-C474), but lacks Cys in positions 387 and 442 that form the fourth disulphide bond in the bovine αmannosidase (Heikinheimo et al., 2003). There are three putative Nglycosylation sites (N470, N577 and N895) and 8 O-glycosylation sites (S328, S634, T722, S725, S934, S938, S939, and T940). From the putative N-

14

ACCEPTED MANUSCRIPT

glycosylation sites, B. taurus is supposed to be glycosylated in the residues homologous to TmMan2 N470, N895 (Tollersrud et al., 1997).

3.6 Identification of α-mannosidase sequences in the transcriptome of different

insects

RI PT

Transcriptomes annotated as detailed in Material and Methods were searched for α-mannosidases. At first, 54 coding sequences were recovered, translated and the predicted sequences were manually curated.

Forty one sequences identified as pertaining to family GH38 (Henrissat

SC

and Davies, 1997) were checked for the presence of catalytic and zinc-binding residues by alignment with Streptococcus pyrogens α-mannosidase (GenBank AAK34381.1). All accepted sequences had the catalytic and zinc-binding

M AN U

residues, except for few sequences that lacked one of the zinc binding residues because they ended before attaining the site of one of those residues. α-Mannosidases sequences from GH47 family have 3 acid residues important for enzyme catalysis (E330, D463 and E599 in human αmannosidase AAF03215.1) and a conserved Arg residue (R334 in the human enzyme). The 13 GH47 sequences found in our transcriptomes were aligned

TE D

with the human sequence. Seven sequences having at least 2 of the 4 residues mentioned above were accepted for further analysis. The features of the curated sequences of α-mannosidases are listed in

EP

Tables 4 and 5 and the corresponding FASTA sequences are presented in Supplementary Table 1.

Following the criteria described above, 13 GH38 and 6 GH47 α-

AC C

mannosidase sequences were found in the transcriptome of the 9 insects. In B. xanthophis, D. peruvianus and R. americana only GH47 enzymes were found, whereas in P. americana, only GH38 mannosidases are present. Unexpectedly, no α-mannosidases were found in the intestine of Z. subfasciatus, D. saccharalis and S. frugiperda. On the other hand, T.molitor has a high number of α-mannosidases (Tables 4 and 5). As the enzymes have about 1000 amino acid residues, it is likely that true mannosidase sequences have been discarded because they do not have the catalytical and/or zinc binding residues. The enzymes are not expressed in

15

ACCEPTED MANUSCRIPT

large quantities, especially those belonging to family 47, which can make it difficult to obtain a large extension of their sequences. Family 47 α-mannosidase sequences from our transcriptomes plus sequences from characterized animal α-mannosidases were used to make a cladogram (not shown). The sequences form 2 monophyletic groups. One

RI PT

(bootstrap 100) contains all the sequences retrieved from data bases plus the contigs Md1922 and Tm2371. The other contigs from our transcriptomes branched together in the other group. More studies are needed to disclose the reason why the sequences form these 2 branches.

SC

Among the sequences of family 47, only Bx2085 has a transmembrane helix predicted and no potential GPI-modification site was found in the sequences. The position of the transmembrane helix present in Bx2085 does

M AN U

not coincide with that presented by H. sapiens enzyme. The N-terminal portion of all the insect mannosidases from our transcriptomes initiated after the transmembrane helix from the H. sapiens enzyme.

Fig. 6 shows a cladogram of family 38 α-mannosidases found in our transcriptomes plus sequences of α-mannosidases present in lysosomes, Golgi and cytosol from different organisms. The cytosol enzymes (CAM) form one

TE D

monophyletic grouping with bootstrap 100. The AMII α-mannosidases from D. melanogaster, M. musculus and H. sapiens are Golgi enzymes and branch together with the S. frugiperda enzyme, indicating that the later enzyme is

EP

probably also present in Golgi apparatus. The sequences found in our transcriptomes are all in the same group of the lysosomal enzymes (LAM) (bootstrap 75), indicating that the sequences found by us are similar to

AC C

lysosomal enzymes.

3.7. Expression analysis by RT-PCR of α-mannosidases genes in T. molitor midgut

RT-PCR data (Fig. 7) shows that the mRNA coding for Tm602 is expressed along the whole midgut and Malpighian tubules (here very faintly) whereas Tm000 (which corresponds to TmMan2) and Tm1774 are expressed only along the midgut. The data suggest that TmMan2, Tm602 and Tm1774 have a midgut function.

16

ACCEPTED MANUSCRIPT 4. Discussion

4.1. Distribution of α-mannosidase activity in T. molitor midguts Most α-mannosidase activity in larval midguts of T. molitor is found in luminal contents of the anterior two thirds of the midgut, which midgut luminal

RI PT

pH is 5.6. The luminal pH of posterior midgut is 7.9 (Terra et al., 1985). The contribution of α-mannosidase activity coming with ingested bran is negligible, meaning that all activity is actually secreted by the midgut cells.

Digestive enzymes are secreted by midgut cells according to 3

SC

mechanisms. One is apocrine secretion, where the secretory vesicles undergo fusions, leading to larger vesicles (aposomes) that after release, involving the loss of at least 10% of the apical cytoplasm, eventually free their contents by

M AN U

solubilization. If the loss of cytoplasm is very small, the secretory mechanism is called microapocrine. The other mechanism is the classical exocytic route, where secretory vesicles fuse with the midgut cell apical membrane, emptying their contents without any loss of cytoplasm (Terra and Ferreira, 2012). Coleopterans usually have apocrine secretion in anterior and exocytic secretion in posterior midgut (Cristofoletti et al., 2001).

TE D

Immunocytochemical data showed that Man2, one of the two αmannosidases chromatographically purified from T. molitor midguts, was found associated with an aposome in anterior midgut cells, as described previously for

EP

amylase (Cristofoletti et al., 2001), meaning that it is secreted by an apocrine mechanism. α-Mannosidase labeling observed at the microvilli of posterior midgut cells mostly correspond to enzyme molecules synthesized in anterior

AC C

midgut but displaced toward the posterior midgut with the food bolus. Occasionally Man2 labeling is observed in posterior midgut cells. This suggests that Man2 is secreted by a minor exocytic secretory route at this region. It should be noted, however, that the high expression of TmMan2 (RT-PCR data) in posterior midgut is not accompanied by a remarkable Man2 labeling and enzyme activity there. It is not clear the meaning of this.

4.2. Purification, properties and functions of T. molitor α-mannosidases T.

molitor

midgut

α-mannosidase

activity

was

separated

by

chromatography into two activity peaks: a major (Man1) and a minor (Man2). An

17

ACCEPTED MANUSCRIPT

antibody generated against a synthetic peptide corresponding to a sequence of a α-mannosidase fragment (later shown to correspond to TmMan2, GenBank access KP892646) recognizes Man2 but not Man1. Thus, the cDNA coding for Man2 is actually TmMan2. TmMan2 codes for a mature α-mannosidase with 107.5 kDa, whereas

RI PT

purified Man2 has 123 kDa, as determined by gel filtration. Bovine lysosomal αmannosidase has 104 kDa, as deduced from the cDNA sequence, whereas the purified enzyme has 119 kDa, which is reduced to 106 kDa on deglycosilation (Tollersrud et al., 1997). Likewise, the difference found between the mass of the

SC

protein coded by TmMan2 and purified Man2 may be accounted by glycosylation. There are several putative glycosylation sites along the Man2

1997, Heikinheimo et al., 2003).

M AN U

sequence and α-mannosidases are known to be glycosylated (Tollersrud et al.,

Man2 originates after SDS-PAGE one band of 72 kDa and another of 51 kDa, which sums 123 kDa, in agreement with gel filtration data. Only the band of 72 kDa is recognized by the α-mannosidase antiserum. These results suggest that Man2 is processed into peptides that remain noncovalently linked within the functional enzyme. Since the α-mannosidase antiserum was prepared

TE D

using a peptide corresponding to the C-terminal end of the molecule (see Fig. 6), the band with 72 kDa should correspond to the C-terminal peptide and that with 51 kDa, to the N-terminal end.

EP

α-Mannosidase processing into two peptides that remain noncovalently linked in the functional enzyme has been described before for lysosomal αmannosidases (Heikinheimo et al., 2003, Nemcovicova et al., 2012).

AC C

The physical and kinectical properties of purified Man1 and Man2 are

similar. They have a molecular mass of 123 kDa (gel filtration), pH optimum (5.6) and response to inhibitors like swainsonine (Man1 Ki, 68 nM; Man2 Ki, 63 nM) and deoxymannojirimycin (Man1 Ki, 0.12 mM; Man2 Ki, 0.15 mM). Their substrate specificities are a little different as Man2 hydrolyzes α-1,3 and α-1,6 bonds better than α-1,2, whereas the contrary is true for Man1. These characteristics permit to classify Man1 and Man2 as α-mannosidases pertaining to Class II (GH38 α-mannosidases), that are catabolic α-mannosidases similar to the lysosomal α-mannosidases (Rose, 2012).

18

ACCEPTED MANUSCRIPT

The identification of Man2 as a lysosomal enzyme on the basis of its kinectical properties agrees with that resulting from its post-translational processing which is typical from lysosomal enzymes. Furthermore, most residues in the relevant positions of the holding and anchor sites described for D. melanogaster Golgi α-mannosidase (V61, Q64, Y267, H273, P298 and

RI PT

W299) (Shah 2008; Zhong et al, 2008) differ in TmMan2. This was based in an alignment of the sequences from lysosomal enzymes from B. taurus (AAC48763), Capra hircus (AEZ02307.1), Cavia porcellus (AAL58982.1), Felis catus (AAB97672.1), H. sapiens (AAC51362.1), Mus musculus (NP_034894.2)

musculus

(CAA43480.1),

H.

sapiens

SC

and D. melanogaster (AAD38576.1) plus the Golgi enzymes from Mus (AAC50302.1),

D.

melanogaster

(CAA54732.1) and TmMan2. The branching of TmMan2 together with

M AN U

lysosomal α-mannosidases in a cladogram of GH38 α-mannosidases also lends support to a lysosomal origin of the protein coded by TmMan2, which is Man2. It is important to stress, however, that Man2, in contrast to true lysosomal αmannosidases, is secreted into the midgut contents. There, Man2 may participate in digestion. Similarly, cathepsin L2 and cathepsin L3 are proteinases that have counterparts in the lysosomes but are also secreted into

TE D

luminal contents (Beton et al., 2012).

It is known that T. molitor food is rich in fungal cells that are absent from the midgut (Genta et al., 2006a). T. molitor midgut has a laminarinase with

EP

high lytic power against fungal cell walls (Genta et al., 2007) and the specific activity of its chitinase is higher than in other insects (Genta et al., 2006b), indicating that T. molitor is adapted to digest fungal cell walls. α-Mannosides are

AC C

present in the outermost layer of fungi cell walls (Martinez-Esparza et al., 2006). Thus, it is possible that TmMan2 is used to remove mannose units during fungal digestion.

Man1 is a lysosomal α-mannosidase regarding its kinetic properties

and may have the same digestive function as Man2. T. molitor midgut expresses other major GH38 α-mannosidases, in addition to TmMan2, all of them branching together with lysosomal α-mannosidases in a cladogram. One of these is contig Tm602 that is expressed along the whole midgut and Malpighian tubules and may code Man1, thus participating in luminal fungal digestion.

19

ACCEPTED MANUSCRIPT

More work is necessary to propose a function for Tm2371 (GH47 αmannosidase).

4.3. Comparison of insect midgut α-mannosidase sequences The amount of family 38 α-mannosidase sequences found in the

RI PT

transcriptome of midgut of insects is perhaps related to the diet of these organisms. This is suggested by the fact that there is a larger number of lysosomal-like α-mannosidases in the T. molitor midgut than in any other. As shown above, T. molitor midgut α-mannosidases are supposed to be involved in

SC

fungal digestion.

In the active site of family 47 α-mannosidase there are 3 acid residues (E330, D463 and E599, human endoplasmic reticulum mannosidase,

M AN U

AAF03215.1), plus an Arg (R334) that could interact with E330 and an His (H524) (Karaveg et al., 2005). Mutagenesis of H334 to an Ala residue reduced the kcat value 4 times and slightly increased Km value, indicating that H524 has no significant role in enzyme catalysis (Karaveg et al., 2005). It is interesting to note that all sequences obtained in our transcriptomes have one Ala residue

Acknowledgments

TE D

instead of His in this position. It is not clear the meaning of this.

EP

This work was supported by the Brazilian research agencies FAPESP (Tematico) and CNPq. We are indebted to W. Caldeira, and M.V. Cruz for technical assistance and to Dr. Ariel M. Silber for his help in selecting peptides

AC C

to be synthesized to raise antibodies against α-manosidase. Nathalia R. Ramalho was a graduate fellow of FAPESP. C. Ferreira and W.R. Terra are staff members of their respective departments, research fellows of the CNPq, and members of the Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular (INCT-EM).

References Bedikou, E.M., Ahi, A.P., Kone, F.M., Gonnety, J.T., Faulet, B.M., Kouame L. P., Niamke, S.L., 2009. Biochemical properties of extracellular α-

20

ACCEPTED MANUSCRIPT

mannosidases from the digestive fluid of Rhynchophorus palmarum larvae. Bul. Insect. 62, 75-84. Beton, D., Guzzo, C.R., Ribeiro, A.F., Farah, C.S., Terra, W.R., 2012. The 3D structure and function of digestive cathepsin L-like proteinases from Tenebrio molitor larval midgut. Insect Biochem. Molec. Biol. 42, 655-664.

Proteolytic enzymes. Academic Press, London.

RI PT

Blum, H., Beier, H., Gross, H.J., 1987. Improved silver staining of plant proteins,

Chipoulet, J.M., Chararas, C., 1985. Survey and electrophorectical separation of the glycosidases of Rhagium inquisitor (Coleoptera: Cerambycidae) larvae.

SC

Comp. Biochem. Physiol. 80B, 241-246.

Chomczynski, P., Sacchi, N., 1987. Single-step method of RNA isolation by acid guanidinium triocyanate–phenol–chloroform extraction. Anal. Biochem. 162,

M AN U

156–159.

Cristofoletti, P.T., Ribeiro, A.F., Terra, W.R., 2001. Apocrine secretion of amylase and exocytosis of trypsin along the midgut of Tenebrio molitor larvae. J. Insect Physiol. 47, 143-155.

Cristofoletti, P.T., Ribeiro, A.F., Terra, W.R., 2005. The cathepsin L-like proteinases from the midgut of Tenebrio molitor larvae: sequence,

TE D

properties, immunocytochemical localization and function. Insect Biochem. Mol. Biol.35, 883-901.

Davies, G., Henrissat, B., 1995. Structures and mechanisms of glycosyl

EP

hydrolases. Struct. 3, 853-859.

Eisenhaber, B., Bork, P., Yuan, Y., Loeffler, G., Eisenhaber, F., 2000. Automated annotation of GPI anchor sites: case study C.elegans. TIBS 25,

AC C

340-341.

Emanuelsson, O., Brunak, S., von Heijne, G., Nielsen, H., 2007. Locating proteins in the cell using TargetP, SignalP, and related tools. Nat. Protoc. 2, 953-971.

Felsenstein, J., 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution, 39, 783-791. Ferreira, A.H.P., Cristofoletti, P.T., Lorenzini D.M., Guerra, L.O., Paiva, P.B. , Briones, M.R.S., Terra, W.R., Ferreira, C., 2007. Identification of midgut microvillar proteins from Tenebrio molitor and Spodoptera frugiperda by cDNA library screenings with antibodies. J Insect Physiol., 53, 1112-1124.

21

ACCEPTED MANUSCRIPT

Ferreira, C., Rebola, K.G.O., Cardoso C., Bragatto I., Ribeiro+, A.F., Terra, W.R., 2014. Insect midgut carboxypeptidases with emphasis on S10 hemipteran and

M14

lepidopteran

carboxypeptidases.

Insect

Mol.

Biol.

doi:

10.1111/imb.12151. Foster, J.M., Roberts, D.B., 1997. The soluble α-mannosidases of Drosophila

RI PT

melanogaster. Insect Biochem. Mol. Biol. 27, 657-661. Genta, F.A., Dillon, R.J., Terra, W.R., Ferreira, C., 2006a. Potential role for gut microbiota in cell wall digestion and glucoside detoxification in Tenebrio molitor larvae. J. Insect Physyol., 52, 593-601.

SC

Genta, F. A.; Blanes L.; Cristofoletti P.T.; Lago, C.L.; Terra W.R.; Ferreira C. 2006b. Purification, characterization and molecular cloning of the major

789-800

M AN U

chitinase from Tenebrio molitor larval midgut. Insect Biochem. Mol. Biol. 36,

Genta, F. A.; Dumont A.F.; Marana S.R.; Terra W.R.; Ferreira C. 2007. The interplay of processivity, substrate inhibition and a secondary substrate binding site of an insect exo-beta-1,3-glucanase. BBA. Proteins and Proteomics, 1774, 1079-1091

Gomez, A., Cardoso, C., Genta, F.A., Terra, W.R., Ferreira, C., 2013. Active

TE D

site characterization and molecular cloning of Tenebrio molitor midgut trehalase and comments on their insect homologs. Insect Biochem. Molec. Biol. 43, 733-741.

EP

Gonzalez, D.S., Jordan, I.K., 2000. The α-mannosidases: phylogeny and adaptive diversification. Mol. Biol. Evol. 17, 292-300.

AC C

Gupta, R., Jung, E., Brunak, S., 2004. Prediction of N-glycosylation sites in human proteins. http://www.cbs.dtu.dk/services/NetNGlyc/.

Hall, T.A., 1999. Bioedit: userfriendly alignment editor and analysis program for windows 95/98/NT. Nucl.Acids Symp. Ser. 41, 95-98.

Heikinheimo, P., Helland, R., Leiros, H.K., Leiros, I., Karlsen, S., Evjen, G., Ravelli, R., Schoehn, G., Ruigrok, R., Tollersrud, O.K., McSweeney, S., Hough, E., 2003.The structure of bovine lysosomal alpha-mannosidase suggests a novel mechanism for low-pH activation. J. Molce. Biol., 28, 631644. Henrissat, B., Davies, G.J., 1997. Structural and sequence-based classification of glycoside hydrolases. Curr. Op. Struct. Biol. 7, 637-644

22

ACCEPTED MANUSCRIPT

Hillis, D.B., Bull, J.J., 1993. An empirical test of bootstrapping as a method for assessing confidence in phylogenetic trees. Systematic Biology, 42, 182192. Jordão, B.P., Terra, W.R., Ribeiro, A.F., Lehane, M.J. and Ferreira, C., 1996. Trypsin secretion in Musca domestica midguts: a biochemical and

RI PT

immunocytochemical study. Insect Biochem. Mol. Biol., 26: 337-346. Karaveg, K., Siriwardena. A., Tempel W., Liu Z-J., Glushka J., Wang B-C., Moremen K. W., 2005. Mechanism of Class 1 (Glycosylhydrolase Family 47) α-Mannosidases involved in N-Glycan processing and endoplasmic

SC

reticulum quality control. J. Biol. Chem., 280, 16197-16207.

Kawar, Z., Jarvis, D.L., 2001. Biosynthesis and subcellular localization of a

289-297.

M AN U

lepidopteran insect alpha 1,2-mannosidase. Insect Biochem. Mol. Biol. 31,

Laemmli, U.K., 1970. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227, 680–685.

Manchenko, G. P. 1994. Handbook of detection of enzymes on electrophoretic gels. Boca Raton: CRC Press

Martinez-Esparza, M., Sarazin, A., Jouy, N., Poulain, D., Jouault, T. 2006.

TE D

Comparative analysis of cell wall surface glycan expression in Candida albicans and Saccharomyces cerevisae yeasts by flow cytometry.J. Imm. Meth. 314, 90-102.

EP

Morton, R.E., Evans, T.A., 1992. Modifications of the bicinchoninic acid protein assay to eliminate lipid interference in determining lipoprotein protein content. Analyt Biochem. 204, 332-334.

AC C

Nemcovicova, I., Nemcovic, M., Sestak, S., Plskova, M., Wilson, I.B.H., Mucha, J., 2012. Expression, purification and preliminary crystallographic analysis of Drosophila melanogaster lysosomal α-mannosidase. Acta Crystallogrf.,

68, 965-970.

Noelting G. & Bernfeld P., 1948. Sur les enzymes amylolitiques. III. La betaamilase: dosage d'activité et contrôle de l'absence d'alfa-amilase. Helv. Chim. Acta 31, 286-290. Ren, J., Bretthauer R.k., Castellino, F.J., 1995. Purification and properties of a Golgi-derived (α1,2)-mannosidase-I from Baculovirus-infected lepidopteran

23

ACCEPTED MANUSCRIPT

insect cells (IPLB-SF21AE) with preferential ativity toward mannose6-Nacetylglucosamine2. Biochemistry 34, 2489-2495. Rose, D.R., 2012. Structure, mechanism and inhibition of Golgi α-mannosidase II. Curr. Op. Struct. Biol. 22, 558-562. Segel, I. H., 1975 Enzyme Kinetics. Wiley, New York. Shapiro, A.L., Vinuela, E.,

RI PT

Maizel, J.V., 1967. Molecular weight estimation of polypeptide chains by electrophoresis in SDS-polyacrylamide gels. Biochem. Biophys. Res. Commun. 28, 815-820.

Shah, N., Kuntz, D.A., Rose, D.R., 2008. Golgi α-mannosidase II cleaves two

SC

sugars sequentially in the same catalytic site. Proc. Nat. Acad. Sci., 105, 9570-9575.

Silva, C.P., Ribeiro, A.F., Gulbekian, S., Terra, W.R., 1995. Organization, origin

M AN U

and function of the outer microvillar (perimicrovillar) membranes of Dysdercus peruvianus (Hemiptera) midgut cells. J. Insect Physiol., 41, 1093-1103.

Silva, C.P., Terra, W.R., 1994. Digestive and absortive sites along the midgut of the

cotton

seed

sucker

bug

Dysdercus

peruvianus

(Hemiptera:

Pyrrhocoridae). Insect Biochem. Molec. Biol. 24, 493-505. P.K.,

Krohn,R.I.,

Hermanson

TE D

Smith,

G.T.,

Mallia,

A.K.,

Gatner,F.H.,

Provenzano, M.D.,Fujimoto, E.K., Goeke, N.M., Olson, B.J., Klenk D.C.,. 1985. Measurements of protein using bicinchoninic acid. Analyt Biochem

EP

150, 76-85.

Suits, M.D.L., Zhu, Y., Taylor, E.J., Walton, J., Zechel D.L., Gilbert H.J., Davies, G.J., 2010. Structure and kinetic investigation of Streptococcus pyogenes

AC C

family GH38 α-mannosidase. PLoS ONE 5, e9006.

Steentoft, C., Vakhrushev, S.Y., Joshi, H.J., Kong, Y., Vester-Christensen, M.B., Schjoldager, K.T., Lavrsen, K., Dabelsteen, S., Pedersen, N.B., Marcos-Silva, L., Gupta, R., Bennett, E.P., Mandel, U., Brunak, S., Wandall, H.H., Levery, S.B., Clausen, H., 2013. Precision mapping of the human OGalNAc glycoproteome through SimpleCell technology. EMBO J. 32,:14781488. Tamura, K., Stecher, G., Peterson, D., Filipski, A. and Kumar S.. 2013. MEGA 6: Molecular evolutionary genetics analysis version 6.0. Molec. Bio. Evol. 30, 2725-2729.

24

ACCEPTED MANUSCRIPT

Terra. W.R., Ferreira, C., Bastos, F., 1985. Phylogenetic considerations of insect digestion. Disaccharidases and the spatial organization of digestion in the Tenebrio molitor larvae. Insect Biochem. 15, 443-449. Terra, W.R., Ferreira, C., de Bianchi, A.G., 1979. Distribution of digestive enzymes among the endo and ectoperitrophic spaces and midgut cells of

RI PT

Rhynchosciara and its physiological significance. J. Insect Physiol. 25, 487494.

Terra, W.R., Ferreira, C., 2012. Biochemistry and molecular biology of digestion. In: Gilbert, L.I. (Ed.) Insect Molecular Biology and Biochemistry.

SC

First ed. Academic Press-Elsevier. London, pp. 365-418.

Terra, W.R., Ferreira, C., Garcia, E.S., 1988. Origin, distribution, properties, and

Biochem. 18, 423-434.

M AN U

functions of the major Rhodnius prolixus midgut hydrolases. Insect

Tollersrud O.K., Berg, T., Healy, P., Evjen G., Ramachandran U., Nilssen O., 1997. Purification of bovine lysosomal α-mannosidase, characterization of its gene and determination of two mutations that cause α-mannosidosis. Eur. J. Biochem., 246, 410-419.

Tusnády, G. and Simon, I., 2001. The HMMTOP transmembrane topology

TE D

prediction server. Bioinformatics. 17, 849-850. Van den Elsen, J.M.H., Kuntz, D.A., Rose, D.R., 2001. Structure of Golgi αmannosidase II: a target for inhibition of growth and metastasis of cancer

EP

cells. EMBO J. 20, 3008-3017.

Zhong, W., Kuntz, D.A., Ember, B., Singh, H., Moremen, K.W., Rose, D.R., Boons, G-J., 2008. Probing the substrate specificity of Golgi α-mannosidase

AC C

II by use of synthetic oligosaccharides and a catalytic nucleophile mutant. J. Am. Chem. Soc., 130, 8975-8983.

ACCEPTED MANUSCRIPT

25

SC

Anterior + Middle Posterior Soluble Membrane Soluble Membrane 27 ± 3 29 ± 3 7±1 15 ± 2 10.5 11.7 2.8 6.1 120 ± 20 130 ± 20 57 ± 7 46 ± 3 230 ± 20 240 ± 10 140±20 380 ± 50

M AN U

Total activity (mUnits/animal) % of total Protein (µg/animal) Specific activity (mUnits/mg)

Whole midgut Soluble Membrane 220 ± 10 50 ± 3 81.5 18.5 1100 ± 100 160 ± 10 280 ± 30 360 ± 30

RI PT

Table 1. Distribution of α-mannosidase activity along the midgut of T. molitor Midgut Tissue

Midgut contents Anterior + Middle Posterior Soluble Soluble 149 ± 9 20 ± 1 60.3 8.2 560 ± 40 145 ± 9 280 ± 10 144 ± 3

AC C

EP

TE D

Sum or total activities 270 ±20 250 ± 20 (mU/animal) Values are mean and SEM calculated from determinations carried out in 4 different preparations obtained from 7 animals each. The substrate used was p-nitrophenyl α-D-mannoside. One unit of enzyme is the amount of enzyme that cleaves one µmol of substrate per min.

26

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

Table 2. Kinetic properties of Man1 and Man2 Parameter Man1 Man2 pHo 5.6 5.6 Km (mM) 0.84 ± 0.05 0.62 ± 0.02 Ki deoxymannojirimycin (mM) 0.120 ± 0.005 0.150 ± 0.008 Ki swainsonine (nM) 68 ± 5 63 ± 4 Substrate: NPαMan. The inhibitors are of the linear competitive type for both Man1 and Man2.

27

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

Table 3. Substrate specificity of Man1 and Man2 (relative Vmax/Km) Substrate Man1 Man2 2 α-mannobiose 100 ± 1 50 ± 2 3 α-mannobiose 24.4 ± 0.5 100 ± 3 6 α-mannobiose 34.1 ± 0.5 87 ± 4 Relative Vmax/Km were calculated from Km and Vmax values determined with 15 different concentrations of substrate (range 0.25 - 23 mM).

ACCEPTED MANUSCRIPT

RI PT

28

AC C

EP

TE D

M AN U

SC

Table 4. Features of the manually curated sequences of Family 38 α-mannosidases from the transcriptome of 4 insects from 3 orders. Contig GenBank access Insect Reads/contig Length SP OBS D. maculatus Dm609 75 661 Y N-Fragment D. maculatus Dm1611 97 965 Y N-Fragment P. americana Pa172 75 966 Incomplete P. americana Pa174 7 334 Incomplete M. domestica Md33 XP_005189986.1 1021 1009 Y Complete T. molitor Tm345 67 1030 Y Complete T. molitor Tm404 12 896 Incomplete T. molitor Tm406 17 382 Incomplete T. molitor Tm408 160 963 C-Fragment T. molitor Tm602 653 854 Y N-Fragment T. molitor Tm603 289 590 Incomplete T. molitor Tm1774 433 1001 C-Fragment T. molitor Tm1776 10 343 Y N-Fragment Length is the number of amino acids in the protein putatively coded by the contig. Y indicates a predicted signal peptide in the protein, whereas a dash means that the presence of SP could not be evaluated due to the lack of the N-terminal portion. CFragment or N-Fragment indicates that the sequence misses the part coding for N- or C-terminal portion, respectively. Incomplete sequences have both ends missing. Contig Tm408 corresponds to TmMan2, GenBank access KP892646.

ACCEPTED MANUSCRIPT

29

AC C

EP

TE D

M AN U

SC

RI PT

Table 5. Features of the manually curated sequences of Family 47 α-mannosidases from the transcriptome of 6 insects from 3 orders. Contig Gen Bank Insect Reads/contig Length SP OBS TH access B. xantophis Bx2085 16 432 Incomplete Y D. peruvianus Dp1634 10 476 Incomplete N D. peruvianus Dp3988 6 160 Incomplete N D. maculatus Dm6182 6 207 Incomplete N T. molitor Tm2371 11 326 Y N- Fragment N 7 845 N Complete N Md1922 XP_005188141.1 M. domestica R. americana Ra766 12 365 Incomplete N TH, transmembrane helix. Other details as in Table 4.

30

ACCEPTED MANUSCRIPT Legend of figures

Figure 1. Purification of the digestive α-mannosidases from T. molitor larvae. The soluble fraction of T. molitor midgut homogenates was loaded onto a Hitrap Q XL column (A) equilibrated with 20 mM Tris HCl buffer, pH 7.0. Elution was

RI PT

accomplished with a gradient of NaCl from 0 to 1M in the same buffer. At all performed purification steps the flow was set at 0.5 mL/min and fractions of 0.4 ml were collected. Fractions active on the substrate NPαMan were pooled, passed through a Superdex 75 HR 10/30 column (Pharmacia Biotech) (B)

SC

equilibrated with 20 mM Tris HCl buffer, pH 7.0, containing 1M NaCl. The active fractions were pooled, the pool buffer was changed with the aid of a High Trap

M AN U

desalting column and then it was passed through a Phenyl Superose column HR 5/5 (C) equilibrated with 50 mM phosphate buffer, pH 7.0, containing 2M (NH4)2 SO4. Elution was accomplished with a (NH4)2SO4 gradient from 2 to 0 M in the same buffer. The fractions corresponding to each of the two activity peaks were used as a source of α-mannosidases (Man1 and Man2). Electrophoresis of proteins in SDS-12% polyacrylamide gel slabs (D). Lanes 1,

TE D

2 and 4, SDS-PAGE after silver staining for protein detection. 1, midgut contents; 2, Phenyl Superose column eluate corresponding to Man 1; 4, Phenyl Superose eluate corresponding to Man 2. Lanes 3, 5, 6, and 7, western blots with anti α-mannosidase serum: 3, purified Man 1; 5, purified Man 2; 6, T.

EP

molitor midgut contents; 7, T. molitor midgut tissue homogenate. The lanes were combined from different runs as follows: 1, 2 and 4, 3 and 5, 6 and 7.

AC C

Other details in Material and Methods.

Figure 2. Calibration curve for Superdex 200 column. It shows that Man 1 and Man 2 have similar molecular weight (∆) of 123 kDa. The following proteins were used as standards: cytochrome C (12.4 kDa), ovalbumin (43 kDa), bovine serum albumin (67 kDa), β-amylase (200 kDa). Other details in Material and Methods.

31

ACCEPTED MANUSCRIPT

Figure 3. Inhibition of purified Man 1. Lineweaver-Burk plots for different concentrations of deoximannojirimycin. Inset: replots of slopes of LineweaverBurk plots against the concentration of deoximannojirimycin.

Figure 4. Electron transmission immunocytochemical localization of Man 2 in

RI PT

anterior midgut (A) and posterior midgut (B). (A) Note labeling (arrow) associated with a large vesicle, which is actually an aposome and also outside the cell. (B) Note that labeling (arrow) is only seen outside the cell. Mv,

SC

microvilli; SV, secretory vesicle. Bars: 1 µm.

Figure 5. Nucleotide and deduced amino acid sequence of the cDNA (TmMan2) coding for Man2. The signal peptide is in italics. Catalytic residues are in bold.

antibodies in a rabbit are boxed.

M AN U

The amino acid residues of the peptide synthesized and used for raising

Figure 6. Cladogram (maximum likelyhood) of insect G38 family of αmannosidases. Branching corresponding to partitions in under 50% of replicates (1000) were collapsed. Abbreviations followed by numbers correspond to

domestica;

Pa,

TE D

contigs found in our transcriptomes: Dm, Dermestes maculatus; Md, Musca Periplaneta

americana;

Tm,

Tenebrio

molitor.

Other

abbreviations: AMII, α-mannosidase II (Golgi); CAM, cytosol α-mannosidase;

EP

LAM, lysosomal α-mannosidase.

Figure 7. Expression pattern of mRNAs for α-mannosidases in different tissues

AC C

and in different portions of the midgut tissue of T. molitor. A, M, P are anterior, middle, and posterior midgut; C, carcass; FB, fat body; MT, Malpighian tubules. The contig Tm000 is part of TmMan2.

ACCEPTED MANUSCRIPT

A

B A280

0.6

40

0.4

20

0.2

10

20

C

30

40

Fractions

50

60

Phenyl Man 1

60

40

20

0

0

10

20

30

40

50

Man 2

40

1.2 1.0 0.8 0.6 0.4

20

0.2 0

10

20

30

40

50

60

Fractions

70

80

90

0.0

[(NH4)2SO4] (M)

AC C

60

70

80

90

SDS-PAGE western

D 1

2

3

4

1.8

1.4

60

Fractions

1.6

80

0

0.0

A280 2.0

100

Relative activity (%)

70

TE D

0

RI PT

60

80

SC

0.8

Relative activity (%)

80

M AN U

1.0

A280

100

[NaCl] (M)

100

0

Superdex

EP

Relative activity (%)

HTQ

. . .

. .

5

6

7

RI PT

ACCEPTED MANUSCRIPT

2.5

SC

2.5

M AN U

Ve/Vo

2.0 2.0

1.5 1.5

0.5 0.5 0.0

0.0

1.2 1.2

AC C

EP

1.0 1.0

TE D

1.0 1.0

1.4 1.4

1.6 1.6

1.8 1.8

Log (MW, kDa)

2.0 2.0

2.2 2.2

2.4 2.4

RI PT

ACCEPTED MANUSCRIPT

9

SC

8

M AN U

7

1/v

6 5

TE D EP

3

1

-1

0

AC C

2

1

-0.2

Slope

3

5 4

2 1

0

0.2

0.4 0.4

0.6

[I], mM

2 3 -1 1/[S], mM

4

5

6

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT atgagcctgctggtgtttctgttttgctttagcctgtttctgggcgtgaacaccaaaccgtatgaagcggatgcgccgagctgcggctat 30 M S L L V F L F C F S L F L G V N T K P Y E A D A P S C G Y

31

gatgcgtgcccggaagcggatccgagcgcgctgaacgtgcatattgtgccgcatagccatgatgatgtgggctggctgaaaaccctggat 60 D A C P E A D P S A L N V H I V P H S H D D V G W L K T L D

61

cagtattattttcaggatgtgcagaacgtgattagcagcgtgattgtggcgctgaaactgaacccggaacgccgctttgtgcaggtggaa 90 Q Y Y F Q D V Q N V I S S V I V A L K L N P E R R F V Q V E

91

accgcgttttttaaactgtggtggagccgccagaacgatattattaaagaagcggtgcagaacctggtgaacaacggccagctggaattt 120 T A F F K L W W S R Q N D I I K E A V Q N L V N N G Q L E F

121

attaacggcgcgtggagcatgaacgatgaagcggcggtgcattatcagagcaccattgatcagtttaccctgggcctgcgctatattgaa 150 I N G A W S M N D E A A V H Y Q S T I D Q F T L G L R Y I E

151

gataacctgggccgctgcgcgcgcccgaaagtgggctggcagattgatccgtttggccatagccgcgaacaggcgagcattagcgcgcag 180 D

N

L

G

R

C

A

R

P

K

V

G

W

Q

I

D

P

F

G

H

RI PT

1

S

R

E

Q

A

S

I

S

A

Q

ctgggctttgatagcatgttttttgcgcgcctggattatcgcgataaaaaccgccgcatggatgataaaaccatggatctgctgtggcgc 210 L G F D S M F F A R L D Y R D K N R R M D D K T M D L L W R

211

ggcagcgcgaacctgggcaacaacgcggatatttttaccagcgtgctgtatcagcattatagcgcgccgggcggcttttgctttgatatt 240 G S A N L G N N A D I F T S V L Y Q H Y S A P G G F C F D I

241

gtgtgcaacgatgaagtgattattgatgatgaagaagatccggattataacctggaaaaacgcgtgggcgaatttgcggatcagatgcgc 270 V C N D E V I I D D E E D P D Y N L E K R V G E F A D Q M R

271

gatcgcgcggaacattatccgaccaacaacattctggtgaccatgggcgatgattttcgctatgaagcggcgatgaccacctatatgaac 300 D

R

A

E

H

Y

P

T

N

N

I

L

M AN U

SC

181

V

T

M

G

D

D

F

R

Y

E

A

A

M

T

T

Y

M

N

ctggatctgctgattaaaggctttgatctgtttgaacagacctataacgataaacgcattaaagtgttttatagcaccccgagctgctat 330 L D L L I K G F D L F E Q T Y N D K R I K V F Y S T P S C Y

331

accaaagcggtgaacgattatgtgaacagcaacaactataacctggaactgaaaaccgatgatttttttccgtatgcggatggcaccaac 360 T K A V N D Y V N S N N Y N L E L K T D D F F P Y A D G T N

211

ggcagcgcgaacctgggcaacaacgcggatatttttaccagcgtgctgtatcagcattatagcgcgccgggcggcttttgctttgatatt 240 G S A N L G N N A D I F T S V L Y Q H Y S A P G G F C F D I

241

gtgtgcaacgatgaagtgattattgatgatgaagaagatccggattataacctggaaaaacgcgtgggcgaatttgcggatcagatgcgc 270 V C N D E V I I D D E E D P D Y N L E K R V G E F A D Q M R

571

ctgctgaaaaaaaccccgaacctgagcgtgggctatgaagaaaccagctttgaaattagcgaacagaccggcctgctggaaagcattacc 600 L L K K T P N L S V G Y E E T S F E I S E Q T G L L E S I T

601

atgaacggcgtgaccctgcaggtgacccaggattttcagtattataccagccagaacagcagcggcgcgtatatttttgtgccggtggaa 630 M N G V T L Q V T Q D F Q Y Y T S Q N S S G A Y I F V P V E

631

accgatccgagccgcgtggcgggcggcccgattaccaccaccctggtgagcggcgatgtgagccagggcgtgctgcaggaatttggcagc 660 T D P S R V A G G P I T T T L V S G D V S Q G V L Q E F G S

661

tgggcgcgccagtttattaaagtgtataacgatgataaaagctatattgaatttgattggattgtgggcccgctggatattagcgatggc 690 W A R Q F I K V Y N D D K S Y I E F D W I V G P L D I S D G

691

gtgggcaaagaagtggtgagcaaatttagcaccccgctggaaaccaacggccagttttataccgatagcaacggccgcgaagtgctgcag 720 V G K E V V S K F S T P L E T N G Q F Y T D S N G R E V L Q

721

cgcacccgcaacagccgcccggattatgattataccgatgaacagccggtggcgggcaactattatccggtgaccagcaaaattgtgatt 750 R T R N S R P D Y D Y T D E Q P V A G N Y Y P V T S K I V I

751

gaagatgatgatgtggaatttgcggtgctgaccgatcgcagccagggcggcagcagcattaacgatggcgaagtggaactgatggtgcat 780 E D D D V E F A V L T D R S Q G G S S I N D G E V E L M V H

781

cgcgcgtgccagcatgatgatggccgcggcgtgggcgaaaacctgaacgaacaggaatttggcgatggcattcgcgtgcgcggcaaacat 810 R A C Q H D D G R G V G E N L N E Q E F G D G I R V R G K H

811

tttctggtgctgggcccgaaaggcggcaacggcgataaaagcattgcggcggtggaacgcgatgtggcgcagcgcaaactgctgagcccg 840 F L V L G P K G G N G D K S I A A V E R D V A Q R K L L S P

AC C

EP

TE D

301

ACCEPTED MANUSCRIPT 841

tggacctttgtgaccaaacaggtggataacctgaacaacctgcagtttagcggcctgaaaaacagcctgccggataacgtgcagattctg 870 W T F V T K Q V D N L N N L Q F S G L K N S L P D N V Q I L

871

accctggaaccgtggagcgaaaacaccctgctgtttcgcctggaacatgtgctggaaaacggcgaagatgataacctgagccaggaagtg 900 T L E P W S E N T L L F R L E H V L E N G E D D N L S Q E V

901 accgtggatgtgagcgatctgtttaccctgtttagcattaccgaactgaaagaaaccaccctgggcgcgaacatgctgctggaagaaaac 930 T V D V S D L F T L F S I T E L K E T T L G A N M L L E E N gtgcgcctgagctggccgggcagcagcaccaccgatgatcaggtggaaaaacgcgatgtggatgatctgaccgtgaccctgcagccgatg 960 V R L S W P G S S T T D D Q V E K R D V D D L T V T L Q P M

961

cagattcgcacctttctggcgaccgtgagctataac 972 Q I R T F L A T V S Y N

AC C

EP

TE D

M AN U

SC

RI PT

931

ACCEPTED MANUSCRIPT

99 93

Tm602

Tm345

50

Tm1774 98

Tm1776

SC

Dm609

RI PT

Tm603

Tm408 Tm404

M AN U 99

Tm406

75

88

Pa172 Pa174

LAM D.melanogaster AAD38576.1

TE D

91 64

AC C

EP

100

69

Md33 LAM M.musculus NP 034894.2 LAM B taurus AAC48763.1 LAM H.sapiens AAC51362.1 Dm1611 S.frugiperda AAB62719.1 AMII D.melanogaster CAA54732.1

65

AMII M.musculus CAA43480.1

91 99

AMII H.sapiens AAC50302.1 CAM H.sapiens AAC00190.2

100

CAM M.musculus AAH16253.1

Tm 602

EP

AC C

Control

TE D

Tm 1774 Tm 000

M

P

M AN U

A

SC

RI PT

ACCEPTED MANUSCRIPT

C

FB

MT

ACCEPTED MANUSCRIPT Highlights

- Two soluble α-mannosidases (Man1 and Man2) were purified by chromatography from the midgut of Tenebrio molitor

RI PT

- The α-mannosidases have MW 123 kDa (gel filtration) and Man 2 form two bands (72 and 51 kDa) in SDS-PAGE.

-The sequence TmMan2 (KP892646) codes for Man2 that is processed into two peptides that remain linked in the functional enzyme.

SC

- Man2 hydrolyzes α-1,2, α-1,3, and α-1,6-mannobiose and once secreted by an apocrine mechanism (immunocytolocalization) may digest fungal cells.

AC C

EP

TE D

M AN U

- The number of family 38 of α-mannosidases in the midgut of 9 different insects may be related to their diets.

ACCEPTED MANUSCRIPT Supplementary Table 1- FASTA sequences corresponding to the manuallycurated sequences coding for α-mannosidases from 6 insects pertaining to 5 orders plus TmXXX and Tm000. See text for details.

>TmXXX MSLLVFLFCFSLFLGVNTKPYEADAPSCGYDACPEADPSALNVHIVPH

SC

SHDDVGWLKTLDQYYFQDVQNVISSVIVALKLNPERRFVQVETAFFKKWW

RI PT

Contigs Family 38

EQQKDSIKQDVIDLVNNGQFEIINGAWSMNDEAAVLYQCTIDQFTLGLRY

M AN U

LEDRLGACSRPRVGWQIDPFGHSREQASISAQLGFDSIFFARLDYRDKIN

RMGKKTGDLIWRGSSNLGNSSDIFTSVLYNHYSAPPGFCFDIVCDDDPSS TTKKVRITTTKAESKILQTL

TE D

>Tm000

RLTIGRLGPTSHAPGRHGGRGNSIWTGYFTSRATSKHFERQGNNLLQVSKQLAANAQGSY DNEQINTLKEAVGVMQHHDAITGTEKQHVADNYYMRLSRGMQSAGDAAGQVLSNLITGDD

EP

TNLEFDSCLLANVSACTQTESDTFTVAVYNPLSRTQTAIVTLPVFDQQNYQIRDPDDNDV PYQLDASLTDFSYVENTRTSQTTLQFAAKDLPPLGFKVYRFSAADKQPKSNPLLKKTPNL

AC C

SVGYEETSFEISEQTGLLESITMNGVTLQVTQDFQYYTSQNSSGAYIFVPVETDPSRVAG GPITTTLVSGDVSQGVLQEFGSWARQFIKVYNDDKSYIEFGWIVGPLDISDGVGKEVVSK FSTSLETNGQFYTDSNGREVLQRTRNSRPDYDYTDEQPVAGNYYPVTSKIVIEDDDVEFA VLTDRSQGGSSINDGEVELMVHRACQHDDGRGVGENLNEQEFGDGIRVRGKHFLVLGPKG GNGDKSIAAVERDVAQRKLSSPWTFVTKQVDNLNNLQFSGLKNSLPDNVQILTLESWSEN TLLFRLEHVLENGEDDNLSQEVTVDVSDLFTLFSITELKETTLGANMLLEENVRLSWPGS STTDDLVEKRDVDDLTVTLQPMQIRTFLATVSYN

ACCEPTED MANUSCRIPT >Dm1611 NAMKLFAFILFLSFLGLISAKPIEESSKCVSCRRKADPKKLSIHLVPHSHDDVGWLKTVD QYYYGTNNSYQRASVELIIDSVYRALLENKNRTFIQVETYFLHKWWQRQNEYTRKKYREL VNNGQIEIVGGGWSMNDEAVTNYLSLIEQFEFGFRKLDEMLGECGHPKVAWQIDPFGHSK

RI PT

TMAQLFSEIGFDGLFFARLDWRDRMKRRNEAQMEMIWEGFGGNLFTSILYEHYGAPDGFC FDITCNENNPIVDDEDSPLFNQESAVKQFTSIVEKRSKSYRTNNILIPMGDDFNYMHASS YYLNMDMLIEGFNKYPQKTDDGREIEIFYSTPWCYIKAVNEEIGKNSSIQLQKKTDDFLP

LGNDPHTYWSGYYTSRPTSKRLEREGINLMQVFKQLNSFATKPYSTVPIESAMGIMQHHD

SC

AITGTEKQHVANDYHRILTSIIHETTFAVDSIISELLSVNKEKPVVLASSSCPLSNISYC

PITESDEFVVVIYNPFSESVTHHIRLPVNNTNYQVIGPTDEIIKSQIVPAISDFKVLNGT

M AN U

FEHSKYDLVFNADIYPHTITTMKVKKVEENSNTGTIKVNPETNCDEIGDIFKAYFEKDGR LKSVKYKDIDLDLKQELLHYMSAHGNNRNFSQRASGAYIFRPDPEQPDAIPLPNLNFSSK CYKGEVVDEVHQTYNEWVKQIIRVYKNEDHMEFDWLVGPINISDDRGKEIISRFYTTKIN SSDTFYTDSNARQLIKRVRDHRPTFNYTAEEKVAGNYYPVTSRIVLKDDNLEMAVLNDRP

TE D

QGGTSLKSGEVELMVHRRCLNDDAFGVGEALNETEYNVGLIARGRHYLTVNNPDDSHSEH LSQKKLFSPMVFISLIKLDEKLIKRSFSAIDPTILPDGAQILTLEPLSKNEILLRLENIY

EYKLA

AC C

>Dm609

EP

ENGEEKTVKLNKLFKNFNVIKVSELDLGGVRPKGLSKRLQWGEDKGETEKLNSVANEGAF

MLLKIALLIFTFTYCAICVPIAEEIKCGYQSCHPIEDNKLNIHLVPHSHDDVGWLKTVDE YYYGVNNKVQPVGIQYIISSVVQSLQSNPDRRFIQVEGAYFWQWWKRQTEETKSIVKKLV NSGQLEIINGAWSMNDEAATNYQSIIDQFTLGIKMYNDTLGSCSYPKIGWQIDPFGHSKE MANIFSQLGYEAIFFARIDHKDKTKRSQQKALELIWQGSQTRGKASDIFAVIFSDFYIQP LHFCWDVLCFDEPIVDDINSPEYNADKKVKLFMDYVNKHAEHYQTNNILIPMGGDFTYQA AEMYYSNIDKLIQALRKFDKNVNPIYSTPSCYVKAVNDVSRKKKISFPVLTTDFFPYSSD DVSFWSGYYTSRPNSKRLERMGNNILQVTKQLISMHRVENKDFNDTLTLLKESMGIMQHH

ACCEPTED MANUSCRIPT DAITGTEKQAVAQDYARLMKKSIDKVEQNITNILADLSKKEKSKDVSEDKIQFSSCLLAN VSYCEASKKKSFLVAVYNPLSRVVSDYIRLPVQANFSFEITGPNGIVQYDVMKTISPFNY INNVSQTKYELVFHAQNIPPLGLQYYYVKSLNLTQQKNEQIPFAEETIFGNNKTGFAINE RGLLKEITVNGETLEIEQQFYKYMAFEGNNYGQNRSSGAYIFRPSKSNGGIAEPLIPGDK

RI PT

K

>Md33

MKFGVWALSSALLVLSWSYHAEAACGYKACPASKANMINVHLVPHSHDDTGWLKTVDQYY

SC

YGSRNGIQHAGVQYILDSVITELLKDKSRRFIQVESAFFFKWYNEQTAAVKKNVKTLVEE GRLEFTGGAWSMNDEAAVHYQSVIDQFTLGLKSLADTFGSCARPRIGWQIDPFGHSREMA

M AN U

SLFAQMGYDGNFFARMDYMEKNARLNNMTGEMIWKASENLDERSEIFTGVLYNHYSAPPG FCFDNLCSDDPIIDGDSYENNVKAKVDDFLTYINKMGTYYRATDLLVPMGDDFNYENAYV NYKNMDKLIKYVNQRQLQGGKVNVFYSTPACYLNALHNDNKTWPTKTQDFFPYSTDWHTY WTGYYSSRPTQKRFERDGNHFLQVAKQLTTMANLTTTAAFANLDTLRQAMGIMQHHDAVT

TE D

GTEKQHVAQDYDRMLTKAINVAETNTRDALRKLTNLTNGEFVSCLQLNLSICEVSQWSPN NLVVTVVNPLAHPSTQYVRIPVKNGTYIVTDATGRQVKSDVVPVPKELIDLTHLRPNASQ HELVFSAYADKIANYYIKVQPQPREFDEEIQRNEIKLPKRFLRQHLTKLEVAPASETIEP

EP

MAVGDVVVQNSQIKLTFNSNGILYNVQMNGISEDITQQFFYYKGAYGNNAEFKNRSSGAY IFRPNGTEILIGTKANLTIVNGSLVKEVHQRFTDWVSQVIRIYEGINRVEFEWLVGPIPI

AC C

QDNIGKEVITRFTSNLKNSGVFYTDSNGREMLRRVRNQREYFKPNMTESVSGNYYPITSR IALESSTRRMALLNDRAQGGSSLKDGALELMLHRRLLRDDAFGVGEALNETVNGKGLVAR GKVYLMLSTVSKISTTNERLAEREIHLPFWKFFSNATKATNKVLPRIPTFTSLPIGIDVL TLEPYSTNERLLRLEYFFNKNETSSLTFNIRSIFDALGGTEIRETTLDGNLALGSMNRFK FHPDGTVPKKVEYYKSKHTPLAAVQKDAVSKFQITMDPMQIRTFIVKWK

>Pa172 AARAVCGYESCHNVDPTKLNVHLVAHTHDDVGWLKTLDQYYYGSRNDIQNAGVQYIIDSV

ACCEPTED MANUSCRIPT VDSLVANPDRRFIYVETAFFWKWWVEQDEDMQQTVKDLVNEGRLEFIGGAWSMNDEAASH YYSTIDQFTWGLRKLNDTFGACGRPHIGWQIDPFGHSREMASLFAQMGYDGVLFARLDYQ DKDNRLKTKTPEMIWEGSPNLGESADLFTSVLYNFYNAPGGFCFDILCNDQPFIDNVESP DYNVDQKVQDFVNFVKTQGSYYTSSNIILTMGGDFNYQDANTWFKNLDKLIKYVNAADST

RI PT

LNVIYSTPSCYLKAVNDAGLTYTTKQDDFFPYASDPHSYWTGYFTSRPASKYFERLGNNF LQVAKQLEAMTLLGQVGDSSIDDLREAMGVMQHHDAITGTEKQHVAGDYARLQYRAMDEA SVSMETALNQLMENPTTTPIEIHSCLLFNISECHVTERGGSQFLVTVYNPLSHSLDYSVR FPVPSGTYTVQDPTGAELPFQIIPLPDEVLALPERDSSIATHELAFRALSLPPLGFRSYY

SC

VVRVSDDFVEVEPSADTFIGDSLLQVTIDNVTGLVDSLTINGEEIALSQNFFYYDGYIGE

NDNADHRSSGAYIFRPISSVPQTIATSATWKIYKGHIVDEIHQTFSPWVSQVIRIYKQEN

M AN U

HVEFSWLVGPIPIEDGVGKEVVNKFSTDLASDGLFYTDSNGREMLERKRDFRPTWTVDIA EPVAGNYYPVTSKILIRDTTKGQEFAVLNDRAQGGSSLNDGEVELMVHRRLLHDDAFGVG EALNEVAYDQGLVVRGRHYVIAGPTSGLSPSLAAQERELEQKKMLSPWMFFAVAGPSFEY WQSTFKMEFSGLTESLPQNVKIHTLEPWRDQNLLLRLEHILEQDDDPELSQPATVVYEDI

IDVAKR

EP

>Pa174

TE D

FSTFTVTSSHETTLAANQWIEDLDRLVWKSDSAREPKEKQKMEKDPKYVTLTPMQIKTYV

VTCFAHPRRSLYRDDRADVTCGYQSCHQVDPTKLNVHLVPHTHDDVGWLKTVDQYYYGAN

AC C

TNYQRAGVQYILDSVIEALLDNPDRRFIYVETAFFWKWWVEQTPERQQEVKDLVNEGRLE FIGGAWSMNDEAASHYQSTVDQFTWGLRKLNDTFGSCGRPHIGWQIDPFGHSREMASIIX ARMGYDGLFFARLDYQDKNTRLAGKTAEMIWEGSPNLGASADMFTSALYNHYSAPSGFCF DILCNTEPFIDNQKSPDYNVDAKVQNFVNFVNGQAEHYTSNNILVTMGDDFNYQDALMYF KNMDKLVKYVNAADKGVNVIYSTPSCYLKAVNDE

>Tm404 HDDLGWLKTMDKYYFQDVQNVIGSVVGALKQNPDRRFVQVETAYFKEWWDRQNDIVRQDV

ACCEPTED MANUSCRIPT IDLVNNGQLEIINGGWCMNDEANTNYQSTIDQYTLGLRFLEDTLGPCGRPRVGWQIDTFG HSREQASISAKLGFDSLFFMRMDYRDKNKRLADKTGDLLWKGSQNLNDSYIFTSIFYGAY SFPEGFCFDIVCQDEPIIDDEESPDYNYDRRVDEFAEFVRGQASQYPTNNILVVMGDDVR YQASLTNFLNIDRLIKGFELFPKTFDDKPIKLLYSTPSCYAKAVNEFVTANDYNLDIKTD

RI PT

DFLPYATDGYGYWSGFYTSRPSKKRFERQGNNFLQIAKQLSAITNQPYESRITRLKEAVA VIQHHDAITGTEKEDVMKDYVRMLDTALEEANQAVDPILSTLIGSTSDDSFKFNTCLLAN ISSCGPSKSDKFTVTVYNPLSRPVSSPIELPVDSQTWMIEDPQGNEVIYQTDPPSXDFSY AEDVPISPYTLLFTAEDLPPLGFKVYTFTKAETVPEEQPHLTVGNDNTSFEIDDDTGLLK

SC

SITMNGVTMDVSQDLAYYRSGSYSGAYIFVPLENEKHRVTEDKVKTTPISGDIYQGVLQE FNSWAKQIIKVYNDDSNYIEIDWIIGPVDVSDGIGREVVSVFTTPLQTEGNFYTDSNGRE

M AN U

MLKRTRNYRPTFDYTNEEPIAGNYHPITSRIVLKDEEQGLELAILNDRAQGGASLEDGQV EIMIQRACTHNDRSQGNVRDSINDQEYGQGVIIRGKHYLILGPSSGNGEKSLAAIQRDVA QKKLLAPWAFVTDQDITQQXXDTREFSSLKTPLSDNVHILTLEPWNENTLLLRLEHIMEK

>Tm406

TE D

DEDENLSQETTVDLSDLFATFTITELEETTLGANIPLEDSVRLSWPGTGDTEDGKN

EADPSALNVHIVPHSHDDVGWLKTLDQYYFQDVQNVISSVIVALKLNPERRFVQVETAFF

EP

KKWWEQQKDSIKQDVIDLVNNGQFEIINGAWSMNDEAAVLYQCTIDQFTLGLRYLEDRLG ACSRPRVGWQIDPFGHSREQASISAQLGFDSIFFARLDYRDKINRMGKKTXDLIWRGSSN

AC C

LGNSSDIFTSVLYNHYSAPPGFCFDIVCDDDPXIDDEESPDYNYQSRVENFANFVKEQAS KFPTNNIIVVMGDDFRYQAALNSYINTDRLIKGFDLFPQTFQGKPSKSSTRLPRVTPKLS TITSQPTITSLKLKPTTFFLTPTELEALXXWDTFTSRPASKRFVYEGNNLLQVLPTYLXP NQSVNLFQVAKQLAVVGQESYD

>Tm408 VFIALSVDGKPVVEDDPICGYEACPEASPDTLNIHIIPHSHDDVGWLKTVDQYFFQDVQNVISSVVDALKQN PERRFVQVETAFFKLWWSRQNDIIKEAVQNLVNNGQLEFINGAWSMNDEAAVHYQSTIDQFTLGLRYIED NLGRCARPKVGWQIDPFGHSREQASISAQLGFDSMFFARLDYRDKNRRMDDKTXGPFVERKRQLGNNAD IFTSVLYQHYSAPGGFCFDIVCNDEVIIDDEEDPDYNLEKRVGEFADQMRDRAEHYPTNNILVTMGDDFRYE

ACCEPTED MANUSCRIPT

RI PT

AAMTTYMNLDLLIKGFDLFEQTYNDKRIKVFYSTPSCYTKAVNDYVNSNNYNLELKTDDFFPYADGTNTYW TGYFTSRATSKHFERQGNNLLQVSKQLAANAQGSYDNEQINTLKEAVGVMQHHDAITGTEKQHVANNYYL RLSRGMQSAADAAGQVLSNLITGDDTNLEFDSCLLANVSACTQTESDTFTVAVYNPLSRTQTAIVTLPVFDQ QNYQIRDPDDNDVPYQLDASLTDFSYVENARTSQTTLQFAAKDLPPLGFKVYRFSATDKQPKSNPLLKKTPN LSVGYEETSFEISEQTGLLESITMNGVTLQVTQDFQYYTSQNSSGAYIFVPVETDPSRVAGGPITTTLVSGDVS QGVLQEFGSWARQFIKVYNDDKSYIEFDWIVGPLDISDGVGKEVVSKFSTSLETNGQFYTDSNGREVLQRTR NSRPDYDYTDEQPVAGNYYPVTSKIVIEDDDVEFAVLTDRSQGGSSINDGEVELMVHRACQHDDGRGVGE NLNEQEFGDGIRVRGKHFLVLGPKGGNGDKSIAAVERDVAQRKLLSPWTFVTKQVDNLNNLQFSALKNSLP DNVQILTLEPWSENTLLFRLEHVLENGEDDNLSQEVTVDVSDLFTLFSITELKETTLGANMLLEENVRLSWPG SSTTDDQAEKRDVDDLTVTLQPMQIRTFLATVSYN

SC

>Tm345

MHFWIVPSVLLLNFHGIISNPIKGDIQAKDSVVCQKACHPLVDGKINVHLVPHSHDDLGW

M AN U

LKTFDQYYEGTGQFNAVENAGVRFILSSTIPALKADPNRRYIQVETGFFWKWWQEQSNET RQDVVDLVNSGQLEMINGGWSMNDEAASHYHSIIDQFTWGFRILEDTVGKCGRPKIGWQI DPFGHSKEHASILKQLGFEGLVVVRIDYRDKNKRRAEKNLDFIWKSNDNLENSEIFTTMF PDFYFEESGYCFDVMCSWDTINEGNLDSKVKGFAKILDGYKEYYKTNNIMMPMGRDFTYQ KAEQNFASMDLFLKGFKDHDKYNVIYSTPSCYIQAVQDEIQKNSIKLEEKTDDFFPYASE

TE D

AHSFWTGYFTSRPTSKRFERTGNNILQSVKQLTAFAKMKSKDYGESIADLRGAMGVMQHH DGITGTEKQAVSNDYAQMLHQAIKEAEEPVGTIIGELLRKNDEDEIDLQLSSCLLANVSI CDTTGKDRFLVVVQNPLSRVVTHYVHLPVEGDNYKVTGPDGEEVYDVFDTLHSFDYINEP

EP

TKPSSKDLVFAARNLPALGINLYYVEKMSESSNLYKPSQPLEAAEDGAFGTETNGFKIDT STGKLASVTINGATQEISQEFLYYPGYTDSTTTDDHRSSGAYIFRPAENEAKPMTTESSI

AC C

DTNCAKGNLVDQCIQIINDEVRQIIKVYKDADDAFVEFDWLVGDLQFVDNKGKEVITRFT VKDFSNSGTFYTDANGRQQVRRELNKRGDFEYDPEEEPVSSNYYPVTSKIVIKDETKKLE VAVLNDRAQGGTSLKEGTVELMIHRRLLGDDDKGVKEALNEIQYDKGLYVRGQHYLTFGS TESKVANGKSTAAFERDLAHRKLLAPWILLSEATDTLATLEKTQQILEFKFEALKKSLPD NVHILTLEPWKNSYLLRLEHALEKNEDDALSTEVTVDLEELFTLFNITEIWETTLGANQV LDESPNLKITLKPMEIRTFIIKTDQNNDDIDNDNDNSAGSHTARVKMGFKVKGIYVINNP ISFLKHVYQL

ACCEPTED MANUSCRIPT >Tm602 MECGRGLIAVFVLFNVWWVNSTPLTKDDATECVSACQGLDEKKVNVHLVPHSHDDVGW LQTFEGYYKGTGDFKEIKNAGVQYILDSTIQALQKNKNRRYIQVETAFFWKWWNNQGRRT PAAPDLVNSGQLEMVGGGWSMNDEAAAHYQSIIDQFTWGFRILDDTVGECGRPKVGWQ

RI PT

IDPFGHSREHASISKQLGFEGLVLGRIDYRDKSQRIEDKNLDFNWVTNPNFEDSTILTTM FPDFYTWPEGLCLDSTCASSQKLINDDNVEAVVANFTAILQEWIGYYKTKNILIPMGHDF TYQKAEDNFNSMDKLIEGFKDSDYNVIYSTPSCYIEAVTAAKPTLTKKEDDFFPYASNAN

EFWTGYFTSRTDSKTIRKPTAKRFERVANNILQSVKQLTTFSRIQGGDYDKNIVDLRMAMGVMQHHDA

SC

ITGTEKQDVANDYTSMLYKGIAKTQDSISKIISDLLKKNDSAVDLALSTCLLANVTICDA

SNKDKFLVAVQNPLSKTVSHHVRLPVNGTNFKITDADGEVAHDVLDAMHTFDFETKFETP

M AN U

SKEIVFLAKDLPPLGVKLYYVETVTDDTKKYKPQDDVVTFGDAKTTEFTIDATTGK

LASVTIKGVEIKVQQDFYYYHGRVDGAYVFEPEEGTSEDATLLGGDFKEKKYVKGDLVEE VHQVISDEATQVIRVYKTEDDAYVEFDWLIGNLQFDKNQSKEVITRFTIDGIANKDIFYT DANGRQQVQRTLNKRSDYEYDATEEPISSNYYPVTSKIVVKDETAKIEVAVLNDRAQGGS

AFERDLAHKKLLAP

EP

>Tm603

TE D

VLEPGVIELMVHRKEVADDHKGVDEVLNEQQFGKGLYARGQHYLTFGSSESKPESGVSTA

DHGVRQRFDSGVCAFQFWWVNSTPLTKDDATECVSACQGLDEKKVNVHLVPHSHDDVGWL

AC C

QTFEGYYKGTGDFKEIKNAGVQYILDSTIQALQKNKNRRYIQVETAFFWKWWNNQGXKDT RSAVRDLVNSGQLEMVGGGWSMNDEAAAHYQSIIDQFTWGFRILDDTVGECGRPKVGWQI DPFGHSREHASISKQLGFEGLVLGRIDYRDKSQRIEDKNLDFNWVTNPNFEDSTILTTMF PDFYTWPEGLCLDSTCASSQKLINDDNVEAVVANFTAILQEWIGYYXNKNILIPMGHDFT YQKAEDNFNSMDKLIEGFKDSDYNVIYSTPSCYIEAVTAAKPTLTKKEDDFFPYASNANE FWTGYFTSRTDSKTXFERVANNILQSVKQLTTFSRIQGGDDDKNIVDLRMAMGVMQHHDA ITGTEKQDVANDYTSMLYKGIAKTQDSISKIISDLLKKNDSAVDLALSTCLLANVTICDA SNKDKFLVAVQNPLSKTVSHHVRLPVNGTNFKITDADGEVAHDVLDAMHTFDFETKFETP

ACCEPTED MANUSCRIPT SKEIVFLAKDLPPLGVKLYYVETVTDDTKKYKPLQDITGRRGYLWRCEDN

>Tm1774 IFTRARSAFAGGFASPVKRPRAQSCQACHPVDPDKINVHLIPHSHDDVGWLKTVDQYYYG

RI PT

SHSDIQRAGVQYIISSTVEALKADPDRRFVQVETAFFWKWWQHQPDNLKQDFIDLVNNGQ LEIINAAWSMNDEAATNYQSTIDQFTYGLRTINDTVGTCGTPRIGWQIDPFGHSREQASI

FAQLGYDGVFFARIDHDDRDQRIADKTMEVIWQGSANLDNANIFTNVFPEFYYPPSGYCF DIECGDEVLNDDEASPDYNIPRKVDDFLSKVQELASYYQTNNLLIPMGGDFQYQSAEKNF

SC

VNMDKLISGFQGNDQINLIYSTPSCYIQAVNDEAASKNIEFTVKTDDFFPYASESHCYWT GYFTSRPNAKRFERTTNNILQAGKQLAAFSKVKGNDQEDSLTLLKQSVGILQHHDAITGT

M AN U

AKAAVANDYARLLAKSIKRAEPSLGNIVTDLLKKDSSSDINLNLQTCLLANVSICEPAAS

DRFVVTVYNPLERPVTDYVRIPVPDGGYTITGPDGEVESDLLDSISSFDYVDDDTGSPNP KELVFAAADLPGFGLRLYYVEKTSSKSKPIKSTPKLKFGTDEIGFEIDDGTGLLKSVTMN GQTVDITQQFFSYNGYNGENDGADNQASGAYIFRPLENSSNAVGDTVSVTSTSGNLVDEV

TE D

RQQVNDWITQIIRVYKGGNNNYIEFDWLVGPIPVDTDNGIGTEIVSRFTVGDFDNGESFY TDSNGRELIKRQLNKRYDYEYDSSLEPIASNYYPVTSKIVIKDETKNLEVALLNDRAQGG ASLQNGQVELMVHRRLLKDDAKGVGEALDDQEFGQGVVARGQIYLVVGSTDSDGGDGKST

EP

AAQERELALKKLLTPLVLVGDASSDDLSLDNIQSTLNLNFDGLAKTLPDNVHILTLEPWQ EDSYILRLEHILENNEDDSLSQAVSVDLSGLFSLFEVTEIRETTLGANQWLDEFEAKEKY

AC C

VWNVKGGGSVVNNKASAAPRADEISLNPMQIRTFVVKVAAN

>Tm1776

MIKYLLAVTVLFVGVYSTPIKLKEAPSCQACHPVDPDKINVHLIAHSHDDVGWLKTVDQY FYGSHPEIQRAGVQYTISSTIEALKGDPDRRYIQVETAFFWKWWQRQPDRIKQDVIDLTN SGQLEFTNAAWTQNDEAGTNYQSIIDQFTDGLRFINETIGQCGAPTIGWQVDPFGHSREQ ASLFAQFGFDGDFFGRIDYSDLKTKRDNKDMEVVWQGSANLDNTNIFTVVFPDGYGPLGG YCYDIQCGDTVFNDDESSPDYNVPQIVGAFQERLDQLITYYRTNNIIIPMGTDFQFQSAE

ACCEPTED MANUSCRIPT KNFINMDRMIAGFKDNDRYNVLYSTPSCYIKAVNEAAAAQDVE

RI PT

Contigs Family 47

>Bx2085

GVLRWYRGQLLHLAKDIGYRLLPAFNTTTGIPHARVNLRYGLKSQKLEWSQETCTACAGT

MVLEMAALSRLTGEPVFEEKAHRAMDELWKLRHRSSDLMGTVLNVQSGTWVRRDSGVGAG

SC

IDSYYEYCLKAYILLGEDRYLSRFNRHYSAVMKYISQGPMLLDVHMHRPHTNSRNFMDAL LAFWPGLQVLKGDLKPAVETHEMLYQVMQRHNFIPEAFTTDFQVHWGQHPLRPEFLESTY

M AN U

FLYRATKDPHYLHVGRDILRALQTFARVPCGYAAVKDVRTNHHEDRMDSFVLAETLKYLY LLFAEDTDLLLNLDEFLFTTEAHLLPLALARSPGNLSFFSLGDELSMDDSEWADQCPSHI RLFPASVRHPLRNMVKGSCPRSVDAKPRKLTASQFQTVEGPKQIHLLTIYLTFFISVALY VKIYIYCTLYRG

TE D

>Dp1634

DMFYHAYRAYMDNAYPADELMPLSCEGRYRGSHPSRGDLDDTLGNFSLSLIDSLDTLVVL GDLAEFEYAVKLVIDNVKFDSDVTVSVFETNIRVLGGLLSAHILADYVKENIGIMHWYRG

EP

ELLEMAKDIGYRLLPAFNTSTGIPQARVNLKQGTKLTKIWNTRETCTACAGTMLLEMAAL SRLTGDPIFEEKADKGMDSLWGLRHRGSGLMGSVLDVESGRWVRRESGVGAGIDSYYEYC

AC C

FKAYVLLGNPKYLSRFNKHYSAVMKYISQGPMLLDVHMHRPHTNSRNFMDALLAFWPGLQ VLSGDLKPAVETHEMLYQVMQRHNFIPEAFTTDFQVHWGEYPLRPEFLESTYFLYRATGD HHYLRVGEKILRGLQTHTKVPCGYAALSDVRSGRHDDRMDSFVLAETFKYLYLLFTDPND LIIDLDDFLFTTEAHLLPLSLSRISVNANTSSYTEGLEDEDGPQQCPSTLKLFLKL

>Dp3988 AAGPLKIMRNSTDPPDSLWAERRRHVVQMTKFAWDNYVRYAWGKNELRPLTKKAHNTGIF GSAALGATIVDGLDTLYIMGLHEEFRQGRAWIQENLTMEDLNGDFSVFETTIRYIGGLLT

ACCEPTED MANUSCRIPT CYSFTGDVMFRDKAENIAKKLLPAFQTITGIPHSLVNFKN

>Dm6182 VFEEKAHRAMDELWKMRHRSSDLMGTVLNVHSGDWVRRDSGVGAGIDSYYEYCLKAYILL

RI PT

GDDRYLNRFNRHYNAVMKYISQGPMLLDVHMHRPHTNSRNYMDALLAFWPGLQVLKGDIK PAVETHEMLYQVMQRHKFIPEAFTTDFQVHWGNHPLRPEFLESTYFLYSATNDPYYLEVG KNVLRSLQKYARVSCGYAAVNDVRTGK >Tm2371

SC

MLSVSEVVVRLLGVCVFIVFCTQCYSEESKAMTREERIRLREETRDMFYHAYRAYMENAY PADELMPLSCKGRYRGLTPNRGDIDDCLGNFSLTLIDTLDSLVVLGDLEEFEHAVKLVIK

M AN U

DVSFDNDVVVSVFETNIRVLGGLLSAHILADYLQQRDGIMTWYKGELLNMAKDVGYRLLP AFNTTTGIPHSRVNMKYGMKSDRLESARETCTACAGSMILEMAALSRLTGEPIFEEKAHK AMDELWKMRHRSSDLMGTVLNVHSGDWVRRDSGVGAGIDSYYEYCLKAYILLGDNKYLHR FNRHYNAVMKYISQGPMLLDVHMHRP

TE D

>Md1922

MPNFCLPRTSHNCRPLSSGPMTGTAEFQRNNISTINSSSYTSFNKLENMAKIVICTLLAL GLFVTLVSSDTIPTQNMSHKEREALREEARDMFYHAYHAYMDNAYPADELMPLSCKGRYR

EP

GVTPSRGDMDDILGNFSMTLVDTLDTLVILGDFAEFEHAVKLVIRDVQFDNDIIVSVFET NIRMMGGLLSAHILAEYIQKNAQVMHWYKGELLEMARDLGYRLLPAFNTSTGIPHARVNL

AC C

RYGMKDELLKKSRETCTACAGTILLEFAALSRLTGDPIFEARAHAAMDALWKLRHRGSDL MGTVLNVHSGDWVRRDSGVGAGIDSYYEYLFKSYVLLGDDKYLARFNRHYNAVMKYVSEG PMLLDVLMHRPHAKSRNFMDALLAFWPGLQVLSGDLKPAVQTHEMLYQVMQMHTFIPEAF TIDFQIHWGQHHLRPEFIESTYFLYRATGDHHYLQVGKKALKALQQHAKVPCGYAAVNDV RTGKHEDRMDSFVLSETFKYLYLLFTEPQDLILNIDEFVFTTEAHLLPLSLAQLGNSTYS FRQSEEHNVLDFMRTCPSSNKLFPEKVRKPLRNFITGSCPRTPATKRLRALDFQASNADH LRAVYDMGITMVSLGDGKVRLFHSFYNAKSPEEGEMGLQFMQEMLELTKLQSLNQLAQLQ AVAYPLDDKSNDWTALMAGPSHFSPELTEDDFVESEVILAEPLRACDERLANAEKAKGKI

ACCEPTED MANUSCRIPT LIAERGDCTFVSKARLAQKSGALALIVCDNVPGSSGETQPMFAMSGDGINDVKIPVVFMY SQEFAKLSKVMQRRPHMKIRIMQMIEFKRWQMAKESKANATETTPSASASTTTTTTKKSK PEKEL >Ra766

RI PT

FAALSRLSGEPIFERKAHASMDALWKLRNRGSDLMGTVLNVHNGDWVRRDSGVGAGIDSY YEYCLKSYVLXGHGKYLARFNRHYSAVMKYISQGPMLLDVHMHRPHAKSRNFMDALLAFW PGLQVLSGDLKPAVQTHEMLYQVMQMHTFIPEAFTFDFSDSLGTASTAXXEFIESTYFLY

KATGDHYYLQVAKKALQTLQKHARVDCGFAAVNDVRTGKHEDRMDSFVLSETIKYLFLIF

SC

ADPSELILDLDSFIFTTEAHLLPLSLGQLTNVTTESIDQDDHVAMDYMRSCPSPNKLFPE

TVRRPLRDLVTGVCPRIPNSKRLRALDFQSSNADHLRTVYDMGISMVSLGEGKVQLMHSF

AC C

EP

TE D

M AN U

YNAKS

Insect midgut α-mannosidases from family 38 and 47 with emphasis on those of Tenebrio molitor.

α-Mannosidases are enzymes which remove non-reducing terminal residues from glycoconjugates. Data on both GH47 and GH38 (Golgi and lysosomal) enzymes ...
1MB Sizes 1 Downloads 11 Views