ARTICLE IN PRESS

Engineering Escherichia coli for Functional Expression of Membrane Proteins Franz Y. Ho, Bert Poolman1 Department of Biochemistry, Groningen Biomolecular Sciences and Biotechnology Institute & Zernike Institute for Advanced Materials, University of Groningen, Groningen, The Netherlands 1 Corresponding author: e-mail address: [email protected]

Contents 1. Introduction 1.1 Green fluorescent protein as a folding reporter 1.2 Antibiotic resistance marker protein for selection 2. Preparation of Erythromycin-Sensitive E. coli Strain 3. Preparation of Expression Plasmid 3.1 Construction of expression vector for target protein–GFP–ErmC fusion 3.2 Expression test 4. Selecting Cells for Better Expression 5. Characterizing Evolved Strains 5.1 Basic characterizations 5.2 Plasmid curing 5.3 Functional assays 6. Summary Acknowledgments References

2 4 5 6 8 8 9 12 13 13 14 14 15 16 16

Abstract A major bottleneck in the characterization of membrane proteins is low yield of functional protein in recombinant expression. Microorganisms are widely used for recombinant protein production, because of ease of cultivation and high protein yield. However, the target proteins do not always obtain their native conformation and may end up in a nonfunctional state, in insoluble aggregates. For screening of functional protein, it is thus important to readily discriminate aggregated, mistargeted protein from globally well-folded, membrane-inserted protein. We developed a robust strategy for expression screening of functional proteins in bacteria, which is based on directed evolution. In this strategy, the C-terminus of the target membrane protein is tagged with two additional protein domains in tandem. The first one is green fluorescent protein (GFP), which functions as a reporter of the global folding state of the fusion protein. The other one is the erythromycin resistance protein (23S ribosomal RNA adenine N-6 Methods in Enzymology ISSN 0076-6879 http://dx.doi.org/10.1016/bs.mie.2015.01.003

#

2015 Elsevier Inc. All rights reserved.

1

ARTICLE IN PRESS 2

Franz Y. Ho and Bert Poolman

methyltransferase, ErmC), which confers a means to select for enhanced expression. By gradually increasing the antibiotic concentration in the medium, we force the cells to evolve in a way that allows more functional target-GFP–ErmC to be expressed. The acquired genomic mutations can be generic or membrane protein specific. This strategy is readily adopted for the expression of any protein and ultimately yields a wealth of genomic data that may provide insight into the factors that limit the production of given classes or types of proteins.

1. INTRODUCTION Integral membrane proteins contribute 15–30% of the open-reading frames found in the genomes of organisms from all domains of life (Bendtsen, Binnewies, Hallin, & Ussery, 2005; Wallin & von Heijne, 1998). However, our understanding of function–structure relationships of membrane proteins is low compared to water-soluble proteins (Granseth, Seppa¨la¨, Rapp, Daley, & Von Heijne, 2007). There are many difficulties in studying membrane proteins, and the challenge already starts with the expression of the genes and production of the proteins in a functional state. Recombinant expression in heterologous hosts is the most versatile strategy, but difficult to produce proteins often cause toxicity to the cells and end up themselves in insoluble aggregates (Bill et al., 2011). Expression conditions can be optimized but this is a laborious process (Francis & Page, 2010). Large-scale production of proteins is commonly achieved using microorganisms, because of the ease of growth and many genetic and biochemical tools are available. Several factors determine the production level and functionality of the proteins, such as (i) the toxicity the proteins evoke to the host cells (Wagner et al., 2007), (ii) the posttranslational modifications required for structure and/or functionality (Grisshammer & Tate, 1995), (iii) the difference in codon usage between the recombinant gene and the expression host (Angov, Hillier, Kincaid, & Lyon, 2008), (iv) limitations of protein synthesis precursors such as tRNA and amino acids (Marreddy et al., 2010; Puri et al., 2014), (v) overloading of foldase and chaperone activities (Tate, Whiteley, & Betenbaugh, 1999), (vi) saturation of the membrane protein insertion machinery (Loll, 2003; Wagner, Bader, Drew, & de Gier, 2006), and (vii) uncoordinated protein biosynthesis kinetics (Bill et al., 2011; De Marco, 2013). Membrane proteins have a complex biogenesis pathway, requiring chaperones and machineries for membrane insertion, protein folding, and oligomerization, but often the limiting factor(s) leading to low or nonfunctional expression is not understood. Rather than

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

3

screening numbers of expression conditions, it is easier to improve the production by evolving the host. In this method, we describe a simple procedure for engineering Escherichia coli by forcing the cells to produce more desired protein by fusing a folding indicator and antibiotic resistance marker to its C-terminus (Fig. 1A). The same strategy has been proven successful for the overexpression of membrane proteins in both the Gram-positive bacterium Lactococcus lactis and the Gram-negative bacterium E. coli (Gul, Linares, Ho, & Poolman, 2014; Linares, Geertsma, & Poolman, 2010).

A

B

25

50

20

45

ErmC GFP

40

15

35

10

30 5 25 0

C

10

−5

10

−4

−3

10

L-Arabinose

C

D L-Arabinose

(% w/v)

Fluorescence (a.u.)

N

Lactose transport activity (nmol per mg protein per min)

Target protein

L-Arabinose

2.0 × 10 −5 2.0 × 10 −4

10

−2

−1

0

10

(% w/v)

(% w/v)

1.0 × 10 −3 1.5 × 10 −3 4.0 × 10 −3 2.0 × 10 −1

10−5 10−4 10−3 10−2 10−1 In-gel fluorescence

Anti-His immunoblot

Figure 1 (A) Cartoon of fusion protein used for strain selection. The folding reporter GFP and the selection marker encoding the 23S ribosomal RNA adenine N-6methyltransferase (ErmC) are fused in tandem at the C-terminus of the target membrane protein. N and C indicate the N- and C-terminus of the fusion protein. (B) Transport activity of LacSΔIIA fused with GFP in E. coli MC1061 cells. The expression of LacSΔIIA was controlled by the concentration of L-arabinose. (C) Expression of LacSΔIIA–GFP fusion was analyzed by SDS-PAGE. Upper panel: in-gel fluorescence; lower panel: anti-His immunoblot. White and black arrows indicate the folded and misfolded fusion protein, respectively. (D) Confocal microscopy images of E. coli MC1061 cells expressing LacSΔIIA–GFP to different levels; the L-arabinose concentration is indicated above the panels. Upper panel: close-up of cells to indicate the distribution of the fluorescence (scale bar, 2 μm). Lower panel: overview of the culture (scale bar, 10 μm). Panels (B–D) are modified with permission from Geertsma, Groeneveld, Slotboom, and Poolman (2008); copyright (2008) by the National Academy of Sciences USA.

ARTICLE IN PRESS 4

Franz Y. Ho and Bert Poolman

1.1 Green fluorescent protein as a folding reporter Protein folding can start right after the nascent chain is formed, but in case of membrane proteins, it is delayed as translation and membrane insertion are coupled and folding is completed after the last transmembrane segments have left the translocon (Luirink, von Heijne, Houben, & de Gier, 2005). In multidomain proteins, domains are tethered by either structured or disordered linkers and the rate of folding of individual domains can differ (Arviv & Levy, 2012; George & Heringa, 2002; Jappelli, Luzzago, Tataseo, Pernice, & Cesareni, 1992; Zhang & Ignatova, 2009). In the present method, green fluorescent protein (GFP) is used as a global folding reporter, which requires fusion of the fluorescent protein to the C-terminus of the target membrane protein. Thus, the synthesis of the membrane protein precedes that of the fluorescent protein. The folding of a membrane protein is typically faster than the folding and maturation of GFP, which takes about 30–90 min (Evdokimov et al., 2006; Reid & Flynn, 1997; Waldo, Standish, Berendzen, & Terwilliger, 1999). Therefore, when the target membrane protein is misfolded, it is likely to drag the not yet fully synthesized, folded, and/or matured GFP into an aggregated state. In contrast, when the target membrane protein is properly folded, the GFP β-barrel is formed and the chromophore will mature. Proper folding of the protein protects the chromophore from quenching by water dipoles, paramagnetic oxygen, or cis–trans isomerization (Tsien, 1998). The maturation process involves a series of covalent rearrangements of the amino acids that form the tripeptide chromophore (Ser/Thr65, Tyr66, and Gly67) within an α-helix that is buried inside the hollow β-barrel. When the chromophore matures, GFP becomes SDS resistant, and the protein migrates faster in SDS-PAA than the fully denatured polypeptide. The apparent difference in migration is about 10 kDa, and this difference is also observed when GFP is fused to another protein (Geertsma, Groeneveld, et al., 2008). Thus, when a membrane protein is well folded, the C-terminal GFP will fully mature, and the whole fusion protein migrates faster on SDS-PAA than the misfolded membrane protein–GFP fusion. By analyzing protein expression on immunoblots, one can easily discriminate the well-folded and misfolded proteins (Drew, Lerch, Kunji, Slotboom, & de Gier, 2006; Geertsma, Groeneveld, et al., 2008). Moreover, the fluorescence reports the absolute amount of folded fusion protein, which can be observed not only in gel but also in vivo (Drew et al., 2006; Linares et al., 2010). We and others have observed that the in vivo activity of transport proteins correlates with the corresponding in-cell and in-gel GFP fluorescence of the

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

5

fusion proteins (Geertsma, Groeneveld, et al., 2008; Hibi et al., 2008; Schlegel et al., 2012). In brief, the glutamate transporter GltP from E. coli and lactose transporter LacSΔIIA from Streptococcus thermophilus have been fused to the N-terminus of GFP and expressed in E. coli. The measured transport rate of glutamate or lactose matches the GFP fluorescence intensity both in vivo and in SDS-PAA gels (Fig. 1), which is observed over a wide range of expression levels (Geertsma, Groeneveld, et al., 2008). This indicates that the fluorescence intensity of GFP can be used as a read-out of the in vivo activity of the target membrane protein. However, since the chromophore maturation requires aerial oxygen, the culture has to be well aerated when the protein is being expressed (see figure 1C in Linares et al., 2010). We emphasize that the fluorescence of GFP reports the global folding of the target molecule, but the membrane-inserted protein is not necessarily functional in every aspect. Factors like final folding steps, oligomerization, membrane lipid composition, and others can impact the observed activity (Grisshammer & Tate, 1995).

1.2 Antibiotic resistance marker protein for selection One of the key features of our strain engineering system is the tagging of the target membrane protein–GFP fusion with an antibiotic resistance marker at the C-terminus. Thereby, cells expressing folded, membrane-inserted, fluorescent fusion protein have a growth advantage when cultivated in the presence of the antibiotic. We have tested several antibiotic resistance markers, such as chloramphenicol acetyltransferase, tetracycline efflux protein, aminoglycoside phosphotransferase, and 23S ribosomal RNA adenine N-6-methyltransferase (ErmC). Of these antibiotic resistance markers, the water-soluble ErmC has the most favorable properties and did not strongly affect the expression or activity of the membrane proteins to which it was fused. Contrary to ErmC, all the other antibiotic resistance markers function as oligomers, which is likely to impact the protein to which they are fused (Gul et al., 2014). Proteome-wide topology studies in E. coli and S. cerevisiae indicate that 80% of the integral membrane proteins have their C-termini at the cytosolic side (Cin topology) (Daley et al., 2005; Kim, Mele´n, Osterberg, & Von Heijne, 2006; Von Heijne, 2006). Our strategy of using GFP as folding reporter and ErmC to select for increased expression is not readily applicable to proteins with the C-terminus on the outside, since GFP is not fluorescent when localized in the periplasm of, e.g., E. coli. The localization of the

ARTICLE IN PRESS 6

Franz Y. Ho and Bert Poolman

C-terminus of the target protein is readily predicted by various online protein topology prediction programs (Bernsel et al., 2008; Hennerdal & Elofsson, 2011). Thus, our selection system is based on expressing membrane protein fused with C-terminal inside GFP and ErmC.

2. PREPARATION OF ERYTHROMYCIN-SENSITIVE E. COLI STRAIN Compared to Gram-positive bacteria such as L. lactis, enterobacteriaceae including E. coli are intrinsically resistant to erythromycin (Andremont, Gerbaud, & Courvalin, 1986). Therefore, we engineered the erythromycin sensitivity of the MC1061 strain by knocking out the acriflavine resistance B (acrB) gene. AcrB is a member of the resistance nodulation cell division (RND) superfamily and forms the AcrAB-TolC tripartite drug efflux system. It recognizes a wide variety of toxic compounds, including antibiotics, and transports these compounds out of the cell (Pos, 2009; Tal & Schuldiner, 2009). In our initial experiments, we used drug efflux mutants prepared in the BW25113 strain (Gul et al., 2014). However, E. coli MC1061 (araD139, Δ(araA-leu)7697, ΔlacX74, galU,galK, hsr, hsm+, strA; Casadaban & Cohen, 1980) is generally a better host for membrane protein expression, and thus we inactivated acrB gene in this strain by targeted gene knock-out. Gene inactivation can be achieved by various methods. We used the recombination-mediated genetic engineering method (recombineering) to create E. coli MC1061ΔacrB, which is simple and efficient for introducing site-specific mutations. It utilizes the bacteriophage λ homologous recombination proteins, called RED, to modify the genome of E. coli by using linear double-stranded (ds) DNA fragments of around 50 nucleotides that are homologous to the target locus (Datsenko & Wanner, 2000). Here, we describe the background and procedure of “recombineering” briefly; for the detailed protocol, we refer to Sawitzke et al. (2007) and Thomason, Sawitzke, Li, Costantino, and Court (2014). Primer sequences (Table 1) for amplification of the DNA fragment, used for knocking out the acrB coding sequence, are designed according to Baba et al. (2006). Capital letters in the primer sequences are complementary to the sequences adjacent to the chromosomal acrB, while the small letters are sequences priming to pKD13. This plasmid carries a kanamycin resistance gene cassette flanked by Flp recognition targets (FRT) at both ends, which is recognized by yeast Flp recombinase (Datsenko & Wanner, 2000). Using

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

7

Table 1 Primers for inactivation of acrB gene Primer Sequence (50 to 30 )

Forward

TTACGCGGCCTTAGTGATTACACGTTGTATCAATGAT GATCGACAGTATGgtgtaggctggagctgcttc

Reverse

TCAGCCTGAACAGTCCAAGTCTTAACTTAAACAGGAG CCGTTAAGACATGctgtcaaacatgagaattaa

the primers listed in Table 1, the PCR-amplified DNA fragment for recombineering contains the kanamycin resistance marker gene flanked by FRT sites, with 50-nucleotide-long extensions corresponding to upstream and downstream of the chromosomal acrB coding region. After replacing the acrB gene by the FRT-flanked kanamycin resistance gene cassette, one can remove the marker by temporally expressing Flp to excise it from the genome, leaving only a short sequence of FRT as a scar. In principle, any antibiotic resistance gene without FRT sequences can be used for replacing the genomic sequence, if removal of the antibiotic marker is unnecessary (Sawitzke et al., 2007). 1. Transform pKD46 in E. coli MC1061; pKD46 is a temperature-sensitive plasmid with coding sequences for λ RED proteins. 2. Grow the transformed cells at 30 °C and shake the culture at 200 rpm. Induce the expression of λ RED proteins by the addition of 0.2% (w/v) L-arabinose. 3. Collect cells during mid-log phase and prepare electrocompetent cells. 4. Amplify the DNA fragment by PCR that corresponds to the acrB locus. 5. Transform the cells with the amplified DNA fragment by electroporation. 6. Select colonies on LB agar supplemented with 25 μg/ml kanamycin and confirm the genome modification by colony PCR. 7. Remove pKD46 plasmid by growing the cells on nonselective medium (LB agar without antibiotics) at 37 °C. 8. Test the erythromycin sensitivity of the parent and ΔacrB strain; the recommended concentration range is 1–500 μg/ml. E. coli MC1061 grows normal up to 100 μg/ml of erythromycin, whereas the ΔacrB strain is already sensitive at 5 μg/ml; other strains may have different erythromycin sensitivity. If the growth of ΔacrB cells in the presence of 5 μg/ml erythromycin is minimal after overnight incubation, the strain can be used for host evolution (Gul et al., 2014). Erythromycin sensitivity of MC1061 before and after acrB inactivation is shown in Fig. 2.

ARTICLE IN PRESS 8

Fraction of surviving cells

Franz Y. Ho and Bert Poolman

Erythromycin (µg/ml)

Figure 2 Sensitivity of E. coli strain MC1061 to erythromycin before and after inactivation of the acrB gene. Open squares: MC1061; black rhombus: MC1061ΔacrB. The error bars show standard deviation (SD) from three independent measurements.

3. PREPARATION OF EXPRESSION PLASMID 3.1 Construction of expression vector for target protein–GFP–ErmC fusion There are wide varieties of plasmids, which can be used for protein expression in E. coli. In the current method, we prefer the araBAD expression system for production of membrane proteins. It uses the PBAD promoter, a tightly regulated bacterial promoter system, which gives minimal expression in the absence of inducer and toxicity by misfolded protein is readily controlled. Furthermore, the dynamic range of expression upon induction by L-arabinose can be more than three orders of magnitude (Guzman, Belin, Carson, & Beckwith, 1995). Genes encoding EGFP and ErmC are first cloned at the 30 end of the multiple-cloning site of the pBAD vector, after which the coding sequence of the target protein is inserted at the 50 end in frame with the gene for the GFP–ErmC tandem. Another concern is the linker tethering the target membrane protein and the GFP. We tested the effect of the linker length and the folding reporter efficiency of the C-terminal fused GFP. We used a flexible (GGGS)n peptide sequence and observed no difference in whole-cell fluorescence from no linker to (GGGS)5 in case the fusion protein itself has already up to 32 amino acids between the last transmembrane segment and the GFP (Linares et al., 2010). Furthermore, for purification and immunodetection of the proteins, we routinely add a 10 His tag at

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

9

the C-terminus of the fusion protein; and for functional and structural studies, we engineer a TEV protease recognition peptide sequence in the linker region between the target protein and GFP (Geertsma, Groeneveld, et al., 2008; Gul et al., 2014). The expression plasmid can be converted into the ligation-independent cloning (LIC) compatible system, which facilitates the cloning procedure. For the details of designing and conversion of plasmids into LIC compatible vectors, we refer to Geertsma & Poolman, 2007. 1. Design primers for amplifying codon optimized gfp (GenBank: KF410615; we use gfp+ that encodes EGFP; Scholz, Thiel, Hillen, & Niederweis, 2000) and ermC from Staphylococcus aureus (GenBank: JF968525; Weisblum, 1995). The 10 His tag coding sequence can be prepared by annealing complementing oligonucleotides. They have to be in-frame after construction. 2. Clone the coding sequences of gfp, ermC, and the 10 His in the multiple-cloning site. 3. Clone the coding sequence of the target gene into the multiple-cloning site. Make sure that all the sequences are in frame.

3.2 Expression test Before selecting cells with increased expression, it is important to first find the optimal induction condition for the fusion protein. At least two parameters should be tested: the inducer concentration and the temperature of expression. In general, the fraction of properly folded protein is higher at low induction level and at low growth temperature, since excessive transcription and translation usually overload the chaperone and translocation systems. The optimal induction condition has to be determined experimentally, but we routinely perform the experiments at 20 °C and L-arabinose concentrations of 0.01–0.001% (w/v). The expression test is also useful after the selection process, i.e., to find the best performing clones and to benchmark their expression performance. 3.2.1 Bacterial culture for protein expression and determination of whole-cell fluorescence 1. Transform E. coli cells with pBAD bearing the gene for the target membrane protein fused with GFP (with or without ErmC). 2. Pick a single colony and inoculate into LB medium supplemented with 100 μg/ml ampicillin (to maintain the pBAD vector), and grow the culture overnight at 37 °C with shaking at 200 rpm for adequate aeration.

ARTICLE IN PRESS 10

Franz Y. Ho and Bert Poolman

3. Inoculate 1% (v/v) of overnight culture into fresh LB medium, supplemented with 100 μg/ml ampicillin, and grow the cells to A600 of 0.5–0.6 (1 cm light path) at 37 °C. 4. When the cells reach A600 of 0.5–0.6, induce protein expression by adding L-arabinose to the culture. Keep a fraction of cells for detection of un-induced expression. Initially, a broad range of induction conditions is tested, which can include varying the L-arabinose concentration, induction time, and growth temperature (Table 2). 5. At the end of the cultivation, measure the optical density of the cultures at 600 nm. 6. Collect cells by centrifugation at 5000  g for 15 min, and wash the cells by resuspension in 0.5 of culture volume of 50 mM potassium phosphate (KPi), pH 7 (wash buffer). Repeat the washing step and resuspend the cells in 0.2  of the original culture volume of wash buffer. 7. After the second wash, remove the supernatant and resuspend cells in wash buffer to equal protein content (e.g., 3–5 mg/ml total protein, given to the fact that A600 of 1 corresponds to about 0.3 mg/ml total protein). 8. Aliquot 100 μl of cell suspension (in triplicate) into wells of a 96-well plate, which is suitable for fluorescence measurements (black wells with transparent bottom). Use a fluorescence plate reader with suitable excitation and emission filters for GFP (488 nm excitation, 510 nm emission). 9. Measure the optical density of the cell suspension in the wells. These values are used for normalizing the fluorescence counts and subsequent gel-based analysis.

Table 2 Parameters of expression optimization Parameters Recommendation

Expression temperature

16–37 °C are commonly used e.g., 16, 20, 25, 30, and 37 °C

Time

1 h to overnight e.g., 1, 2, 4, 6 h, or overnight

Inducer concentration

104–101% (w/v) L-arabinose for the araBAD system; strains that catabolize L-arabinose require higher concentrations of the inducer (Guzman et al., 1995; Horazdovsky & Hogg, 1989) 20 μM to 0.4 mM IPTG for the lac promoter; strains that lack the lacY gene generally require a higher concentration of IPTG

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

11

3.2.2 Gel-based analysis of the expressed fusion constructs Although whole-cell fluorescence indicates the amount of globally folded membrane protein–GFP–ErmC fusion, the fluorescence signal can be misleading when part of the fusion protein is degraded and soluble GFP is measured. Therefore, it is necessary to confirm the full length of the fusion proteins by SDS-PAGE. Moreover, folded and misfolded fusion proteins can be discriminated by SDS-PAGE followed by Western blotting (Geertsma, Groeneveld, et al., 2008). The choice of lysis buffer depends on the target proteins and subsequent purification steps; 50 mM potassium phosphate, pH 7, supplemented with 1 mM MgCl2, 10% glycerol, 25 μg/ml DNaseI, and 1 mM PMSF is usually a good starting point. 1. Resuspend cells corresponding to 3–5 mg/ml in lysis buffer. 2. Break the cells by either beating with 0.1 μm glass beads (at 4 °C) or by probe sonication in an ice-water bath (control sonication power to prevent foaming and heating of the sample). 3. Mix 40 μl of cell lysate with 10 μl 5  Laemmli buffer (Laemmli, 1970), incubate the mixture at 37–50 °C for 5 min (temperature should be lower than 65 °C, as otherwise the GFP will denature). 4. Separate 5–15 μl of each samples by SDS-PAGE; samples should contain 20–100 μg of total cell protein. 5. After electrophoresis, rinse the gel with water and capture its fluorescence on a proper imaging system, such as ImageAnalyzer LAS from FujiFilm or Typhoon from GE Healthcare. In the fluorescence image, only properly matured GFP and thus overall folded fusion proteins are detected. 6. Use the same gel, transfer the separated proteins onto PVDF membrane by semidry electrotransfer (Bjerrum & Scha¨fer-Nielsen, 1986; Gershoni, 1988). 7. After the transfer, visualize the protein expression by Western blotting, using an anti-His antibody and an appropriate secondary antibody (Gershoni, 1988). 8. Both folded and misfolded proteins are visible on the Western blot. Different culture and induction conditions can be compared at this stage, and optimal starting conditions for expression screening are chosen. An example of an expression test is shown in Fig. 3. In the example, the cAMP receptor 1 (cAR1) from Dictyostelium was expressed in E. coli BW25113B and NG3 at different inducer (L-arabinose) concentrations. The NG3 strain was obtained by selecting for enhanced production of the glutamate transporter GltP in the BW25113B strain as described in this method. It carries

ARTICLE IN PRESS 12

Franz Y. Ho and Bert Poolman

Whole-cell fluorescence (a.u.)

A

L-Arabinose

(% w/v)

B L-Arabinose

(% w/v)

In-gel fluorescence

Anti-His tag immunoblot

Figure 3 Example of expression test. (A) In vivo cell fluorescence of E. coli BW25113B cells (open square) and its evolved derivative NG3 (closed circle) expressing the Dictyostelium discoideum cAMP receptor 1–GFP–ErmC fusion protein at different L-arabinose concentrations. The fluorescence intensity is normalized by cell density and error bars show the standard deviation (SD) of three independent measurements. (B) In-gel GFP fluorescence and immunoblots detected by anti-His tag antibody of the same gels are shown. Black and white arrows indicate the positions of the nonfluorescent (misfolded) and fluorescent (folded) protein species, respectively.

genomic mutations in the hns, ung, and cpxA genes when compared to the parental BW25113B strain (Gul et al., 2014).

4. SELECTING CELLS FOR BETTER EXPRESSION Improved expression strains are selected by directed evolution. The target protein is fused to GFP and ErmC. If the target protein is misfolded, it will drag GFP and ErmC in a nonfunctional state and growth will be

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

13

limited by the presence of erythromycin in the medium. Evolved cells that produce more folded protein will be more resistant to erythromycin and outcompete the parental strain. By culturing cells in the presence of inducer to trigger protein expression and gradually increasing the erythromycin concentration, one selects for better performing cells in terms of protein expression. We use GFP to identify among the erythromycin-resistant strains those that produce the highest fluorescence, and thus fusion protein that is inserted into the membrane rather than misfolded protein with some ErmC activity. 1. Pick a single colony and inoculate into LB medium supplemented with 100 μg/ml ampicillin (or appropriate antibiotics to maintain the plasmid). Grow overnight at 37 °C with vigorous agitation (200 rpm). 2. Dilute the overnight culture in 1–100 ratio. Grow at 37 °C to A600 0.4, and then lower the cultivation temperature to 25 °C; we occasionally perform the selection at 17 °C. 3. Induce protein expression at A600 0.5–0.6, by adding L-arabinose to a final concentration of 0.01% (w/v). 4. After 2 h, inoculate fresh LB-ampicillin medium with the induced culture at 2% (v/v); keep L-arabinose at 0.01% for protein expression, and add erythromycin at 5 μg/ml for selection. Continue the cultivation for 48 h. 5. Transfer 2% of the culture into fresh medium with the same supplements but an elevated concentration of erythromycin, e.g., 10 μg/ml, and continue growth for 48 h. 6. Repeat step 5 and increase the erythromycin concentration consecutively from 10 μg/ml to 20, 50, 100, and 200 μg/ml. 7. Optionally, one can continue the subculturing at 200 μg/ml erythromycin by repeating step 5 for one to five times. 8. At the end of the selection, the cells are plated onto LB agar supplemented with 100 μg/ml ampicillin, 0.01% (w/v) L-arabinose, and the highest concentration of erythromycin reached. 9. Pick the most fluorescent colonies from the plate for further analysis.

5. CHARACTERIZING EVOLVED STRAINS 5.1 Basic characterizations 5.1.1 Plasmid copy number, DNA sequencing, and transcript levels Recombinant protein yield can be increased by altered copy number or mutations within the plasmid. We estimate changes in plasmid copy number from gels or more precisely by quantitative PCR (Skulj et al., 2008). So far,

ARTICLE IN PRESS 14

Franz Y. Ho and Bert Poolman

we have never found changes in plasmid copy number or mutations in the plasmid. Whole-genome sequencing using next generation sequencing technologies (ultradeep sequencing) revealed multiple mutations in the genomes of E. coli and L. lactis (Gul et al., 2014; Linares et al., 2010). The transcription level of the target protein can be determined by qRTPCR (Ba´ez-Viveros et al., 2007).

5.2 Plasmid curing The evolved E. coli cells are subsequently cured from the expression plasmid, which is done by subculturing cells at 5–7 °C above the optimal growth temperature (42–44 °C) (Trevors, 1986). Depending on the properties of the plasmid (copy number, stability), the cells may have to be subcultured up to 50 times. Cells which are sensitive to the marker antibiotic are isolated. 1. Inoculate evolved strains in LB medium without the plasmid marker antibiotic; i.e., ampicillin in case of pBAD vectors. 2. Grow the cells at 37 °C with vigorous agitation until late log phase. 3. Transfer the cells to fresh LB medium at a 5–10% (v/v) inoculum, continue cultivation nonselectively at 42–44 °C with vigorous agitation. 4. Repeat step 3 and screen colonies until the cells are sensitive to ampicillin. 5. Purify isolated colonies and characterize the strains, following transformation with fresh plasmid. The evolved expression hosts can now be tested for the expression of different target proteins.

5.3 Functional assays The function of the expressed protein can sometimes already be tested in vivo. For example, it is possible to test the function of transport proteins by measuring the import or export of radiolabeled substrate (Geertsma, Groeneveld, et al., 2008). Similarly, it is possible to probe binding of ligands to a receptor in vivo when high-affinity ligands are available (Brodersen, Honore´, Pedersen, & Klotz, 1988; Detmers et al., 2000; Silhavy, Szmelcman, Boos, & Schwartz, 1975). Ultimately, one will have to purify the protein and determine if the protein isolated from the evolved host has genuine activity. For membrane transport proteins, we have previously described various simple methods to test their activity either in membrane vesicles or proteoliposomes (Geertsma, Nik Mahmood, et al., 2008; Mulligan et al., 2009; Poolman et al., 2005).

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

15

6. SUMMARY

Plasmid curing

Selection

Induction

Strain engineering

In this chapter, we describe a straightforward procedure for increasing the level of well-folded membrane protein by directed evolution of bacteria. The method involves selection of E. coli strains that tolerate a higher concentration of erythromycin when the target membrane protein, fused to

Figure 4 Schematic outline of the selection procedure. P, passage number.

ARTICLE IN PRESS 16

Franz Y. Ho and Bert Poolman

GFP and ErmC, is produced at elevated level. The evolved strains can have multiple mutations in their genome, which differ depending on the target protein. We emphasize that the method is not restricted to overexpressing membrane proteins but is applicable to any difficult to produce protein. We also present protocols for the basic characterization of the evolved strains. The complete procedure is summarized schematically in Fig. 4.

ACKNOWLEDGMENTS We thank Prof. Shimon Schuldiner for the gift of E. coli strain BW25113 and the isogenic drug efflux-deficient derivatives. For developing the system, we thank Dr. Eric Geertsma, Dr. Daniel Linares, and Dr. Nadia Gul for their pioneering work. We also thank the Netherlands Science Foundation (NWO; TOP-GO program grant, project number 700.10.53) and the Michael J. Fox foundation (Rapid Response Innovation Awards, 2014) for financial support.

REFERENCES Andremont, A., Gerbaud, G., & Courvalin, P. (1986). Plasmid-mediated high-level resistance to erythromycin in Escherichia coli. Antimicrobial Agents and Chemotherapy, 29(3), 515–518. http://dx.doi.org/10.1128/AAC.29.3.515, Updated. Angov, E., Hillier, C. J., Kincaid, R. L., & Lyon, J. A. (2008). Heterologous protein expression is enhanced by harmonizing the codon usage frequencies of the target gene with those of the expression host. PLoS One, 3(5), e2189. http://dx.doi.org/10.1371/ journal.pone.0002189. Arviv, O., & Levy, Y. (2012). Folding of multidomain proteins: Biophysical consequences of tethering even in apparently independent folding. Proteins, 80(12), 2780–2798. http:// dx.doi.org/10.1002/prot.24161. Baba, T., Ara, T., Hasegawa, M., Takai, Y., Okumura, Y., Baba, M., et al. (2006). Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: The Keio collection. Molecular Systems Biology, 2, 2006.0008. http://dx.doi.org/10.1038/msb4100050. Ba´ez-Viveros, J. L., Flores, N., Jua´rez, K., Castillo-Espan˜a, P., Bolivar, F., & Gosset, G. (2007). Metabolic transcription analysis of engineered Escherichia coli strains that overproduce L-phenylalanine. Microbial Cell Factories, 6, 30. http://dx.doi.org/ 10.1186/1475-2859-6-30. Bendtsen, J. D., Binnewies, T. T., Hallin, P. F., & Ussery, D. W. (2005). Genome update: Prediction of membrane proteins in prokaryotic genomes. Microbiology (Reading, England), 151(Pt 7), 2119–2121. http://dx.doi.org/10.1099/mic.0.28181-0. Bernsel, A., Viklund, H., Falk, J., Lindahl, E., von Heijne, G., & Elofsson, A. (2008). Prediction of membrane-protein topology from first principles. Proceedings of the National Academy of Sciences of the United States of America, 105(20), 7177–7181. http://dx.doi. org/10.1073/pnas.0711151105. Bill, R. M., Henderson, P. J. F., Iwata, S., Kunji, E. R. S., Michel, H., Neutze, R., et al. (2011). Overcoming barriers to membrane protein structure determination. Nature Biotechnology, 29(4), 335–340. http://dx.doi.org/10.1038/nbt.1833. Bjerrum, O. J., & Scha¨fer-Nielsen, C. (1986). Buffer systems and transfer parameters for semidry electroblotting with a horizontal apparatus. In M. J. Dunn (Ed.), Electrophoresis’86: Proceedings of the fifth meeting of the international electrophoresis society, London (pp. 315–327). Weinheim: VCH Verlagsgesellschaft.

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

17

Brodersen, R., Honore´, B., Pedersen, A. O., & Klotz, I. M. (1988). Binding constants for ligand–carrier complexes. Trends in Pharmacological Sciences, 9(7), 252–257. http://dx.doi. org/10.1016/0165-6147(88)90155-1. Casadaban, M. J., & Cohen, S. N. (1980). Analysis of gene control signals by DNA fusion and cloning in Escherichia coli. Journal of Molecular Biology, 138(2), 179–207. Daley, D. O., Rapp, M., Granseth, E., Mele´n, K., Drew, D., & von Heijne, G. (2005). Global topology analysis of the Escherichia coli inner membrane proteome. Science (New York, N.Y.), 308(5726), 1321–1323. http://dx.doi.org/10.1126/science.1109730. Datsenko, K., & Wanner, B. L. (2000). One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proceedings of the National Academy of Sciences of the United States of America, 97(12), 6640–6645. http://dx.doi.org/10.1073/ pnas.120163297. De Marco, A. (2013). Recombinant polypeptide production in E. coli: Towards a rational approach to improve the yields of functional proteins. Microbial Cell Factories, 12(1), 101. http://dx.doi.org/10.1186/1475-2859-12-101. Detmers, F. J., Lanfermeijer, F. C., Abele, R., Jack, R. W., Tampe, R., Konings, W. N., et al. (2000). Combinatorial peptide libraries reveal the ligand-binding mechanism of the oligopeptide receptor OppA of Lactococcus lactis. Proceedings of the National Academy of Sciences of the United States of America, 97(23), 12487–12492. http://dx.doi.org/ 10.1073/pnas.220308797. Drew, D., Lerch, M., Kunji, E., Slotboom, D.-J., & de Gier, J.-W. (2006). Optimization of membrane protein overexpression and purification using GFP fusions. Nature Methods, 3(4), 303–313. http://dx.doi.org/10.1038/nmeth0406-303. Evdokimov, A. G., Pokross, M. E., Egorov, N. S., Zaraisky, A. G., Yampolsky, I. V., Merzlyak, E. M., et al. (2006). Structural basis for the fast maturation of Arthropoda green fluorescent protein. EMBO Reports, 7(10), 1006–1012. http://dx.doi.org/ 10.1038/sj.embor.7400787. Francis, D. M., & Page, R. (2010). Strategies to optimize protein expression in E. coli. Current protocols in protein science/editorial board, John E. Coligan. . .[et al.]: Vol. 61 (pp. 5.24.1–5.24.29), chapter 5 (August). http://dx.doi.org/10.1002/0471140864. ps0524s61. Geertsma, E. R., Groeneveld, M., Slotboom, D.-J., & Poolman, B. (2008). Quality control of overexpressed membrane proteins. Proceedings of the National Academy of Sciences of the United States of America, 105(15), 5722–5727. Geertsma, E. R., Nik Mahmood, N. A. B., Schuurman-Wolters, G. K., & Poolman, B. (2008). Membrane reconstitution of ABC transporters and assays of translocator function. Nature Protocols, 3(2), 256–266. http://dx.doi.org/10.1038/nprot.2007.519. Geertsma, E. R., & Poolman, B. (2007). High-throughput cloning and expression in recalcitrant bacteria. Nature Methods, 4(9), 705–707. http://dx.doi.org/10.1038/nmeth1073. George, R., & Heringa, J. (2002). An analysis of protein domain linkers: Their classification and role in protein folding. Protein Engineering, 15(11), 871–879. Gershoni, J. M. (1988). Protein blotting: A manual. Methods of Biochemical Analysis, 33, 1–58. Granseth, E., Seppa¨la¨, S., Rapp, M., Daley, D. O., & Von Heijne, G. (2007). Membrane protein structural biology—How far can the bugs take us? Molecular Membrane Biology, 24(5–6), 329–332. http://dx.doi.org/10.1080/09687680701413882. Grisshammer, R., & Tate, C. G. (1995). Overexpression of integral membrane proteins for structural studies. Quarterly Reviews of Biophysics, 28(3), 315–422. Gul, N., Linares, D. M., Ho, F. Y., & Poolman, B. (2014). Evolved Escherichia coli strains for amplified, functional expression of membrane proteins. Journal of Molecular Biology, 426(1), 136–149. http://dx.doi.org/10.1016/j.jmb.2013.09.009. Guzman, L. M., Belin, D., Carson, M. J., & Beckwith, J. (1995). Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. Journal of Bacteriology, 177(14), 4121–4130.

ARTICLE IN PRESS 18

Franz Y. Ho and Bert Poolman

Hennerdal, A., & Elofsson, A. (2011). Rapid membrane protein topology prediction. Bioinformatics (Oxford, England), 27(9), 1322–1323. http://dx.doi.org/10.1093/ bioinformatics/btr119. Hibi, T., Aoki, S., Oda, K., Munemasa, S., Ozaki, S., Shirai, O., et al. (2008). Purification of the functional plant membrane channel KAT1. Biochemical and Biophysical Research Communications, 374(3), 465–469. http://dx.doi.org/10.1016/j.bbrc.2008.07.026. Horazdovsky, B. F., & Hogg, R. W. (1989). Genetic reconstitution of the high-affinity L-arabinose transport system. Journal of Bacteriology, 171(6), 3053–3059. Jappelli, R., Luzzago, A., Tataseo, P., Pernice, I., & Cesareni, G. (1992). Loop mutations can cause a substantial conformational change in the carboxy terminus of the ferritin protein. Journal of Molecular Biology, 227(2), 532–543. Kim, H., Mele´n, K., Osterberg, M., & von Heijne, G. (2006). A global topology map of the Saccharomyces cerevisiae membrane proteome. Proceedings of the National Academy of Sciences of the United States of America, 103(30), 11142–11147. http://dx.doi.org/10.1073/ pnas.0604075103. Laemmli, U. K. (1970). Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature, 227(5259), 680–685. Linares, D. M., Geertsma, E. R., & Poolman, B. (2010). Evolved Lactococcus lactis strains for enhanced expression of recombinant membrane proteins. Journal of Molecular Biology, 401(1), 45–55. http://dx.doi.org/10.1016/j.jmb.2010.06.002. Loll, P. J. (2003). Membrane protein structural biology: The high throughput challenge. Journal of Structural Biology, 142(1), 144–153. http://dx.doi.org/10.1016/S1047-8477(03) 00045-5. Luirink, J., von Heijne, G., Houben, E., & de Gier, J.-W. (2005). Biogenesis of inner membrane proteins in Escherichia coli. Annual Review of Microbiology, 59, 329–355. http://dx. doi.org/10.1146/annurev.micro.59.030804.121246. Marreddy, R. K. R., Geertsma, E. R., Permentier, H. P., Pinto, J. P. C., Kok, J., & Poolman, B. (2010). Amino acid accumulation limits the overexpression of proteins in Lactococcus lactis. PLoS One, 5(4), e10317. http://dx.doi.org/10.1371/journal. pone.0010317. Mulligan, C., Geertsma, E. R., Severi, E., Kelly, D. J., Poolman, B., & Thomas, G. H. (2009). The substrate-binding protein imposes directionality on an electrochemical sodium gradient-driven TRAP transporter. Proceedings of the National Academy of Sciences of the United States of America, 106(6), 1778–1783. http://dx.doi.org/10.1073/ pnas.0809979106. Poolman, B., Doeven, M. K., Geertsma, E. R., Biemans-Oldehinkel, E., Konings, W. N., & Rees, D. C. (2005). Functional analysis of detergent-solubilized and membranereconstituted ATP-binding cassette transporters. Methods in Enzymology, 400(1999), 429–459. http://dx.doi.org/10.1016/S0076-6879(05)00025-X. Pos, K. M. (2009). Drug transport mechanism of the AcrB efflux pump. Biochimica et Biophysica Acta, 1794(5), 782–793. http://dx.doi.org/10.1016/j.bbapap.2008.12.015. Puri, P., Wetzel, C., Saffert, P., Gaston, K. W., Russell, S. P., Cordero Varela, J., et al. (2014). Systematic identification of tRNAome and its dynamics in Lactococcus lactis. Molecular Microbiology, 93(5), 944–956. http://dx.doi.org/10.1111/mmi.12710. Reid, B. G., & Flynn, G. C. (1997). Chromophore formation in green fluorescent protein. Biochemistry, 36(22), 6786–6791. http://dx.doi.org/10.1021/bi970281w. Sawitzke, J. A., Thomason, L. C., Costantino, N., Bubunenko, M., Datta, S., & Court, D. L. (2007). Recombineering: In vivo genetic engineering in E. coli, S. enterica, and beyond. Methods in Enzymology, 421(06), 171–199. http://dx.doi.org/10.1016/S0076-6879(06) 21015-2. Schlegel, S., L€ ofblom, J., Lee, C., Hjelm, A., Klepsch, M., Strous, M., et al. (2012). Optimizing membrane protein overexpression in the Escherichia coli strain Lemo21(DE3).

ARTICLE IN PRESS Engineering Escherichia coli for Functional Expression

19

Journal of Molecular Biology, 423(4), 648–659. http://dx.doi.org/10.1016/j. jmb.2012.07.019. Scholz, O., Thiel, A., Hillen, W., & Niederweis, M. (2000). Quantitative analysis of gene expression with an improved green fluorescent protein. p6. European Journal of Biochemistry/FEBS, 267(6), 1565–1570. Silhavy, T. J., Szmelcman, S., Boos, W., & Schwartz, M. (1975). On the significance of the retention of ligand by protein. Proceedings of the National Academy of Sciences of the United States of America, 72(6), 2120–2124. Skulj, M., Okrslar, V., Jalen, S., Jevsevar, S., Slanc, P., Strukelj, B., et al. (2008). Improved determination of plasmid copy number using quantitative real-time PCR for monitoring fermentation processes. Microbial Cell Factories, 7, 6. http://dx.doi.org/10.1186/14752859-7-6. Tal, N., & Schuldiner, S. (2009). A coordinated network of transporters with overlapping specificities provides a robust survival strategy. Proceedings of the National Academy of Sciences of the United States of America, 106(22), 9051–9056. http://dx.doi.org/10.1073/ pnas.0902400106. Tate, C. G., Whiteley, E., & Betenbaugh, M. J. (1999). Molecular chaperones stimulate the functional expression of the cocaine-sensitive serotonin transporter. The Journal of Biological Chemistry, 274(25), 17551–17558. http://dx.doi.org/10.1074/jbc.274.25.17551. Thomason, L. C., Sawitzke, J. A., Li, X., Costantino, N., & Court, D. L. (2014). Recombineering: Genetic engineering in bacteria using homologous recombination. Current protocols in molecular biology/edited by Frederick M. Ausubel. . .[et al.]: Vol. 106 (pp. 1.16.1–1.16.39). http://dx.doi.org/10.1002/0471142727.mb0116s106. Trevors, J. (1986). Plasmid curing in bacteria. FEMS Microbiology Letters, 32(3–4), 149–157. http://dx.doi.org/10.1016/0378-1097(86)90286-7. Tsien, R. Y. (1998). The green fluorescent protein. Annual Review of Biochemistry, 67, 509–544. http://dx.doi.org/10.1146/annurev.biochem.67.1.509. Von Heijne, G. (2006). Membrane-protein topology. Nature Reviews. Molecular Cell Biology, 7(12), 909–918. http://dx.doi.org/10.1038/nrm2063. Wagner, S., Baars, L., Ytterberg, A. J., Klussmeier, A., Wagner, C. S., Nord, O., et al. (2007). Consequences of membrane protein overexpression in Escherichia coli. Molecular & Cellular Proteomics: MCP, 6(9), 1527–1550. http://dx.doi.org/10.1074/mcp.M600431MCP200. Wagner, S., Bader, M. L., Drew, D., & de Gier, J.-W. (2006). Rationalizing membrane protein overexpression. Trends in Biotechnology, 24(8), 364–371. http://dx.doi.org/10.1016/ j.tibtech.2006.06.008. Waldo, G. S., Standish, B. M., Berendzen, J., & Terwilliger, T. C. (1999). Rapid proteinfolding assay using green fluorescent protein. Nature Biotechnology, 17(7), 691–695. http://dx.doi.org/10.1038/10904. Wallin, E., & von Heijne, G. (1998). Genome-wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms. Protein Science: A Publication of the Protein Society, 7(4), 1029–1038. http://dx.doi.org/10.1002/pro.5560070420. Weisblum, B. (1995). Erythromycin resistance by ribosome modification. Antimicrobial Agents and Chemotherapy, 39(3), 577–585. Zhang, G., & Ignatova, Z. (2009). Generic algorithm to predict the speed of translational elongation: Implications for protein biogenesis. PLoS One, 4(4), e5036. http://dx.doi. org/10.1371/journal.pone.0005036.

Engineering Escherichia coli for functional expression of membrane proteins.

A major bottleneck in the characterization of membrane proteins is low yield of functional protein in recombinant expression. Microorganisms are widel...
859KB Sizes 7 Downloads 10 Views