Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 1 of 47

1

The SOX transcription factors as key players in pluripotent stem cells

Essam M. Abdelalim, Mohamed M. Emara, and Prasanna R. Kolatkar

Qatar Biomedical Research Institute, Qatar Foundation, Education City, 5825 Doha, Qatar.

Address correspondence to: Essam M. Abdelalim, PhD Qatar Biomedical Research Institute, Qatar Foundation, Education City, Doha 5825, Qatar Tel.: +974-44546432

Fax: +974-44541770

E-mail: [email protected]

1

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 2 of 47

2

Abstract Pluripotent stem cells (PSCs), including embryonic stem cells (ESCs), and induced PSCs (iPSCs) are able to self-renew and differentiate into a multitude of specialized cellular lineages. In these cells, the pluripotential identity is maintained by a group of transcription factors (TFs). Among these factors, SOX TFs play an essential role, not only in regulating pluripotency, but also in mediating self-renewal and differentiation. Some SOX TFs are highly expressed in undifferentiated PSCs, while others are upregulated upon differentiation to promote specific lineage differentiation. Further roles of SOX factors in pluripotency are highlighted through their critical involvement in iPSCs generation. To perform these multiple functions and activities, SOX TFs are strongly associated with a complex regulatory network(s) that involves the binding of SOX factors to variant trans-acting partners to activate or suppress specific genes. Although, SOX2 has attracted special attention as a critical factor in maintaining PSCs characteristics and as an integral component that is required to reprogram somatic cells into pluripotency, new reports widely appreciated that other SOX TFs, such as SOX1, SOX3, or re-engineered SOX7 and SOX17, can compensate for the absence of SOX2 and thus play a fundamental role during the reprogramming process as well as maintaining pluripotency. These findings indicate that the recent progress has greatly expanded our knowledge about the role of SOX factors in PSCs. Thus, in this review we summarize what is currently known about the roles of SOX factors in PSCs as well as their role in somatic cell reprogramming. Also, we intend to provide an update on their relationship with other factors in regulating the characteristics and early differentiation of PSCs. Key Words: ESCs, iPSCs, SOX, Pluripotency, Self-renewal, Reprogramming, Differentiation.

2

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 3 of 47

3

Introduction Embryonic stem cells (ESCs) [1, 2] and induced pluripotent stem cells (iPSCs) [3, 4] are characterized by their unique ability to self-renew and differentiate into all types of cells. Stem cell fate is a precariously balanced phenomenon that is dependent on a variety of factors including extracellular as well as intracellular molecules. The effector molecules themselves are heterogeneous and include proteins (transcription factors, chromatin modifiers, etc.), miRNAs, as well as a host of other moieties. Among these effector classes are the transcription factors (TFs), which are regulatory proteins that bind to a specific DNA sequence to mediate gene expression. Over the past few decades these factors have attracted a great deal of attention as functional proteins that play a significant role in regulating a large number of cellular processes. These processes include, but are not limited to, gene regulation, cell proliferation, cell survival, and cell development. The central role of TFs in mediating gene expression highlights their direct and/or indirect role in controlling pluripotency and stem cell fate [5-8]. TFs can mediate stem cell fate through different strategies. These include the activation of genes that maintain pluripotency or upregulation of various defined factors that direct stem cell differentiation to specific stem cell lineage [5-7, 9, 10]. Early in cell development, TFs make important decisions to decide if cells will remain as stem cells or if they will differentiate into the different germ layers. Central to this decision making process are several players including Oct4, Klfs, the Sox family, and a host of key factors that decide the balance of pluripotency or differentiation. SRY-related high-mobility-group box (SOX) family proteins comprise group of transcriptional regulators having a highly conserved high mobility group (HMG) domain similar to that of sex determining region Y protein (SRY), which mediates DNA binding [11-14]. Sox proteins are divided into groups SOXA-SOXH based on the degree of amino acid identity within

3

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 4 of 47

4 the HMG-box (Table 1). Several SOX proteins have been identified and found to act as critical TFs involved in embryonic development and cell fate decision [15, 16]. The SOXA group consists only of SRY, encoded in mammalian Y-chromosomes [17]; SOXB group consists of two sub-groups (SOXB1 and SOXB2); SOXB1 includes SOX1, SOX2, and SOX3 [18], whereas SOXB2 proteins include SOX14 and SOX21 [19]; SOXC includes SOX4, SOX11 and SOX12 [20]; SOXE includes SOX8, SOX9 and SOX10 [21]; and SOXF includes SOX7, SOX17 and SOX18 [22]; SOXD group includes SOX5, SOX6 and SOX13 [23]; SOXG (SOX15) [24] and SOXH (SOX30) [25] proteins are structurally related to SOXB1 and SOXD proteins, respectively (Table 1). All the structures of Sox HMG domains solved to date are highly similar including the bending of the DNA (Fig.1). The importance of TFs in the stem cell field increased significantly as a result of the breakthrough discovery by Shinya Yamanaka, who showed that overexpression of four transcription factors (OCT4, KLF4, SOX2, and C-MYC) enables reprogramming of somatic cells to specific type of cells that have the same characteristics and properties of natural pluripotent stem cells (PSCs) [3, 4]. These cells are called iPSCs and have the potential to selfrenew and differentiate into the three germ layers [26]. iPSCs were initially induced

by

transducing mouse [3] or human [4] fibroblasts with retroviral vectors expressing the above mentioned reprogramming factors. Since iPSCs discovery, several other factors have been used to elicit the most efficient approach for iPSCs generation. These factors range from small compounds that mimic TFs [27] to drugs that alter TFs or RNA molecules that overexpress reprogramming factors. Although many factors are important for ESC and iPSC maintenance or generation, the duo of OCT4 and SOX2 stands out as two of the most critical factors. Several

4

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 5 of 47

5 studies previously showed that the knockdowns of these factors result in inducing differentiation and losing pluripotency, hence their importance in stem cell regulation. In this review, we highlight the main role of SOX factors in mediating stem cell fate. We provide an overview of the SOX factors during preimplantation embryo development and their role to orchestrate the operation of self-renewal and differentiation in PSCs as well as their importance in the iPSC generation. Finally, we discuss the relationship between Sox factors and other factors in PSCs.

The role of SOX transcription factors in preimplantation embryos One of the first events that occurs during embryonic development is the formation of the blastocyst, which is a hollow sphere composed of an outer trophectoderm (TE) layer and a population of founder cells called inner cell mass (ICM). The ICM cells have the potential to differentiate into the three germ layers derived tissues (ectoderm, mesoderm, and endoderm) in a prevalent phenomenon known as pluripotency (Fig. 2). This makes the ICM a valuable source to derive ESCs, which are PSCs. Therefore, the genes expressed in the ICM are the same as those expressed in ESCs cultured in vitro [1, 2]. Although many factors have been found to be important for the development of preimplantation embryos, SOX TFs stands out as one of the most critical factors [28-30]. Out of the 30 SOX genes, which have been identified in mammals, the most well known of these factors is SOX2, which is expressed at all developmental stages and has been shown to be essential for early embryogenesis. During the early stages of development, SOX2 is expressed in both ICM and TE, whereas at later stages its expression becomes restricted to the ICM [31, 32]. At the late blastocyst stage, where ICM segregates into primitive endoderm, primitive ectoderm, and epiblast cells, SOX2 expression is only detectable

5

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 6 of 47

6 in pluripotent epiblast cells and multipotent cells of the extraembryonic ectoderm (ExEn) [31] but not in primitive endoderm cells [33, 34]. The critical role of SOX2 in the development of preimplantation embryo has been provided by the observation that Sox2 deletion in mouse zygote causes early embryonic death. This embryonic lethal phenotype has been found to be specifically due to the inability to form the epiblast without affecting TE formation [31]. These results have been supported by the work done by Keramari and his colleagues [28] who have related the unobserved effect of Sox2 deletion on TE formation to the presence of the maternal Sox2. The maternal form of SOX2 is found in oocytes and preimplantation embryo and persists until the blastocyst stage, suggesting that maternal SOX2 might mask a phenotype in TE during the early development of Sox2-deficient zygote [28]. Moreover, the knockdown of both maternal and zygotic Sox2 results in a complete arrest of embryos at the morula stage and suppression of TE formation [28]. This is compounded with the failure of Sox2-deficient embryos to produce ESCs, from ICM, trophoblast stem cells (TSCs), or TE [31]. Taken together these findings indicate that SOX2 is essential for the segregation of the TE from the ICM lineage during early embryogenesis. Another key member of the SOX family that plays a role in embryonic development is SOX17. Earlier studies have shown that Sox17 transcript starts to express, during mouse development, at embryonic day 6 in the ExEn [35, 36]. However, a recent study recorded an earlier expression of SOX17 at the morula and blastocyst stages, implying the important role of SOX17 in preimplantation embryos [33]. An overall expression of SOX17 is initially observed at the 32-cell stage, whereas at the late blastocyst stage, SOX17 expression becomes limited to the primitive endoderm lineages [33]. Indeed, the expression of SOX17 at this stage co-localizes and coincides well with the expression of the primitive endoderm marker GATA6, which has

6

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 7 of 47

7 previously been shown to be the master regulator of early extraembryonic development [37, 38]. Therefore it is not surprising that SOX17 has been found to be required for the formation of extraembryonic endoderm stem cell lines, called XEN cells [33, 39]. To that end, two Sox transcription factors, SOX2 and SOX17, have been shown to play different essential roles in the early development of the preimplantation embryo. However, the question remains as to whether other SOX transcription factors form a wide regulatory network along with SOX2 and SOX17 to mediate preimplantation embryonic development.

The role of SOX transcription factors in pluripotent stem cells Members of the SOX family exhibit a demarcated phenomenon in governing pluripotency, in which SOX2 is considered to be the most well studied factor. In PSCs, SOX2 appears closely linked to the other two core pluripotent transcription factors OCT4 and NANOG. The three proteins are known to be the key players in a network of intrinsic transcription factors that play a critical role in maintaining the self-renewal and pluripotent identities of PSCs [40] by regulating the expression of pluripotency genes and suppressing genes, which stimulate differentiation [5, 31, 41, 42]. Interestingly, it has been reported that although the NIH3T3 cells (differentiated dermal fibroblast cells) express low levels of OCT4, NANOG, and SOX2, these cells have stemness properties and are able to differentiate into other cell types [43]. This finding suggests that even in the presence of low levels of these crucial pluripotency regulators, the cells can sustain certain level of pluripotency. Previous data highlighted the role of SOX2 in this dynamic regulatory circuit. For instance, a number of research studies showed that SOX2 complexes with OCT4 in a cooperative interaction to maintain ESC pluripotency [44-49] while others showed that Nanog transcription factor is dependent on SOX2 and OCT4 to maintain the ESC

7

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 8 of 47

8 characteristics [41, 48]. The synergistic action between OCT4 and SOX2 extends to include the activation of several ESC-specific enhancers, that contain binding sites for both OCT4 and SOX2, such as FGF4 [44], UTF1 [45], FBX15 [46], NANOG [48], and LEFTY [47]. Remarkably, OCT4 and SOX2 bind to their own enhancers and thus activate an auto-regulatory loop to maintain ESC in a pluripotent state [49-51]. Taken together, these findings support the notion that SOX2, OCT4, and NANOG function predominantly to form a regulatory network that orchestrates pluripotency in PSCs. Within this network the three transcription factors (SOX2, OCT4, and NANOG) regulate the expression of themselves as well as other factors. For example, ectopic expression of SOX2 in embryonal carcinoma cells (ECCs) and ESCs leads to suppression of its own gene as well as Sox2/Oct4 target genes involving Oct4, Nanog and Fgf4 genes [52]. This kind of regulation appears to be dose dependent. Notably, the dose of Sox2 in ESCs appears to be crucial to maintain the undifferentiated state of these cells, as an increase or a decrease in SOX2 protein levels leads to ESCs differentiation. Elevated SOX2 shows different regulatory effects between mouse ESCs and human ESCs. In mouse ESCs, slight elevation in the levels of SOX2 protein (less than two fold increase) reduces the expression of Sox2/Oct4 target genes (Nanog and lefty) and cues ESC differentiation into neuroectoderm, mesoderm, and TE lineages but not the endoderm one [53]. By contrast, the overexpression of SOX2 in stable human ESC lines downregulates OCT4 and NANOG expressions and induces ESC differentiation into TE lineage by regulating the expression of the TE marker CDX2 [54]. Indeed, knockdown experiments in human or mouse ESCs supports the important role of SOX2 expression in this regulatory system. Deletion of Sox2 in mouse ESCs [42, 50] or its knockdown in human ESCs results in suppression of ESC pluripotency and induction of the differentiation into TE [55]. Another study

8

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 9 of 47

9 reported that knockdown of SOX2 in human ESCs induces their differentiation into trophectodermal and endodermal lineages [54]. It is worth to note that the discrepancy between these two observations has been explained to be as a result of using two different culturing systems; one with feeder layers [55] and the other with feeder-free culture system [54]. Like SOX2, any variations in OCT4 levels lead to changes in the stem cell fate decision. High expression levels of Oct4 induce the differentiation of the ExEn and mesoderm, while lower levels stimulate TE differentiation [56]. Interestingly, the deletion of Oct4 in mouse ESCs abrogates the expression of SOX2 that switch the cell from pluripotent state to a differentiated state [57]. Altogether, these data suggest that the undifferentiated PSCs should maintain appropriate levels of SOX2 protein to sustain the unique ability to self-renew. Additionally two other SOX TFs, SOX3 and SOX15, have also been reported to be expressed in undifferentiated human ESCs [58] and mouse ESCs [59], respectively. Although both proteins share a high homology with the Sox2 HMG DNA binding domain, they behave in an alternate manner in maintaining pluripotency. A recent report showed that the knockdown of SOX3, which is expressed in low levels in hESCs, has no effect on the pluripotency of these cells [58]. Interestingly, the knockdown of SOX2 in hESCs upregulates the expression of SOX3, and maintain the undifferentiated state, suggesting that SOX3 may compensate the absence of SOX2 to maintain the ESCs phenotype [58]. Moreover, concurrent suppression of SOX2 and SOX3 leads to loss of pluripotency and induction of differentiation [58]. These observations suggest a counter relationship between the levels of SOX2 and SOX3 in which human ESCs that express high levels of SOX2 and low levels of SOX3 have the ability to keep the cells under normal status of self-renewal and differentiation, whereas in the presence of low levels SOX2, and abundant levels of SOX3, hESCs maintain the pluripotency, but not self-renewal (Fig. 3). On

9

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 10 of 47

10 the other hand, mouse ESCs lacking SOX15 maintain normal self-renewal abilities and show no defects during differentiation process [59].

The role of SOX transcription factors in somatic cell reprogramming The discovery of iPSCs technology significantly increased the importance of SOX TFs in pluripotency. This technology is founded on the notion of overexpressing SOX2 along with other three transcription factors (OCT4, KLF4, and C-MYC) to enable the reprogramming of differentiated (somatic) cells to pluripotent cells (iPSCs) that closely resemble ESCs [3, 4]. Although, the generation of these PSCs is based on the presence of the cocktail factors mentioned above, SOX2 has been found to be necessary for the success of this process [3, 4]. The requirement of this activity of SOX2 is even stronger, because although C-MYC and KLF4 can be replaced or removed without loss of the reprogramming efficiency [60-63], the SOX2/OCT4 appears essential for the reprogramming process [63-65]. In addition, it has been previously shown that human iPSCs have been generated from human cord blood using OCT4 and SOX2 alone [66, 67]. Notwithstanding the above findings, it has been found that in the cells, which express appropriate levels of SOX2, such as neural progenitor cells (NPCs), SOX2 overexpression is not required for their reprogramming into iPSCs [68]. Thus, overexpression of SOX2 during the reprogramming of NPCs reduces the reprogramming efficiency [68], which confirms the fact that too high levels of SOX2 can induce the differentiation of pluripotent stem cells [54]. The mechanisms underlying the role of SOX2 during the cellular reprogramming process remain largely elusive. However, some reports shed the light on possible mechanisms. A previous study showed that SOX2/OCT4 increase the efficiency of iPSCs generation by a direct

10

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 11 of 47

11 activation of the miR-200 family [53, 69]. Another recent study reported that during the early stage of reprogramming process, transient SOX2-mediated inhibition of mammalian target of rapamycin (mTOR) enhances the reprogramming efficiency through a temporary increase in the autophagy process, which is required for iPSC generation [70]. In this study, SOX2 has been found to inhibit mTOR by recruiting the NuRD complex to the mTOR promoter, resulting in transcriptional suppression of mTOR [70]. Several other members of the SOX family have been tested for their ability to generate iPSCs [62]. Remarkably, it has been shown that SOX1 and SOX3, which are closely related factors to SOX2, can replace SOX2 during the reprogramming process [62]. Similarly, the replacement of SOX2 by SOX15 or SOX18, during reprogramming could generate iPSCs, but not as efficiently as members of SOXB1 subfamily. In contrast, neither SOX7 nor SOX17 permit reprogramming when they substituted SOX2 within the reprogramming factors mixture [62]. Interestingly, it has been found that a single amino acid replacement in the DNA binding/OCT interaction domains in the Sox17 gene can change its function, and the reengineered SOX17 (SOX17EK) is able to replace SOX2 during reprogramming process [71]. Furthermore, when the conserved glutamate in either SOX7 or SOX17 is mutated to lysine in the corresponding position in SOX2 HMG domain, both proteins are converted into efficient reprogramming factors and produced 10-50 times more colonies containing fully reprogrammed pluripotent cells than those observed with SOX2 [64, 72, 73]. On the other hand, no pluripotent colonies have been detected when similar mutations are introduced to other family members; SOX4, SOX5, SOX6, SOX8, SOX9, SOX11, SOX13, and SOX18. However, attaching the C-terminus of SOX17 to the pointmutated HMG domains of SOX18 and SOX4 (SOX18EKC17 and SOX4-EKC17) enable them to generate iPSCs, but with very low efficiency [73]. Collectively these data illustrate the core

11

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 12 of 47

12 role of SOX1B family members in somatic cell reprogramming and their high potential in efficient iPSCs generation. This, however, does not exclude the possibility that other SOX factors may play an important role in somatic cell reprogramming. Yet, such a possible role remains unclear although the recent comparison of the HMG domain sequence from Sox family members that demonstrated a phylogenetic evidence for the close relationship between SOX14, SOX15, and SOX21, to SOX1B family [74]. Thus other SOX factors require further investigation.

The role of SOX factors in the differentiation of pluripotent stem cells Some SOX factors, such as SOX2, SOX7, and SOX17, play an important regulatory role during the differentiation of PSCs. SOX7 and its closely related group member, SOX17, are involved in determining the endodermal fate [35, 75, 76] and are known to be upregulated during the differentiation process [36, 76]. In mouse ECs, SOX17 expression is detected in both ExEn and definitive endoderm (DE), whereas SOX7 is not expressed except in the ExEn [35]. Consistent with this observation, SOX7 has been found to be required for stimulating the master regulators of ExEn differentiation, GATA4 and GATA6, and in turn ExEn differentiation [75], whereas SOX17 has been shown to function in the differentiation of both DE [35] and ExEn [33, 36]. Beside these studies that have been done in mouse models, the important role of SOX7 and SOX17 in endoderm differentiation has been also highlighted in the developmental studies of Xenopus Sox7 (Xsox7) [77] and Sox17 (Xsox17) [78-80] as well as those done on Sox17 zebrafish (Zsox17) [81]. In human ESCs, overexpression of SOX17 promotes differentiation to the DE phenotype, while SOX7 overexpression induces cell differentiation to the ExEn [76]. The ExEn derived as a

12

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 13 of 47

13 result of SOX7 overexpression shows a gene profile similar to that of mouse XEN cells [39, 76]. Interestingly, during pancreatic beta cell differentiation, continued SOX17 expression stimulates the expression of PDX1 [76], which is a pancreatic progenitor marker [82]. Although the overexpression of SOX7 and SOX17 in human ESCs does not influence the expression of OCT4 and NANOG, during differentiation these genes are rapidly downregulated [76]. Conversely, the overexpression of SOX17, in mouse ESCs, inhibits pluripotency genes and induces differentiation of ESCs into extraembryonic cells. Furthermore, mouse ESCs lacking Sox17 maintain the expression of the pluripotency factors, such as OCT4, NANOG and SOX2, and interfere with the capability of these cells to differentiate into extraembryonic lineages [33]. Altogether, these findings suggest different roles of SOX17 between mouse and human ESCs, which may reflect intrinsic differences between both ESC types. This difference comes along with a previous study, which showed that mouse and human ESCs differ in the conditions required for the induction of DE SOX17-positive cells [83]. In contrast to SOX7 and SOX17, SOX2 is downregulated during normal differentiation process [42] where it works jointly with OCT4 to suppress the differentiation of either neuroectodermal or mesendodermal layers [84]. Any alterations in levels of either protein will lead to cell differentiation. It has been previously shown that the elevation of SOX2 protein levels stimulates the differentiation of ESCs into neuroectoderm, mesoderm, and TE [53], while high levels of OCT4 direct the differentiation into mesoderm and endoderm [56]. Of note, suppression of either OCT4 or SOX2 triggers the differentiation of ESCs into TE [42, 50, 56]. Thus, the upregulation of OCT4 and SOX2 affects the fate of ESCs differently. Taken together, these findings suggest that SOX2 and OCT4 protein levels should be tightly controlled since

13

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 14 of 47

14 aberrantly high protein levels of SOX2 or OCT4 switch these proteins from an activator to a repressor of ESC self-renewal. It has been shown that SOX2 activates the differentiation to neuroectoderm by inhibiting regulators that are found to be essential for the mesendodermal differentiation [58, 84]. For example, constitutive expression of SOX2 specifically inhibits brachyury, one of the main mesendodermal regulators, and drives the cells toward the neuroectoderm fate [84]. Interestingly, such inhibition is independent of OCT4, which is found to be necessary for the differentiation of cells into the mesendodermal lineage, by suppressing one of the SOXB1 family members, SOX1, that is also known to be a marker for neuroectoderm differentiation [84]. On the other hand, in the absence of OCT4 under undifferentiated culture conditions, SOX2 overexpression triggers SOX1, indicating the importance of SOX2 for NE differentiation. Perhaps the strongest support for the role of SOX2 in neuroectoderm differentiation comes from the finding of Takemoto and his colleagues [85], who demonstrated that in the absence of TBX6, a promoting factor for mesoderm differentiation, SOX2 started to be expressed in the paraxial mesoderm of the mutant mouse embryos and converts the differentiation of this compartment into the neuroectodermal lineage. Moreover, when TBX6 suppresses SOX2 enhancer, N1, SOX2 is repressed and the differentiation is committed towards the mesodermal lineage. Congruent with the above findings, overexpression of SOX2 is sufficient to suppress the differentiation of mesendoderm even under mesendoderm-enhancing culture conditions [58]. In mouse ESCs, it has been found that SOX3 and SOX11 have the opposite effect on the genes that expressed in neural precursor cell (NPC) and neurons [86]. Overexpression of SOX3 activates the genes expressed in the NPCs, while the overexpression of SOX11 activates genes expressed in the neurons. Also, in ESC-derived NPCs, Sox3 binds genes, which are subsequently activated upon neuronal differentiation through their

14

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 15 of 47

15 binding to Sox11 [86]. Furthermore, SOX21, another SOX factor, has been found to induce ESC differentiation if it is overexpressed in ESCs [87]. In this respect, it is logical to assume that SOX factors are differentially regulated to serve a role in determination and differentiation of a specific stem cell fate.

Relationship between SOX transcription factors and other factors in pluripotent stem cells SOX proteins perform their gene regulatory functions by interacting with partner trans-acting factors to form a regulatory network of transcription factors, chromatin remodeling factors, and co-activators that tightly controls pluripotency and differentiation of PSCs [88, 89]. This complex formation occurs when the functional binding site of SOX protein is associated with another partner protein binding site, which is essential for SOX-dependent transcriptional regulation [10]. Of this network, SOX2 was shown to interact with about 70 different proteins such as OCT4, NANOG, ESRRB, KLF4, SALL1, and SALL4 (Fig. 4A,B) [90-94]. It has been previously shown that SOX2, OCT4, and NANOG form autoregulatory and feedback loops, which control their own expression and thus control the expression of other regulatory factors to keep the balanced status between self-renewal and differentiation (Fig. 4A,B) [5, 7]. More importantly, this autoregulatory circuit has been found to be conserved in mouse as well as human ESCs and is controlled by OCT4/SOX2 interaction [5]. Although SOX2 is the most well known member of SOX family to complex with OCT4, other members of this family have also been reported to partner with OCT4 to determine stem cell fate. In ESCs, genome-wide TF binding experiments showed that there is extensive overlap in the target genes of Sox2, Oct4, and Nanog [5, 95]. Such 15

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 16 of 47

16 strong genomic overlap was studied using a variety of methods but the most conclusive have been the recent Chromatin Immunoprecipitation sequencing (ChIP-seq) experiments [95, 96]. ChIP-Chip analysis of hESCs showed that ~350 genes (out of ~18,000 genes interrogated) bind the three TFs OCT4, NANOG, and SOX2 [5]. However, ChIP-seq analysis of mESCs showed that 600 genes bind Oct4, Nanog, and Sox2 [95]. In addition, other complimentary experiments using genetic knockdowns as well as purified proteins have been used to show both cis and trans interactions involved in this cooperative mechanism. OCT4 (containing the POU5F1 domain) is known to bind the classical octamer motif (ATTTGCAT) and binds adjacent to SOX2 (HMG domain) motif (AACAAAG) predominantly during pluripotent state. Remarkably, the gap between the 2 binding motifs has been shown to be critical for accommodating different Sox family members to interact with OCT4 [97]. Indeed, individual motifs for binding OCT4 and SOX family members are essentially the same, however, it is the spacing between the motifs, which leads to co-assembly of different SOX family members with OCT4 and thereby lead to different types of cell fate outcomes [67]. In addition to these findings, a single-cell gene expression profiling of mouse blastocyst revealed a marked coexpression of SOX2, SOX17, and OCT4. More importantly, such defined expression has been observed in the ICM at an early developmental stage before lineage segregation [58, 98], suggesting an intriguing possibility that SOX2 can compete with SOX17 for OCT4 binding and thus affect stem cell fate. In agreement with this possible scenario, stable expression of SOX17 in ESC lines switches the binding preference of OCT4 from SOX2 to SOX17 [72]. Such an interesting swap in binding activity is recognized by a single base pair difference that distinguishes OCT4/SOX2 canonical motif binding from the opposing OCT4/SOX17 compressed motif binding (Fig. 5). The cooperative binding of OCT4 and SOX2 is dependent on the presence of a one base pair space between the

16

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 17 of 47

17 two binding motifs, whereas this interspanning base pair is absent in OCT4 and SOX17 binding mode. This interchangeable interaction mediates a developmental switch, where the cell divert from pluripotent state, in case of OCT4-SOX2 interaction, to the endodermal differentiation fate when OCT4 and SOX17 complex together [64, 71, 72]. The different SOX family members binding with diverse interfaces likely lead to recruitment of additional cell fate determining factors and thereby lead to specific cell developmental pathways. The precarious cell fate balance not only hinges on a single base pair difference in the cis regulatory element but also is affected by the presence of a single key SOX residue at the protein-protein interaction interface, which has been found to be crucial for partnering POU5F1 domain on the proper DNA element [11, 71, 99]. Importantly, the type of this single residue differs between SOX2 and SOX17 as shown by amino acid sequence alignment analysis. Sox2 harbors a basic lysine residue, which is highly conserved among other members of SOXB1 subfamily, whereas a conserved glutamate residue within subfamily SOXF is the candidate residue of SOX17 [71]. Recent biochemical studies using purified components as well as genomic and cell biology approaches have shown that mutating this particular residue can in fact impart the properties of the SOX containing this residue [72, 73]. Point mutation of the basic lysine amino acid, in SOX2/OCT4 interface, to acidic glutamate, significantly weakened Sox2 interaction with Oct4 DNA in mobility shift assays upon the canonical element and abolished its partnering ability as confirmed by Chip-seq analysis. In contrast, when the conserved glutamate in SOX17 was mutated to a lysine residue the intensity of SOX17/OCT4 complex noticeably increased upon the canonical cis element. Given these results, it is perhaps not surprising that SOX17 was converted into a potent induced pluripotent stem cell TF and SOX2 conversely was transformed into an endodermal TF. Likewise, similar mutations within SOX7 and SOX18, other

17

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 18 of 47

18 SOXF subfamily members, lead to loss of both proteins differentiation ability to endoderm and create new versions of SOX proteins that maintain the pluripotent state of cells. Consistently with the above findings, the binding of β-catenin to SOX17 C-terminal activation domain, which is known to be critical for endoderm formation, has also been shown to increase the efficiency of producing iPSCs when using mutated SOX17 [72]. Thus the interplay between the HMG domain as well as the C-terminal activation domain is critical for both partnering of proper POU5F1 domain and the subsequent level of activation. Other factors, which are important for the PSC maintainenace, are additionally regulated by SOX proteins. For example, in ESCs, targets of SOX2 are co-occupied by TFs Smad1 and Smad3 (Smad2/3), the downstream effectors of Transforming growth factor (TGF-β), which is crucial for ESC self-renewal [95, 100]. Also, in hESCs, it has been found that SOX2 is essential for transcription of tropomyosin-related kinase C (TRKC) and that the deficiency of SOX2 in undifferentiated hESCs downregulates TRKC expression [101]. These findings suggest that SOX2 regulates the maintenance of PSC state by involving several mechanisms.

Relationship between SOX factors and miRNAs in pluripotent stem cells In addition to its protein interaction network, mentioned above, SOX2 along with other pluripotent stem cell factors establish another circuit of associations with microRNAs (miRNAs). These miRNA-pluripotent stem cell factors interactions activate the expression of specific miRNAs families or clusters to help to maintain stem cell pluripotency. It has been well documented that the three top pluripotency regulatory proteins (SOX2, OCT4, and NANOG) bind to the promoter regions of pluripotent stem cell-specific miRNA clusters including the mouse miR-290-295, the human miR-371-373, and the conserved miR-302-367 and miR-106a18

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 19 of 47

19 363 clusters [102-104]. In hESCs, OCT4 and SOX2 have been found to bind to the conserved promoter region of the miR-302 in a ChIP assay. Interestingly, SOX2 binds to miR-302 in a region juxtaposed to the OCT4 binding site, in a way similar to their binding to target genes, confirming the synergistic regulatory mechanism of the two proteins. Indeed, both proteins have been found to be necessary for the expression and transcriptional activation of miR-302 [103]. This is likely to reflect the involvement of SOX2 and OCT4 in cell cycle mechanism to maintain pluripotency, because miR-302 was found to suppress the expression of a key cell cycle regulatory factor, cyclin D1, in hESCs [8, 103]. Another microRNA cluster to consider for a committed binding to SOX2 and OCT4 and could play a crucial role in pluripotency is miR-200 family. Members of this family are dramatically upregulated in undifferentiated ESCs and neuronal progenitor cells as compared to differentiated stages [105, 106]. During the undifferentiated state, high levels of mouse SOX2 directly activate the promoter of miR-200c/141 gene cluster to express miR-200c, which in-turn inhibits mouse SOX2 expression by a direct binding to the Sox2 3’URT. These series of interactions create a negative feedback loop that regulates SOX2 levels and might eventually lead to Sox2 silencing, and may be other pluripotent stem cell factors, and thus promote cells towards differentiation [106]. Furthermore the ability of SOX2 and OCT4 to bind to the promoter regions of specific miR-200 family clusters and directly activate miR-200s expression has been shown in a recent study using a dual-luciferase reporter assay. The transcriptional activation of miR-200 by SOX2 or OCT4 is accompanied by a promotion in iPSC generation, indicating the important role of Sox2/Oct4-miR-200 cluster interactions in plruripotent stem cells [69]. In line with this, the knockdown of miR-200c suppresses the expression of SOX2 as well as

19

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 20 of 47

20 other pluripotent stem cell factors in hESCs [107]. The crucial role of SOX2-miR200 clusters interaction in ESCs pluripotency requires further investigation. Beside their role in sustaining pluripotency, the microRNA-pluripotent stem cell factors interactions could also drive the cell to differentiate into different fates by targeting the mRNA of SOX2 and other pluripotent stem cell factors and negatively regulate their expression [104, 108]. For example miR-134, miR-296, and miR-470 binds to different coding sequences in Nanog, Oct4, and Sox2 and inhibit their expression [108]. Moreover, miR-145 targets the 3’UTRs of Sox2, Oct4, and Klf4 mRNAs and thus represses the ESC pluripotent state [109]. All the above mentioned examples show that through diverse mechanisms, the regulatory network of miRNA-pluripotency factors interactions are likely to be critical for the fidelity of pluripotent stem cell status.

Conclusions and perspectives The importance of SOX TFs in stem cell biology has recently received great interest and a substantial effort has been dedicated to understand the role of these factors in regulating the cell fate decision in PSCs. In particular, we have focused in this review on the existence and functions of Sox TFs in preimplantation embryos and PSCs, their important role as reprogramming factors to induce pluripotency, and their interaction with other trans-acting factors to perform specific functions in PSCs. It has become apparent that the expression patterns and the dose of Sox factors in PSCs play an important role in determining the destiny of these cells. Some SOX factors, such as SOX2, SOX3, and SOX15 are expressed in undifferentiated PSCs, where they play a role in maintaining their self-renewal and pluripotency (Table 2). Other SOX factors such as SOX7, SOX17, and SOX21 are upregulated upon differentiation of PSCs,

20

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 21 of 47

21 where they are responsible to drive the differentiation into specific lineages. Maintaining appropriate levels of Sox proteins in PSCs are crucial to sustain their abilities in either selfrenewal or differentiation. In this regard, it is interesting to address the mechanisms underlying the different roles of Sox TFs within the same pluripotent cell and determine whether other Sox TFs perform similar functions. Also, further studies are needed to clarify the mechanism by which SOX TFs define specificity for their target genes, particularly the identification of binding partners. Understanding these mechanisms will provide insights into how to control PSCs in vitro as well as their differentiation into certain lineages, which will eventually help in clinical applications. In PSCs, SOX TFs are part of an integrated network that is involved in controlling stem cell fate by activating genes that maintain self-renewal and pluripotency or repressing other genes that trigger differentiation (Table 2). It has been deeply discussed in this review how these SOX proteins gene regulatory functions are based on this circuit of interactions that include SOX factors association with partner transcription factors such as OCT4 and NANOG as well as their specific binding to the pluripotent stem cell-specific miRNA clusters. The details of this binding mechanism(s) and how SOX TFs define the specificity for their target genes, particularly the identification of binding partners, is a great question to be resolved. From the time iPSCs technology was first used, the role of SOX TFs in the generation of iPSCs has been highlighted. Importantly, the recent discovery of replacing the SOX2 with the reengineered Sox factors (SOX7EK and SOX17EK) has substantially improved the efficiency of the reprogramming process [71, 72] (Table 2) and open up new possibilities for the involvement of other SOX TFs in somatic cell reprogramming. This might help to reduce the number of the TFs required for the iPSC generation and exclude the oncogenic factors, such as C-MYC, from

21

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 22 of 47

22 the reprogramming cocktail. Therefore, other studies on iPSC generation are required to examine the combination of different TFs and their effect in the reprogramming process. Doing this kind of analysis will broaden our understanding of the reprogramming process and the transcriptional regulatory networks in PSCs.

Acknowledgment We would like to thank Dr. Balasubramanian Moovarkumudalvan from Qatar Biomedical Research Institute, Qatar Foundation, for helping us in preparing Figure 1 in this review.

Author Disclosure Statement No competing financial interests exist.

References 1.

Evans M. (2011). Discovering pluripotency: 30 years of mouse embryonic stem cells. Nat Rev

Mol Cell Biol 12:680-6. 2.

Thomson JA, J Itskovitz-Eldor, SS Shapiro, MA Waknitz, JJ Swiergiel, VS Marshall and JM

Jones. (1998). Embryonic stem cell lines derived from human blastocysts. Science 282:1145-7. 3.

Takahashi K and S Yamanaka. (2006). Induction of pluripotent stem cells from mouse embryonic

and adult fibroblast cultures by defined factors. Cell 126:663-76. 4.

Takahashi K, K Tanabe, M Ohnuki, M Narita, T Ichisaka, K Tomoda and S Yamanaka. (2007).

Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131:861-72.

22

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 23 of 47

23 5.

Boyer LA, TI Lee, MF Cole, SE Johnstone, SS Levine, JP Zucker, MG Guenther, RM Kumar,

HL Murray, RG Jenner, DK Gifford, DA Melton, R Jaenisch and RA Young. (2005). Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122:947-56. 6.

Loh YH, Q Wu, JL Chew, VB Vega, W Zhang, X Chen, G Bourque, J George, B Leong, J Liu,

KY Wong, KW Sung, CW Lee, XD Zhao, KP Chiu, L Lipovich, VA Kuznetsov, P Robson, LW Stanton, CL Wei, Y Ruan, B Lim and HH Ng. (2006). The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells. Nat Genet 38:431-40. 7.

Niwa H. (2007). How is pluripotency determined and maintained? Development 134:635-46.

8.

Abdelalim EM. (2013). Molecular mechanisms controlling the cell cycle in embryonic stem cells.

Stem Cell Rev 9:764-73. 9.

Sarkar A and K Hochedlinger. (2013). The sox family of transcription factors: versatile regulators

of stem and progenitor cell fate. Cell Stem Cell 12:15-30. 10.

Kamachi Y and H Kondoh. (2013). Sox proteins: regulators of cell fate specification and

differentiation. Development 140:4129-44. 11.

Remenyi A, K Lins, LJ Nissen, R Reinbold, HR Scholer and M Wilmanns. (2003). Crystal

structure of a POU/HMG/DNA ternary complex suggests differential assembly of Oct4 and Sox2 on two enhancers. Genes Dev 17:2048-59. 12.

Jauch R, CK Ng, K Narasimhan and PR Kolatkar. (2012). The crystal structure of the Sox4 HMG

domain-DNA complex suggests a mechanism for positional interdependence in DNA recognition. Biochem J 443:39-47. 13.

Joint Center For Structural Genomics (Jcsg) PFSCB. (2012). Crystal structure of a hmg domain

of transcription factor sox-9 bound to DNA (Sox-9/dna) from homo sapiens at 2.77 Å resolution. In: Molecular Modeling Database (MMDB). NCBI. 14.

Palasingam P, R Jauch, CK Ng and PR Kolatkar. (2009). The structure of Sox17 bound to DNA

reveals a conserved bending topology but selective protein interaction platforms. J Mol Biol 388:619-30. 15.

Lovell-Badge R. (2010). The early history of the Sox genes. Int J Biochem Cell Biol 42:378-80. 23

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 24 of 47

24 16.

Kiefer JC. (2007). Back to basics: Sox genes. Dev Dyn 236:2356-66.

17.

Sekido R. (2010). SRY: A transcriptional activator of mammalian testis determination. Int J

Biochem Cell Biol 42:417-20. 18.

Kamachi Y, M Uchikawa, J Collignon, R Lovell-Badge and H Kondoh. (1998). Involvement of

Sox1, 2 and 3 in the early and subsequent molecular events of lens induction. Development 125:2521-32. 19.

Uchikawa M, Y Kamachi and H Kondoh. (1999). Two distinct subgroups of Group B Sox genes

for transcriptional activators and repressors: their expression during embryonic organogenesis of the chicken. Mech Dev 84:103-20. 20.

Dy P, A Penzo-Mendez, H Wang, CE Pedraza, WB Macklin and V Lefebvre. (2008). The three

SoxC proteins--Sox4, Sox11 and Sox12--exhibit overlapping expression patterns and molecular properties. Nucleic Acids Res 36:3101-17. 21.

Stolt CC and M Wegner. (2010). SoxE function in vertebrate nervous system development. Int J

Biochem Cell Biol 42:437-40. 22.

Francois M, P Koopman and M Beltrame. (2010). SoxF genes: Key players in the development of

the cardio-vascular system. Int J Biochem Cell Biol 42:445-8. 23.

Lefebvre V. (2010). The SoxD transcription factors--Sox5, Sox6, and Sox13--are key cell fate

modulators. Int J Biochem Cell Biol 42:429-32. 24.

Meeson AP, X Shi, MS Alexander, RS Williams, RE Allen, N Jiang, IM Adham, SC Goetsch,

RE Hammer and DJ Garry. (2007). Sox15 and Fhl3 transcriptionally coactivate Foxk1 and regulate myogenic progenitor cells. EMBO J 26:1902-12. 25.

Osaki E, Y Nishina, J Inazawa, NG Copeland, DJ Gilbert, NA Jenkins, M Ohsugi, T Tezuka, M

Yoshida and K Semba. (1999). Identification of a novel Sry-related gene and its germ cell-specific expression. Nucleic Acids Res 27:2503-10. 26.

Robinton DA and GQ Daley. (2012). The promise of induced pluripotent stem cells in research

and therapy. Nature 481:295-305.

24

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 25 of 47

25 27.

Xu Y, Y Shi and S Ding. (2008). A chemical approach to stem-cell biology and regenerative

medicine. Nature 453:338-44. 28.

Keramari M, J Razavi, KA Ingman, C Patsch, F Edenhofer, CM Ward and SJ Kimber. (2010).

Sox2 is essential for formation of trophectoderm in the preimplantation embryo. PLoS One 5:e13952. 29.

Goissis MD and JB Cibelli. (2014). Functional Characterization of SOX2 in Bovine

Preimplantation Embryos. Biol Reprod. 30.

Pan H and RM Schultz. (2011). Sox2 modulates reprogramming of gene expression in two-cell

mouse embryos. Biol Reprod 85:409-16. 31.

Avilion AA, SK Nicolis, LH Pevny, L Perez, N Vivian and R Lovell-Badge. (2003). Multipotent

cell lineages in early mouse development depend on SOX2 function. Genes Dev 17:126-40. 32.

Lorthongpanich C, SH Yang, K Piotrowska-Nitsche, R Parnpai and AW Chan. (2008).

Development of single mouse blastomeres into blastocysts, outgrowths and the establishment of embryonic stem cells. Reproduction 135:805-13. 33.

Niakan KK, H Ji, R Maehr, SA Vokes, KT Rodolfa, RI Sherwood, M Yamaki, JT Dimos, AE

Chen, DA Melton, AP McMahon and K Eggan. (2010). Sox17 promotes differentiation in mouse embryonic stem cells by directly regulating extraembryonic gene expression and indirectly antagonizing self-renewal. Genes Dev 24:312-26. 34.

Plusa B, A Piliszek, S Frankenberg, J Artus and AK Hadjantonakis. (2008). Distinct sequential

cell behaviours direct primitive endoderm formation in the mouse blastocyst. Development 135:3081-91. 35.

Kanai-Azuma M, Y Kanai, JM Gad, Y Tajima, C Taya, M Kurohmaru, Y Sanai, H Yonekawa, K

Yazaki, PP Tam and Y Hayashi. (2002). Depletion of definitive gut endoderm in Sox17-null mutant mice. Development 129:2367-79. 36.

Shimoda M, M Kanai-Azuma, K Hara, S Miyazaki, Y Kanai, M Monden and J Miyazaki. (2007).

Sox17 plays a substantial role in late-stage differentiation of the extraembryonic endoderm in vitro. J Cell Sci 120:3859-69.

25

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 26 of 47

26 37.

Koutsourakis M, A Langeveld, R Patient, R Beddington and F Grosveld. (1999). The

transcription factor GATA6 is essential for early extraembryonic development. Development 126:723-32. 38.

Cai KQ, CD Capo-Chichi, ME Rula, DH Yang and XX Xu. (2008). Dynamic GATA6 expression

in primitive endoderm formation and maturation in early mouse embryogenesis. Dev Dyn 237:2820-9. 39.

Kunath T, D Arnaud, GD Uy, I Okamoto, C Chureau, Y Yamanaka, E Heard, RL Gardner, P

Avner and J Rossant. (2005). Imprinted X-inactivation in extra-embryonic endoderm cell lines from mouse blastocysts. Development 132:1649-61. 40.

Loh KM and B Lim. (2011). A precarious balance: pluripotency factors as lineage specifiers. Cell

Stem Cell 8:363-9. 41.

Rodda DJ, JL Chew, LH Lim, YH Loh, B Wang, HH Ng and P Robson. (2005). Transcriptional

regulation of nanog by OCT4 and SOX2. J Biol Chem 280:24731-7. 42.

Masui S, Y Nakatake, Y Toyooka, D Shimosato, R Yagi, K Takahashi, H Okochi, A Okuda, R

Matoba, AA Sharov, MS Ko and H Niwa. (2007). Pluripotency governed by Sox2 via regulation of Oct3/4 expression in mouse embryonic stem cells. Nat Cell Biol 9:625-35. 43.

Dadheech N, A Srivastava, M Belani, S Gupta, R Pal, RR Bhonde, AS Srivastava and S Gupta.

(2013). Basal expression of pluripotency-associated genes can contribute to stemness property and differentiation potential. Stem Cells Dev 22:1802-17. 44.

Yuan H, N Corbi, C Basilico and L Dailey. (1995). Developmental-specific activity of the FGF-4

enhancer requires the synergistic action of Sox2 and Oct-3. Genes Dev 9:2635-45. 45.

Nishimoto M, A Fukushima, A Okuda and M Muramatsu. (1999). The gene for the embryonic

stem cell coactivator UTF1 carries a regulatory element which selectively interacts with a complex composed of Oct-3/4 and Sox-2. Mol Cell Biol 19:5453-65. 46.

Tokuzawa Y, E Kaiho, M Maruyama, K Takahashi, K Mitsui, M Maeda, H Niwa and S

Yamanaka. (2003). Fbx15 is a novel target of Oct3/4 but is dispensable for embryonic stem cell selfrenewal and mouse development. Mol Cell Biol 23:2699-708.

26

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 27 of 47

27 47.

Nakatake Y, N Fukui, Y Iwamatsu, S Masui, K Takahashi, R Yagi, K Yagi, J Miyazaki, R

Matoba, MS Ko and H Niwa. (2006). Klf4 cooperates with Oct3/4 and Sox2 to activate the Lefty1 core promoter in embryonic stem cells. Mol Cell Biol 26:7772-82. 48.

Kuroda T, M Tada, H Kubota, H Kimura, SY Hatano, H Suemori, N Nakatsuji and T Tada.

(2005). Octamer and Sox elements are required for transcriptional cis regulation of Nanog gene expression. Mol Cell Biol 25:2475-85. 49.

Tomioka M, M Nishimoto, S Miyagi, T Katayanagi, N Fukui, H Niwa, M Muramatsu and A

Okuda. (2002). Identification of Sox-2 regulatory region which is under the control of Oct-3/4-Sox-2 complex. Nucleic Acids Res 30:3202-13. 50.

Chew JL, YH Loh, W Zhang, X Chen, WL Tam, LS Yeap, P Li, YS Ang, B Lim, P Robson and

HH Ng. (2005). Reciprocal transcriptional regulation of Pou5f1 and Sox2 via the Oct4/Sox2 complex in embryonic stem cells. Mol Cell Biol 25:6031-46. 51.

Okumura-Nakanishi S, M Saito, H Niwa and F Ishikawa. (2005). Oct-3/4 and Sox2 regulate Oct-

3/4 gene in embryonic stem cells. J Biol Chem 280:5307-17. 52.

Boer B, J Kopp, S Mallanna, M Desler, H Chakravarthy, PJ Wilder, C Bernadt and A Rizzino.

(2007). Elevating the levels of Sox2 in embryonal carcinoma cells and embryonic stem cells inhibits the expression of Sox2:Oct-3/4 target genes. Nucleic Acids Res 35:1773-86. 53.

Kopp JL, BD Ormsbee, M Desler and A Rizzino. (2008). Small increases in the level of Sox2

trigger the differentiation of mouse embryonic stem cells. Stem Cells 26:903-11. 54.

Adachi K, H Suemori, SY Yasuda, N Nakatsuji and E Kawase. (2010). Role of SOX2 in

maintaining pluripotency of human embryonic stem cells. Genes Cells 15:455-70. 55.

Fong H, KA Hohenstein and PJ Donovan. (2008). Regulation of self-renewal and pluripotency by

Sox2 in human embryonic stem cells. Stem Cells 26:1931-8. 56.

Niwa H, J Miyazaki and AG Smith. (2000). Quantitative expression of Oct-3/4 defines

differentiation, dedifferentiation or self-renewal of ES cells. Nat Genet 24:372-6.

27

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 28 of 47

28 57.

Nichols J, B Zevnik, K Anastassiadis, H Niwa, D Klewe-Nebenius, I Chambers, H Scholer and A

Smith. (1998). Formation of pluripotent stem cells in the mammalian embryo depends on the POU transcription factor Oct4. Cell 95:379-91. 58.

Wang Z, E Oron, B Nelson, S Razis and N Ivanova. (2012). Distinct lineage specification roles

for NANOG, OCT4, and SOX2 in human embryonic stem cells. Cell Stem Cell 10:440-54. 59.

Maruyama M, T Ichisaka, M Nakagawa and S Yamanaka. (2005). Differential roles for Sox15

and Sox2 in transcriptional control in mouse embryonic stem cells. J Biol Chem 280:24371-9. 60.

Feng B, J Jiang, P Kraus, JH Ng, JC Heng, YS Chan, LP Yaw, W Zhang, YH Loh, J Han, VB

Vega, V Cacheux-Rataboul, B Lim, T Lufkin and HH Ng. (2009). Reprogramming of fibroblasts into induced pluripotent stem cells with orphan nuclear receptor Esrrb. Nat Cell Biol 11:197-203. 61.

Huangfu D, R Maehr, W Guo, A Eijkelenboom, M Snitow, AE Chen and DA Melton. (2008).

Induction of pluripotent stem cells by defined factors is greatly improved by small-molecule compounds. Nat Biotechnol 26:795-7. 62.

Nakagawa M, M Koyanagi, K Tanabe, K Takahashi, T Ichisaka, T Aoi, K Okita, Y Mochiduki, N

Takizawa and S Yamanaka. (2008). Generation of induced pluripotent stem cells without Myc from mouse and human fibroblasts. Nat Biotechnol 26:101-6. 63.

Wernig M, A Meissner, JP Cassady and R Jaenisch. (2008). c-Myc is dispensable for direct

reprogramming of mouse fibroblasts. Cell Stem Cell 2:10-2. 64.

Aksoy I and LW Stanton. (2013). Pluripotency-regulating networks provide basis for

reprogramming. Curr Mol Med 13:695-706. 65.

Huangfu D, K Osafune, R Maehr, W Guo, A Eijkelenboom, S Chen, W Muhlestein and DA

Melton. (2008). Induction of pluripotent stem cells from primary human fibroblasts with only Oct4 and Sox2. Nat Biotechnol 26:1269-75. 66.

Giorgetti A, N Montserrat, T Aasen, F Gonzalez, I Rodriguez-Piza, R Vassena, A Raya, S Boue,

MJ Barrero, BA Corbella, M Torrabadella, A Veiga and JC Izpisua Belmonte. (2009). Generation of induced pluripotent stem cells from human cord blood using OCT4 and SOX2. Cell Stem Cell 5:353-7. 28

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 29 of 47

29 67.

Meng X, A Neises, RJ Su, KJ Payne, L Ritter, DS Gridley, J Wang, M Sheng, KH Lau, DJ

Baylink and XB Zhang. (2012). Efficient reprogramming of human cord blood CD34+ cells into induced pluripotent stem cells with OCT4 and SOX2 alone. Mol Ther 20:408-16. 68.

Eminli S, J Utikal, K Arnold, R Jaenisch and K Hochedlinger. (2008). Reprogramming of neural

progenitor cells into induced pluripotent stem cells in the absence of exogenous Sox2 expression. Stem Cells 26:2467-74. 69.

Wang G, X Guo, W Hong, Q Liu, T Wei, C Lu, L Gao, D Ye, Y Zhou, J Chen, J Wang, M Wu, H

Liu and J Kang. (2013). Critical regulation of miR-200/ZEB2 pathway in Oct4/Sox2-induced mesenchymal-to-epithelial transition and induced pluripotent stem cell generation. Proc Natl Acad Sci U S A 110:2858-63. 70.

Wang S, P Xia, B Ye, G Huang, J Liu and Z Fan. (2013). Transient activation of autophagy via

Sox2-mediated suppression of mTOR is an important early step in reprogramming to pluripotency. Cell Stem Cell 13:617-25. 71.

Jauch R, I Aksoy, AP Hutchins, CK Ng, XF Tian, J Chen, P Palasingam, P Robson, LW Stanton

and PR Kolatkar. (2011). Conversion of Sox17 into a pluripotency reprogramming factor by reengineering its association with Oct4 on DNA. Stem Cells 29:940-51. 72.

Aksoy I, R Jauch, J Chen, M Dyla, U Divakar, GK Bogu, R Teo, CK Leng Ng, W Herath, S Lili,

AP Hutchins, P Robson, PR Kolatkar and LW Stanton. (2013). Oct4 switches partnering from Sox2 to Sox17 to reinterpret the enhancer code and specify endoderm. EMBO J 32:938-53. 73.

Aksoy I, R Jauch, V Eras, WB Chng, J Chen, U Divakar, CK Ng, PR Kolatkar and LW Stanton.

(2013). Sox transcription factors require selective interactions with Oct4 and specific transactivation functions to mediate reprogramming. Stem Cells 31:2632-46. 74.

Schmidt R and K Plath. (2012). The roles of the reprogramming factors Oct4, Sox2 and Klf4 in

resetting the somatic cell epigenome during induced pluripotent stem cell generation. Genome Biol 13:251.

29

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 30 of 47

30 75.

Futaki S, Y Hayashi, T Emoto, CN Weber and K Sekiguchi. (2004). Sox7 plays crucial roles in

parietal endoderm differentiation in F9 embryonal carcinoma cells through regulating Gata-4 and Gata-6 expression. Mol Cell Biol 24:10492-503. 76.

Seguin CA, JS Draper, A Nagy and J Rossant. (2008). Establishment of endoderm progenitors by

SOX transcription factor expression in human embryonic stem cells. Cell Stem Cell 3:182-95. 77.

Zhang C and MW Klymkowsky. (2007). The Sox axis, Nodal signaling, and germ layer

specification. Differentiation 75:536-45. 78.

Hudson C, D Clements, RV Friday, D Stott and HR Woodland. (1997). Xsox17alpha and -beta

mediate endoderm formation in Xenopus. Cell 91:397-405. 79.

Clements D and HR Woodland. (2000). Changes in embryonic cell fate produced by expression

of an endodermal transcription factor, Xsox17. Mech Dev 99:65-70. 80.

Sinner D, S Rankin, M Lee and AM Zorn. (2004). Sox17 and beta-catenin cooperate to regulate

the transcription of endodermal genes. Development 131:3069-80. 81.

Alexander J and DY Stainier. (1999). A molecular pathway leading to endoderm formation in

zebrafish. Curr Biol 9:1147-57. 82.

Abdelalim EM, A Bonnefond, A Bennaceur-Griscelli and P Froguel. (2014). Pluripotent stem

cells as a potential tool for disease modelling and cell therapy in diabetes. Stem Cell Rev 10:327-37. 83.

Mfopou JK, B Chen, L Sui, K Sermon and L Bouwens. (2010). Recent advances and prospects in

the differentiation of pancreatic cells from human embryonic stem cells. Diabetes 59:2094-101. 84.

Thomson M, SJ Liu, LN Zou, Z Smith, A Meissner and S Ramanathan. (2011). Pluripotency

factors in embryonic stem cells regulate differentiation into germ layers. Cell 145:875-89. 85.

Takemoto T, M Uchikawa, M Yoshida, DM Bell, R Lovell-Badge, VE Papaioannou and H

Kondoh. (2011). Tbx6-dependent Sox2 regulation determines neural or mesodermal fate in axial stem cells. Nature 470:394-8. 86.

Bergsland M, D Ramskold, C Zaouter, S Klum, R Sandberg and J Muhr. (2011). Sequentially

acting Sox transcription factors in neural lineage development. Genes Dev 25:2453-64. 30

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 31 of 47

31 87.

Mallanna SK, BD Ormsbee, M Iacovino, JM Gilmore, JL Cox, M Kyba, MP Washburn and A

Rizzino. (2010). Proteomic analysis of Sox2-associated proteins during early stages of mouse embryonic stem cell differentiation identifies Sox21 as a novel regulator of stem cell fate. Stem Cells 28:1715-27. 88.

Kamachi Y, M Uchikawa and H Kondoh. (2000). Pairing SOX off: with partners in the regulation

of embryonic development. Trends Genet 16:182-7. 89.

Kondoh H and Y Kamachi. (2010). SOX-partner code for cell specification: Regulatory target

selection and underlying molecular mechanisms. Int J Biochem Cell Biol 42:391-9. 90.

Gao Z, JL Cox, JM Gilmore, BD Ormsbee, SK Mallanna, MP Washburn and A Rizzino. (2012).

Determination of protein interactome of transcription factor Sox2 in embryonic stem cells engineered for inducible expression of four reprogramming factors. J Biol Chem 287:11384-97. 91.

Ho L, JL Ronan, J Wu, BT Staahl, L Chen, A Kuo, J Lessard, AI Nesvizhskii, J Ranish and GR

Crabtree. (2009). An embryonic stem cell chromatin remodeling complex, esBAF, is essential for embryonic stem cell self-renewal and pluripotency. Proc Natl Acad Sci U S A 106:5181-6. 92.

Wang J, S Rao, J Chu, X Shen, DN Levasseur, TW Theunissen and SH Orkin. (2006). A protein

interaction network for pluripotency of embryonic stem cells. Nature 444:364-8. 93.

Magdeldin S, T Yamamoto, I Tooyama and EM Abdelalim. (2014). New Proteomic Insights on

the Role of NPR-A in Regulating Self-Renewal of Embryonic Stem Cells. Stem Cell Rev. 94.

Abdelalim EM and I Tooyama. (2011). NPR-A regulates self-renewal and pluripotency of

embryonic stem cells. Cell Death Dis 2:e127. 95.

Chen X, H Xu, P Yuan, F Fang, M Huss, VB Vega, E Wong, YL Orlov, W Zhang, J Jiang, YH

Loh, HC Yeo, ZX Yeo, V Narang, KR Govindarajan, B Leong, A Shahab, Y Ruan, G Bourque, WK Sung, ND Clarke, CL Wei and HH Ng. (2008). Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell 133:1106-17. 96.

Whyte WA, DA Orlando, D Hnisz, BJ Abraham, CY Lin, MH Kagey, PB Rahl, TI Lee and RA

Young. (2013). Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell 153:307-19. 31

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 32 of 47

32 97.

Reményi A, K Lins, LJ Nissen, R Reinbold, HR Schöler and M Wilmanns. (2003). Crystal

structure of a POU/HMG/DNA ternary complex suggests differential assembly of Oct4 and Sox2 on two enhancers. Genes Dev 17:2048-59. 98.

Guo G, M Huss, GQ Tong, C Wang, L Li Sun, ND Clarke and P Robson. (2010). Resolution of

cell fate decisions revealed by single-cell gene expression analysis from zygote to blastocyst. Dev Cell 18:675-85. 99.

Jauch R and PR Kolatkar. (2013). What makes a pluripotency reprogramming factor? Curr Mol

Med 13:806-14. 100.

Mullen AC, DA Orlando, JJ Newman, J Loven, RM Kumar, S Bilodeau, J Reddy, MG Guenther,

RP DeKoter and RA Young. (2011). Master transcription factors determine cell-type-specific responses to TGF-beta signaling. Cell 147:565-76. 101.

Fong H, RC Wong and PJ Donovan. (2012). Transcriptional regulation of TRKC by SOX2 in

human embryonic stem cells. Stem Cell Res 8:206-14. 102.

Barroso-delJesus A, C Romero-Lopez, G Lucena-Aguilar, GJ Melen, L Sanchez, G Ligero, A

Berzal-Herranz and P Menendez. (2008). Embryonic stem cell-specific miR302-367 cluster: human gene structure and functional characterization of its core promoter. Mol Cell Biol 28:6609-19. 103.

Card DA, PB Hebbar, L Li, KW Trotter, Y Komatsu, Y Mishina and TK Archer. (2008).

Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem cells. Mol Cell Biol 28:642638. 104.

Marson A, SS Levine, MF Cole, GM Frampton, T Brambrink, S Johnstone, MG Guenther, WK

Johnston, M Wernig, J Newman, JM Calabrese, LM Dennis, TL Volkert, S Gupta, J Love, N Hannett, PA Sharp, DP Bartel, R Jaenisch and RA Young. (2008). Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell 134:521-33. 105.

Gill JG, EM Langer, RC Lindsley, M Cai, TL Murphy, M Kyba and KM Murphy. (2011). Snail

and the microRNA-200 family act in opposition to regulate epithelial-to-mesenchymal transition and germ layer fate restriction in differentiating ESCs. Stem Cells 29:764-76. 32

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 33 of 47

33 106.

Peng C, N Li, YK Ng, J Zhang, F Meier, FJ Theis, M Merkenschlager, W Chen, W Wurst and N

Prakash. (2012). A unilateral negative feedback loop between miR-200 microRNAs and Sox2/E2F3 controls neural progenitor cell-cycle exit and differentiation. J Neurosci 32:13292-308. 107.

Huang HN, SY Chen, SM Hwang, CC Yu, MW Su, W Mai, HW Wang, WC Cheng, SC

Schuyler, N Ma, FL Lu and J Lu. (2014). miR-200c and GATA binding protein 4 regulate human embryonic stem cell renewal and differentiation. Stem Cell Res 12:338-53. 108.

Tay Y, J Zhang, AM Thomson, B Lim and I Rigoutsos. (2008). MicroRNAs to Nanog, Oct4 and

Sox2 coding regions modulate embryonic stem cell differentiation. Nature 455:1124-8. 109.

Xu N, T Papagiannakopoulos, G Pan, JA Thomson and KS Kosik. (2009). MicroRNA-145

regulates OCT4, SOX2, and KLF4 and represses pluripotency in human embryonic stem cells. Cell 137:647-58. 110.

Franceschini A, D Szklarczyk, S Frankild, M Kuhn, M Simonovic, A Roth, J Lin, P Minguez, P

Bork, C von Mering and LJ Jensen. (2013). STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res 41:D808-15.

33

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 34 of 47

34

Figure Legends

Figure 1: The superposition of DNA-bound SOX HMG domain structures including SOX2 (red) [11], SOX4 (blue) [12], SOX9 (yellow) [13], and SOX17 (green) [14].

Figure 2: Development of preimplantation embryos and generation of embryonic stem cells. During early development (prior to implantation), the zygote undergoes a number of mitotic divisions. In human, at the day 5-8 (3.5 days in mouse), the blastocyst stage embryo is formed, which consists of inner cell mass (ICM), surrounded by an outer layer of the trophectoderm (TE). In vivo, the ICM will form all the three germ layers, while the TE will form the placenta. Embryonic stem cells (ESCs) are derived from the ICM of the blastocyst stage embryos and under appropriate culture conditions in vitro, they can self-renew indefinitely while maintaining their pluripotency. ESCs depend on an intrinsic network of transcription factors, which are also expressed in the ICM, to maintain their pluripotent identity. ESCs are able to differentiate into the three germ layers (endoderm, mesoderm, and ectoderm).

Figure 3: The relationship between SOX2 and SOX3 in hESCs. Diagram represents the relationship that exists in undifferentiated hESCs between SOX2 and SOX3. Normally hESCs express high level of SOX2, and low level of SOX3. Suppressing SOX2 in hESCs induces activation of SOX3 expression leading to maintain the pluripotency, but not self-renewal. Inhibition of both factors (SOX2 and SOX3) in hESCs leads to suppression of pluripotency and self-renewal as well as induce hESC differentiation.

34

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 35 of 47

35 Figure 4: Outline of SOX2 regulatory network for mediating pluripotency. (A) A protein association network illustrates the most important interactions between SOX2 and other transcription factors that regulate pluripotency. Thicker lines represent stronger associations. This protein interaction map was done using STRING database version 9.1 based on medium confidence events [110]. (B) SOX2 partners with different transcription factors (TFs) to control the expression of different regulatory factors that maintain the balance between self-renewal and differentiation. This TF complex mediates stem cell fate through activation of genes that maintain self-renewal or suppressing them to upregulate specific genes that direct stem cell differentiation to specific cell lineages.

35

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 36 of 47

36

Figure 5: Differential assembly of OCT4 with SOX2 and SOX17. The preferential binding of SOX2 and SOX17 on canonical and compressed cis elements involved in pluripotency and endodermal programs, respectively. The compressed motif has a missing base at position 18, which is highlighted in the orange square.

Figure Legends

36

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof. Page 37 of 47

37

Figure 1: The superposition of DNA-bound SOX HMG domain structures including SOX2 (red)

[11], SOX4 (blue) [12], SOX9 (yellow) [13], and SOX17 (green) [14].

37

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 38 of 47

38

38

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 39 of 47

39

Figure 2: Development of preimplantation embryos and generation of embryonic stem cells. During early development (prior to implantation), the zygote undergoes a number of mitotic divisions. In human, at the day 5-8 (3.5 days in mouse), the blastocyst stage embryo is formed, which consists of inner cell mass (ICM), surrounded by an outer layer of the trophectoderm (TE). In vivo, the ICM will form all the three germ layers, while the TE will form the placenta. Embryonic stem cells (ESCs) are derived from the ICM of the blastocyst stage embryos and under appropriate culture conditions in vitro, they can self-renew indefinitely while maintaining their pluripotency. ESCs depend on an intrinsic network of transcription factors, which are also

39

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 40 of 47

40

expressed in the ICM, to maintain their pluripotent identity. ESCs are able to differentiate into

the three germ layers (endoderm, mesoderm, and ectoderm).

40

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 41 of 47

41

Figure 3: The relationship between SOX2 and SOX3 in hESCs. Diagram represents the relationship that exists in undifferentiated hESCs between SOX2 and SOX3. Normally hESCs express high level of SOX2, and low level of SOX3. Suppressing SOX2 in hESCs induces activation of SOX3 expression leading to maintain the pluripotency, but not self-renewal. Inhibition of both factors (SOX2 and SOX3) in hESCs leads to suppression of pluripotency and self-renewal as well as induce hESC differentiation.

41

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 42 of 47

42

Figure 4: Outline of SOX2 regulatory network for mediating pluripotency. (A) A protein association network illustrates the most important interactions between SOX2 and other transcription factors that regulate pluripotency. Thicker lines represent stronger associations. This protein interaction map was done using STRING database version 9.1 based on medium confidence events [110]. (B) SOX2 partners with different transcription factors (TFs) to control

42

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 43 of 47

43 the expression of different regulatory factors that maintain the balance between self-renewal and differentiation. This TF complex mediates stem cell fate through activation of genes that maintain self-renewal or suppressing them to upregulate specific genes that direct stem cell differentiation to specific cell lineages.

Figure 5: Differential assembly of OCT4 with SOX2 and SOX17. The preferential binding of SOX2 and SOX17 on canonical and compressed cis elements involved in pluripotency and endodermal programs, respectively. The compressed motif has a missing base at position 18, which is highlighted in the orange square.

43

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof.

Page 44 of 47

44

Table 1: Sox family proteins are divided into groups SoxA-SoxH based on the degree of amino acid identity within the HMG-box.

Group

Subgroup

Members

Reference

SRY

[17]

SOXB1

SOX1, SOX2, AND SOX3

[18]

SOXB2

SOX14 AND SOX21

[19]

SOXC

SOX4, SOX11 AND SOX12

[20]

SOXD

SOX5, SOX6 AND SOX13

[23]

SOXE

SOX8, SOX9 AND SOX10

[21]

SOXF

SOX7, SOX17 AND SOX18

[22]

SOXG

SOX15

[24]

SOXH

SOX30

[25]

SOXA SOXB

44

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) as been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ fro

Page 45 of 47

45 Table 2: The role of different SOX TFs in preimplanatation embryos, pluripotent stem cells, and iPSC generation SOX TFs SOX1

SOX2

Preimplantation embryos

PSC pluripotency

PSC self-renewal

 Is capable to replace SOX2 during the reprogramming process (highly efficient) [62]  Essential for reprogramming (high efficiency) [3, 4]

Not reported

Not reported

Not reported

 Expressed in preimplantation embryos and essential for embryo development [28, 29, 31]

 Essential for pluripotency in both hESCs and mESCs [42, 54, 55]

 Essential for self-renewal in both hESCs and mESCs [42, 54, 55]

Not reported

 Maintains pluripotency in the absence of SOX2 in hESCs [58]  No essential role in pluripotency of mESCs [59]

 Does NOT maintain selfrenewal in the absence of SOX2 in hESCs [58]  No essential role in selfrenewal of mESCs [59]

 Is capable to replace SOX2 during the reprogramming process (high efficiency) [62]

Not reported

Not reported

Not reported

Not reported

 No essential role in pluripotency of both hESCs and mESCs [58, 59]

 No essential role in selfrenewal of both hESCs and mESCs [58, 59]

 Not effective during the reprogramming process [62]  Reengineered SOX7 (SOX7EK) is capable to replace SOX2 during the reprogramming process (very high efficiency) [73]  Is capable to replace SOX2 during the reprogramming process (Moderate efficiency) [62]

 Expressed in preimplantation embryos and regulates differentiation during development [33]

 No essential role in pluripotency

 No essential role in selfrenewal

Not reported

Not reported

Not reported

SOX3

SOX7

SOX15

SOX17

iPSC generation

SOX18 45

 Not effective during the reprogramming process [62]  Reengineered SOX17 (SOX17EK) is capable to replace SOX2 during the reprogramming process (very high efficiency) [71, 73]  Is capable to replace SOX2 during the reprogramming process (Low efficiency) [62]  Reengineered SOX18 (SOX18EKC17) is

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) as been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ fro

Page 46 of 47

46

capable to replace SOX2 during the reprogramming process (Moderate efficiency) [73]

46

Stem Cells and Development The SOX transcription factors as key players in pluripotent stem cells (doi: 10.1089/scd.2014.0297) This article has been peer-reviewed and accepted for publication, but has yet to undergo copyediting and proof correction. The final published version may differ from this proof. Page 47 of 47

47

47

The SOX transcription factors as key players in pluripotent stem cells.

Pluripotent stem cells (PSCs), including embryonic stem cells (ESCs), and induced PSCs (iPSCs) are able to self-renew and differentiate into a multitu...
939KB Sizes 3 Downloads 4 Views