Identification and analysis of CYP450 genes from transcriptome of Lonicera japonica and expression analysis of chlorogenic acid biosynthesis related CYP450s Xiwu Qi1 ,2 , Xu Yu1 ,2 , Daohua Xu1 , Hailing Fang1 ,2 , Ke Dong3 , Weilin Li1 ,2 and Chengyuan Liang1 ,2 1

Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China The Jiangsu Provincial Platform for Conservation and Utilization of Agricultural Germplasm, Nanjing, China 3 Department of Biological Sciences, College of Natural Sciences, Seoul National University, Seoul, South Korea 2

ABSTRACT

Submitted 21 April 2017 Accepted 17 August 2017 Published 12 September 2017 Corresponding authors Weilin Li, [email protected] Chengyuan Liang, [email protected] Academic editor David Hyten Additional Information and Declarations can be found on page 16 DOI 10.7717/peerj.3781 Copyright 2017 Qi et al. Distributed under Creative Commons CC-BY 4.0

Background. Lonicera japonica is an important medicinal plant that has been widely used in traditional Chinese medicine for thousands of years. The pharmacological activities of L. japonica are mainly due to its rich natural active ingredients, most of which are secondary metabolites. CYP450s are a large, complex, and widespread superfamily of proteins that participate in many endogenous and exogenous metabolic reactions, especially secondary metabolism. Here, we identified CYP450s in L. japonica transcriptome and analyzed CYP450s that may be involved in chlorogenic acid (CGA) biosynthesis. Methods. The recent availability of L. japonica transcriptome provided opportunity to identify CYP450s in this herb. BLAST based method and HMM based method were used to identify CYP450s in L. japonica transcriptome. Then, phylogenetic analysis, conserved motifs analysis, GO annotation, and KEGG annotation analyses were conducted to characterize the identified CYP450s. qRT-PCR was used to explore expression patterns of five CGA biosynthesis related CYP450s. Results. In this study, 151 putative CYP450s with complete cytochrome P450 domain, which belonged to 10 clans, 45 families and 76 subfamilies, were identified in L. japonica transcriptome. Phylogenetic analysis classified these CYP450s into two major branches, A-type (47%) and non-A type (53%). Both types of CYP450s had conserved motifs in L. japonica. The differences of typical motif sequences between A-type and non-A type CYP450s in L. japonica were similar with other plants. GO classification indicated that non-A type CYP450s participated in more molecular functions and biological processes than A-type. KEGG pathway annotation totally assigned 47 CYP450s to 25 KEGG pathways. From these data, we cloned two LjC3Hs (CYP98A subfamily) and three LjC4Hs (CYP73A subfamily) that may be involved in biosynthesis of CGA, the major ingredient for pharmacological activities of L. japonica. qRT-PCR results indicated that two LjC3Hs exhibited oppositing expression patterns during the flower development and LjC3H2 exhibited a similar expression pattern with CGA concentration measured by HPLC. The expression patterns of three LjC4Hs were quite different and the expression pattern of LjC4H3 was quite similar with that of LjC3H1.

OPEN ACCESS

How to cite this article Qi et al. (2017), Identification and analysis of CYP450 genes from transcriptome of Lonicera japonica and expression analysis of chlorogenic acid biosynthesis related CYP450s. PeerJ 5:e3781; DOI 10.7717/peerj.3781

Discussion. Our results provide a comprehensive identification and characterization of CYP450s in L. japonica. Five CGA biosynthesis related CYP450s were cloned and their expression patterns were explored. The different expression patterns of two LjC3Hs and three LjC4Hs may be due to functional divergence of both substrate and catalytic specificity during plant evolution. The co-expression pattern of LjC3H1 and LjC4H3 strongly suggested that they were under coordinated regulation by the same transcription factors due to same cis elements in their promoters. In conclusion, this study provides insight into CYP450s and will effectively facilitate the research of biosynthesis of CGA in L. japonica.

Subjects Molecular Biology, Plant Science Keywords Lonicera japonica, CYP450, CGA biosynthesis, C3H, C4H

INTRODUCTION Cytochrome P450 monooxygenases (CYP450s) are a large and complex superfamily which can be found in almost all living organisms (Nelson, 1999). Plant CYP450s are heme-containing enzymes that take part in a wide variety of reactions of both primary and secondary metabolism (Kumar et al., 2014), including the production of fatty acids, sterols, plant hormones, flavonoids, terpenoids, lignin, signaling molecules, and other biological molecules (Schuler & Werck-Reichhart, 2003). Lonicera japonica Thunb. is a perennial evergreen vine belonging to the family Caprifoliaceae. L. japonica is a medicinal plant of great importance in traditional Chinese medicine that has been used for thousands of years (Shang et al., 2011). There are more than 500 traditional Chinese medicine prescriptions containing L. japonica (Shang et al., 2011). Modern pharmacological studies have indicated that the extracts of L. japonica possess many biological and pharmacological activities, such as anti-inflammatory, antiviral, antibacterial, antioxidant, hepato-protective, anti-tumor, and other activities (Xiang et al., 2001; Yoo et al., 2008). The active compounds of L. japonica have been extensively studied. Essential oils (Schlotzhauer, Pair & Horvat, 1996), phenolic acids (Lu, Jiang & Chen, 2004), flavone (Chen et al., 2005), triterpenoid saponins (Chai, Li & Li, 2005), iridoilds and inorganic elements as the main compositions were isolated and identified in L. japonica. Among all these products, chlorogenic acid (CGA) is the major ingredient for pharmacological activities and its content is typically used as the main indicator of quality for evaluating L. japonica (Chinese Pharmacopoeia Commission, 2010). As one of the most important secondary metabolites in plants, CGA is often used in medicines and foods for its high anti-oxidative activity (Zucker & Levy, 1959). The biosynthetic pathway of CGA has been investigated in many plants and is catalyzed by a series of enzymes (Niggeweg, Michael & Martin, 2004). Cinnamate 4-hydroxylase (C4H) and p-coumarate 30 -hydroxylase (C3H) are two CYP450s that participate in the two steps of hydroxylation in CGA biosynthetic pathway (Gabriac et al., 1991; Schoch et al., 2001). In L. japonica, a CYP98A subfamily gene encoding LjC3H was isolated and characterized.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

2/20

By using heterologous expressed LjC3H in vitro assay, a recent study revealed that the recombinant protein was effective in converting p-coumaroylquinate to CGA (Pu et al., 2013). Two C4Hs belonging to the CYP73A subfamily were also cloned in L. japonica. Expression and activity analysis suggested that LjC4H2 may be one of the critical genes that regulate CGA content in L. japonica (Yuan et al., 2014). The studies of L. japonica have been focused on the identification of active compounds and pharmacological activity assays. In recent years, with the technological advancement in molecular biology, especially the development of next-generation sequencing technology, great progress has been made in the identification of active compounds involved in the biosynthesis processes in L. japonica (Yuan et al., 2012; He et al., 2013). In this study, bioinformatics tools were used to identify and analyze the CYP450 genes based on transcriptome data of L. japonica. We identified two LjC3Hs and three LjC4Hs from the CYP450 candidate genes, which including one previously reported LjC3H and two LjC4Hs genes. We further cloned the five CYP450 genes and analyzed their transcriptional patterns in different developmental stages flowers. The results provided here will expand CYP450s information and could effectively facilitate CGA biosynthetic studies in L. japonica.

MATERIALS AND METHODS Identification of CYP450 genes in L. japonica The transcriptome data of L. japonica generated from different sequencing platforms including 454 GS-FLX, Illumina HiSeq2000, and Illumina GA II was downloaded from the NCBI SRA database with accession numbers SRR290309, SRR342027, SRR576924, SRR576925 and SRR766791. Four datasets were assembled and annotated. To identify putative CYP450 genes, both Hidden Markov Model (HMM) method and BLAST method were used. For HMM method, P450.hmm file which represents the Hidden Markov Model of the cytochrome P450 family was initially downloaded from Pfam (http://pfam.xfam.org/), and then, HMMER3 software (Eddy, 2011) was used to search P450.hmm against L. japonica deduced amino acid database. For BLAST method, 19,047 full length plant CYP450 sequences were retrieved from UniProt (http://www.uniprot.org/). These sequences were used as queries to tblastn against L. japonica transcriptome assembly with an E-value cutoff of 1e−5. After filtering out the repeated results, the coding sequences of the resultant subjects were retrieved. Finally, results from the two methods were integrated and corrected manually. The identification methods were conducted for the four datasets of L. japonica and the results were also integrated and corrected. The corrected L. japonica CYP450s were further submitted to NCBI Conserved Domain Search (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) to predict the conserved domain. Sequences with complete cytochrome P450 domain were selected for further analysis.

Classification and characterization of L. japonica CYP450 genes L. japonica CYP450s were classified into different families and subfamilies according to the sequence similarity using sequences from Cytochrome P450 Homepage as reference sequences. If the amino acid sequences of L. japonica CYP450s showed >40%, >55%, or

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

3/20

>95% sequence similarity with reference sequences, they were classified into the same family, subfamily, or allelic variant, respectively (Nelson, 2009). The deduced amino acid sequences of L. japonica CYP450s were subjected to Multiple Expectation Maximization for Motif Elicitation (MEME, http://meme-suite.org/) (Bailey et al., 2009) analysis for identification of conserved motifs. Sequences of the four conserved CYP450 motifs including heme-binding region, PERF motif, K-helix region and I-helix region were extracted and then subjected to WEBLOGO (http://weblogo.berkeley.edu/) (Crooks et al., 2004) to create the sequence logos.

Phylogenetic analysis of predicted CYP450 genes A total of 63 representative sequences from plant CYP450 families were selected for phylogenetic analysis with 151 L. japonica CYP450 sequences. Specifically, CYP450 sequences whose functions had already been identified were preferentially selected. Multiple sequence alignment was performed using MUSCLE 3.6 software (Edgar, 2004). The result of alignment was imported to MEGA4 (Tamura et al., 2007) and phylogenetic analysis was performed. The phylogenetic tree was constructed using the Neighbor-Joining algorithm with the Poisson model and pairwise deletion. Bootstrap testing with 1,000 replications was used to test the phylogenetic tree. The Newick format file of bootstrap consensus tree was exported and then modified using EvolView (http://www.evolgenius.info/evolview/) (Zhang et al., 2012).

Gene ontology and KEGG pathway analysis Blast2GO (http://www.blast2go.com/) (Conesa et al., 2005) was used to perform Gene ontology (GO) annotation of L. japonica CYP450s. These predicted genes were functionally categorized according to three different criterions including cellular component, molecular function and biological process. The GO terms of all L. japonica CYP450s were extracted and subjected to Web Gene Ontology Annotation Plot (WEGO, http: //wego.genomics.org.cn/cgi-bin/wego/index.pl) (Ye et al., 2006) to plot GO annotation results. KEGG annotation that maps the L. japonica CYP450s to possible KEGG pathway for biological interpretation of systemic functions was also conducted using Blast2GO.

Extraction and quantification of CGA The L. japonica used for this study was maintained at the Germplasm Nursery in Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, Jiangsu Province. Flower buds and flowers samples for CGA and RNA extraction were collected at five stages: young alabastrum (YA, ≤1.5 cm), green alabastrum (GA, 2.0–3.0 cm), while alabastrum (WA, 3.2–4.4 cm), silvery flower (SF, about 5 cm), and golden flower (GF, about 5 cm). The extraction and quantification of CGA were conducted as described in Chinese Pharmacopoeia with minor modifications (Chinese Pharmacopoeia Commission, 2010). Briefly, dried buds or flowers were separately comminuted with a miler, and 0.2 g of each solid sample (40 mesh) was extracted with 25 mL of 50% aqueous methanol by ultrasonication (250 W, 35 kHz) for 30 min. After cooling to room temperature, the extracts were replenished to earlier weights with 50% aqueous methanol. Then, 5 mL of the extracts were diluted to 25 mL with 50% aqueous methanol and filtered with 0.45 µm

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

4/20

Millipore filter membranes. An Agilent 1200LC series HPLC system was used to analyze the CGA levels. Separations were performed on an Agilent TC-C18 reserved-phase column (5 µm, 250 mm × 4.6 mm) at 25 ◦ C. The mobile phase was composed of acetonitrile-0.4% H3 PO4 (13:87). The flow rate was 1 mL/min and fractions were monitored at 327 nm. Components were identified by comparison of the retention times of the eluting peaks to those of commercial standards under the same conditions.

RNA extraction and qRT-PCR Total RNA from five samples was extracted using RNAiso Plus (Takara, Tokyo, Otsu, Shiga, Japan) according to the manufacturer’s instructions. RNA quality and concentration were measured using a ND-1000 UV spectrophotometer (Nanodrop Technologies, Wilmington, DE, USA). First-strand cDNA was synthesized using 3 µg of total RNA with M-MLV reverse transcriptase (Promega, USA) in a 25 µl reaction system. For quantitative real-time reverse transcriptional PCR (qRT-PCR), each reaction was prepared according to the R Premix Ex TaqTM II (Takara) and 2 µl of manufacturer’s instructions using SYBR diluted cDNA as a template. The qRT-PCR reactions were conducted on the qTOWER2.2 Real Time PCR Systems (Analytik, Jena, Germany). The L. japonica actin gene was used as a control to normalize the relative expression levels of target genes. Gene-specific primers used for qRT-PCR were listed on Table S1. All results were representative of three independent experiments.

RESULTS Identification and classification of CYP450 genes in L. japonica Cytochrome P450 is one of the most massive gene superfamilies that is comprised of a number of families and subfamilies. In the present study, by integrating the results from different datasets and manual correction, we in total identified 151 putative CYP450s with complete cytochrome P450 domain in L. japonica. Among them, nine CYP450s had been previously reported and the other 142 CYP450s were identified here for the first time in L. japonica. Based on sequence similarity, we classified the 151 CYP450 genes from L. japonica into 10 clans consisting of 45 families and 76 subfamilies (Table 1). Among them, the CYP71 clan, which represents the whole set of A-type CYP450 genes, contains 71 genes belonging to 19 families (CYP71, CYP73, CYP75–CYP84, CYP89, CYP92, CYP93, CYP98, CYP701, CYP706, and CYP736). The non-A type CYP450 genes of L. japonica contains the remaining 80 genes, which belongs to nine CYP clans (CYP51, 72, 74, 85, 86, 97, 710, 711, and 727) and 26 families (CYP51, CYP72, CYP714, CYP715, CYP721, CYP734, CYP749, CYP74, CYP85, CYP87, CYP88, CYP90, CYP707, CYP716, CYP722, CYP724, CYP728, CYP729, CYP86, CYP94, CYP96, CYP704, CYP97, CYP710, CYP711, and CYP727). The largest CYP family of L. japonica is CYP71 and CYP72, containing 17 and 18 members, respectively.

Phylogenetic analysis of predicted CYP450s in L. japonica Representative members of each plant CYP450 family were selected and used to conduct phylogenetic analysis with 151 CYP450s from L. japonica. The predicted CYP450s were

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

5/20

Table 1 List of predicted CYP450s with complete cytochrome P450 domain from L. japonica. Type

Clan

Family

Subfamily

Gene ID

Type

Clan

Family

Subfamily

Gene ID

non-A

51

CYP51

CYP51G

m183961

non-A

72

CYP72

CYP72A

m61801

non-A

51

CYP51

CYP51G

m52657

non-A

72

CYP72

CYP72A

m25640

A

71

CYP71

CYP71B

m62714

non-A

72

CYP72

CYP72A

m206268

A

71

CYP71

CYP71D

m153867

non-A

72

CYP72

CYP72A

m132911

A

71

CYP71

CYP71D

m20042

non-A

72

CYP72

CYP72A

m20456

A

71

CYP71

CYP71D

m123612

non-A

72

CYP72

CYP72A

m161676

A

71

CYP71

CYP71D

m203354

non-A

72

CYP72

CYP72A

m16935

A

71

CYP71

CYP71D

m30084

non-A

72

CYP72

CYP72A

m216352

A

71

CYP71

CYP71D

m75810

non-A

72

CYP72

CYP72A

m196797

A

71

CYP71

CYP71D

m124427

non-A

72

CYP72

CYP72A

m21341

A

71

CYP71

CYP71D

m94101

non-A

72

CYP72

CYP72A

m194714

A

71

CYP71

CYP71D

m117052

non-A

72

CYP72

CYP72A

m76011

A

71

CYP71

CYP71AP

m198376

non-A

72

CYP72

CYP72A

m178417

A

71

CYP71

CYP71AU

m112981

non-A

72

CYP72

CYP72D

m62754

A

71

CYP71

CYP71AU

m12680

non-A

72

CYP72

CYP72D

m75640

A

71

CYP71

CYP71AU

m33704

non-A

72

CYP714

CYP714A

m189781

A

71

CYP71

CYP71BC

m197010

non-A

72

CYP714

CYP714E

m200247

A

71

CYP71

CYP71BE

m79469

non-A

72

CYP714

CYP714E

m125702

A

71

CYP71

CYP71BG

m82900

non-A

72

CYP714

CYP714E

m205273

A

71

CYP73

CYP73A

m177245

non-A

72

CYP714

CYP714E

m19972

A

71

CYP73

CYP73A

m13469

non-A

72

CYP714

CYP714E

m17561

A

71

CYP73

CYP73A

m8810

non-A

72

CYP715

CYP715A

m34769

A

71

CYP75

CYP75B

m13120

non-A

72

CYP721

CYP721A

m85505

A

71

CYP76

CYP76A

m204536

non-A

72

CYP734

CYP734A

m842

A

71

CYP76

CYP76A

m184619

non-A

72

CYP749

CYP749A

m139970

A

71

CYP76

CYP76A

m18954

non-A

74

CYP74

CYP74A

m4277

A

71

CYP76

CYP76A

m155830

non-A

74

CYP74

CYP74A

m50137

A

71

CYP76

CYP76A

m197465

non-A

74

CYP74

CYP74A

m67219

A

71

CYP76

CYP76B

m31162

non-A

74

CYP74

CYP74B

m52145

A

71

CYP76

CYP76B

m1881

non-A

74

CYP74

CYP74B

m19408

A

71

CYP76

CYP76B

m156655

non-A

85

CYP85

CYP85A

m206529

A

71

CYP76

CYP76B

m122126

non-A

85

CYP85

CYP85A

m42514

A

71

CYP76

CYP76Y

m139737

non-A

85

CYP87

CYP87D

m191193

A

71

CYP76

CYP76Y

m19896

non-A

85

CYP88

CYP88A

m84626

A

71

CYP77

CYP77A

m194721

non-A

85

CYP90

CYP90A

m21741

A

71

CYP77

CYP77B

m148606

non-A

85

CYP90

CYP90B

m8267

A

71

CYP78

CYP78A

m187124

non-A

85

CYP90

CYP90C

m119587

A

71

CYP78

CYP78A

m152788

non-A

85

CYP90

CYP90D

m88205

A

71

CYP79

CYP79D

m32635

non-A

85

CYP707

CYP707A

m212742

A

71

CYP79

CYP79D

m230122

non-A

85

CYP707

CYP707A

m213600

A

71

CYP80

CYP80C

m37356

non-A

85

CYP707

CYP707A m47109 (continued on next page)

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

6/20

Table 1 (continued) Type

Clan

Family

Subfamily

Gene ID

Type

Clan

Family

Subfamily

Gene ID

A A

71 71

CYP81 CYP81

CYP81B CYP81C

m211982 m12729

non-A non-A

85 85

CYP707 CYP707

CYP707A CYP707A

m35702 m17557

A

71

CYP81

CYP81E

m131282

non-A

85

CYP716

CYP716A

m191349

A

71

CYP81

CYP81E

m61839

non-A

85

CYP716

CYP716A

m57776

A

71

CYP81

CYP81E

m61297

non-A

85

CYP716

CYP716A

m12551

A

71

CYP82

CYP82C

m99205

non-A

85

CYP716

CYP716C

m77065

A

71

CYP82

CYP82C

m76311

non-A

85

CYP716

CYP716C

m23342

A

71

CYP82

CYP82D

m169021

non-A

85

CYP716

CYP716D

m141170

A

71

CYP82

CYP82D

m39884

non-A

85

CYP716

CYP716E

m153199

A

71

CYP82

CYP82D

m211151

non-A

85

CYP716

CYP716E

m200248

A

71

CYP82

CYP82U

m215270

non-A

85

CYP722

CYP722A

m120593

A

71

CYP82

CYP82U

m56636

non-A

85

CYP722

CYP722C

m202676

A

71

CYP82

CYP82U

m82069

non-A

85

CYP724

CYP724A

m206239

A

71

CYP82

CYP82U

m61602

non-A

85

CYP728

CYP728B

m166264

A

71

CYP83

CYP83F

m86843

non-A

85

CYP729

CYP729A

m77833

A

71

CYP83

CYP83F

m64245

non-A

86

CYP86

CYP86A

m71018

A

71

CYP83

CYP83F

m109275

non-A

86

CYP86

CYP86A

m6298

A

71

CYP83

CYP83F

m91426

non-A

86

CYP86

CYP86C

m120347

A

71

CYP84

CYP84A

m176881

non-A

86

CYP94

CYP94A

m145786

A

71

CYP84

CYP84A

m124488

non-A

86

CYP94

CYP94B

m38093

A

71

CYP89

CYP89A

m131845

non-A

86

CYP94

CYP94C

m59371

A

71

CYP92

CYP92A

m14848

non-A

86

CYP94

CYP94C

m6650

A

71

CYP92

CYP92B

m61326

non-A

86

CYP94

CYP94D

m102827

A

71

CYP93

CYP93B

m79556

non-A

86

CYP94

CYP94D

m100765

A

71

CYP98

CYP98A

m184946

non-A

86

CYP94

CYP94F

m175121

A

71

CYP98

CYP98A

m43608

non-A

86

CYP96

CYP96A

m794

A

71

CYP701

CYP701A

m27329

non-A

86

CYP96

CYP96A

m21366

A

71

CYP701

CYP701A

m150262

non-A

86

CYP704

CYP704A

m94230

A

71

CYP706

CYP706C

m115920

non-A

97

CYP97

CYP97A

m56546

A

71

CYP736

CYP736A

m18282

non-A

97

CYP97

CYP97B

m17072

A

71

CYP736

CYP736A

m135731

non-A

97

CYP97

CYP97C

m3461

A

71

CYP736

CYP736A

m182725

non-A

710

CYP710

CYP710A

m92981

non-A

72

CYP72

CYP72A

m51504

non-A

711

CYP711

CYP711A

m201472

non-A

72

CYP72

CYP72A

m55535

non-A

727

CYP727

CYP727B

m144680

non-A

72

CYP72

CYP72A

m11850

classified into two major branches, A-type (47%) and non-A type (53%) (Fig. 1). There were 10 clans in plants CYP450s. Four clans contained multiple families and were designated by their lowest-numbered family members, CYP71, CYP72, CYP85 and CYP86. The other six clans were designated by their only family, CYP51, CYP74, CYP97, CYP710, CYP711 and CYP727. In L. japonica, all 10 clans were identified. Genes belonging to same clan clustered as one clade. For example, the 72 clan, which comprised 28 CYP450s belonged to six families, were clustered as one clade with the eight representative CYP450s. The CYP71

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

7/20

CYP72A-m 216352|Lj CYP72 A-m 16935 |Lj 676 |Lj CYP72A-m 161 178 417 |Lj CYP72A-m 10 |At CYP7 2A 6| Lj -m 20 45 j CYP7 2A 29 11 |L -m 13 |Lj CYP7 2A 62 68 A-m 20 5 0 4 |Lj CYP72 51 j 2 A-m 5 6 4 0 |L 2 CYP7 j |L -m 1 72A 180 CYP -m 6 8 5 0 |Lj 72A 1 j CYP -m 1 5 3 5 |L j 5 72A CYP 2 A-m 5 7 5 4 |L j |L 7 62 CYP 2 D -m 5 6 4 0 7 P7 |At | Lj -m CY A1 2D 21 5505 P7 P7 CY CY -m 8 1 | At | Lj A 4A 42 21 P7 YP7 3 -m 8 | At CY C 3 4 A B1 g j P7 7 0 9 2 2 | P 0 | L 7 P CY A CY 4 9 3 9 9 | At 1 t 1 j P7 CY A-m 3 5 A 1 | A 9 | L j 4 9 P7 5 A 7 6 t P7 CY P7 1 3 4 1 | A 1 | L m CY 8 CY A- 1 4 A 9 7 15 7 18 P7 CYP -m CY A 14 P7 CY

cla 72

n

n

CY CY P7 P 1 CY 7 1 4 E4 CY P7 E- m 1 CY P7 1 4 m 2 2 5 P7 1 4 E-m 0 0 7 0 CY 1 4 E E-m 1 9 2 4 2 | L P9 -m 1 9 7 7 | j CY 7 B 2 0 7 5 2 | Lj P9 -m 5 2 6 1 Lj 7 1 7 7 3 | Lj C CCY 0 P9 YP9 m 3 4 7 2 | Lj |L 7 7 CY A-m A3 | 6 1 | L j A P 5 j CY 7 3 2 6 5 4 t P A 6 CY 7 3 1 1 | O | Lj A1 s P7 | 3 O CY CYP7 0 A1 |O s P7 0 04 A1 s 4A |At -m 9 CYP CYP8 4 2 3 0 6 86C |Lj -m 1 B1 |At CYP 86A 20347 -m 6 CYP |Lj 86A -m 7 2 9 8 |Lj CYP9 6 A-m 1 0 1 8 |L j CYP96 2 1 3 6 6 |L j A18 |V CYP9 v 6A-m 79 CYP9 4F -m 17 51 4| Lj 21 |Lj CYP9 4C -m 59 37 1|Lj CYP94 C1 |At CYP94C-m 665 0|Lj CYP9 4B-m 3809 3|Lj CYP94A-m 145786|Lj CYP94D-m 100765|Lj CYP94D-m 102827|Lj CYP77 B1|At 606 |Lj CYP77B-m 148 194 721 |Lj CYP77A-m A3 |Os CYP7 23 2| At 9A CYP8 j 18 45 |L -m 13 CYP8 9A 71 9A1| Cj CYP |At j 0 1 A3 CYP7 5 0 2 6 2 |L j 1 2 9 |L j A-m 701 273 CYP A-m 8 8 1 0 |L j 701 -m 9 |L A CYP 6 3 4 7 CYP A-m 1 3 |At A5 |Lj 73 CYP CYP7 3 7 2 4 5 t 17 3 |A Lj -m 3A 8A 946| j 7 9 P L P CY CY 1 8 4 6 0 8 | 3 -m t 8 A -m 4 5 | A | Lj 9 P 4 A A CY P9 8 P7 8 7 1 2 8 | Lj j CY CY 1 8 2 7 8 2 | L j -m 1 5 1 2 | L 5 8A 0 P7 A-m 2 3 6 3 CY 7 8 -m 3 2 | At t j P 2 D m CY 7 9 D - 9 A 2 | A 6 | L Lj P 7 A 9 2 | CY YP7 CYP 0 3 2 1 5 5 C P7 1 2 6 6 CY B-m 1 5 6 -m P7 6 B CY P7 CY

74

cla

n

71

0

Lj 4| Tc 6 1| 62 5 A 1 6 Pt j 2 m | |L P7 B- D 2 93 CY 7 2 8 2 8 | At 0 5 j P P7 1 8 12 t |L CY CY P7 A-m 1 | A 6 7 6 CY 2 2 2 A 2 0 2 P7 P7 2 -m | Pt | Lj Y 7 C CY 2 C A1 5 5 2 3 17 P7 7 3 t j CY CYP 7 A-m 1 | A 2|L 7 0 7 0 7 A 3 5 7 0 2 |Lj P CY CYP A-m 1 2 7 4 |Lj 2 00 07 P7 7 A-m 2 1 3 6 |Lj Y 0 C -m P7 109 CY 7 0 7 A -m 4 7 j P 6 |L CY 7 0 7 A 8 4 6 2 P CY 8 8 A-m |At j 3 3 3 |L CYP YP8 8 A 778 C -m A |Os 729 CYP P7 2 9 A1 t CY A1 |A 2 0 At CYP7 8A2| CYP70 2| At CYP8 7A 93 |Lj -m 19 11 CYP8 7D 20 65 29 |Lj -m CYP8 5A 425 14| Lj CYP85A-m CYP85A1|At CYP72 0A1|At CYP90D-m 88205|Lj CYP90C-m 119587|Lj CYP90A-m 21741|Lj CYP9 0B-m 8267 |Lj CYP90B1|At CYP72 4A -m 206 239 |Lj CYP7 24 A1 |At

97

cla

51

CYP72A-m 196797|Lj CYP72A-m 21341|Lj CYP7 2A-m 1947 14|Lj CYP72A-m 760 11|Lj CYP72 7B -m 144 680 |Lj CYP7 27 B2 |Pt CYP7 11 A-m 20 14 72 |L CYP7 j 11 CYP51 A1 |At G2| CYP5 1 G-m At CYP 1839 5 6 1 |L CYP 1 G-m 5 j 265 74A 7 |Lj CYP -m 5 CYP 7 4 A-m 0 1 3 7 |L j 672 74A 1 9 |L -m C j CY YP7 4 4 2 7 7 P7 |L A |A j 4 CY t B-m CY P7 4 B 52 1 P7 1 0 m 1 9 4 5 |L j CY CYP A-m 4 0 8 7 CY P7 1 1 0 A 9 2 9 |Lj CY P7 1 6 A-m 1 | A 8 1 | L j t P7 6 A 57 1 CY CY 6 A m 1 7 7 6 C P7 P7 1 -m 1 2 5 5 | Lj C YP7 1 6 6 A 9 1 1 | L C YP7 1 6 D -m 1 | A 3 4 9 j |L CY YP7 1 6 C-m 1 4 t j 11 P7 1 6 C7 1 6 E- m 2 7 0 6 7 0 E- m 1 3 3 5 | | Lj m 5 3 4 2 Lj 20 |L 1 02 99 j 4 8 | Lj |L j

727 71 1

non-A type

85

cla

8 6 cl an

n

j |L 2 7 Lj | 44 0 j 12 81 4|L j -m 7 5 0 8 2 | L j 1 D -m 3 0 6 1 4 | L P7 1 D -m 1 2 3 3 5 j CY YP7 1 D -m 0 3 2 | L Lj C P7 1 D m 2 0 0 4 7 | CY P7 1 D - m 2 3 8 6 15 CY 7 l D P E 1 CY P7 D -m A1 | j CY P7 1 7 2 6 2 | Os 8 2 | L CY CYP 9 9 A 1 8 2 j P m Pg CY 3 6 A- A1 2 | 7 3 1 |L P7 7 3 6 1 3 5 2 5 |Lj P CY 7 Y 2 C A- m 3 6 -m 1 8 P7 j CY 7 3 6 A 1 |At A P 8 1 |L CY YP8 4 1 7 6 8 j C 8 8 |L -m 244 84A 1 8 |Lj CYP 4 A-m 8 484 CYP 2 A-m 1 t 9 CYP P9 2 A2 |N 6 |Lj CY 6132 2 B-m 0| Lj CYP9 B-m 13 12 CYP75 5B1| At CYP7 A5 |At CYP7 05 A1 |At CYP7 12 795 56| Lj CYP93B-m CYP93D 1|At CYP81 E-m 61839 |Lj CYP81E-m 61297|Lj

C CY YP C P7 7 6 CY YP 6 B B-m P7 7 6 -m 1 Y 6 Y -m 3 1 8 8 -m 1 1 1 | CY 6 P8 CYP 1 3 9 8 9 2 | Lj 0 C 8 0 9 7 6 Lj CY CY -m C1 3 7 | Lj P CY 7 6 A P7 3 7 3 | Pt | Lj 6 P CY 7 6 A -m 1 G1 | 5 6 | L A 9 P j CY 7 6 A m 2 0 7 4 6 t -m P7 45 5| CY 6 A-m 1 8 4 3 6 | Lj 6 P7 L 6A 155 19| j Lj 8 m 3 C CY P7 0 YP7 0 1 8 9 5 0 | Lj 6 6 4 CYP C-m 1 A1 |A |Lj t 1 8 CYP 2 U-m 5 9 2 0 82 |L 82U -m 5 0 6 9 |L j CYP 6 82U j -m 6 6 3 6 |L CYP j 82U 160 -m 2 2 |Lj CYP8 1 2 C-m 5 2 7 0 |L j 7 CYP82 6 3 1 1 |Lj C4| A CYP8 t 2C-m 99 20 CYP8 2D -m 16 90 5| Lj 21 |Lj CYP8 2D -m 21 11 51 |Lj CYP82 D-m 39 88 4|Lj CYP81B-m 211 982 |Lj CYP8 1C-m 1272 9|Lj CYP81D1 1|At CYP81E-m 131282|Lj

C CYP YP8 3 B 1 CY 8 3 F-m |At P CY 8 3 F-m 1 0 9 2 P8 7 5 |L 9 3F CY j -m 1 4 2 6 P 8 64 |Lj CY 3F 2 CY P7 1 B -m 8 6 4 5 |L j CY P7 1 C-m 8 4 3 |Lj CY P7 1 B-m 1 9 7 P7 BG 6 2 7 0 1 0 1 | 1 CY CY AP- m 8 2 4 | L Lj 90 j CY P7 P7 1 m 1 C P7 1 AU A1 2 9 8 3 0 | Lj CY YP7 1 A -m | At 7 6 | Lj C P7 1 A U-m 1 2 CY YP7 1 B U-m 1 1 6 8 0 P7 1 D E3 29 |L 1 D -m m 7 3 7 8 1 j - m 1 1 9 4 0 4 | | Lj L 6 j 94 70 9| 1 0 5 2 Lj 1 | | Lj Lj

7 1 clan

A type

Figure 1 Phylogenetic analysis of predicted CYP450s in L. japonica and the representative members of CYP450 families.

clan that comprised 71 members belonging to 19 families was the largest clan. Three clans, CYP710, CYP711 and CYP727, had only one member identified for each clan.

Conserved motifs analysis of L. japonica CYP450s Plant CYP450s shared some typical conserved motifs including heme-binding region, PERF motif, K-helix region and I-helix region, which were important for catalytic activities (Paquette, Jensen & Bak, 2009). The L. japonica CYP450s were divided into A-type and non-A type according to phylogenetic analysis. The deduced amino acid sequences were subjected to MEME to analyze the conserved motifs. The consensus sequences of the heme-binding region, also known as ‘‘P450 signature’’, were ‘‘PFGXGRRXCPG’’ and ‘‘XFXXGXRXCXG’’ for A-type and non-A type CYP450s, respectively (Fig. 2). The cysteine residues in this motif of two types of CYP450s were universally conserved, which links the heme iron to the apoprotein. The consensus sequences of the PERF motif were

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

8/20

18

17

16

15

14

13

12

11

9

10

8

7

6

5

4

3

2

13

12

11

9

8

7

6

5

3

2

10

C

MAS V

N

E F G I H V

I

F

T

16

15

14

S

13

12

11

10

S

9

8

7

6

5

4

3

Y

T

A

D

A

C

LTW

SL VA

S

TT

I

T

F S FA

A I S M LC A AVS I SVG M T

Y G S

V

V

S

M

N

L

C

S

L

16

I

I

A

15

A

H

14

A K

N

S

13

I

Y

V

S A

E

Q

H

R

V

M

F A

12

I

M M

11

I

0 N

10

bits

T

Y

V

AG TT

YF FL L V

C

S

I-helix region

1L

I

Q

SM

Q

9

C

A

M

E LRL P

Y

S

C

K

A F

8

Q

C

W

D

L

N

AV

K

7

W

D

T

18

I

I V I L

G

K

A

6

L

W

T

V

Y

S

W S

R

M

T

3 2

R

M

Q

TQ

4

I G T T V FV

17

A

E

V

Y N

T

H

R

C

F

Q

R

S D

T

P

4

I M

H

P

12

9

R T

A

S

16

S

N

G

A

S

7

4

3

2

1

V

10

I N N E

F

8

I

V

S

6

I

5

F

0 N

I

N

Q

M

A H N F S E L R V S C

F

EA

T

V

LS V VT

15

A F I T V M TS M

14

GAS

V

I L NV

W

S E

V TS TL SA 13

F DL 1L M I L ME

1

bits bits

16

11

T

D AGTE

2

11

3

N

L Y V I

LK

0 N

V

9

6

5

4

1

R R

I-helix region

4

M

2

I

M

M

P

R

K-helix region

3

L S V F T T

R

L

Y

D

E

K N E

L K

R

3

L

Q

3

2

1

V

M

L M

15

N

Y

14

C

N

F

I

K M

F

Y D SK I G M

S H

L K

A

A

W N

2

A R F S

L

M

F M

A S

H

L N

M

G

H

T

L

KN

N

V

S

Q

C K V S

E K P

4

A V AG S

13

R

E

10

0F N

KC S

I

12

Q

1

KETLRLHPP

AVI VI

8

2

R

L

1

12

11

10

0 N

C

H

H

QE

I P

V

L

T

Q

D

1

T

V

Y

Y

9

4

3

2

1

YL

3

2

2

F

G

V

Y

K T

R

H

L

T

T

E

G

D

15

W

R

S

D

I

E N K

Q S

N

14

D

F

A

D E M G N

D

Q

S

C

13

S

C

N

I

I

A

G

L

S

T

M

V

S

N

Q

PERF motif

3

L

K

R

K

A

SA

L NW

S

5

bits 21

20

19

18

17

C

4

F PERF

EE

LT

N

E

D A M Q S S

P

V

0 N

A

L

G

F

LA

V

T

S

Q

16

15

14

G

F GRCG A WG D EF P RF

FY P G P

I M P L

H

1

A

G

L

13

7

A

R

2

I

M

M

Q

L

T

1

L

AT

K-helix region

4 bits

6

5

3

2

1

W

P

D

K

D S

bits

K

I

MS

F

I V

3

PERF motif

E

0 N

M

T

A L F

L P

A

A

8

1

V

7

2

M S

V

7

bits

3

G

G

6

4

ST

4

0 N

A S A

V M

V

12

H Q

A

S

11

G

9

K

RYM

Y H L

E

L

I

10

D N

S A S

Heme-binding region

4

PFG GRR CPG

I EL FL

8

F

2

5

bits

3

non-A type

4

Heme-binding region

4

1

B

A-type

1

A

C

Figure 2 Weblogos of conserved motifs identified in A-type (A) and non-A type (B) CYP450s from L. japonica.

also different for two types of CYP450s in L. japonica, which are ‘‘PERF’’ for A-type and ‘‘PXRX’’ for non-A type. The R residues in the PERF motif and E and R residues in the K-helix region were universally conserved, which form a salt bridge that has been proposed to be involved in locking the Cys-pocket in position and assuring the stable association of heme with the protein. The threonine residues in the I-helix region which is involved in oxygen activation was highly conserved in both A-type and non-A type CYP450s. In general, sequences of the typical motifs were conserved in L. japonica CYP450s, and the differences between A-type and non-A type CYP450s in L. japonica were similar with other plants (Chen et al., 2014).

Gene ontology classification of L. japonica CYP450s Gene ontology (GO) is a classification system for standardized gene functions which classifies genes into three main independent GO categories: cellular component, molecular function and biological process. In this study, GO assignments were conducted to classify the functions of CYP450s from L. japonica using Blast2GO. Results indicated that all 151 CYP450s were mapped to one or more GO terms, of which 145 were assigned to the ‘‘cellular component’’, 151 to the ‘‘molecular function’’, and 151 to the ‘‘biological process’’ (Fig. 3). Of these categories, cell, binding, catalytic, and metabolic process were the largest subcategories. Comparison of the GO classification between the A-type and non-A type CYP450s, we found that non-A type CYP450s participated in more molecular

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

9/20

75

71 80 53 60

50

35 40

25

17 20

0

0 0

macr o

mole

Number of genes

A type non-A type

c cell ell enve part cular lo comp pe le orga nelle organe x lle mem br orga nelle ane pa bindi rt ng c dem atalytic ethy hydr lase olase oxido l redu yase struc c t u ral m tase anat olecu omic al str t l uctur ransfera e se cellu e b f i o o r l cellu lar comp ogical re mation lar co g o mpo nent bio ulation nent g orga enesis deve cellula nization r estab lo lishm pmental process ent o p f loca rocess lizati o gro n multi meta localiza wth cellu lar o rgan bolic proction isma l p ess pigm rocess en repro reprod tation ducti uctio resp onse ve proce n ss to sti mulu s

Percent of genes

100

Cellular Component Molecular Function

Biological Process

Figure 3 Gene ontology annotation of A-type and non-A type CYP450s in L. japonica.

functions and biological processes than A-type. For example, GO terms of non-A type CYP450s in molecular function category included demethylase, hydrolase, lyase, and transferase; however, no A-type CYP450s was assigned to these subcategories. In biological process category, non-A type CYP450s participated in more biological processes than A-type, including anatomical structure formation, cellular component organization, developmental process, establishment of localization, growth, localization, multicellular organismal process, and reproduction. The GO annotation provided a valuable clue to investigate the functions of CYP450s in L. japonica.

KEGG pathway analysis of L. japonica CYP450s In order to further understand the biological functions of CYP450s in L. japonica, pathway-based analysis was performed. Given that a CYP450 could be assigned to one or more KEGG pathways as well as GO terms, 47 (31.1%) CYP450s were totally assigned to 25 KEGG pathways (Fig. 4). The 25 pathways could be mainly grouped into six classes, including lipid metabolism, amino acid metabolism, metabolism of cofactors and vitamins, metabolism of terpenoids and polyketides, biosynthesis of other secondary metabolites, and xenobiotics biodegradation and metabolism. In the class of ‘biosynthesis of other secondary metabolites’, after removing duplicate hits, ten CYP450s (CYP73A-m13469, CYP73A-m177245, CYP73A-m8810, CYP75B-m13120, CYP76Am155830, CYP78A-m152788, CYP93B-m79556, CYP98A-m184946, CYP98A-m43608 and CYP736A-m18282) were found to be involved in the biosynthesis of phenolic compounds

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

10/20

0

Lipid metabolism

Amino acid metabolism Metabolism of cofactors and vitamins Metabolism of terpenoids and polyketides

Biosynthesis of other secondary metabolites

Xenobiotics biodegradation and metabolism

2

4

6

8

10

12

Fatty acid degradation Steroid biosynthesis Steroid hormone biosynthesis Arachidonic acid metabolism Linoleic acid metabolism alpha-Linolenic acid metabolism Phenylalanine metabolism Tryptophan metabolism Cyanoamino acid metabolism Retinol metabolism Ubiquinone and other terpenoid-quinone biosynthesis Monoterpenoid biosynthesis Diterpenoid biosynthesis Carotenoid biosynthesis Phenylpropanoid biosynthesis Stilbenoid, diarylheptanoid and gingerol biosynthesis Flavonoid biosynthesis Flavone and flavonol biosynthesis Isoflavonoid biosynthesis Indole alkaloid biosynthesis Caffeine metabolism Aminobenzoate degradation Metabolism of xenobiotics by cytochrome P450 Drug metabolism - cytochrome P450 Biosynthesis of antibiotics

KEGG pathway

Number of sequence

Figure 4 KEGG pathway analysis of predicted CYP450s in L. japonica.

including phenylpropanoid, stilbenoid, diarylheptanoid and gingerol, flavonoid, flavone and flavonol, and isoflavonoid. All ten CYP450s belonged to CYP71 clan. In the class of ‘metabolism of terpenoids and polyketides’, nine CYP450s (CYP72A-m132911, CYP72Am20456, CYP72A-m206268, CYP72D-m62754, CYP72D-m75640, CYP714A-m189781, CYP714E-m17561, CYP714E-m205273 and CYP734A-m842) were found to be involved in ‘monoterpenoid biosynthesis’, all of which belonged to CYP72 clan. Three CYP450s (CYP701A-m150262, CYP701A-m27329 and CYP728B-m166264) were found to be involved in ‘diterpenoid biosynthesis’, among them, two belonged to CYP71 clan and one belonged to CYP85 clan. Five CYP450s (CYP707A-m213600, CYP707A-m35702, CYP707A-m47109, CYP707A-m212742 and CYP728B-m166264) were found to be involved in ‘carotenoid biosynthesis’, all of which belonged to CYP85 clan.

CYP450s involved in CGA biosynthesis CGA is the most major active ingredient in L. japonica and the biosynthetic pathway of CGA has been investigated in many plants. In CGA biosynthetic pathway, C4H and C3H are the two CYP450-encoded enzymes that participate in the two steps of hydroxylation. In the present study, three C4H and two C3H genes were identified and cloned from L. japonica. Among them, two LjC4Hs and one LjC3H have been previously reported. The newly identified C4H and C3H were designated as ‘LjC4H3’ (GenBank accession number: KX845341) and ‘LjC3H2’ (GenBank accession number: KX845342), respectively. The C4Hs belonged to CYP73A subfamily and C3Hs belonged to CYP98A subfamily. Phylogenetic

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

11/20

47 100 53 55 100 80 66

81 100 91 92 98 57

C4H|Ms C4H|Ca C4H|Ps C4H|Gm C4H|Cr C4H3|Lj C4H1|Lj C4H|At C4H2|Lj C3dH1-2|Sm C3H1|Cc C3H1|Lj CYP98A3|At CYP98A44|Tp C3H2|Lj C3dHi2|Ob 100 C3dH|Ot

C4H

C3H

0.05

Figure 5 Phylogenetic analysis of C3Hs and C4Hs from L. japonica and other plants. LjC3Hs were labeled by black dots and LjC4Hs were labeled by black triangles. Protein sequences were downloaded from UniProt with accession numbers as follows: C3H1|Cc (A4ZKM5), CYP98A3|At (O22203), C3dH1-2|Sm (D8SCG3), C3dHi2|Ob (Q8L5H7), CYP98A44|Tp (C9EGT6), C3dH|Ot (T1NXG3), C4H|At (P92994), C4H|Cr (P48522), C4H|Ps (Q43067), C4H|Ms (P37114), C4H|Ca (O81928), C4H|Gm (Q42797).

analysis indicated that two clades were clustered for C4Hs and C3Hs from L. japonica and other plants (Fig. 5). Because CGA was mainly accumulated in flower bud of L. japonica, buds and flowers in different developmental stages were selected to explore the relationship of C4H and C3H expressions and CGA contents. HPLC analysis was used to measure CGA concentrations in different developmental stages of buds and flowers. As shown in Fig. 6, the percentage of CGA contents decreased during the flower development. Nevertheless, with the increase of bud or flower weights, the total CGA contents increased from young alabastrum (YA) to while alabastrum (WA) stage and reached peak at the WA stage. After flowering, the total CGA contents decreased quickly during flower development. Furthermore, qRT-PCR was conducted to analyze the transcriptional levels of CGA biosynthetic genes in the different developmental stages of buds and flowers, including the five CYP450s identified in this study. The two LjC3Hs exhibited oppositing expression patterns, the transcriptional levels of LjC3H1 increased but that of LjC3H2 decreased during the flower development (Fig. 7). The expression patterns of three LjC4Hs were quite different and the relative expression levels of LjC4H3 was obviously higher than those of the other two (Fig. 7). Interestingly, the expression patterns of LjPAL1, LjC4H3, LjC3H1 and LjHQT were quite similar, which

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

12/20

8

CGA contents (%)

7 6 5 4 3 2 1 0 YA

GA

WA

SF

GF

Figure 6 CGA contents of buds and flowers in different developmental stages L. japonica. YA-young alabastrum, GA-green alabastrum, WA-white alabastrum, SF-silvery flower, and GF-golden flower.

0.02

SF

F

0.01 0.005

GA

WA

SF

WA

SF

0.2 0.15 0.1 0.05

Relative expression

0.02 0.01

GA

WA

SF

GF

SF

GF

LjC4H1

0.06 0.05 0.04 0.03 0.02 0.01 0

GA

WA

SF

GF

YA

H

Lj4CL2

0.05 0.04 0.03 0.02 0.01 0

YA

J

LjC3H2

0.035 0.03 0.025 0.02 0.015 0.01 0.005 0

0.03

YA

G

LjC4H3

0.25

GF

0.04

GF

0 GA

0.05

0 YA

GF

Relative expression

0.04

D

LjPAL3

0.06

Relative expression

WA

LjC4H2

0.015

YA

Relative expression

Relative expression

0.06

Relative expression

GA

Rellative expression

Relative expression

0.1 0.08

0

I

C

LjPAL2

0.12

0 YA

E

B

LjPAL1

0.35 0.3 0.25 0.2 0.15 0.1 0.05 0

Relative expression

Relative expression

A

YA

GA

WA

SF

GF

GA

WA

SF

GF

SF

GF

LjC3H1

0.2 0.15 0.1 0.05 0 YA

GA

WA

LjHQT

0.8 0.6 0.4 0.2 0

YA

GA

WA

SF

GF

YA

GA

WA

Figure 7 Transcriptional analyses of CGA biosynthetic pathway genes in buds and flowers of L. japonica at different developmental stages.

exhibited a trend of decreasing first and then increasing. Considering the gene expressions with CGA contents, only LjC3H2 exhibited a similar pattern with CGA concentrations.

DISCUSSION L. japonica is an important medicinal plant that has been widely used in traditional Chinese medicine for thousands of years. The pharmacological activities of this medicinal plant are mainly due to its rich natural active ingredients, most of which are secondary

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

13/20

metabolites. CYP450s are a large, complex, and widespread superfamily that participate in many metabolic reactions, especially secondary metabolism. The identification and characterization of CYP450s in L. japonica will effectively facilitate the study of natural active compounds biosynthesis. In this study, we identified 151 putative CYP450s with complete cytochrome P450 domain from transcriptome data of L. japonica. According to the classification criteria, the 151 CYP450s were classified into 10 clans consisting of 45 families and 76 subfamilies. Next, we conducted phylogenetic analysis, conserved motifs analysis, GO annotation, and KEGG annotation to characterize the identified CYP450s. As mentioned above, nine CYP450s have been previously reported in L. japonica, which were also identified among the 151 CYP450s of this study. These results indicated that the identified CYP450s from the L. japonica transcriptome data in this study were quite comprehensive. The evolution of plant CYP450s can be divided into three major groups: CYP450s involved in sterol and carotenoid biosynthesis were the most ancient, CYP450s involved in adaptation to land environment were the next oldest, and CYP450s involved in biosynthesis of plant secondary metabolites were the most recent to evolve (Morant et al., 2007; Nelson et al., 2008). In this study, ten CYP450s (CYP73A-m13469, CYP73Am177245, CYP73A-m8810, CYP75B-m13120, CYP76A-m155830, CYP78A-m152788, CYP93B-m79556, CYP98A-m184946, CYP98A-m43608, and CYP736A-m18282) were annotated to participate in the biosynthesis of phenolic compounds, a most common type of secondary metabolite in plants, including phenylpropanoid, stilbenoid, flavonoid, and isoflavonoid. All ten CYP450s belonged to CYP71 clan. As earlier reported, the most recently evolved CYP450 group comprises the highly proliferated clan 71. This clan includes CYP450s involved in the biosynthesis of the majority of plant secondary metabolites involved in adaptation to abiotic and biotic stress (Morant et al., 2007), with which our present findings are in agreement. Five CYP450s (CYP707A-m213600, CYP707A-m35702, CYP707A-m47109, CYP707A-m212742, and CYP728B-m166264) were found to be involved in carotenoid biosynthesis, all of which belonged to the CYP85 clan. These CYP450s belonged to the oldest group with a function that preceded the colonization of land by plants (Morant et al., 2007). CGA is the major active ingredient in L. japonica, and the biosynthetic pathway of CGA has been investigated in many plants. In CGA biosynthetic pathway, C4H and C3H are two CYP450 encoded enzymes that participate in the two steps of hydroxylation (Gabriac et al., 1991; Schoch et al., 2001). In L. japonica, a gene encoding LjC3H has been isolated and characterized by Pu et al. (2013), and was identified as CYP98A subfamily member. In vitro assay using heterologous expressed LjC3H revealed that the recombinant protein was effective in converting p-coumaroylquinate to CGA. Southern blotting suggested that the gene was present in the genome in two copies, but unfortunately, only one copy of LjC3H was obtained. In this study, two LjC3Hs were identified and cloned from L. japonica, both of which belonged to the CYP98A subfamily. Among the two LjC3Hs, one was same as the LjC3H reported by Pu et al. (2013), the other is a newly identified gene and is hereby designated LjC3H2. These results suggested that the newly identified LjC3H2 was the other copy of LjC3H in the genome of L. japonica. Two C4Hs were also cloned in L. japonica

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

14/20

by Yuan et al. (2014), which belonged to the CYP73A subfamily. Expression and activity analysis suggested that LjC4H2 may be one of the critical genes that regulate CGA content in L. japonica. In our study, three C4Hs were identified and cloned from L. japonica, including the previously reported two genes. The newly identified LjC4H was designated as LjC4H3, which showed high degree of sequence homology with LjC4H1. Phylogenetic analysis showed that LjC4H1 and LjC4H3 clustered to one clade. This result suggested that these two genes may be generated by recent gene duplication. In the present study, the expression patterns of two LjC3Hs and three LjC4Hs were quite different during the flower development. This phenomenon that different members of the same family exhibit different expression patterns during development was also observed in other plants (Bi et al., 2011; Qi et al., 2014), which might be caused by functional divergence of both substrate and catalytic specificity during plant evolution (Helariutta et al., 1996; Xu et al., 2009). Considering the gene expressions with CGA contents, only LjC3H2 exhibited a similar pattern with CGA concentrations in our study. This result was similar with that of coffee (Lepelley et al., 2007). In coffee, transcriptional levels of CGA biosynthetic genes and CGA contents were measured during grain development and C3H1 showed a similar expression pattern with CGA concentrations. Both the CGA concentrations and C3H expression pattern were similar with those of L. japonica, respectively. However, in this study, the expression patterns of LjC3H1 and three LjC4Hs were inconsistent with CGA contents during flower development. The reason for this phenomenon could be that C3H and C4H not only participated in CGA biosynthesis, but were also involved in other metabolites. The product catalyzed by C4H was a common precursor in phenylpropanoid metabolism, including flavonoids, anthocyanins, condensed tannins, and isoflavonoids (Winkel-Shirley, 2001). C3H was also a key enzyme in lignin biosynthesis (Boerjan, Ralph & Baucher, 2003). It is likely that the complexity of the metabolic pathways led to the inconsistency between gene expressions and product contents. In this study, the expression patterns of LjPAL1, LjC4H3, LjC3H1 and LjHQT were quite similar during flower development. The co-expression patterns of these four genes strongly suggested that they were under coordinated regulation by the same transcription factors due to similar cis elements in their promoters (Bi et al., 2011). In apple, anthocyanin biosynthetic genes including CHS, CHI, F3H, DFR, LDOX and UFGT showed similar expression patterns during fruit development, which were coordinately regulated by a MYB transcription factor, MdMYB10 (Espley et al., 2007). Fruit-specific ectopic expression of AtMYB12 in tomato led to upregulation of all biosynthetic genes required for the production of flavonols and their derivatives, including PAL, C4H, 4CL, CHS, CHI, F3H, F3’H, FLS, ANS, C3H, HCT, HQT, GT, and RT ; and, in addition, led to the increase of flavonols and their derivatives (Luo et al., 2008). In pine and eucalyptus, xylem-associated MYB transcription factors could bind to the AC elements and activate the transcription of the lignin biosynthetic genes (Patzlaff et al., 2003; Goicoechea et al., 2005). Moreover, the rice genome sequence analysis revealed that ACII motif existed in the promoters of many lignin biosynthetic genes, including PAL, 4CL, C4H, C3H, CCoAOMT, CCR, and CAD, suggesting that they were under coordinated regulation by the same transcription factors (Bi et al., 2011).

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

15/20

CONCLUSIONS In this study, we identified 151 putative CYP450s with complete cytochrome P450 domain in L. japonica transcriptome, 142 of which were identified here for the first time. According to the classification criteria, the 151 CYP450s were classified into 10 clans consisting of 45 families and 76 subfamilies. Next, we conducted phylogenetic analysis, conserved motifs analysis, GO annotation, and KEGG annotation to characterize the identified CYP450s. From these data, we cloned two LjC3Hs (CYP98A subfamily) and three LjC4Hs (CYP73A subfamily) genes that may be involved in biosynthesis of CGA, including the newly identified LjC3H2 and LjC4H3. Furthermore, qRT-PCR and HPLC results indicated that only LjC3H2 exhibited a similar expression pattern with CGA concentration. Different members of the same family exhibited different expression patterns during development that may be due to functional divergence of both substrate and catalytic specificity during plant evolution. The co-expression pattern of LjPAL1, LjC4H3, LjC3H1 and LjHQT strongly suggested that they were under coordinated regulation by the same transcription factors due to same cis elements in their promoters. In conclusion, this study provides insight into CYP450s and will effectively facilitate the research of biosynthesis of CGA in L. japonica.

ADDITIONAL INFORMATION AND DECLARATIONS Funding The work was supported by the National Natural Science Foundation of China (31500249), Six Talent Peaks Project in Jiangsu Province (2015-NY-032), the Natural Science Foundation of Jiangsu Province (BK20161381), and the Foundation of Jiangsu Key Laboratory for the Research and Uti1ization of Plant Resources (SQ201401). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Grant Disclosures The following grant information was disclosed by the authors: National Natural Science Foundation of China: 31500249. Six Talent Peaks Project in Jiangsu Province: 2015-NY-032. Natural Science Foundation of Jiangsu Province: BK20161381. Foundation of Jiangsu Key Laboratory for the Research and Uti1ization of Plant Resources: SQ201401.

Competing Interests The authors declare there are no competing interests.

Author Contributions • Xiwu Qi performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables. • Xu Yu and Daohua Xu performed the experiments. • Hailing Fang analyzed the data.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

16/20

• Ke Dong contributed reagents/materials/analysis tools. • Weilin Li and Chengyuan Liang conceived and designed the experiments, reviewed drafts of the paper.

DNA Deposition The following information was supplied regarding the deposition of DNA sequences: The LjC4H3 and LjC3H2 sequences described here are accessible via GenBank accession numbers KX845341 and KX845342.

Data Availability The following information was supplied regarding data availability: The raw data used for transcriptomic assemblies was downloaded from NCBI SRA with accession numbers SRR290309, SRR342027, SRR576924, SRR576925 and SRR766791 (https://ftp.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR290/SRR290309/ SRR290309.sra https://ftp.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR342/SRR342027/ SRR342027.sra https://ftp.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR576/SRR576924/ SRR576924.sra https://ftp.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR576/SRR576925/ SRR576925.sra https://ftp.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR766/SRR766791/ SRR766791.sra).

Supplemental Information Supplemental information for this article can be found online at http://dx.doi.org/10.7717/ peerj.3781#supplemental-information.

REFERENCES Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. 2009. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Research 37:W202–W208 DOI 10.1093/nar/gkp335. Bi C, Chen F, Jackson L, Gill BS, Li W. 2011. Expression of lignin biosynthetic genes in wheat during development and upon infection by fungal pathogens. Plant Molecular Biology Reporter 29:149–161 DOI 10.1007/s11105-010-0219-8. Boerjan W, Ralph J, Baucher M. 2003. Lignin biosynthesis. Annual Review of Plant Biology 54:519–546 DOI 10.1146/annurev.arplant.54.031902.134938. Chai XY, Li SL, Li P. 2005. Quality evaluation of Flos Lonicerae through a simultaneous determination of seven saponins by HPLC with ELSD. Journal of Chromatography A 1070:43–48 DOI 10.1016/j.chroma.2005.02.031. Chen J, Li SL, Li P, Song Y, Chai XY, Ma DY. 2005. Qualitative and quantitative analysis of active flavonoids in Flos Lonicerae by capillary zone electrophoresis coupled with solid–phase extraction. Jorunal of Separation Science 28:365–372 DOI 10.1002/jssc.200400024.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

17/20

Chen H, Wu B, Nelson DR, Wu K, Liu C. 2014. Computational identification and systematic classification of novel cytochrome P450 genes in Salvia miltiorrhiza. PLOS ONE 9:e115149 DOI 10.1371/journal.pone.0115149. Chinese Pharmacopoeia Commission. 2010. The pharmacopoeia of the People’s Republic of China, 2010 ed. Beijing: China Medical Science Press, 205–206. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. 2005. Blast2go: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676 DOI 10.1093/bioinformatics/bti610. Crooks GE, Hon G, Chandonia JM, Brenner SE. 2004. WebLogo: a sequence logo generator. Genome Research 14:1188–1190 DOI 10.1101/gr.849004. Eddy SR. 2011. Accelerated profile HMM searches. PLOS Computational Biology 7:e1002195 DOI 10.1371/journal.pcbi.1002195. Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32:1792–1797 DOI 10.1093/nar/gkh340. Espley RV, Hellens RP, Putterill J, Stevenson DE, Kutty-Amma S, Allan AC. 2007. Red colouration in apple fruit is due to the activity of the MYB transcription factor, MdMYB10. Plant Journal 49:414–427 DOI 10.1111/j.1365-313X.2006.02964.x. Gabriac B, Werck-Reichhart D, Teutsch H, Durst F. 1991. Purification and immunocharacterization of a plant cytochrome p450: the cinnamic acid 4-hydroxylase. Archives of Biochemistry and Biophysics 288:302–309 DOI 10.1016/0003-9861(91)90199-S. Goicoechea M, Lacombe E, Legay S, Mihaljevic S, Rech P, Jauneau A, Lapierre C, Pollet B, Verhaegen D, Chaubet-Gigot N, Grima-Pettenati J. 2005. EgMYB2, a new transcriptional activator from Eucalyptus xylem, regulates secondary cell wall formation and lignin biosynthesis. Plant Journal 43:553–567 DOI 10.1111/j.1365-313X.2005.02480.x. He L, Xu X, Li Y, Li C, Zhu Y, Yan H, Sun Z, Sun C, Song J, Bi Y, Shen J, Cheng R, Wang Z, Xiao W, Chen S. 2013. Transcriptome analysis of buds and leaves using 454 pyrosequencing to discover genes associated with the biosynthesis of active ingredients in Lonicera japonica Thunb. PLOS ONE 8:e62922 DOI 10.1371/journal.pone.0062922. Helariutta Y, Kotilainen M, Elomaa P, Kalkkinen N, Bremer K, Teeri TH, Albert VA. 1996. Duplication and functional divergence in the chalcone synthase gene family of Asteraceae: evolution with substrate change and catalytic simplification. Proceedings of the National Academy of Sciences of the United States of America 93:9033–9038 DOI 10.1073/pnas.93.17.9033. Kumar MS, Babu PR, Rao KV, Reddy VD. 2014. Organization and classification of cytochrome P450 genes in castor (Ricinus communis L.). Proceedings of the National Academy of Sciences India Section B 84:131–143 DOI 10.1007/s40011-013-0192-8. Lepelley M, Cheminade G, Tremillon N, Simkin A, Ciaillet V, McCarthy J. 2007. Chlorogenic acid synthesis in coffee: an analysis of CGA content and real-time RTPCR expression of HCT, HQT, C3H1, and CCoAOMT1 genes during grain development in C. canephora. Plant Science 172:978–996 DOI 10.1016/j.plantsci.2007.02.004.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

18/20

Lu HT, Jiang Y, Chen F. 2004. Application of preparative high-speed counter-current chromatography for separation of chlorogenic acid from Flos Lonicerae. Journal of Chromatography A 1026:185–190 DOI 10.1016/j.chroma.2003.11.004. Luo J, Butelli E, Hill L, Parr A, Niggeweg R, Bailey P, Weisshaar B, Martin C. 2008. AtMYB12 regulates caffeoyl quinic acid and flavonol synthesis in tomato: expression in fruit results in very high levels of both types of polyphenol. Plant Journal 56:316–326 DOI 10.1111/j.1365-313X.2008.03597.x. Morant M, Jørgensen K, Schaller H, Pinot F, Møller BL, Werck-Reichhart D, Bak S. 2007. CYP703 is an ancient cytochrome P450 in land plants catalyzing in-chain hydroxylation of lauric acid to provide building blocks for sporopollenin synthesis in pollen. Plant Cell 19:1473–1487 DOI 10.1105/tpc.106.045948. Nelson DR. 1999. Cytochrome P450 and the individuality of species. Archives of Biochemistry and Biophysics 369:1–10 DOI 10.1006/abbi.1999.1352. Nelson DR. 2009. The cytochrome P450 homepage. Human Genomics 4:59–65 DOI 10.1186/1479-7364-4-1-59. Nelson DR, Ming R, Alam M, Schuler MA. 2008. Comparison of cytochrome P450 genes from six plant genomes. Tropical Plant Biology 1:216–235 DOI 10.1007/s12042-008-9022-1. Niggeweg R, Michael AJ, Martin C. 2004. Engineering plants with increased levels of the antioxidant chlorogenic acid. Nature Biotechnology 22:746–754 DOI 10.1038/nbt966. Paquette SM, Jensen K, Bak S. 2009. A web-based resource for the Arabidopsis P450, cytochromes b5, NADPH-cytochrome P450 reductases, and family 1 glycosyltransferases (http://www.P450.kvl.dk). Phytochemistry 70:1940–1947 DOI 10.1016/j.phytochem.2009.08.024. Patzlaff A, McInnis S, Courtenay A, Surman C, Newman LJ, Smith C, Bevan MW, Mansfield S, Whetten RW, Sederoff RR, Campbell MM. 2003. Characterisation of a pine MYB that regulates lignification. Plant Journal 36:743–754 DOI 10.1046/j.1365-313X.2003.01916.x. Pu G, Wang P, Zhou B, Liu Z, Xiang F. 2013. Cloning and characterization of Lonicera japonica p-coumaroyl ester 3-hydroxylase which is involved in the biosynthesis of chlorogenic acid. Bioscience Biotechnology and Biochemistry 77:1403–1409 DOI 10.1271/bbb.130011. Qi X, Shuai Q, Chen H, Fan L, Zeng Q, He N. 2014. Cloning and expression analyses of the anthocyanin biosynthetic genes in mulberry plants. Molecular Genetics and Genomics 289:783–793 DOI 10.1007/s00438-014-0851-3. Schlotzhauer WS, Pair SD, Horvat RJ. 1996. Volatile constituents from the flowers of Japanese honeysuckle (Lonicera japonica). Journal of Agricultural and Food Chemistry 44:206–209 DOI 10.1021/jf950275b. Schoch G, Goepfert S, Morant M, Hehn A, Meyer D, Ullmann P, Werck-Reichhart D. 2001. CYP98A3 from Arabidopsis thaliana is a 30 -hydroxylase of phenolic esters, a missing link in the phenylpropanoid pathway. Journal of Biological Chemistry 276:36566–36574 DOI 10.1074/jbc.M104047200.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

19/20

Schuler MA, Werck-Reichhart D. 2003. Functional genomic of P450s. Annual Review of Plant Biology 54:629–667 DOI 10.1146/annurev.arplant.54.031902.134840. Shang X, Pan H, Li M, Miao X, Ding H. 2011. Lonicera japonica Thunb.: ethnopharmacology, phytochemistry and pharmacology of an important traditional Chinese medicine. Journal of Ethnopharmacology 138:1–21 DOI 10.1016/j.jep.2011.08.016. Tamura K, Dudley J, Nei M, Kumar S. 2007. MEGA4: molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24:1596–1599 DOI 10.1093/molbev/msm092. Winkel-Shirley B. 2001. Flavonoid biosynthesis. a colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiology 126:485–493 DOI 10.1104/pp.126.2.485. Xiang T, Xiong QB, Ketut AI, Tezuka Y, Nagaoka T, Wu LJ, Kadota S. 2001. Studies on the hepatocyte protective activity and the structure-activity relationships of quinic acid and caffeic acid derivatives from the flower buds of Lonicera bournei. Planta Medica 67:322–325 DOI 10.1055/s-2001-14337. Xu Z, Zhang D, Hu J, Zhou X, Ye X, Reichel KL, Stewart NR, Syrenne RD, Yang X, Gao P, Shi W, Doeppke C, Sykes RW, Burris JN, Bozell JJ, Cheng MZ, Hayes DG, Labbe N, Davis M, Stewart Jr CN, Yuan JS. 2009. Comparative genome analysis of lignin biosynthesis gene families across the plant kingdom. BMC Bioinformatics 10(Suppl 11):S3 DOI 10.1186/1471-2105-10-S11-S3. Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, Wang J, Li S, Li R, Bolund L, Wang J. 2006. WEGO: a web tool for plotting GO annotations. Nucleic Acids Research 34:W293–W297 DOI 10.1093/nar/gkl031. Yoo HJ, Kang HJ, Song YS, Park EH, Lim CJ. 2008. Anti-angiogenic, antinociceptive and anti-inflammatory activities of Lonicera japonica extract. Journal of Pharmacy & Pharmacology 60:779–786 DOI 10.1211/jpp.60.6.0014. Yuan Y, Song L, Li M, Liu G, Chu Y, Ma L, Zhou Y, Wang X, Gao W, Qin S, Yu J, Wang X, Huang L. 2012. Genetic variation and metabolic pathway intricacy govern the active compound content and quality of the Chinese medicinal plant Lonicera japonica thumb. BMC Genomics 13:195 DOI 10.1186/1471-2164-13-195. Yuan Y, Wang Z, Jiang C, Wang X, Huang L. 2014. Exploiting genes and functional diversity of chlorogenic acid and luteolin biosyntheses in Lonicera japonica and their substitutes. Gene 534:408–416 DOI 10.1016/j.gene.2012.09.051. Zhang H, Gao S, Lercher MJ, Hu S, Chen WH. 2012. EvolView, an online tool for visualizing, annotating and managing phylogenetic trees. Nucleic Acids Research 40:W569–W572 DOI 10.1093/nar/gks576. Zucker M, Levy CC. 1959. Some factors which affect the synthesis of chlorogenic acid in disks of potato tuber. Plant Physiology 34:108–112 DOI 10.1104/pp.34.2.108.

Qi et al. (2017), PeerJ, DOI 10.7717/peerj.3781

20/20

Identification and analysis of CYP450 genes from transcriptome of Lonicera japonica and expression analysis of chlorogenic acid biosynthesis related CYP450s.

Lonicera japonica is an important medicinal plant that has been widely used in traditional Chinese medicine for thousands of years. The pharmacologica...
2MB Sizes 1 Downloads 11 Views