ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article
Revised

Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1

[version 3; peer review: 5 not approved]
Previously titled: Expression of modified enhanced green fluorescent polyarginine protein in Saccharomyces cerevisiae INVSc1
PUBLISHED 25 Nov 2024
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Cell & Molecular Biology gateway.

Abstract

Green Fluorescent Protein (EGFP) is widely used as a reporter gene, aiding in protein recovery and transduction studies. In this study, EGFP was tagged with eleven arginine residues (PolyR) and six histidine residues (His-tag) for purification. The aim was to enhance the synthesis of EGFP-PolyR in Saccharomyces cerevisiae and evaluate the effects of polyarginine modification on protein stability and expression levels. The expression of EGFP and EGFP-PolyR in S. cerevisiae was assessed through fluorescence measurements and protein levels. Structural analyses were conducted using in silico tools to investigate changes in beta strands and helices, which were validated through Western blots. Results showed that EGFP-PolyR maintained similar fluorescence levels to EGFP, but with notable structural changes. EGFP-PolyR's final beta strand terminates at Ala228, compared to Gly229 in EGFP, affecting the beta sheet's stability. Structural modifications also included altered helix lengths, with a longer helix 10 and shorter helix 9 in EGFP-PolyR. These alterations, along with shifts in helix-helix interactions, contribute to destabilization. Additionally, EGFP-PolyR exhibited unique gamma coils absent in EGFP, further differentiating its structure. The structural changes led to decreased protein expression and solubility, as indicated by Western blot analysis, with EGFP-PolyR showing significantly lower expression levels. The findings suggest that EGFP-PolyR is prone to aggregation and misfolding, characteristics often associated with aggregation-prone proteins.In conclusion, the polyarginine modification significantly impacts the structural integrity, stability, and solubility of EGFP. While fluorescence is retained, these changes hinder protein detectability and purification, highlighting the importance of considering structural alterations when modifying reporter proteins for experimental use.

Keywords

Saccharomyces cerevisiae INVSc1 protein expression, EGFP protein, protein transductions.

Revised Amendments from Version 2

In this version, There are several changes and additions to the paper :
1.                  We have added Dr. Syahputra Wibowo as an author because he conducted the bioinformatics analysis for this revision. 
2.                  There are several parts that we decided to change this paper, namely:
-          We have changed the title of this manuscript to reflect the additional data we have included.
-          Introduction part, we changed it according to the bioinformatics analysis results.
-          In the methodology section, we have incorporated protein structure analysis and structural analysis, thus completing the bioinformatic analysis.
-          In the results section, we have incorporated additional bioinformatic analysis images and comprehensive structural analysis tables for EGFP and EGFP-PolyR.
-          In the discussion section, the text has been modified and augmented in accordance with the incorporation of bioinformatic analysis data.
-          In conclusion, the conclusion was also modified in accordance with the alterations made to other sections of the manuscript.
-          In the extended data section, the figure previously published on figshare has been updated, and a new link has been included in this paper.
-          In the reference section, some references were deleted and new ones were added in accordance with the revisions made to this paper.

To read any peer review reports and author responses for this article, follow the "read" links in the Open Peer Review table.

Introduction

Enhanced green fluorescent protein (EGFP) has often been used as a reporter for protein transport, especially in eukaryotic cells such as mammalian and yeast cells. EGFP was derived from green fluorescent protein (GFP), a commonly used reporter protein that does not require a substrate for it to fluoresce. The EGFP gene is a mutated version of the GFP gene, with P64L and S65T mutations. These mutations cause a higher emission intensity in EGFP relative to that of the wild-type GFP, and better thermal stability of the EGFP protein, especially in eukaryotic cells.1 EGFP can be used as a marker in optimizing protein isolation procedures and their functions. The fluorescence makes it easier for us to observe the localization of the expression of the protein. The intensity of EGFP fluorescence can also indicate the level of expression of the protein in a system. EGFP can also be used to study protein transfer, such as analysis of the work of a protein transduction system for cell reprogramming purposes.

One of the uses of cell reprogramming is manufacturing induced pluripotent stem cells. Induced pluripotent stem cells are expected to be used as therapeutic cells for patients. Cell reprogramming can be done using protein transduction, viral transduction, or chemicals. Cell reprogramming by viral transduction can activate oncogenic genes, whereas chemical methods are highly unspecific and cannot be interpreted clearly.2,3 On the other hand, protein transduction is reputed as a harmless method to deliver protein precisely. However, protein transduction has its disadvantages, such as low transduction efficiency in host cells. Therefore, research is needed to increase transduction proteins by modifying the target protein.

Polyarginines are short polypeptides classified as non-amphipathic cell-penetrating peptides (NaCPP) and have cationic characteristics, including binding anionic lipid membranes; therefore, polyarginine can enter through the plasma membrane of eukaryotic cells. As a result, the NaCPP class is less toxic than the primary amphipathic and secondary cell-penetrating peptide (CPP). The NaCPP class uses endocytosis mechanisms with low concentrations for cellular uptake.4 The amount of added arginine amino acid residues affect the cellular uptake’s efficiency level, whereas more arginine residues have a better efficiency level. Adding more than 15 amino acid residues decreases intracellular delivery efficiency.5 Proteins fused to polyarginine are more efficiently transduced into human iPS cells compared to those without the addition of CPP.6

In this research, we designed and constructed two proteins, EGFP and EGFP-PolyR. EGFP-PolyR is an EGFP protein with the addition of eleven arginine residues (polyarginine – polyR) at the C terminal of EGFP to be used as a transduction protein model for cell reprogramming purposes. The gene encoding EGFP and EGFP-PolyR was synthetically constructed and expressed in Saccharomyces cerevisiae INVSc1. The study aimed to investigate the pattern of EGFP-PolyR protein expression in S. cerevisiae INVSc1 compared to EGFP. This paper aims to contribute to the current understanding of structural modifications when optimizing reporter proteins for experimental applications.

Methods

Plasmid design and construction

The gene encoding EGFP (GenBank Accession #U55762) and EGFP- PolyR (was synthetically made by IDT Custom Gene Synthesis cloned in a plasmid called pEGFP- PolyR, in such a way that it allowed sub-cloning of the gene with and without the polyR sequence, by using a different combination of restriction enzymes. In addition to the 11× arginine residues (polyR), codons encoding 6× histidine residues (6×His-tag) were also added downstream to the polyR codons. The synthetic gene was sub-cloned into pYES2CT, a Saccharomyces cerevisiae expression plasmid, during which the polyarginine codon either remained fused to the EGFP gene or was removed, creating recombinant pYES2CT pYES2CT-EGFP-PolyR or pYES2CT- EGFP, respectively. Since the pYES2CT plasmid has its own 6×His-tag downstream to its Multiple Cloning Sites (MCS), positioned in-frame with the EGFP and EGFP-PolyR codons, the expressed EGFP was expected to have one 6×His-tag, whereas the EGFP-PolyR were expected to have two.

The intended digested fragments sized 821 bp (EGFP-PolyR) and 733 bp (EGFP), were gel-purified using GenepHlow Gel/PCR kit (DFH004) (GeneAid, Taiwan) with 20 μL elution buffer. The DNA concentration was measured using a BioDrop-Duo UV/Vis spectrophotometer (Thomas Scientific, United States). Then, the DNA was stored at -20°C. Ligation was carried out following the T4 DNA ligase procedure as described by the manufacturer (Promega, United States). The ligation was performed at 4°C overnight, with a plasmid-insert volume ratio of 1:10.

Transformation of E. coli TOP10

The transformation into Escherichia coli TOP10 was carried out following the method of Abyadeh et al. (2017).7 Recombinant E. coli cells were cultured in LB with 100 μg/mL ampicillin. Transformation by heat shock was carried out using the Scilogex SCI-100HCM-Pro Digital Thermal Mixer (United States) at 42°C for 90 seconds. The transformants were selected on LB agar with100 mg/mL ampicillin (LB Agar+Amp). Colony PCR was performed on selected transformants with pYES2CT-specific primers (Forward: 5′-AATATACCTCTATACTTTAACGTC-3′; Reverse 5′-GAGGGCGTGAATGTAAGCGTGAC-3′) to verify the presence of intended inserts. PCR was done using Biometra Tone Thermal Cyclers (Analytik Jena AG, Germany) and carried out following the procedure from MyTaq Red Mix (Bioline, UK) manufacturer, for 35 cycles, with predenaturation (95°C) for 1 minute, denaturation for 15 seconds, annealing (55°C) for 15 seconds, elongation (72°C) for 10 seconds, and post elongation for 5 minutes.

Transformation of Saccharomyces cerevisiae INVSc1

pYES2CT-EGFP- PolyR and pYES2CT-EGFP plasmids were isolated using ‘WizardPlus SV minipreps DNA purification kit (A1330) (Promega, United States)’ after being propagated in E. coli TOP10. Isolated plasmids were used to transform Saccharomyces cerevisiae INVSc1 with the lithium acetate (LiAc) method.8 Transformants were selected on minimal medium (MM) agar without uracil. Plasmids were isolated from transformant colonies using the MP Biomedicals yeast plasmid isolation kit (MP Biomedicals, United States, Cat. 112069400). PCR was carried out to confirm the success of the transformation using Biometra Tone Thermal Cyclers (Analytik Jena AG, Germany) and carried out following the procedure from MyTaq Red Mix (Bioline, UK) manufacturer, for 35 cycles, with an annealing temperature of 55°C. Colony PCR was performed on selected transformants with pYES2CT-specific primers (Forward: 5′-AATATACCTCTATACTTTAACGTC-3′; Reverse 5′-GAGGGCGTGAATGTAAGCGTGAC-3′), to verify the presence of intended inserts. PCR was done using Biometra Tone Thermal Cyclers (Analytik Jena AG, Germany) and carried out following the procedure from MyTaq Red Mix (Bioline, UK) manufacturer, for 35 cycles, with predenaturation (95°C) for 1 minute, denaturation for 15 seconds, annealing (55°C) for 15 seconds, elongation (72°C) for 10 seconds, and post elongation for 5 minutes.

Expression of the modified protein in Saccharomyces cerevisiae

S. cerevisiae’s fast-growing diploid strain INVSc1 (Invitrogen Corporation) was used as a host cell for protein expression. The yeasts were cultured in yeast-extract peptone dextrose (YEPD) medium. Recombinant cells of S. cerevisiae INVSc1 were cultured in a minimal medium (MM) without uracil. Protein expression followed the user guide of the pYES2CT plasmid (Invitrogen, 2003), with several modifications. A single colony of recombinant S. cerevisiae INVSc1 carrying pYES2CT-EGFP or pYES2CT-EGFP- PolyR was inoculated into 3 mL of MM medium; the culture was then incubated at 30°C 230 rpm for 48 hours. After that, a 1-mL culture was taken to measure its OD600 value. Culture with OD600 = 0.4 were subcultured into 100 mL of induction medium (MM medium containing 2% galactose and 1% raffinose). The cultures were further incubated after adding the induction medium for 48 to 72 hours. The culture was centrifuged at 5000 rpm for 10 minutes to harvest and stored in a freezer at -80 °C until further use.

Fluorescent microscope visualization of EGFP and EGFP-PolyR Expression

A 1 mL 10× diluted culture was centrifuged at 5000 rpm for five minutes. The pellet was then fixed with 500 μL 3.7% formaldehyde for 30 minutes. The cell suspension was centrifuged at 5000 rpm for 5 minutes, and the supernatant was discarded. as the remaining pellet was resuspended in 1 mL PBS 1×. A 50-μL fixed-cell suspension was placed in a well of 96-well plates (Thermo Scientific) to visualize the EGFP fluorescence under an inverted fluorescent microscope (Zeiss Axiovert 40 CFL, UK), (40× magnification, λex 488 nm and λem 507 nm). Fluorescent intensity was measured using ImageJ software, and delta intensity was calculated as follows:

Intensityt=IntensitytIntensity0
where,

Intensityt is the fluorescent intensity at time t post-inoculation to induction medium

Intensity0 is the fluorescent intensity at time 0 post-inoculation to induction medium

Statistical analysis

All data were expressed as mean ± standard deviation. All statistical analyses were performed using SPSS software (IBM SPSS Statistics 24). The differences between groups were analyzed using one-way analysis of variance (ANOVA) after passed the homogeneity test and normality test, then proceeded to the Bonferroni post-hoc test. A p-value < 0.05 was considered as statistically significant.

Detection of expressed protein

Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) was carried out following the modified Iqbal and Ahmad (2016) method. The protein sample was mixed with 6× SDS gel loading buffer (600 mM Tris-Cl pH 6.8, 12% SDS, 30% glycerol (v/v), 0.03% bromophenol blue, and 20 mM DTT) and ran on 10 % SDS-PAGE gel at 100 V for 110 minutes until the bromophenol blue dye reached approximately 0.5 cm from the lower limit of the gel.9

Western blotting was also carried out following the method of Iqbal and Ahmad (2016), with several modifications. Protein was transferred from gel to membrane (transfer buffer: 300 mM glycine, 300 mM Tris, 0.05% SDS) using Pierce Power Blotter (Thermo Scientific, United States) set at 25 kV, 1.3 A for 25 minutes. Then, the membrane was washed six times with PBST (PBS, 1% Tween-20) before applying the primary antibody (Anti-6×His-Tag monoclonal antibody (Invitrogen, catalog number MA1-21315) in blocking buffer (5% skim milk (Merck, Germany) in PBST), with 1:3000 ratio). Then the membrane was incubated at 4°C overnight before applying the secondary antibody (Goat anti-mouse IgG HRP, Invitrogen, catalog number 31430 in blocking buffer, with a ratio of 1:20,000 ratio) for 60 minutes with shaking (70 rpm). Then, the substrate (SuperSignal West Femto Luminol, Thermo Scientific, United States) was used. The duration of the exposure was 20 minutes.

Protein structure modeling

Protein structures were predicted using AlphaFold3,10 a deep learning-based tool for accurate protein structure prediction. The input sequence for each protein was provided to AlphaFold3, which was run using the default settings. The predictions were performed on a high-performance computing cluster to ensure computational efficiency and accuracy. The output from AlphaFold2 included a predicted structure with confidence scores, which were used to assess the reliability of the model.

Structural analysis

Ramachandran Plot Analysis: The stereochemical quality of the AlphaFol3-predicted structures was assessed using the Ramachandran plot, which was generated using PROCHECK. PROCHECK is a widely used tool for evaluating the geometry of protein structures, specifically analyzing the distribution of phi (φ) and psi (ψ) dihedral angles. The structural models generated by AlphaFold2 were analyzed using PROMOTIF, a tool available through the PDBSum web server.11 PROMOTIF was employed to examine key structural features and motifs within the predicted protein structures. This included the identification of secondary structure elements, analysis of protein-protein and protein-ligand interfaces, and evaluation of the overall geometry of the model.

Results

Construction of Plasmid pYES2CT-EGFP and pYES2CT-EGFP-PolyR and Transformation into Saccharomyces cerevisiae INVSc1

After digests and ligation processes were successfully performed (Figure 1 and Figure 2A), the ligation products, hereby called pYES2CT-EGFP and pYES2CT-EGFP-PolyR, for plasmid carrying egfp and egfp-poly R respectively, were propagated in E. coli TOP10. Plasmids isolated from this E.coli TOP10 that had been confirmed to have the correct size ( Figure 2B) and carried the genes ( Figure 2C) were then used to transform S. cerevisiae INVSc1. Screening of S. cerevisiae INVSc1 transformants was performed by PCR of purified plasmids. PCR results of positive colonies are presented in Figures 2D and 3.

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure1.gif

Figure 1. The cloning process of genes encoding for EGFP and EGFP-PolyR into pYES2CT plasmid.

Insertion of EGFP-PolyR gene was performed using the enzymes BamH1 (position 519) and Not1 (position 577) (A), whereas EGFP gene was inserted into pYES2CT plasmid using BamHI and EcoRI, thus leaving the PolyR fragment out (B).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure2.gif

Figure 2. pEGFP-Poly R (lane 3) and pYES2CT (lane 9) were cut with restriction enzymes NotI and BamHI, where pEGFP-Poly R produced egfp-polyR gene (822 bp band), and pYES2CT became linearized with size 5932 bp.

pEGFP-Poly R (lane 5) and pYES2CT (lane 7) were cut with restriction enzymes EcoRI and BamHI, where pEGFP-Poly R produced Egfp gene (733 bp band), and pYES2CT produced vector (5905 bp band) (A). The 822 and 733 bp DNA were isolated from gel, and ligated to the 5932 and 5905 bp bands, respectively. Single digest (BamHI) of plasmids isolated from E.coli TOP10 transformed with pYES2CT-EGFP (lane 3 and 4) and pYES2CT-EGFP-Poly R (lane 5 and 6), in comparison to wild type pYES2CT (lane 2), showed that the plasmid sizes were correct (B). Also, colony PCR of E.coli confirmed the presence of the intended genes as 1037bp and 1098bp bands, representing the Egfp and Egfp-polyR , respectively (C). Transformed yeast colony screening was performed by PCR of plasmids purified from the yeast, using pYES2CT-specific primers. As a control, pYES2CT plasmid was used (lane K). Lanes 1-15 showed PCR amplification results of plasmids isolated from S. cerevisiae culture, transformed with pYES2CT (lanes 1-5), pYES2CT-EGFP-Poly R (lanes 6-10), and pYES2CT-EGFP (lanes 11-15). Amplification of pYES2CT without insert (lane 1-5) shows the same size amplicons as a positive control (K), which are 334 bp, amplification of pYES2CT-EGFP-Poly R produced amplicons of 1097 bp (lane 6-10), while the pYES2CT-EGFP produced amplicons of 1036 bp (lane 11-15) (D). Lane M is GeneRuler 1kb DNA Ladder (Thermo Scientific SM0312).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure3.gif

Figure 3. Positive E. coli TOP10 and S. cerevisiae INVSc1 transformants are carrying plasmids pYES2CT-EGFP (A3 and B3, respectively) and pYES2CT-EGFP-PolyR (A4 and B4, respectively).

Transformation positive controls are shown in A1 and B1, whereas the negative controls are in A2 and B2.

The role of EGFP and EGFP-PolyR addition on the growth curve of Saccharomyces cerevisiae INVSc1

The growth curve showed that post-induction S. cerevisiae INVSc1 stayed longer at the exponential phase when not carrying any plasmid in comparison to the same strain with pYES2CT plasmids ( Figure 4). The plasmid-carrying yeasts seemed to reach stationary phase 24 hours after inoculation to induction medium ( Figure 4B and C), whereas the wild-type culture stayed at exponential phase for up to 64 hours ( Figure 4A) before slowing down its growth. There is a significant decrease of the growth curve in S. cerevisiae INVSc1 carrying the plasmids EGFP and EGFP-PolyR, compared to the wildtype. The significant decrease was seen from 48 – 72 hours ( Figure 4D).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure4.gif

Figure 4. Growth curve comparison of Saccharomyces cerevisiae INVSc1 without plasmid (A), carrying pYES2CT- EGFP (B), carrying pYES2CT-EGFP-PolyR plasmid (C), in the induction medium (containing 2% galactose and 1% raffinose). The growth comparison of Saccharomyces cerevisiae INVSc1 on 48 hours and 72 hours (D).

Optical densities were measured at λ=660nm, 48 hours, and 72 hours post-inoculation to induction medium of the wild-type and recombinant S. cerevisiae (D). The result reported are mean of four replicates while the error bar indicates its standard deviation. Mean data accompanied by different letter in each time point are significantly different (Bonferroni test, p < 0.05).

The fluorescence produced post-inoculation to induction medium by each recombinant yeast and wild-type yeast is presented in Figure 5A. The Δintensity was calculated and presented as a graph in Figure 5B. There was a significant difference between wildtype fluresens expression with recombinant yeast carrying pYES2CT EGFP and pYES2CT EGFP PolyR-plasmid on 48 hours and 72 hours. The addition of pYESC2 EGFP and pYES2CT EGFP PolyR plasmids significantly inhibited the proliferation of S. cerevisiae starting at 48 hours.

Western blotting using anti-his-tag mouse monoclonal antibody (primary antibody) and Goat anti-mouse IgG HRP (secondary antibody) proved that the EGFP-PolyR and EGFP proteins were expressed in S. cerevisiae INVSc1 at similar ( Figure 6).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure5.gif

Figure 5. The fluorescence expressed by S. cerevisiae INVSc1 carrying no plasmid (wild-type), and carrying pYES2CT-EGFP and pEGFP-PolyR plasmids at 48, and 72 hours post-inoculation to induction medium (A), the calculated Δintensity of each yeast at 48- and 72-hour post inoculation to induction medium (B).

The result reported are mean of six replicates while the error bar indicates its standard deviation. Mean data accompanied by different letter in each time point are significantly different (Bonferroni test, p < 0.05).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure6.gif

Figure 6. Western blotting of protein expressed at 48 hours post-inoculation to induction medium by S. cerevisiae INVSc1 carrying no plasmid (Lane 1 and 2), pYES2CT-EGFP (Lane 3 and 4), and pYES2CT-EGFP-PolyR (Lanes 5 and 6).

The Western blot was performed using an anti-his-tag antibody as the primary antibody.

Bioinformatic analysis

The beta strand structures of EGFP-Poly R and EGFP exhibit a critical difference in the length of the final beta strand. In EGFP-Poly R, the last beta strand ends at Ala228, whereas in EGFP, it extends one residue further to Gly229. This difference could impact the structural stability of the beta sheet, as the absence of the glycine residue in EGFP-Poly R might reduce the flexibility and stability of this structural element. In addition to differences in beta strands, there are notable variations in the helical structures of EGFP and EGFP-Poly R. EGFP features a total of 11 helices, including both G and H types, while EGFP-Poly R has only 10 helices, missing one compared to EGFP (Table 1 and Table 2). This reduction in helices could significantly affect the overall stability and integrity of the protein structure, as helices play a crucial role in maintaining protein stability. Notably, helix 10 in EGFP spans from Leu251 to Ser253, comprising 3 residues, whereas in EGFP-Poly R, it extends to 5 residues, from Pro265 to Leu269 (Figure 7 and Figure 8). This extension may reflect a structural adjustment due to the additional arginine residues in the Poly R sequence. Conversely, helix 9 in EGFP-Poly R, spanning from Pro245 to Trp248, shows decreased length and stability with a lower pitch and rise per residue compared to EGFP, suggesting potential instability. Both EGFP and EGFP-Poly R deviate from ideal helical parameters, but EGFP-Poly R exhibits slightly greater deviations in helix 9, indicating possible structural strain from the Poly R modification. The extension of helix 10 by two additional residues in EGFP-Poly R might be a direct consequence of this modification, altering the local structural environment. These differences in the beta strands and helices between EGFP and EGFP-Poly R likely contribute to the overall destabilization of the protein. The shortening of the final beta strand, combined with changes in helix lengths and stability, particularly the extension of helix 10 could lead to misfolding or reduced structural integrity of EGFP-Poly R (Figure 8). Additionally, the interactions between helices show some important differences. The helix-helix interactions between EGFP and EGFP-Poly R reveal subtle yet significant differences that could impact the stability of the protein. Both proteins share similarities in certain interactions, such as those between helices A1 and A5, A1 and A6, A2 and A3, and A5 and A6, with nearly identical distances and angles. However, differences arise in the interactions between helices A5 and A8, and A8 and A9. In EGFP, the A5-A8 interaction has a distance of 10.6 Å with an angle of -102.9°, while in EGFP-Poly R, this distance shortens to 10.0 Å with an angle of -100.5°. More notably, the A8-A9 interaction shows a distance of 2.6 Å with an angle of -79.3° in EGFP, but in EGFP-Poly R, the distance increases to 3.2 Å with a significant angle shift to -94.9°, and an increase in the number of interacting residues from 4 to 6. These differences suggest altered packing and potential destabilization of the protein structure in EGFP-Poly R. The changes in helix-helix interaction, particularly between A8 and A9, could contribute to a less stable conformation in EGFP-Poly R.

Table 1. Comprehensive structural of EGFP.

Structure typeStartEndSequence Turn Type/Type Length/No. Resid Class/ H-Bond/Topology/Interaction Type/Helices
Beta Hairpins
Val12Val23 12 Strands 2:2 IP
His26Asp37 12 Strands 4:4 I
Tyr93Phe101 9 Strands 3:5
Asn106Glu116 11 Strands 2:2 IIP
His149Asp156 8 Strands 4:4 I
Gly161Asn171 11 Strands 3:5
His200Ser209 10 Strands 10:10
Beta Bulges
Phe101Asp104Gly105ClassicAntiparallel H-Bond: Yes
Ile172Gly175Ser176G1Antiparallel H-Bond: Yes
Strands
Val12Val23 12 Residues
His26Asp37 12 Residues
Lys42Cys49 8 Residues
Tyr93Phe101 9 Residues
Asn106Glu116 11 Residues
Thr119Ile129 11 Residues
His149Asp156 8 Residues
Gly161Asn171 11 Residues
Val177Pro188 12 Residues
His200Ser209 10 Residues
His218Gly229 12 Residues
Helices
Lys4Phe9 HelixG6 Residues
Trp58Leu61 HelixG4 Residues
Val62Leu65 HelixG4 Residues
Gln70Phe72 HelixG3 Residues
Asp77His82 HelixG6 Residues
Phe84Ser87 HelixH4 Residues
Lys157Lys159 HelixH3 Residues
Met234Arg243 HelixH10 Residues
Pro245Trp248 HelixH4 Residues
Leu251Ser253 HelixG3 Residues
Pro265Gly268 HelixG4 Residues
Helix-Helix Interactions A1 A5 G Gn c1 interacting residue
A1 A6 G Hc 31 interacting residue
A2 A3 G Hc n4 interacting residues
A5 A6 G Hc n1 interacting residue
A8 A9 G HI2 interacting residues
Beta Turns
Asp22Gly25DVNGIV H-Bond: Yes
Val23His26VNGHI H-Bond: Yes
Asp37Tyr40DATYI H-Bond: Yes
Ala38Gly41ATYGI
Cys49Gly52CTTGI H-Bond: Yes
Leu65Gly68LTYGI
Thr66Val69TYGVI H-Bond: Yes
Tyr67Gln70YGVQIV
Ala88Glu91AMPEVIa1
Met89Gly92MPEGI
Phe101Asp104FKDDIV H-Bond: Yes
Phe115Asp118FEGDIV
Glu116Thr119EGDTII' H-Bond: Yes
Lys132Gly135KEDGI H-Bond: Yes
Gly135Leu138GNILIV
Asn136Gly139NILGI
Ile137His140ILGHI H-Bond: Yes
Ile172Gly175IEDGI H-Bond: Yes
Asp211Glu214DPNEI H-Bond: Yes
Gly229Leu232GITLIV H-Bond: Yes
Gamma Turns
His82Phe84HDFINVERSE
Glu143Asn145EYNINVERSE
Gly268Asp270GLDINVERSE
Thr272Thr274TRTINVERSE
Gly275His277GHHINVERSE
His276His278HHHINVERSE
His278His280HHHINVERSE

Table 2. Comprehensive Structural of EGFP-POLY R.

Structure typeStartEndSequenceType/Turn TypeLength/No. ResidClass/Interaction/H-Bond/Topology
Beta Sheet Mixed11 strandsTopology: -1 -1 5X -1-1 5-1 -1 3 1
Beta Hairpins
Val12Val23 Hairpin12 residues2:2 IP
His26Asp37 Hairpin12 residues4:4 I
Tyr93Phe101 Hairpin9 residues3:5
Asn106Glu116 Hairpin11 residues2:2 IIP
His149Asp156 Hairpin8 residues4:4 I
Gly161Asn171 Hairpin11 residues3:5
His200Ser209 Hairpin10 residues10:10
Beta Bulges
Phe101Asp104Gly105BulgeAntiparallel H-Bond: Yes
Ile172Gly175Ser176BulgeAntiparallel H-Bond: Yes
Strands
Val12Val23 Strand12 residues
His26Asp37 Strand12 residues
Lys42Cys49 Strand8 residues
Tyr93Phe101 Strand9 residues
Asn106Glu116 Strand11 residues
Thr119Ile129 Strand11 residues
His149Asp156 Strand8 residues
Gly161Asn171 Strand11 residues
Val177Pro188 Strand12 residues
His200Ser209 Strand10 residues
His218Ala228 Strand11 residues
Helices
Lys4Phe9 Helix6 residues
Trp58Leu61 Helix4 residues
Val62Leu65 Helix4 residues
Gln70Phe72 Helix3 residues
Asp77His82 Helix6 residues
Phe84Ser87 Helix4 residues
Lys157Lys159 Helix3 residues
Met234Arg243 Helix10 residues
Pro245Trp248 Helix4 residues
Pro265Leu269 Helix5 residues
Helix-Helix Interactions A1 A5 G Gn c1 interacting residue
A1 A6 G HI c3 interacting residues
A2 A3 G Hc n4 interacting residues
A5 A6 G Hc n1 interacting residue
A8 A9 G HI2 interacting residues
Beta Turns
Asp22Gly25DVNGTurnIV H-Bond: Yes
Val23His26VNGHTurnI H-Bond: Yes
Asp37Tyr40DATYTurnI H-Bond: Yes
Ala38Gly41ATYGTurnI
Cys49Gly52CTTGTurnI H-Bond: Yes
Leu65Gly68LTYGTurnIV H-Bond: Yes
Thr66Val69TYGVTurnI H-Bond: Yes
Tyr67Gln70YGVQTurnIV
Ala88Glu91AMPETurnVIa1
Met89Gly92MPEGTurnI
Phe101Asp104FKDDTurnII H-Bond: Yes
Phe115Asp118FEGDTurnIV
Glu116Thr119EGDTTurnII H-Bond: Yes
Lys132Gly135KEDGTurnI H-Bond: Yes
Gly135Leu138GNILTurnIV
Asn136Gly139NILGTurnI
Ile137His140ILGHTurnI H-Bond: Yes
Ile172Gly175IEDGTurnI H-Bond: Yes
Asp211Glu214DPNETurnI H-Bond: Yes
Gamma Turns
His82Phe84HDFTurnINVERSE
Glu143Asn145EYNTurnINVERSE
Phe257Gly259FEGTurnINVERSE
Leu269Ser271LDSTurnINVERSE
Ser272Arg275TRTTurnINVERSE
Thr277Arg278GRRTurnINVERSE
Arg276Gly277TGRTurnINVERSE
cebd1182-3eef-4d1e-832c-9b34455e01e6_figure7.gif

Figure 7. Bioinformatic analysis of EGFP (yellow = 6x HisTag).

cebd1182-3eef-4d1e-832c-9b34455e01e6_figure8.gif

Figure 8. Bioinformatic analysis of EGFP-Poly R (red = Poly R and yellow = 6x HisTag).

The comparison of beta turns between EGFP and EGFP-Poly R reveals several unique structural differences that highlight how modifications in the sequence can influence the protein's conformation. One notable difference is observed in the Leu65-Gly68 turn. EGFP exhibits a phi angle of -89.7° and a psi angle of -39.1°, while EGFP-Poly R shows a significantly more negative phi angle of -102.1° and a psi angle of -35.1°. This change in the phi angle suggests a more pronounced deviation in the turn's geometry in EGFP-Poly R compared to EGFP. Additionally, the chi1 angles are 51.1° for EGFP and 47.1° for EGFP-Poly R, reflecting an adjustment in the side-chain orientation due to the extended polyarginine sequence. Another distinct difference is found in the Phe115-Asp118 turn. EGFP has a phi angle of -125.8° and a psi angle of 100.9°, whereas EGFP-Poly R shows a less negative phi angle of -121.3° and a psi angle of 98.3°. The chi1 angles for EGFP are -65.5°, while for EGFP-Poly R, they are -66.0°. The shift in the phi and psi angles in EGFP-Poly R indicates a possible alteration in the turn’s local conformation, potentially affecting interactions with neighboring regions. The Asp211-Glu214 turn presents another unique aspect. EGFP has a phi angle of -63.7° and a psi angle of -16.3°, whereas EGFP-Poly R shows slightly different angles with a phi of -64.6° and a psi of -16.1°. This minimal difference in phi and psi angles, along with chi1 angles of 26.5° for EGFP and 26.8° for EGFP-Poly R, highlights subtle adjustments in the beta turn’s configuration. Another differences in the Leu65-Gly68 turn, in EGFP, this turn is of the I type, suggesting a more regular and stable loop structure at this position. In contrast, in EGFP-Poly R, the same region forms an IV type turn, indicating increased flexibility or a less well-defined structure, which could lead to differences in the local stability of the protein. The I type turn is a common secondary structure element in proteins where four consecutive residues create a turn in the polypeptide chain. In this type of turn, the first and fourth residues are hydrogen-bonded, creating a tight loop. The phi (Φ) and psi (Ψ) angles typically found in I type turns are around -60° for Φ and -30° to -60° for Ψ. These angles contribute to the formation of a stable turn, often found in beta-hairpins or at the end of beta strands. The IV type turn is a more flexible and less common turn compared to type I. It doesn’t conform to a specific set of phi and psi angles and is often seen as a more irregular or less tightly constrained turn. This flexibility means that IV type turns can vary more in their structure, leading to different folding patterns or conformations in the protein. Because of its variability, an IV type turn can introduce local flexibility or disorder in a protein structure, potentially affecting the overall stability of the region. Flexible or irregular turn like the IV type could make the protein more prone to misfolding or less resistant to environmental stressors.12

The last one is the difference of gamma turns between EGFP and EGFP-Poly R reveals significant structural differences that likely influence their stability and functionality. EGFP features seven gamma turns (Table 1), whereas EGFP-Poly R contains nine, indicating that the poly-arginine modification introduces additional structural elements (Table 2). Among the common gamma turns, the His82 to Phe84 turn shows minor differences. In EGFP, the phi angle is -85.6°, psi is 82.4°, chi1 is -168.5°, with a CA distance of 6.0 Å. For EGFP-Poly R, the phi angle is -86.3°, psi is 83.0°, chi1 is -169.3°, and the CA distance remains 6.0 Å. Similarly, the Glu143 to Asn145 turn in EGFP has a phi angle of -84.5°, psi of 65.2°, chi1 of -171.6°, with a CA distance of 5.6 Å. In EGFP-Poly R, the corresponding values are phi -83.9°, psi 65.6°, chi1-170.0°, and CA distance 5.6 Å. EGFP also has unique gamma turns not present in EGFP-Poly R. The Gly268 to Asp270 turn, with a phi angle of -84.6°, psi of 69.5°, chi1 of -70.8°, and a CA distance of 5.8 Å, is missing in EGFP-Poly R. Its absence may lead to reduced flexibility and altered local folding. Similarly, the Thr272 to Thr274 turn, with phi of -72.7°, psi of 62.5°, chi1 of -78.9°, and CA distance of 5.4 Å, is also not found in EGFP-Poly R. The Gly275 to His277 turn, with phi of -65.9°, psi of 95.2°, chi1 of -86.4°, and a CA distance of 5.8 Å, is absent in EGFP-Poly R. On the other hand, EGFP-Poly R includes several unique gamma turns not observed in EGFP. The Phe257 to Gly259 turn, with phi of -61.9°, psi of 76.0°, chi1 of -74.2°, and a CA distance of 5.0 Å, introduces new conformational possibilities. The Leu269 to Ser271 turn, with phi of -75.5°, psi of 58.9°, chi1 of -152.5°, and a CA distance of 5.5 Å. The Thr272 to Thr274 turn in EGFP-Poly R, with phi of -56.1°, psi of 83.6°, chi1 of -80.3°, and a CA distance of 5.7 Å, differs from its counterpart in EGFP. Then Gly275 to Arg277 turn, with phi of -82.1°, psi of 91.4°, chi1 of -94.9°, and a CA distance of 6.0 Å, reflects a structural modification that might influence the protein's overall folding.

Discussion

The addition of PolyR residues did not affect the expression of EGFP fluorescence in the Saccharomyces cerevisiae cell

In this study, PolyR was added to EGFP protein to function as a cell-penetrating peptide (CPP), which is commonly used as a carrier of protein into cells. Arginine-rich CPP has been used as a carrier for the intracellular delivery of various bioactive molecules such as proteins, peptides, and nucleic acids. The number of arginine residues used as CPP can affect the efficiency of cellular uptake and cytosolic release. It is well understood that high cellular uptake efficiency is achieved with 8-12 residues.13 Polyarginine as CPP also plays an essential role in forming hydrogen bonds with cell membrane protein groups.14

The expression of EGFP protein, as inferred from the fluorescence and Western blot data, indicated that the induction of the recombinant cultures was successful. Analysis performed at 48 and 72 hours post-inoculation to induction medium showed no increase in fluorescent intensity with time (Figure 5B). Thus, in the future, this could be used as a reference supporting that it would only take 48 hours of culturing for expression, hence being more time-efficient.

It is known that the addition of a CPP sequence can have a negative influence on recombinant protein yield. 15 It was reported that ten arginines (R10) residues were added to the superfolder GFP (sfGFP) N-terminal in pBAD plasmid containing arabinose-inducible sfGFP gene with a C-terminal His tag. No recombinant R10-sfGFP was expressed by E. coli TOP10. However, the unmodified sfGFP gene expressed over 30 mg/L protein under the same condition.16 It was also reported that although the fluorescent protein obtained after purification showed a single band on SDS-PAGE, mass spectrometry (MS) analysis revealed truncation of the R10, leaving only two arginine residues present at the C-terminus upon expression when the position of R10 and His-tag was swapped relative to the sfGFP gene (i.e. His tag at the N-terminus and R10 at the C-terminus). 17

In contrast to these previous findings, our result indicated that the addition of a PolyR group to the C-terminal of EGFP carrying C-terminal 6×His-tag did not seem to interfere with protein expression in S. cerevisiae INVSc1, as shown by the similar fluorescence (Figure 5B) and protein (Figure 6) levels expressed by yeasts carrying pYES2CT-EGFP band pYES2CT-EGFP-PolyR. In combination with the data from the proteins tertiary structural predictions and alignment, this expression result indicated that the conformational change caused by the addition of PolyR did not affect the protein expression.

It would be interesting to observe whether relocating the 6×His-tag or the PolyR to different terminals would improve the protein expression as well as provide ease of purification since it is feared that the PolyR can interfere with the binding of 6×His-tag to Ni-NTA resin to be used for purification or the anti-His-tag antibody used for detection of expression.

Modified protein plasmid insertion causes a decrease in the growth curve of S. cerevisiae

The lower growth curve of the transformant S. cerevisiae compared to the wild-type can be due to the growth inhibition factors of S. cerevisiae activated due to the addition of MM induction medium (2% galactose and 1% raffinose). Adding raffinose to the growth medium of S. cerevisiae can activate the programmed cell death formation of acetic acid. Guaragnella's study (2013) suggests that S. cerevisiae is resistant to programmed cell death via acetic acid because S. cerevisiae can compensate by activating retrograde mitochondria. Although programmed cell death is activated, they are not necessarily inhibited by growth.16 In addition, previous studies have shown that high-level expression of a EGFP protein localized to an intracellular compartment is expected to cause cellular defects because it overloads localization processes and growth defects.17,18 This causes a significant decrease in the growth curve of yeast containing plasmids EGFP and EGFP PolyR. Even so, the low growth curve of the transformant S. cerevisiae seems does not affect the expression of EGFP fluorescence proteins (Figures 5A and 5B and 2). Previous research has shown that PolyR expression in cells can cause protein translation stalls.19 Results in experiments showed that the addition of PolyR at the C terminal did not cause the EGFP protein translation stall, only a decrease of the growth curve significantly. This is possible because our plasmid vectors have been designed to express them in sufficient quantities within the cell.

Structural alterations of EGFP-Poly R lead to reduced Western blot detection

The reduced detection of EGFP-Poly R in Western blot assays, compared to native EGFP, stems from significant structural alterations caused by the poly-arginine modifications. EGFP-Poly R exhibits a truncated final beta strand and a reduced number of helices, leading to decreased stability and increased susceptibility to misfolding and aggregation. Additionally, changes in the lengths and angles of helices, as well as the introduction of new gamma turns, contribute to structural strain and potential instability.20 These modifications result in reduced solubility and altered epitope accessibility, making EGFP-Poly R more prone to proteolytic cleavage during sample preparation. Consequently, while EGFP-Poly R is expressed within cells, its aggregated and unstable forms are less detectable in Western blots, unlike the more stable and detectable native EGFP.21 Histag addition can cause protein instability. It would seem prudent to suggest that some cases of histag may result in increased aggregation, disruption to protein folding, proteolytic changes of cleavage, and a combination of these factors, which could be 8X more significant than 6X.2224

Conclusions

It is imperative to stress that poly-arginine causes significant changes with regard to the structure of EGFP and affects the stability and solubility of the protein. However, it is rather reassuring that the fluorescence of EGFP-PolyR is preserved at similar levels. However, the changes in the secondary structure elements, such as beta-strands, helices, and turns, as well as the increase in protein instability, should not be disregarded as they do influence the detectability and overall usefulness of the antibody in protein immobilization and isolation/analysis. Which proves the fact that structural alteration should be taken into account when optimizing the reporter proteins for the experimental purposes.

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 03 Jan 2023
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Tandio Saputro SS, Wahyunita K, Nurhasanah A et al. Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.12688/f1000research.123181.3)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 3
VERSION 3
PUBLISHED 25 Nov 2024
Revised
Views
13
Cite
Reviewer Report 19 Feb 2025
Francine M Perrine-Walker, The University of Sydney, Sydney, New South Wales, Australia 
Not Approved
VIEWS 13
The manuscript by Tandio Saputro and colleagues titled ‘Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1’ investigated how the reporter, Green Fluorescent Protein (EGFP) could be structurally modified by tagging it with ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Perrine-Walker FM. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.172954.r360640)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
10
Cite
Reviewer Report 14 Dec 2024
Stéphane Guyot, UMR Procédés Alimentaires et Microbiologiques, L'Institut Agro Dijon, Dijon, Bourgogne-Franche-Comté, France 
Not Approved
VIEWS 10
The new comments are presented in bold and are based on the previous report and the new version of the paper. In light of the new version of the paper with tracked changes and no direct response to the comments ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Guyot S. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.172954.r343340)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 2
VERSION 2
PUBLISHED 27 Mar 2024
Revised
Views
14
Cite
Reviewer Report 24 Jul 2024
Yide Huang, Fujian Normal University, Fuzhou, Fujian, China 
Not Approved
VIEWS 14
The authors attempted to express EGFP and EGFP-polyR proteins with a 6×His tag in Saccharomyces cerevisiae for subsequent protein transduction studies. They synthesized EGFP and EGFP-polyR sequences, cloned them into the S. cerevisiae expression vector pYES2CT, and introduced the ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Huang Y. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.163692.r300366)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
15
Cite
Reviewer Report 23 Jul 2024
Stéphane Guyot, UMR Procédés Alimentaires et Microbiologiques, L'Institut Agro Dijon, Dijon, Bourgogne-Franche-Comté, France 
Not Approved
VIEWS 15
The main aim of this paper is to propose a method for expressing a EGFP carrying C-terminal PolyR plus 6xHis-tag in the diploid Saccharomyces cerevisiae INVSc1 strain. Such an engineering approach is proposed to study and improve the transduction process ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Guyot S. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.163692.r300364)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 1
VERSION 1
PUBLISHED 03 Jan 2023
Views
26
Cite
Reviewer Report 06 Nov 2023
Hisao Moriya, Okayama University, Okayama, Japan 
Not Approved
VIEWS 26
In this paper, the authors attempted to express EGFP-polyR in S. cerevisiae. They concluded that EGFP was successfully expressed and that the addition of polyR had no effect on the expression level of EGFP.

However, the expression ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Moriya H. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.135264.r177225)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
52
Cite
Reviewer Report 03 Nov 2023
Khomaini Hasan, Faculty of Medicine, Universitas Jenderal Achmad Yani, Cimahi, Indonesia 
Not Approved
VIEWS 52
Pramono and Co-workers study on expression of recombinant Enhanced Green Fluorescent Protein (EGFP) with some modification with the addition of poly-R in order to help protein penetration within cells. In general, the work can be significantly improved, especially in terms ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Hasan K. Reviewer Report For: Structural and Functional Implications of Polyarginine Addition to Green Fluorescent Protein Expression in Saccharomyces cerevisiae INVSc1 [version 3; peer review: 5 not approved]. F1000Research 2024, 12:1 (https://doi.org/10.5256/f1000research.135264.r159002)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 03 Jan 2023
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.