Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation.

Andrew R. Dalby; Mohd Shahir Shamsir

doi:10.12688/f1000research.6831.1

Home Browse Molecular Dynamics Simulations of the Temperature Induced Unfolding...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation.

[version 1; peer review: 1 approved, 2 approved with reservations]

Andrew R. Dalby¹, Mohd Shahir Shamsir²

PUBLISHED 20 Aug 2015

Author details Author details

¹ Faculty of Science and Technology, University of Westminster, London, W1W 6UW, UK
² Faculty of Biosciences and Medical Engineering, Universiti Teknologi Malaysia, Skudai, Johor, 81310, Malaysia

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Molecular dynamics simulations have been used extensively to model the folding and unfolding of proteins. The rates of folding and unfolding should follow the Arrhenius equation over a limited range of temperatures. This study shows that molecular dynamic simulations of the unfolding of crambin between 500K and 560K do follow the Arrhenius equation. They also show that while there is a large amount of variation between the simulations the average values for the rate show a very high degree of correlation.

Keywords

Molecular dynamics, Arrhenius, unfolding rate

Corresponding author: Andrew R. Dalby

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2015 Dalby AR and Shamsir MS. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Dalby AR and Shamsir MS. Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation. [version 1; peer review: 1 approved, 2 approved with reservations]. F1000Research 2015, 4:589 (https://doi.org/10.12688/f1000research.6831.1) First published: 20 Aug 2015, 4:589 (https://doi.org/10.12688/f1000research.6831.1) Latest published: 20 Aug 2015, 4:589 (https://doi.org/10.12688/f1000research.6831.1)

Introduction

Molecular dynamics (MD) simulations have become an important tool in understanding chemical and biochemical processes at the molecular level. Through the use of Newtonian mechanics and an empirically derived force-field, simulations have been used to investigate the interactions of drug molecules and their targets as well as to predict the behaviour of proteins and peptides (Vasquez et al., 1994). Originally simulations were carried out in vacuo, but now with the increasing power of computers, simulations are usually carried out using periodic boundary conditions in water.

Increasingly realistic simulations lead to a better understanding of processes such as protein folding and unfolding at the molecular level (Lindorff-Larsen et al., 2012; Scheraga et al., 2007). The literature on molecular dynamics simulations for protein folding and unfolding is extensive. This can include very large simulations that can be simplified using coarse-graining, down to simulations of small proteins or peptides. One area where MD simulations have been particularly widely used is in the simulation of prion proteins and the protein mis-folding diseases associated with them (Shamsir & Dalby, 2005; Shamsir & Dalby, 2007; van der Kamp & Daggett, 2010; van der Kamp & Daggett, 2011).

Molecular dynamics simulations give us a detailed view of protein folding and unfolding pathways and their rates of unfolding (Daggett, 2002). Temperature is often used to accelerate protein unfolding and it has been shown that this does not affect the protein unfolding pathway (Day et al., 2002). It is often difficult to relate the simulated results to experimental data. A review of simulated folding times has shown that the times predicted by MD and the experimental rates for thermal unfolding are in good agreement (Snow et al., 2005). Atomic force microscopy data is another possibility and this has been used to investigate protein folding of T4 lysozyme (Peng & Li, 2008).

The rate of protein folding at increasing temperatures should be described by the Arrhenius equation over a limited range of temperatures (Alberty):

k = A e^{\frac{E_{a}}{R T}}

Where k is the rate of the reaction, A is a constant (pre-exponential factor) E_a is the energy of activation, T is the Temperature in Kelvin and R is the Universal Gas Constant.

A rearrangement of the Arrhenius equation taking natural logarithms gives the linear function:

\ln (k) = \frac{- E_{a}}{R} \frac{1}{T} + \ln (A)

A plot of the natural logarithm of the rate ln(k) against 1/Temperature will be a straight line if the simulations obey the Arrhenius equation.

The Arrhenius equation has been used in solid-state chemistry calculations but currently no studies have tested whether it is valid in MD simulations of protein folding (Huwe et al., 1999). This paper presents a MD simulation study using a small protein (crambin) to test whether the models do agree with the predicted linear behaviour.

Materials and methods

Crambin was chosen as the model protein for simulation of its size as it only has 46 amino acids (Caves et al., 1998). A high resolution crystal structure of crambin (3NIR.pdb) was downloaded from the RCSB Protein Databank (Rose et al., 2011; Schmidt et al., 2011). Molecular dynamics simulations were carried out using Gromacs 4.6.4 on an Ubuntu 12.04 machine with GPU acceleration (Hess et al., 2008).

The protein was solvated using periodic boundary conditions and a surrounding distance of 4nm around the protein. Simulations were run at 500K to 560K at 10K intervals using the OPLS forcefield. At temperatures of 570K and above the simulations fail to complete. The models were initially equilibrated using the canonical (nvt) and isothermal-isobaric (npt) ensembles. At 500K the simulations were run for 10ns with a time-step of 2fs. At 560K the simulations were run for 1ns with a time-step of 2fs. Secondary structure was calculated using DSSP (Dodge et al., 1998) (the data is available in dssp_files.zip). RMSD deviations from the original crystal structure were calculated in Gromacs and displayed in Grace (the data is available in rmsd_grace_datafiles.zip). All of the scripts for equilibration dynamics and analysis can be found in the accompanying data files (the Gromacs files are in gromacs_files.zip and the shell script to run all the simulations and analysis is gromacs_runs_complete_analysis.sh).

The statistical analysis of the rate data was carried out using SPSS version 22 (IBM_Corp., 2013). The line of best fit to the mean data was fitted using linear regression (the data files are available as md_arrhenius_crambin_averaged.sav, md_arrhenius_crambin_averaged.spv). The line of best fit for the complete dataset was calculated using the general linear model (the data files are available as md_arrhenius_crambin.sav, md_arrhenius_crambin.spv). All of the scripts and data for the statistical analysis are available in the supplementary materials.

Results

Over these time periods crambin did not unfold as much as had been expected. As it is such a small protein it seems to be particularly stable to rises in temperature. There were three possible end points that could be used in determining the rate:

1) The unfolding of the C-terminal final bend (Figure 1, the green region from residues 36-38).
2) The loss of the beta sheet (Figure 1, the two red regions residues 2-4 and 33-34).
3) The increase of the root mean squared deviation (RMSD) to 0.4nm from the crystal structure.

Figure 1. An Example DSSP Secondary Structure Plot (from 520K run 1).

It was not clear which of the measures would be the most reliable and so time was taken for all three. There were however missing values because the end points were not reached during the simulations. This was particularly apparent in the 540K simulations, which seem to be anomalous as can be seen in the boxplots for the reaction times from the simulations (Figure 2A–C). The boxplots show the expected downward trend, although there is considerable variation between the times taken for the repeated simulations. This variability declines at higher temperatures as rates become faster. A summary of the times to the three different end points are given in Table 1–Table 3.

Figure 2.

A: Boxplot of the times for the loss of the final bend from the secondary structure in picoseconds. Outliers are labelled. B: Boxplot of the times for the loss of the beta sheet in picoseconds. Outliers are labelled as circles or numbers if they are extreme. C: Boxplot of the times for the RMSD to go above 0.4nm from the crystal structure in picoseconds.

Table 1. Times until the loss of the bend from residue 36-38 in the protein in picoseconds.

Temperature	Mean (ps)	Standard Error (ps)	95% Confidence Interval (ps)
500	2340	480	1252 to 3427
510	2190	341	1418 to 2961
520	1470	219	974 to 1965
530	679	142	357 to 1001
540	420	61	157 to 683
550	415	42	319 to 510
560	328	71	163 to 492

Table 2. Times for loss of the beta sheet from the protein in picoseconds.

Temperature	Mean (ps)	Standard Error (ps)	95% Confidence Interval (ps)
500	1235	163	865 to 1604
510	1195	274	575 to 1815
520	517	151	175 to 859
530	400	111	149 to 651
540	577	122	278 to 876
550	186	25	129 to 243
560	135	25	78 to 192

Table 3. Time for the RMSD to go above 0.4nm from the initial crystal structure in picoseconds.

Temperature	Mean (ps)	Standard Error (ps)	95% Confidence Interval (ps)
500	1563	361	746 to 2380
510	1313	253	741 to 1885
520	859	147	528 to 1190
530	774	101	544 to 1003
540	553	76	341 to 764
550	489	77	314 to 664
560	391	64	242 to 540

The Arrhenius plot can be constructed using either the mean values for the different temperatures or all of the values for the repeats. In both cases a linear model is produced but the correlation is much stronger for the averaged values, which also give a narrower confidence interval for the line of best fit (Figure 3A–C). The wider confidence intervals for all of the data and the extent of the scatter can be seen in Figure 4A–C.

Figure 3.

A: The Arrhenius plot for the unfolding of the final bend using the averaged data. B: The Arrhenius plot for the unfolding of the beta sheet using the averaged data. C: The Arrhenius plot for the increase of the RMSD by 0.4nm from the crystal structure using the averaged data.

Figure 4.

A: The Arrhenius plot for the unfolding of the final bend using the complete data. B: The Arrhenius plot for the unfolding of the beta sheet using the complete data. C: The Arrhenius plot for the increase of the RMSD by 0.4nm from the crystal structure using the complete data.

The parameters for the lines of best fit using all of the data and only the mean values are given in Table 4 and Table 5.

Table 4. Lines of best fit for the Arrhenius equation using all of the data.

End Point		Value	Confidence interval	Coefficient of determination
Bend loss	Gradient	-9458	-11760 to -7245	52%
Bend loss	Intercept	32	27 to 36	52%
Beta sheet loss	Gradient	-10020	-13012 to -7027	41%
Beta sheet loss	Intercept	34	28 to 39	41%
RMSD > 0.4nm	Gradient	-5918	-7976 to -3860	35%
RMSD > 0.4nm	Intercept	25	22 to 29	35%

Table 5. Lines of best fit for the Arrhenius equation using the mean rates.

End Point		Value	Confidence interval	Coefficient of determination
Bend loss	Gradient	-10500	-13555 to -7444	94%
Bend loss	Intercept	34	28 to 40	94%
Beta sheet loss	Gradient	-10217	-14800 to -5635	87%
Beta sheet loss	Intercept	34	25 to 43	87%
RMSD > 0.4nm	Gradient	-6585	-7510 to -5660	98%

Discussion

All three of the end points used produce a linear model in the Arrhenius plot with a high degree of correlation. This suggests that in these cases the MD simulations are following Arrhenius behaviour and that these models can be used to predict rates of unfolding.

Of the three end points the RMSD variation is the easiest to calculate and least ambiguous, but it is also difficult to understand what this signifies at the protein level, when compared to using the disruption of secondary structure as a metric. The other advantage of using the unfolding of the secondary structure is that experimental values are available for unfolding of proteins and so if an appropriate end point can be found in the simulations then the gradients of the Arrhenius plot can be used to calculate the activation energy for unfolding.

This study also highlights the high degree of variability between the trajectories of different simulations. This variability is very high at lower temperatures. Simulations were repeated ten times and this resulted in values for the standard errors for the time of unfolding that could be up to 25% of the time. These standard errors are large and so the number of simulations that are run needs to be increased in order to reduce them. As the standard errors falls with the square root of the sample size this would mean a 4-fold increase in the number of simulations is needed to reduce the standard error by half. This would suggest that more reliable results could be obtained by carrying out 40 simulations at each of the temperatures, which is a considerable additional computational burden.

The large amount of variation also affected the quality of the lines of best fit to the Arrhenius equation. The coefficient of determination was much lower when considering all of the data but nonetheless there is clear evidence of the simulations following Arrhenius behaviour. The confidence intervals for the linear models using all the data and the average data are similar. The variation in the gradient is too large for making comparisons with experimentally derived energies of unfolding and this is another reason why a larger number of simulations will be needed in future studies.

There was a surprising degree of agreement in the slopes and intercepts of the bend loss and beta sheet loss end-points. This suggests that the energies involved in stabilising the beta sheet and final bend are similar. This consistency is encouraging and suggests that detailed energy predictions will be possible from MD simulations.

Crambin was not an ideal case for using in this study. Although it is very small and allows the simulations to be run in a shorter time, the protein does not unfold very much and so longer time-scales are needed within the simulations. Prion protein is another possible model that could be used to test unfolding as this has been shown to unfold over short simulation times (< 10ns) (Shamsir & Dalby, 2005). The other alternative is longer simulations times in order to produce clearer and less ambiguous end-points for the simulations.

Data and software availability

All of the code and data required for carrying out the simulations is available from http://dx.doi.org/10.5281/zenodo.20550. The shell script used to perform the simulations is available from http://dx.doi.org/10.5281/zenodo.20544. The dssp plots and the RMSD graphs for the simulations are available from http://dx.doi.org/10.5281/zenodo.20548. The SPSS data files and output files that include the details of how the analysis was carried out are available from http://dx.doi.org/10.5281/zenodo.20549. The software is released under a MIT license.

Author contributions

ARD conceived the study and designed the experiments. ARD and MSS carried out the simulations and the analysis. ARD and MSS were involved in all of the versions of the manuscript and have agreed to the final content.

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting the work.

Faculty Opinions recommended

References

Alberty R: Physical Chemistry. John Wiley and Sons: New York. 1987. Reference Source
Caves LS, Evanseck JD, Karplus M: Locally accessible conformations of proteins: multiple molecular dynamics simulations of crambin. Protein Sci. 1998; 7(3): 649–666. PubMed Abstract | Publisher Full Text | Free Full Text
Daggett V: Molecular dynamics simulations of the protein unfolding/folding reaction. Acc Chem Res. 2002; 35(6): 422–429. PubMed Abstract | Publisher Full Text
Day R, Bennion BJ, Ham S, et al.: Increasing temperature accelerates protein unfolding without changing the pathway of unfolding. J Mol Biol. 2002; 322(1): 189–203. PubMed Abstract | Publisher Full Text
Dodge C, Schneider R, Sander C: The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res. 1998; 26(1): 313–315. PubMed Abstract | Publisher Full Text | Free Full Text
Hess B, Kutzner C, Van Der Spoel D, et al.: GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput. 2008; 4(3): 435–447. Publisher Full Text
Huwe A, Kremer F, Behrens P, et al.: Molecular Dynamics in Confining Space: From the Single Molecule to the Liquid State. Phys Rev Lett. 1999; 82(11): 2338. Publisher Full Text
IBM_Corp: IBM SPSS Statistics for Windows, Version 22.0. Armonk, NY: IBM Corp. 2013. Reference Source
Lindorff-Larsen K, Trbovic N, Maragakis P, et al.: Structure and dynamics of an unfolded protein examined by molecular dynamics simulation. J Am Chem Soc. 2012; 134(8): 3787–3791. PubMed Abstract | Publisher Full Text
Peng Q, Li H: Atomic force microscopy reveals parallel mechanical unfolding pathways of T4 lysozyme: evidence for a kinetic partitioning mechanism. Proc Natl Acad Sci U S A. 2008; 105(6): 1885–1890. PubMed Abstract | Publisher Full Text | Free Full Text
Rose PW, Beran B, Bi C, et al.: The RCSB Protein Data Bank: redesigned web site and web services. Nucleic Acids Res. 2011; 39(Database issue): D392–D401. PubMed Abstract | Publisher Full Text | Free Full Text
Scheraga HA, Khalili M, Liwo A: Protein-folding dynamics: overview of molecular simulation techniques. Annu Rev Phys Chem. 2007; 58: 57–83. PubMed Abstract | Publisher Full Text
Schmidt A, Teeter M, Weckert E, et al.: Crystal structure of small protein crambin at 0.48 Å resolution. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2011; 67(Pt 4): 424–428. PubMed Abstract | Publisher Full Text | Free Full Text
Shamsir MS, Dalby AR: One gene, two diseases and three conformations: molecular dynamics simulations of mutants of human prion protein at room temperature and elevated temperatures. Proteins. 2005; 59(2): 275–290. PubMed Abstract | Publisher Full Text
Shamsir MS, Dalby AR: Beta-Sheet containment by flanking prolines: Molecular dynamic simulations of the inhibition of beta-sheet elongation by proline residues in human prion protein. Biophys J. 2007; 92(6): 2080–2089. PubMed Abstract | Publisher Full Text | Free Full Text
Snow CD, Sorin EJ, Rhee YM, et al.: How well can simulation predict protein folding kinetics and thermodynamics? Annu Rev Biophys Biomol Struct. 2005; 34: 43–69. PubMed Abstract | Publisher Full Text
van der Kamp MW, Daggett V: Pathogenic mutations in the hydrophobic core of the human prion protein can promote structural instability and misfolding. J Mol Biol. 2010; 404(4): 732–748. PubMed Abstract | Publisher Full Text | Free Full Text
van der Kamp MW, Daggett V: Molecular dynamics as an approach to study prion protein misfolding and the effect of pathogenic mutations. Top Curr Chem. 2011; 305: 169–197. PubMed Abstract | Publisher Full Text
Vasquez M, Nemethy G, Scheraga HA: Conformational energy calculations on polypeptides and proteins. Chem Rev. 1994; 94(8): 2183–2239. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 20 Aug 2015

Author details Author details

¹ Faculty of Science and Technology, University of Westminster, London, W1W 6UW, UK
² Faculty of Biosciences and Medical Engineering, Universiti Teknologi Malaysia, Skudai, Johor, 81310, Malaysia

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (1)

version 1

Published: 20 Aug 2015, 4:589

https://doi.org/10.12688/f1000research.6831.1

© 2015 Dalby AR and Shamsir MS. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Dalby AR and Shamsir MS. Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation. [version 1; peer review: 1 approved, 2 approved with reservations]. F1000Research 2015, 4:589 (https://doi.org/10.12688/f1000research.6831.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 20 Aug 2015

Views

Reviewer Report 28 Oct 2015

Adrian J. Mulholland, Centre for Computational Chemistry, University of Bristol, Bristol, UK

Approved with Reservations

https://doi.org/10.5256/f1000research.7345.r10967

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 14 Oct 2015

Dmitry Nerukh, Institute of Systems Analytics, Aston University, Birmingham, UK

Approved with Reservations

https://doi.org/10.5256/f1000research.7345.r10438

I have two main critical points:

The conceptual one is that the loss of the beta sheet or the bend could not be, strictly speaking, classified as "unfolding". There is very little in the text that describes how specifically the authors

I have two main critical points:

The conceptual one is that the loss of the beta sheet or the bend could not be, strictly speaking, classified as "unfolding". There is very little in the text that describes how specifically the authors quantified the moment of this loss of structure. Looking at Fig. 1 it can be seen that both motifs come back for some periods of time. Is this a loss of structure or just equilibrium fluctuations? If the latter, then there are no observable unfolding event in the data.
As authors rightfully admit, the uncertainty of these loss of structure times is very high, especially at low temperatures. This is most likely because the length of simulations, 1-10ns, is too short (as again, the authors mention this in the text) and also because the algorithm of determining these times is not statistically robust. For proteins, even small ones like crambin, typical time scales for structural rearrangements is of the order of tens of nanoseconds. Even dialanin takes hundreds of nanoseconds to accumulate statistically sound number of its very simple conformational transitions.

Taking into account these points, the main conclusion of the authors based on data shown on Fig. 2-4 appears unsubstantiated to me.

Overall, the paper is interesting and worth indexing if the authors more rigorously calculate the "loss of structure" times. I would consider building a statistical network on the major conformational states, in the spirit of Markov States Model. Then, the average transition times would faithfully represent the "unfolding" events.

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 28 Sep 2015

Melchor Sanchez-Martinez, Mind the Byte, Barcelona, Spain

Approved

https://doi.org/10.5256/f1000research.7345.r10439

The research article entitled 'Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation' by Dalby and Shamshir shows how the Molecular Dynamics simulations of crambin unfolding do follow the Arrhenius equation. It is addressing an interesting point as that Molecular Dynamics simulations of protein folding is used to exhibiting an non-Arrhenius behaviour and protein unfolding follow an Arrhenius equation, it is widely accepted. Testing if it is true is a really nice and interesting work.

In the article, there is a comprehensive explanation of study design, methods and analysis. The conclusions are well explained and justified on the basis of the results, Furthermore, the authors have made the data fully available, including the scripts to reproduce their work, being able to be revised by the author. That point makes the work more trustable and robust.

As a consequence of that, the manuscript is recommended to be approved. However there are some minor changes and comments that the authors may consider to improving the study.

The authors stated that "The Arrhenius equation has been used in solid-state chemistry calculations but currently no studies have tested whether it is valid in MD simulations of protein folding". This phrase should be rewritten and clarified, because as it is written, it seems that this is the first time that the Arrhenius equation is used in protein folding simulations and that's not true because its usage is widely established in protein folding.

To gain statistical significance, the authors performed 10 replicas, which is a good number. However more replicas changing the force-field, for instance using Amberff99SB-ILDN, would be a good option to gain more statistics and at the same time ensure to avoid the force-field dependence of the results, as in folding/unfolding processes it is very relevant (see for instance Lindorff-Larsen et al., 2012). Simulations with one or two more force-fields, performing 10 replicas for each one, would be appreciated.

To explore if the protein is unfolded or not, three different metrics were employed. One of them is the increase of the root mean squared deviation (RMSD) to 0.4nm from the crystal structure. Why 0.4 and not another value? There are some references to that in the literature or it is just an observed trend in the simulations? If it is an observed trend, would change varying the force-field. In Shimada et al., 2001, they identified important structural regions of Crambin. Some of them are used by the authors as unfolding indicators, although the aminoacid numbers are not exactly the same. Thus, I wondering if helix 1 (residues 6-18,sequence: SIVARSNFNVCRL) could be also a good indicator of Crambin unfolding with the employed PDB structure, and if it is, if there is a reason to not be used.

As the authors stated longer simulations of 40ns or even longer should show a more realistic and picture of Crambin unfolding. It is true that it constitutes a considerable additional computational time, but will probably result in stronger and more sound results. Thus I encourage the authors to do it, at least, in the future.

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 01 Oct 2015

Andrew Dalby, Faculty of Science and Technology, University of Westminster, London, W1W 6UW, UK

01 Oct 2015

Author Response

I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly ... Continue reading I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly been on short segments of protein or peptides rather than a complete protein. In writing the introduction I was surprised by the lack of literature on using Arrhenius in a biological simulation context, except for enzyme simulations. I had searched for Arrhenius behaviour in molecular dynamics and as stated I only found references to its use in simulations of materials. It seems that the problem was I should have searched just for Arrhenius and in using behaviour it needs to be the US spelling, behavior to return the protein modelling literature! The work of Baker, Karplus and Kuriyan, and Pande are particularly important in establishing the field. The more recent work by Best and Mittal is another excellent example which incorporates the effects of forcefields.

I agree with the point about the forcefield but this adds a confounding variables and also increases the variance of the simulations, which increases the uncertainty in the gradient and has a negative impact on the confidence intervals for predicted energy barriers. An alternative is to use the technique to calculate relative differences in energy barriers between protein variants using the same forcefield. As you are calculating a difference the forcefield effects can be considered to cancel out. This was the idea behind free energy perturbation calculations.
I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly been on short segments of protein or peptides rather than a complete protein. In writing the introduction I was surprised by the lack of literature on using Arrhenius in a biological simulation context, except for enzyme simulations. I had searched for Arrhenius behaviour in molecular dynamics and as stated I only found references to its use in simulations of materials. It seems that the problem was I should have searched just for Arrhenius and in using behaviour it needs to be the US spelling, behavior to return the protein modelling literature! The work of Baker, Karplus and Kuriyan, and Pande are particularly important in establishing the field. The more recent work by Best and Mittal is another excellent example which incorporates the effects of forcefields.

I agree with the point about the forcefield but this adds a confounding variables and also increases the variance of the simulations, which increases the uncertainty in the gradient and has a negative impact on the confidence intervals for predicted energy barriers. An alternative is to use the technique to calculate relative differences in energy barriers between protein variants using the same forcefield. As you are calculating a difference the forcefield effects can be considered to cancel out. This was the idea behind free energy perturbation calculations.
Competing Interests: I am the author Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 01 Oct 2015

Andrew Dalby, Faculty of Science and Technology, University of Westminster, London, W1W 6UW, UK

01 Oct 2015

Author Response

I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly ... Continue reading I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly been on short segments of protein or peptides rather than a complete protein. In writing the introduction I was surprised by the lack of literature on using Arrhenius in a biological simulation context, except for enzyme simulations. I had searched for Arrhenius behaviour in molecular dynamics and as stated I only found references to its use in simulations of materials. It seems that the problem was I should have searched just for Arrhenius and in using behaviour it needs to be the US spelling, behavior to return the protein modelling literature! The work of Baker, Karplus and Kuriyan, and Pande are particularly important in establishing the field. The more recent work by Best and Mittal is another excellent example which incorporates the effects of forcefields.

I agree with the point about the forcefield but this adds a confounding variables and also increases the variance of the simulations, which increases the uncertainty in the gradient and has a negative impact on the confidence intervals for predicted energy barriers. An alternative is to use the technique to calculate relative differences in energy barriers between protein variants using the same forcefield. As you are calculating a difference the forcefield effects can be considered to cancel out. This was the idea behind free energy perturbation calculations.
I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly been on short segments of protein or peptides rather than a complete protein. In writing the introduction I was surprised by the lack of literature on using Arrhenius in a biological simulation context, except for enzyme simulations. I had searched for Arrhenius behaviour in molecular dynamics and as stated I only found references to its use in simulations of materials. It seems that the problem was I should have searched just for Arrhenius and in using behaviour it needs to be the US spelling, behavior to return the protein modelling literature! The work of Baker, Karplus and Kuriyan, and Pande are particularly important in establishing the field. The more recent work by Best and Mittal is another excellent example which incorporates the effects of forcefields.

I agree with the point about the forcefield but this adds a confounding variables and also increases the variance of the simulations, which increases the uncertainty in the gradient and has a negative impact on the confidence intervals for predicted energy barriers. An alternative is to use the technique to calculate relative differences in energy barriers between protein variants using the same forcefield. As you are calculating a difference the forcefield effects can be considered to cancel out. This was the idea behind free energy perturbation calculations.
Competing Interests: I am the author Close
Report a concern

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 20 Aug 2015

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 1 20 Aug 15	read	read	read

Melchor Sanchez-Martinez, Mind the Byte, Barcelona, Spain
Dmitry Nerukh, Aston University, Birmingham, UK
Adrian J. Mulholland, University of Bristol, Bristol, UK

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

26 Views

28 Oct 2015 | for Version 1

Adrian J. Mulholland, Centre for Computational Chemistry, University of Bristol, Bristol, UK

26 Views Cite this report Responses(0)

Approved With Reservations

I've looked at this paper and have some comments and concerns, some of which reflect the other reviewer's comments.

I believe that there have been previous demonstrations of Arrhenius behaviour in unfolding simulations, including for crambin (see e.g. Ferrara et al. J. Phys. Chem. B 2000, 104, 5000-5010) and other proteins (e.g. Piana et al. PNAS 109 17845–17850, doi: 10.1073/pnas.1201811109).

Consideration of protein stability is crucial here (see Scalley and Baker Proc Natl Acad Sci U S A. 1997 Sep 30; 94(20): 10636–10640) and as the current simulations are run at high temperatures where the protein is unstable, this is an issue. The dielectric behaviour of the water model at these high temperatures is also a concern for me.

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

36 Views

14 Oct 2015 | for Version 1

Dmitry Nerukh, Institute of Systems Analytics, Aston University, Birmingham, UK

36 Views Cite this report Responses(0)

Approved With Reservations

I have two main critical points:

The conceptual one is that the loss of the beta sheet or the bend could not be, strictly speaking, classified as "unfolding". There is very little in the text that describes how specifically the authors quantified the moment of this loss of structure. Looking at Fig. 1 it can be seen that both motifs come back for some periods of time. Is this a loss of structure or just equilibrium fluctuations? If the latter, then there are no observable unfolding event in the data.
As authors rightfully admit, the uncertainty of these loss of structure times is very high, especially at low temperatures. This is most likely because the length of simulations, 1-10ns, is too short (as again, the authors mention this in the text) and also because the algorithm of determining these times is not statistically robust. For proteins, even small ones like crambin, typical time scales for structural rearrangements is of the order of tens of nanoseconds. Even dialanin takes hundreds of nanoseconds to accumulate statistically sound number of its very simple conformational transitions.

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

33 Views

28 Sep 2015 | for Version 1

Melchor Sanchez-Martinez, Mind the Byte, Barcelona, Spain

33 Views Cite this report Responses(1)

Approved

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

01 Oct 2015

Andrew Dalby, Faculty of Science and Technology, University of Westminster, London, W1W 6UW, UK

I would like to thank the referee for his helpful comments.

We definitely need to correct that omission in referring to other work using Arrhenius on proteins, although it has mostly been on short segments of protein or peptides rather than a complete protein. In writing the introduction I was surprised by the lack of literature on using Arrhenius in a biological simulation context, except for enzyme simulations. I had searched for Arrhenius behaviour in molecular dynamics and as stated I only found references to its use in simulations of materials. It seems that the problem was I should have searched just for Arrhenius and in using behaviour it needs to be the US spelling, behavior to return the protein modelling literature! The work of Baker, Karplus and Kuriyan, and Pande are particularly important in establishing the field. The more recent work by Best and Mittal is another excellent example which incorporates the effects of forcefields.

I agree with the point about the forcefield but this adds a confounding variables and also increases the variance of the simulations, which increases the uncertainty in the gradient and has a negative impact on the confidence intervals for predicted energy barriers. An alternative is to use the technique to calculate relative differences in energy barriers between protein variants using the same forcefield. As you are calculating a difference the forcefield effects can be considered to cancel out. This was the idea behind free energy perturbation calculations.

View more View less

Competing Interests

I am the author

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] Alberty R: Physical Chemistry. John Wiley and Sons: New York. 1987. Reference Source

[2] Caves LS, Evanseck JD, Karplus M: Locally accessible conformations of proteins: multiple molecular dynamics simulations of crambin. Protein Sci. 1998; 7(3): 649–666. PubMed Abstract | Publisher Full Text | Free Full Text

[3] Daggett V: Molecular dynamics simulations of the protein unfolding/folding reaction. Acc Chem Res. 2002; 35(6): 422–429. PubMed Abstract | Publisher Full Text

[4] Day R, Bennion BJ, Ham S, et al.: Increasing temperature accelerates protein unfolding without changing the pathway of unfolding. J Mol Biol. 2002; 322(1): 189–203. PubMed Abstract | Publisher Full Text

[5] Dodge C, Schneider R, Sander C: The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res. 1998; 26(1): 313–315. PubMed Abstract | Publisher Full Text | Free Full Text

[6] Hess B, Kutzner C, Van Der Spoel D, et al.: GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput. 2008; 4(3): 435–447. Publisher Full Text

[7] Huwe A, Kremer F, Behrens P, et al.: Molecular Dynamics in Confining Space: From the Single Molecule to the Liquid State. Phys Rev Lett. 1999; 82(11): 2338. Publisher Full Text

[8] IBM_Corp: IBM SPSS Statistics for Windows, Version 22.0. Armonk, NY: IBM Corp. 2013. Reference Source

[9] Lindorff-Larsen K, Trbovic N, Maragakis P, et al.: Structure and dynamics of an unfolded protein examined by molecular dynamics simulation. J Am Chem Soc. 2012; 134(8): 3787–3791. PubMed Abstract | Publisher Full Text

[10] Peng Q, Li H: Atomic force microscopy reveals parallel mechanical unfolding pathways of T4 lysozyme: evidence for a kinetic partitioning mechanism. Proc Natl Acad Sci U S A. 2008; 105(6): 1885–1890. PubMed Abstract | Publisher Full Text | Free Full Text

[11] Rose PW, Beran B, Bi C, et al.: The RCSB Protein Data Bank: redesigned web site and web services. Nucleic Acids Res. 2011; 39(Database issue): D392–D401. PubMed Abstract | Publisher Full Text | Free Full Text

[12] Scheraga HA, Khalili M, Liwo A: Protein-folding dynamics: overview of molecular simulation techniques. Annu Rev Phys Chem. 2007; 58: 57–83. PubMed Abstract | Publisher Full Text

[13] Schmidt A, Teeter M, Weckert E, et al.: Crystal structure of small protein crambin at 0.48 Å resolution. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2011; 67(Pt 4): 424–428. PubMed Abstract | Publisher Full Text | Free Full Text

[14] Shamsir MS, Dalby AR: One gene, two diseases and three conformations: molecular dynamics simulations of mutants of human prion protein at room temperature and elevated temperatures. Proteins. 2005; 59(2): 275–290. PubMed Abstract | Publisher Full Text

[15] Shamsir MS, Dalby AR: Beta-Sheet containment by flanking prolines: Molecular dynamic simulations of the inhibition of beta-sheet elongation by proline residues in human prion protein. Biophys J. 2007; 92(6): 2080–2089. PubMed Abstract | Publisher Full Text | Free Full Text

[16] Snow CD, Sorin EJ, Rhee YM, et al.: How well can simulation predict protein folding kinetics and thermodynamics? Annu Rev Biophys Biomol Struct. 2005; 34: 43–69. PubMed Abstract | Publisher Full Text

[17] van der Kamp MW, Daggett V: Pathogenic mutations in the hydrophobic core of the human prion protein can promote structural instability and misfolding. J Mol Biol. 2010; 404(4): 732–748. PubMed Abstract | Publisher Full Text | Free Full Text

[18] van der Kamp MW, Daggett V: Molecular dynamics as an approach to study prion protein misfolding and the effect of pathogenic mutations. Top Curr Chem. 2011; 305: 169–197. PubMed Abstract | Publisher Full Text

[19] Vasquez M, Nemethy G, Scheraga HA: Conformational energy calculations on polypeptides and proteins. Chem Rev. 1994; 94(8): 2183–2239. Publisher Full Text

Molecular Dynamics Simulations of the Temperature Induced Unfolding of Crambin Follow the Arrhenius Equation.

Abstract

Keywords

Introduction

Materials and methods

Results

Figure 1. An Example DSSP Secondary Structure Plot (from 520K run 1).

Figure 2.

Table 1. Times until the loss of the bend from residue 36-38 in the protein in picoseconds.

Table 2. Times for loss of the beta sheet from the protein in picoseconds.

Table 3. Time for the RMSD to go above 0.4nm from the initial crystal structure in picoseconds.

Figure 3.

Figure 4.

Table 4. Lines of best fit for the Arrhenius equation using all of the data.

Table 5. Lines of best fit for the Arrhenius equation using the mean rates.

Discussion

Data and software availability

Author contributions

Competing interests

Grant information

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated