Team:UNSW Australia/Model/mathematical

$$\require{mhchem}$$

Why mathematical modelling

Mathematical modelling is a powerful tool for verifying and evaluating synthetic biology solutions. Since the ultimate target chassis Symbiodinium is hard to engineer, in silico verification comes in as an alternative to the common in vivo strategy. In this mathematical model, we will consider the effect of the introduction of HSP22E, HSP22F and Glutathione into the system. We believe this model was able to facilitate our understanding of how HSP22E, HSP22F and glutathione can alleviate the cell condition quantitatively. However, as with all modelling, care must be taken to select values and parameters that reflect the real world.

Aim

The aim was to produce a model that shows the effectiveness of the solution and provides insight into how varying the quantities of HSP22E, HSP22F and Glutathione would affect the cell's response under various temperatures. We further aimed to determine the optimal conditions for a cell to trigger the thermal protective response that we designed for the cell.

Implementation

Contrary to the fact that many papers have studied the structure of the heat shock protein families, studies of mathematical model on the heat shock activity remain sparse. However, we were able to find a paper by Carole J. Proctor (1) which models the activity of chaperones on misfolded proteins that are caused by the high level of ROS.

Computational Language and tool

From the original paper, the model was written base on the SBML(System Biology Markup Language), which is a language developed specifically for system biology base on XML(4). However, due to the unfamiliarity of the language, we decided to switch another software package PySB developed by members of the Lopez Lab at Vanderbilt University and the Sorger Lab at Harvard Medical School(3). We found this python package very user-friendly while at the same time provides powerful solvers for the system both deterministic and stochastic. One thing to note is that the PySB model is written in a domain-specific language that somewhat abuses regular python style.

Base of the Model

  • The baseline of the model was inspired by a model which modelled the activity of charperone in aging(1), where we adopted most of their models with a few modifications added.
  • We omitted the dimerization of the protein in our process since it does not contribute much to the effectiveness of the holding activity in the model while at the same time increasing the complexity of the model.
  • HSP22E/F will only perform their function while in the mitochondria/chloroplast(this is a simplification of the model)

Note: in the section below HSP22E/F will be referred to as sHSP for simplicity purpose.

Assumptions and reaction of the model

Below is the network diagram of the model which was drawn using iPad since other graphic design applications are not convenient in producing a schematic diagram like this. Figure 1 is a diagram illustrating the reaction happening between the major proteins involved in the folding and unfolding process. Figure 2 shows the regulation of Hsp90, which is the native regulation in many eukaryotic cells. Moreover, it also includes the regulation of sHSP and Glutathione Synthetase, which are controlled by OxyR transcription factor.

Figure 1: Protein turnover
Figure 2: Autoregulation of Hsp90 and OxyR controlled expression

The details of the reaction and parameter values in the diagram above are specified in the following table.


Reaction Name Reaction Parameter Default value Assumption
Protein Synthesis \(\ce{\varnothing ->[k_{1}] NatP}\) \(k_{1}\) \(10.0\) Half-life of 6–7days(1)
Misfolding \(\ce{NatP + ROS ->[k_{2}] MisP + ROS}\) \(k_{2}\) \(0.00002\) Ratio of native:misfolded proteins is 19:1 under normal conditions(1)
Binding and dissociation of misfoldedprotein with Hsp90 \(\ce{MisP + {Hsp{90}} <=>[k_{3}][k_{4}] MCom}\) \(k_{3}, k_{4}\) \(50.0\) The binding affinity of misfolded protein to Hsp90 isless than that of HSF1. The rate of unsuccessful refolding is low compared to refolding under normal conditions(1)
Protein Refolding \(\ce{MisP + ATP ->[k_{5}] NatP + Hsp{90}+ADP}\) \(k_{5}\) \(4.0\times 10^{-6}\) Rapid reaction when bound to Hsp90 if ATP levels are high.(1)
Protein degradations \(\ce{Proteins + ATP ->[k_{6}] ADP}\) \(k_{6}\) \(6.0\times 10^{-7}\) Half-life of 6–7days(1)
Protein aggregation \(\ce{2MisP ->[k_{7}] AggP}\) \(k_{7}\) \(1.0\times 10^{-7}\) This is a slow reaction unless high levels of misfolded protein(1)
Binding of HSF1 and HSP90 and dissociation \(\ce{Hsp{90} +HSF{1} <=>[k_{8}][k_{9}] HCom}\) \(k_{8}, k_{9}\) \(500, 1.0\) The affinity of HSF1for Hsp90 is 10 times stronger than that of misfolded proteins. Under normal conditions most of HSF1 is complexed to Hsp90(1)
Dimerisation of HSF1 and dissociation \(\ce{2HSF{1} <=>[k_{10}][k_{13}] DiH}\) \(k_{10}, k_{13}\) \(0.01, 0.5\) This reaction is rapid only when levels of unbound HSF1 are high. \(k_{13}\) represents a slow reaction(1)
Trimerisation of HSF1 and dissociation \(\ce{DiH + HSF{1} <=>[k_{11}][k_{12}] TriH}\) \(k_{11}, k_{12}\) \(100 , 0.5\) This is a fast reaction once dimers are formed. This is a slow reaction(1)
Binding of HSE andHSF1-trimers and dissociation \(\ce{TriH + HSE <=>[k_{14}][k_{15}] HSETriH}\) \(k_{14}, k_{15}\) \(0.05\) This reaction only proceeds when trimers are available. If all HSF1 forms trimers, the ratio of the forward to reverse reaction is about 1000:1(1)
Hsp90 transcription & translation \(\ce{HSETriH ->[k_{16}] HSETriH + Hsp{90}}\) \(k_{16}\) \(1000.0\) This is fast when HSE is bound(1)
Hsp90 degradation \(\ce{Hsp{90} + ATP ->[k_{17}] ADP}\) \(k_{17}\) \(8.02\times 10^{-9}\) Half-life of 1 day(1)
ATP formation and expenditure \(\ce{ADP <=>[k_{18}][k_{19}] ADP}\) \(k_{18}, k_{19}\) \(12.0, 0.02\) Assume that ratio of ATP:ADP is 10:1 under normal conditions(1)
ROS production and base removal \(\ce{\varnothing <=>[k_{20}][k_{21}] ROS}\) \(k_{20}, k_{21}\) \(1, 0.001\) Assume constant production level of ROS
ROS reduction \(\ce{ROS + Reduced Glutathione ->[k_{22}] Oxidised Glutathione}\) \(k_{22}\) \(20.0\) Assuming this is a fast reaction
Glutathione reduction \(\ce{Oxidised Glutathione ->[k_{34}] Reduced Glutathione}\) \(k_{34}\) \(0.1\) Assuming this is related to the Glutathione concentration
sHSP binding \(\ce{MitosHsp + MisP <=>[k_{23}][k_{36}] MisPsHsp}\) \(k_{23}, k_{36}\) \(50.0, 0.2\) the rate of unsuccessful binding is low compared to binding
sHSP fail to hold \(\ce{MitosHsp + MisP <=>[k_{23}][k_{36}] MisPsHsp}\) \(k_{24}\) \(1.0\) sHSP can be found in large protein aggregates(2)
HSP90 binding on MisPsHSP \(\ce{Hsp{90} + MisPsHsp <=>[k_{25}][k_{26}] HspMisPsHsp}\) \(k_{25}, k_{26}\) \(50.0, 5.0\) the rate of unsuccessful binding is low compared to binding
Refolding with sHSP \(\ce{HspMisPsHsp + ATP ->[k_{27}] Hsp{90} + MisP + sHsp + ADP}\) \(k_{27}\) \(10.0\) relatively slow since it needs energy(Double check this assumption)
Activation of ROS by OxyR \(\ce{inactivate OxyR + ROS ->[k_{28}] active OxyR + ROS}\) \(k_{28}\) \(20.0\) Relatively fast process since it need to respond relatively fast
Active OxyR binding DNA \(\ce{activate OxyR + sHspGlu <=>[k_{32}][k_{33}] OxyRsHspGlu}\) \(k_{32}, k_{33}\) \(20.0, 5.0\) the rate of unsuccessful binding is low compared to binding(Think about how to represent that part of the DNA)
sHSP synthesis \(\ce{OxyRsHspGlu ->[k_{29}] NonMitosHsp + active OxyR + sHspGlu}\) \(k_{29}\) \(10.0\) This is a fast reaction when OxyR binds to it
sHSP transfer \(\ce{MitosHsp ->[k_{35}] NonMitosHsp}\) \(k_{35}\) \(10.0\) This is a fast reaction
Glutathione Synthetase production Synthesis \(\ce{OxyRsHspGlu ->[k_{30}] active OxyR + sHspGlu}\) \(k_{30}\) \(10.0\) This is a fast reaction when OxyR binds to it
Glutathione Production \(\ce{OxyRsHspGlu ->[k_{31}] active OxyR + sHspGlu}\) \(k_{31}\) \(5.0\) This is a relatively fast reaction
Species initial value (number of molecules)
Native protein 6000000
Hsp90–HSF1 complex 5900
Hsp90 300000
HSF1 100
ROS 100
ATP 10000
ADP 1000
HSE 1
sHSP 200
Glutathione 100
sHSPGlu 1
OxyR 10

Result and Analysis

Since we cannot formalize a equation where the temperature relates to the rate constant also it is not applicable to use the Arrhenius equation which we need to assume the pre-exponential factor \(A\) and activation energy \(E_{a}\) of the reaction, hence we decided to do the simulation at different temperature in a qualitative manner. $$\textit{k} = \textit{A}e^{\frac{-E_{a}}{RT}}\textit{}$$ Given the equation, if we assume the activation energy is \(100kJmol^{-1}\), \(k_{293}, k_{303}\) is the rate constant at \(293, 303\) kelvin respectively, we get, \begin{align} \frac{k_{303}}{k_{293}} & = \frac{\textit{A}e^{\frac{-E_{a}}{RT_{303}}}}{\textit{A}e^{\frac{-E_{a}}{RT_{293}}}} \\ & = \frac{e^{\frac{100000}{8.314\times 303}}}{e^{\frac{100000}{8.314\times 293}}}\\ & = 3.874. \end{align} Hence, with the Arrhenius equation as guidance we decided to have a rate constant relationship as follow, \begin{align} 1 \leq \frac{k_{elevatedT}}{k_{normalT}} \leq 10. \end{align} Moreover, we realized the model would deviate from the expected output if we assume it strictly follows the Arrhenius equation. Unlike chemical reaction, biochemical reaction is extensively controlled by the enzyme, given temperature increment in the environment which would result in different levels of protein conformational change hence disobeying the Arrhenius equation.

Therefore, we abstract the temperature change to the alternation of a few parameters which relates significantly to temperature changes. (\(k_{1}\)) goes down if temperature goes up, (\(k_{2}, k_{6}, k_{20}, k_{29}, k_{30}, \)) go up if temperature rises.

Figure 3: graph output under normal temperature with the baseline model
Figure 4: graph output under higher temperature with the baseline model

This group of graph is the comparision of the baseline condition under different temperature. It is evident that under higher temperature the level of natural proteins goes down quite swiftly in 100 unit time. This can be seen as a baseline of the cellular responce to temperature elevation.

After comparing the baseline model at different temperatures, we want to see how the model with sHSP and Glutathione behave which we will be referring to as the sHSP with Glutathione model afterwards.

Figure 5: graph output under nomral temperature with the sHSP and Glutathione model
Figure 6: graph output under high temperature with the sHSP and Glutathione model

As you can see by comparing the baseline and add on model at higher temp, we can see that the Natural Protein in Figure 4 is delepting at a significantly lower rate than the Natural Protein in Figure 5. This comparison showed a promising result, however, it is still unclear whether sHSP or Glutathione contribute more to the alleviation of the heat stress. Therefore, graphs with only sHSP or Glutathione were plotted below.

Figure 7: graph output under high temperature with only sHSP added
Figure 8: graph output under high temperature with only Glutathione added

Comparing Figure 7 and Figure 8, we can notice that the Natural Proteins of the model with only sHSP decreased to 0, while in Figure 8 the Natural Protein is still maintained at a relatively healthy level. Hence, we can conclude that Glutathione is the main helper as expected since its main function is to reduce the ROS level inside the cell which acts as the main cause of protein misfolding.

After knowing that Glutathione is the main helper, we would also like to evaluate if frontloading the amount of Glutathione before the actual ROS level surge would be helpful for the cell to increase its survivability.

Figure 9: graph output under high temperature with Glutathione frontload
Figure 10: graph output under high temperature without Glutathione frontload

The graph above suggested that 10 times initial frontloading of the Glutathione is not improving the state of cell further. This is largely due to the setting of the model, where a small amount of Glutathione is sufficient to oppress the ROS level inside the cell.

Discussion

Overall, our mathematical modelling shed some insights into how our PROTECC coral solution is effective in maintaining the functionality of the cell. From the analysis of the model, we know that this is a feasible solution to combat cellular heat stress for Symbiodinium cells. Moreover, we figured out that Glutathione is the primary force in lifting the cell from extreme thermality. However, on a more negative note that most of the value of the parameter is chosen at our best estimation since reliable data were hard to obtain. Thus the model is only a rough estimation of the in vitro environment. Furthermore, we did not reach our original goal of evaluating the optimal condition to activate the cellular response due to the limit of time and also the difficulties in adding the quantitative measure onto the model. Additionally, we would like to perform sensitivity analysis in the future to evaluate further the robustness of our model and how it responds to various parameter tweaking.

Limitations and difficulties

We experienced some technical difficulties in incorporating the temperature feature in the model where we used Expression in the PySB package where it seems to have some bug. Thanks to Rodrigo Santibáñez an active member in the PySB community helped us with the debugging and further pointed us to the Kappa package. Due to time constraint we choose to not use the Expression feature in PySB, instead we try to control the parameter values outside the model to represent the model behaviour at different temperatures.

Evaluation and advice to future teams

Initially we attempted to use the Simbiology package of Matlab but found it difficult to collaborate on due to binary file format. Therefore, we reimplement our baseline model using the PySB package where we find it easier to collaborate. PySB has a fantastic community(PySB Gitter) for support where you can have your questions resolved.

Initially we began by using the MATLAB SimBiology package. This proved to be obtuse to operate and difficult to collaborate on, prompting us to move to PySB (Python Systems Biology). PySB provides a good overview of the type of functionality and methods present in mathematical modelling tools, allowing us to quickly replicate the work done previously in SimBiology. PySB proved to be a great learning experience due to the open code and community nature of the project.

Code

Link to code

REFERENCES

  1. Proctor CJ, Sőti C, Boys RJ, Gillespie CS, Shanley DP, Wilkinson DJ, et al. Modelling the actions of chaperones and their role in ageing. Mechanisms of Ageing and Development. 2005;126(1):119–31. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0047637404002337
  2. Rütgers M, Muranaka LS, Mühlhaus T, Sommer F, Thoms S, Schurig J, et al. Substrates of the chloroplast small heat shock proteins 22E/F point to thermolability as a regulative switch for heat acclimation in Chlamydomonas reinhardtii. Plant Mol Biol. 2017 95(6):579–91. Available from: http://link.springer.com/10.1007/s11103-017-0672-y
  3. Lopez CF, Muhlich JL, Bachman JA, Sorger PK. Programming biological models in Python using PySB. Mol Syst Biol. 2013; 9(1):646. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1038/msb.2013.1
  4. Hucka M, Bergmann FT, Hoops S, Keating SM, Sahle S, Schaff JC, et al. The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 1 Core. Journal of Integrative Bioinformatics. 2015 Jun 1;12(2):382–549. Available from: https://www.degruyter.com/view/journals/jib/12/2/article-p382.xml