Difference between revisions of "Team:Virginia/Modeling2"

Line 3: Line 3:
 
     <meta charset="utf-8"/>
 
     <meta charset="utf-8"/>
 
     <meta content="width=device-width" name="viewport"/>
 
     <meta content="width=device-width" name="viewport"/>
     <title>Manifold</title>
+
     <title>Modeling</title>
 
     <link href="images/thickIconBlack.svg" rel="icon" type="image/svg"/>
 
     <link href="images/thickIconBlack.svg" rel="icon" type="image/svg"/>
 +
  
 
     <!-- Font Declarations -->
 
     <!-- Font Declarations -->
Line 19: Line 20:
 
     <link href="https://2020.igem.org/Template:Virginia/styles/mainCSS?action=raw&amp;ctype=text/css" rel="stylesheet" type="text/css"/>
 
     <link href="https://2020.igem.org/Template:Virginia/styles/mainCSS?action=raw&amp;ctype=text/css" rel="stylesheet" type="text/css"/>
 
     <link href="https://2020.igem.org/Template:Virginia/styles/genpageCSS?action=raw&amp;ctype=text/css" rel="stylesheet" type="text/css"/>
 
     <link href="https://2020.igem.org/Template:Virginia/styles/genpageCSS?action=raw&amp;ctype=text/css" rel="stylesheet" type="text/css"/>
 +
    <link href="https://2020.igem.org/Template:Virginia/styles/modelingCSS?action=raw&amp;ctype=text/css" rel="stylesheet" type="text/css"/>
 
     <link href="https://2020.igem.org/Template:Virginia/styles/hoverCSS?action=raw&amp;ctype=text/css" media="all" rel="stylesheet"/>
 
     <link href="https://2020.igem.org/Template:Virginia/styles/hoverCSS?action=raw&amp;ctype=text/css" media="all" rel="stylesheet"/>
  
Line 118: Line 120:
 
       <div class="contentHolder">
 
       <div class="contentHolder">
 
           <div class="genTitle">
 
           <div class="genTitle">
             <div>General Template Page</div>
+
             <div>Modeling</div>
 
           </div>
 
           </div>
           <div class="sectionTitle" id="Section 1">Section 1</div>
+
           <div class="sectionTitle" id="Section 1">Overview</div>
 
           <div class="bar"></div>
 
           <div class="bar"></div>
 
           <div class="section">
 
           <div class="section">
 
             <div class="paragraph">
 
             <div class="paragraph">
                 The lack of a versatile and reliable way to improve metabolic flux channeling, pathway orthogonality, and product yields is a major impediment to the expanded utilization of biosynthesis for the production of drugs and industrially valuable chemicals. Manifold, a platform technology that addresses this problem, consists of <div class="dict">bacterial microcompartments<span><img src="https://upload.wikimedia.org/wikipedia/commons/thumb/2/25/Carboxysome_and_bacterial_microcompartments.jpg/800px-Carboxysome_and_bacterial_microcompartments.jpg"/>Bacterial microcompartments (BMCs) are organelle-like structures, consisting of a protein shell that encloses enzymes and other proteins. BMCs are typically about 40–200 nanometers in diameter and are entirely made of proteins. The shell functions like a membrane, as it is selectively permeable.</span></div> (BMCs) with encapsulated dsDNA scaffolds <div class="ref">[1]<span>Elbaz, J., Yin, P., &amp; Voigt, C. A. (2016). Genetic encoding of DNA nanostructures and their self-assembly in living bacteria. Nature communications, 7(1), 1-11.</span></div> that sequester and spatially organize, at fixed concentrations, biosynthetic enzymes presented as zinc-finger fusion proteins. Here we deliver the designs for an E. coli cell capable of synthesizing resveratrol using the Manifold platform. The Manifold platform will help lower costs and expand the applications of chemical biosynthesis. The lack of a versatile and reliable way to improve metabolic flux channeling, pathway orthogonality, and product yields is a major impediment to the expanded utilization of biosynthesis for the production of drugs and industrially valuable chemicals. Manifold, a platform technology that addresses this problem, consists of bacterial microcompartments (BMCs) with encapsulated dsDNA scaffolds that sequester and spatially organize, at fixed concentrations, biosynthetic enzymes presented as zinc-finger fusion proteins. Here we deliver the designs for an E. coli cell capable of synthesizing resveratrol using the Manifold platform. The Manifold platform will help lower costs and expand the applications of chemical biosynthesis.
+
                 Our scaffold-incorporated bacterial microcompartment (BMC) system, called MANIFOLD, incorporates targeted enzyme fusions to the inner BMC shell proteins to spatially orient select enzymes to minimize metabolic flux leakage. For the Summer of 2020, the team thoroughly developed a proof-of-concept for the production of model molecule trans-resveratrol for its compact size and compatibility.  
 +
            </div>
 +
            <div class="paragraph">
 +
            With the onset of the global pandemic and need for quick project adaptations to a virtual environment, the modeling team sought to put a positive twist on the unprecedented circumstances and took the opportunity to flesh out a thorough multi-scale model to support experimental design in wet lab and explore various other facets of resveratrol metabolism optimization. We utilized a multitude of mathematical models in both Matlab and Python to understand the viability of our BMC system. We chose to embrace the complexities that modeling could envelop and built a multi-scale model balancing assumptions with the realistic conclusions we were to make.  
 +
            </div>
 +
            <div class="paragraph">
 +
              From our endeavors, we determined the optimized expression of our 14 different parts utilizing cataloged parts from the iGEM Parts Registry and the expected increase of biosynthesized trans-resveratrol when utilizing the MANIFOLD system. Our silico chronology began with a considerable amount of literature review to familiarize ourselves with recent advances in BMC research. From this, we were able to construct an optimization model for ideal part expression used in future wet lab practices. We then delved into the reaction kinetics of our system to quantify the improvement of MANIFOLD on current methods of trans-resveratrol biosynthesis.  
 
             </div>
 
             </div>
 
           </div>
 
           </div>
           <div class="sectionTitle" id="Section 2">Section 2</div>
+
           <div class="sectionTitle" id="Section 2">Research-Driven Analysis</div>
 
           <div class="bar"></div>
 
           <div class="bar"></div>
 
           <div class="section">
 
           <div class="section">
 
             <div class="paragraph">
 
             <div class="paragraph">
              The invention consists of a protein shell comprising one or more proteins, one or more nucleic acid scaffolds of which there can be multiple copies, anabolic and/or catabolic enzymes specific to the desired biosynthesis pathway each containing a nucleic acid binding domain, recognition sequences for the utilized nucleic acid binding domains, nucleic acid spacers, and a linkage between the nucleic acid scaffolds and the protein shell. The protein shell (10) can take the form of any closed or open surface that comprises one or more repeating protein units (12). Examples of valid shells include bacterial microcompartments such as the Pdu, Eut, and carboxysome microcompartments, as well as modified,  but not necessarily closed, surfaces composed of mutated versions of these microcompartment shell proteins. The nucleic acid scaffolds (18) comprise multiple recognition sequences (22) and spacers (32) and can be made from any form of nucleic acid, including: deoxyribonucleic acid, ribonucleic acid, and synthetic nucleic acids such as xeno nucleic acids and peptide nucleic acids among others. These scaffolds are attached to the protein shell. The pathway enzymes are biological proteins whose exact sequences are dependent on the given use case of the invention, but which all contain a nucleic acid binding domain either internal to their structure, or at their N or C terminus.  
+
                The in silico investigation of MANIFOLD was guided by these questions relating to the improved assembly and efficiency of our system. In order for our design to present a significant breakthrough in metabolic engineering, we needed quantitative answers. To first construct these questions, the modeling team conducted research driven analysis to determine what parts of our system may already be characterized and which may require further modeling efforts.
 
             </div>
 
             </div>
 +
            <div class="paragraph">
 +
              The bacterial microcompartment shell became the first topic of conversation. The [21-gene] Pdu operon (named for its natural 1,2-propanediol metabolism pathway) derived from citrobacter freundii has been only recently elucidated in literature. The shell is known to be composed of certain pdu proteins with pdu enzymes naturally localized to the interior of the compartment. The proteinaceous shell ideally forms an icosahedral shell with a mean face-to-face diameter of 117.9919 nanometers (Figure 1).[1](Apparent size and morphology of bacterial microcompartments varies with technique) Our study utilized seven pdu proteins (pduABJKNUT) which are necessary for the formation of this nanocarrier and therefore lead to BMCs of the same size.
 +
            </div>
 +
            <div class="figureHolder">
 +
              <img src="https://static.igem.org/mediawiki/2020/d/d6/T--Virginia--images--icosahedral.JPG"/>
 +
              <div>
 +
              <b>Fig 1.</b> 3D rendering of the icosahedral shape of an assembled BMC. Dimensions formulated from findings in Kennedy et. al.[1] to be 117.992 nanometers with a side length of 68.123 nanometers. Note: Images represent the idealized BMC structure based on hypothesized methods of  pdu protein assembly.
 +
              </div>
 +
            </div>
 +
            <div class="paragraph">
 +
            Our selection of this synthetic pdu operon [2] (Synthesis of Empty Bacterial Microcompartments) was bolstered by a recent study elucidating the stoichiometric composition of these proteinaceous enclosures. [3] (Decoding the stoich) This resource quantified the ratio between pdu proteins apparent in BMC enclosures (Figure 2) which solidified the quantitative backing we needed for progression of part expression analysis. With twelve parts required for full assembly of the MANIFOLD system, the challenge of correct, relative expression became a pressing issue. With the realization of no immediate availability of in-lab practices to help in guiding these judgements through fluorescence or scanning electron microscopy methods, our attention shifted towards expanding upon methods used in literature to create an optimization model in silico.
 +
            </div>
 +
          </div>
 +
          <div class="sectionTitle" id="Section 3">Relative Part Expression</div>
 +
          <div class="bar"></div>
 +
          <div class="section">
 +
            <h3> BMC Background
 +
              </h3>
 +
            <div class="paragraph">
 +
              The pduJ protein is the most abundant protein (5212 monomers)[3] of the BMC shell. Therefore, any MANIFOLD system with correct construction of the BMC would mandate the expression of at least 5212 pduJ proteins. We therefore used the expression of pduJ as a relative lower bounded baseline for computing idealized expression of all other parts of the MANIFOLD system. These predicted relativities in part expression were deduced from theoretical relationships within the bacterial microcompartment.
 +
            </div>
 +
            <div class="paragraph">
 +
              One of these key stoichiometric relationships is mediated through the pduA shell protein. The most well-researched process of inner-luminal binding is orchestrated through the pduA protein. The N-terminal of the pduD protein is hypothesized to bind with the C-terminal (luminal-facing) residues of pduA.[4] (Exploring Bacterial Organelle Interactomes: A Model of the Protein-Protein Interaction Network in the Pdu Microcompartment) MANIFOLD utilizes the first eighteen amino acids of pduD as a tag for interior localization to the BMC as has been performed before.[5] (The N-Terminal Region of the Medium Subunit (PduD) Packages Adenosylcobalamin-Dependent Diol Dehydratase (PduCDE) into the Pdu Microcompartment) This peptide sequence was combined with the zinc finger domain to enable DNA scaffold localization to the interior of the BMC for subsequent enzyme attachment. This connectivity is significant in establishing the quantified relationships between MANIFOLD components.
 +
              </div>
 +
              <h3> Model Activation
 +
              </h3>
 +
              <div class="paragraph">
 +
                Correct stoichiometry between MANIFOLD parts required a method of quantification for both transcriptional and translational parameters. Due to the large number of required components, part expression would have to be carefully modulated to be in high enough quantity for a high-output system without killing the E. coli host. Transcriptional relativity had already been documented through the Anderson Promoter Library. The Anderson RBS Library has introduced several RBS combinations but the relativity in these sequences has yet to be fully elucidated. We then looked at the proclaimed RBS Calculator by the Salis Lab (https://salislab.net/software/). This software calculates the translation initiation rate (TIR) based on a thermodynamic model of free energy on and around the start codon.(https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2782888/) Using 21 Anderson RBS sequences and two others found naturally in acquired vectors, we found the calculated TIR values for our eight parts requiring an RBS. These values were used in conjunction with the relative transcriptional values of 19 Anderson Promoters to find optimized sequence combinations to achieve correct relative part expression. Due to the sheer number of combinations, the process yielded computational ratios with minimal error compared to ideal, theoretical ratios (Figure 3). Important assumptions for this model are listed below.
 +
              </div>
 +
              <h3> Assumptions
 +
              </h3>
 +
              <ul>
 +
                <li>Assume steady state with consistency in other central dogma factors (allows for transcriptional and translational rates to be multiplied)</li>
 +
                <li>Assume ACC and HIV RT expressions may be calculated through subunit with lowest translation initiation rate</li>
 +
                <li>Assume ribosome binding only occurs at primary stop codon less than 20 base pairs downstream of selected RBS sequence</li>
 +
              </ul>
 +
             
 
             <div class="figureHolder">
 
             <div class="figureHolder">
               <img src="https://static.igem.org/mediawiki/2019/3/31/T--NCKU_Tainan--CBMB-Amplification.png"/>
+
               <img src="https://i.ytimg.com/vi/NjlzcriYc8o/maxresdefault.jpg"/>
 
               <div>
 
               <div>
               <b>Fig 1.</b> Figure taken from iGEM Tainan 2019 for demo purposes. Notice how the figure is much longer than it is wide, and two images are coupled together to achive this. Try to do that as well so it looks good.
+
              <!-- Comparison and theoretical part expression-->
 +
             
 +
 +
  <table border="1" cellpadding="1" cellspacing="1" class="PromCalcTable">
 +
  <tbody><tr>
 +
    <!--<TD Colspan = "2" Align = "Center">Time Table</TH>-->
 +
  </tr>
 +
  <tr>
 +
    <td>Part</td><td># of molecules</td><td>Ideal Ratio</td><td>RBS</td><td>Promoter</td><td>Computational Ratio</td><td>Error</td>
 +
  </tr>
 +
  <tr>
 +
    <td>pduD fusion</td><td>1841</td><td>0.353</td><td>BBa_J61107</td><td>BBa_J23101</td><td>0.357</td><td>0.00335</td>
 +
    </tr>
 +
    <tr>
 +
      <td>GFP fusion</td><td>3682</td><td>0.706</td><td>BBa_J61111</td><td>BBa_J23106</td><td>0.711</td><td>0.00500</td>
 +
      </tr>
 +
      <tr>
 +
    <td>pduJAKBNUT</td><td>5212</td><td>1.000</td><td>Warren RBS</td><td>BBa_J23114</td><td>1.000</td><td>0</td>
 +
    </tr>
 +
  </tbody></table>
 +
               <b>Fig 3.</b> Results from the optimization ‘PromoterCalc’ model which detail the success in customizing  Anderson Promoter and RBS sequences for individual part expression relative to pduJ. Top: The graph represents the differences between ideal and computationally-derived ratios for each part. Ideal ratios are derived from stoichiometric comparisons between parts in the idealized system. Computational ratios are derived from the model output using the quantification of Anderson promoters and translation initiation rates of Anderson RBS sequences. Bottom: The table presents sample calculations for finding both ideal and computational ratios along with the corresponding error. Note: All r_oligo parts are uniformly represented as one value and pduA’BJKNUT is omitted for simplicity.
 
               </div>
 
               </div>
 
             </div>
 
             </div>
 +
            </div>
 +
            <div class="sectionTitle" id="Section 4">Exploring Model Validity</div>
 +
            <div class="bar"></div>
 +
          <div class="section">
 
             <div class="paragraph">
 
             <div class="paragraph">
              Additionally, protein linkers are usually present between this nucleic acid binding domain and the enzyme structure to prevent inhibition of enzyme activity. However the exact linker(s) used, if any, is(are) also dependent on the specific use case of the invention. These pathway enzymes are attached to the nucleic acid scaffolds via their nucleic acid binding domains. The nucleic acid recognition sequences (22) are unique or semi-unique sequences of nucleic acid monomers on the nucleic acid scaffolds to which the utilized nucleic acid binding domains have some degree of molecular complementarity. These nucleic recognition sequences comprise most of the scaffold and mark the locations to which the DNA binding domains of the pathway enzymes attach to the scaffolds. The nucleic acid spacers (32) are relatively short sequences of nucleic acid monomers that are also present on the nucleic acid scaffolds, between the recognition sequences. The linkage between the nucleic acid scaffolds (18) and protein shell (10) provides a means by which the nucleic acid scaffolds are bound to the protein shell through direct or multi-molecule complementarity. This linkage is found between the nucleic acid scaffolds and the protein shell. One example is through the addition of a nucleic acid binding domain (24) to one or more of the shell proteins forming a nucleic acid binding domain, shell protein fusion (14). Like the pathway enzymes, this nucleic acid binding-domain can be either internal to the shell protein structure or at its N or C terminus, where the exact placement depends on the shell protein being utilized. Alternatively, one or more intermediate proteins can be used to adhere the nucleic acid scaffolds to the shell, where the region of the protein interacting with the shell binds the shell via protein-protein complementarity (28) with a given shell protein, and the region of the protein interacting with the nucleic acid scaffold binds another recognition sequence on the nucleic acid scaffold through another nucleic acid binding domain (30). This forms a shell protein binding, nucleic acid domain fusion (26).<br/><br/><br/>
+
            A supplementary model was created in complement to the optimization model to verify its findings. Specifically, the kinetics of central dogma theory in creation of our DNA Scaffolds was further investigated. The relative ratios of both Reverse Transcriptases compared to pduJ are observably low (Figure 3). Efficient functioning of the MANIFOLD system requires a sufficient amount of DNA Scaffolds to be produced and encapsulated by the BMC before the compartment becomes fully enclosed.
 +
            </div>
 +
            <h3> Model Constraints
 +
            </h3>
 +
            <div class="paragraph">
 +
              Research in BMC assembly remains primarily speculative. Therefore, we used information on viral capsid assembly (a similar proteinaceous compartment) for timeline comparison. Analysis of MS2 bacteriophage viral capsid assembly details the protein shell formation required around 280 seconds for nucleation, growth, and lag time.(https://www.pnas.org/content/116/45/22485) These assembled particles are around thirty nanometers in diameter upon completion. Assuming assembly time only increases with particle size, we used this timestamp as a lower bound for our BMC formation.  In addition to the expected assembly time, it is important to know the amount of DNA Scaffold we may expect to be encapsulated within the BMC upon formation. The pduD localization peptide has a measured Incorporation Efficiency of 24.7% to the lumen of the BMC.(https://onlinelibrary.wiley.com/doi/full/10.1002/mbo3.1010#mbo31010-fig-0016) Using these parameters, our system would require the presence of 37,267 (or 4.571 on the log10 scale) molecules of DNA Scaffolds before t = 280 seconds.
 +
            </div>
 +
            <div class="paragraph">
 +
              We defined our relationships according to the schematic below (Figure 4). This process develops the connections between the input transcriptional/translational controlling factors (Promoter and RBS) and the output number of DNA Scaffold molecules. Our model used a numerical solver of the differential equations below to test this theory (Table I). Our assumptions are also listed below.
 +
            </div>
 +
            <div class="figureHolder">
 +
              <img src="https://wallpapercave.com/wp/wp1864652.jpg"/>
 +
              <!--Central Dogma Schematic-->
 +
              <div>
 +
              <b>Fig 4.</b> Central Dogma Schematic. Where ORI represents the plasmid copy number; kr represents the transcription rate; kd represents the degradation rate; kl represents the translation rate; krHIVRT represents the HIV-RT transcription rate; and ka represents the annealing rate.
 +
              </div>
 +
            </div>
 +
            <div class="paragraph">
 +
              last paragraph
 +
              <br/><br/><br/>
 
             </div>
 
             </div>
 
           </div>
 
           </div>

Revision as of 19:48, 21 October 2020

Modeling

Index:
Modeling
Overview
Our scaffold-incorporated bacterial microcompartment (BMC) system, called MANIFOLD, incorporates targeted enzyme fusions to the inner BMC shell proteins to spatially orient select enzymes to minimize metabolic flux leakage. For the Summer of 2020, the team thoroughly developed a proof-of-concept for the production of model molecule trans-resveratrol for its compact size and compatibility.
With the onset of the global pandemic and need for quick project adaptations to a virtual environment, the modeling team sought to put a positive twist on the unprecedented circumstances and took the opportunity to flesh out a thorough multi-scale model to support experimental design in wet lab and explore various other facets of resveratrol metabolism optimization. We utilized a multitude of mathematical models in both Matlab and Python to understand the viability of our BMC system. We chose to embrace the complexities that modeling could envelop and built a multi-scale model balancing assumptions with the realistic conclusions we were to make.
From our endeavors, we determined the optimized expression of our 14 different parts utilizing cataloged parts from the iGEM Parts Registry and the expected increase of biosynthesized trans-resveratrol when utilizing the MANIFOLD system. Our silico chronology began with a considerable amount of literature review to familiarize ourselves with recent advances in BMC research. From this, we were able to construct an optimization model for ideal part expression used in future wet lab practices. We then delved into the reaction kinetics of our system to quantify the improvement of MANIFOLD on current methods of trans-resveratrol biosynthesis.
Research-Driven Analysis
The in silico investigation of MANIFOLD was guided by these questions relating to the improved assembly and efficiency of our system. In order for our design to present a significant breakthrough in metabolic engineering, we needed quantitative answers. To first construct these questions, the modeling team conducted research driven analysis to determine what parts of our system may already be characterized and which may require further modeling efforts.
The bacterial microcompartment shell became the first topic of conversation. The [21-gene] Pdu operon (named for its natural 1,2-propanediol metabolism pathway) derived from citrobacter freundii has been only recently elucidated in literature. The shell is known to be composed of certain pdu proteins with pdu enzymes naturally localized to the interior of the compartment. The proteinaceous shell ideally forms an icosahedral shell with a mean face-to-face diameter of 117.9919 nanometers (Figure 1).[1](Apparent size and morphology of bacterial microcompartments varies with technique) Our study utilized seven pdu proteins (pduABJKNUT) which are necessary for the formation of this nanocarrier and therefore lead to BMCs of the same size.
Fig 1. 3D rendering of the icosahedral shape of an assembled BMC. Dimensions formulated from findings in Kennedy et. al.[1] to be 117.992 nanometers with a side length of 68.123 nanometers. Note: Images represent the idealized BMC structure based on hypothesized methods of pdu protein assembly.
Our selection of this synthetic pdu operon [2] (Synthesis of Empty Bacterial Microcompartments) was bolstered by a recent study elucidating the stoichiometric composition of these proteinaceous enclosures. [3] (Decoding the stoich) This resource quantified the ratio between pdu proteins apparent in BMC enclosures (Figure 2) which solidified the quantitative backing we needed for progression of part expression analysis. With twelve parts required for full assembly of the MANIFOLD system, the challenge of correct, relative expression became a pressing issue. With the realization of no immediate availability of in-lab practices to help in guiding these judgements through fluorescence or scanning electron microscopy methods, our attention shifted towards expanding upon methods used in literature to create an optimization model in silico.
Relative Part Expression

BMC Background

The pduJ protein is the most abundant protein (5212 monomers)[3] of the BMC shell. Therefore, any MANIFOLD system with correct construction of the BMC would mandate the expression of at least 5212 pduJ proteins. We therefore used the expression of pduJ as a relative lower bounded baseline for computing idealized expression of all other parts of the MANIFOLD system. These predicted relativities in part expression were deduced from theoretical relationships within the bacterial microcompartment.
One of these key stoichiometric relationships is mediated through the pduA shell protein. The most well-researched process of inner-luminal binding is orchestrated through the pduA protein. The N-terminal of the pduD protein is hypothesized to bind with the C-terminal (luminal-facing) residues of pduA.[4] (Exploring Bacterial Organelle Interactomes: A Model of the Protein-Protein Interaction Network in the Pdu Microcompartment) MANIFOLD utilizes the first eighteen amino acids of pduD as a tag for interior localization to the BMC as has been performed before.[5] (The N-Terminal Region of the Medium Subunit (PduD) Packages Adenosylcobalamin-Dependent Diol Dehydratase (PduCDE) into the Pdu Microcompartment) This peptide sequence was combined with the zinc finger domain to enable DNA scaffold localization to the interior of the BMC for subsequent enzyme attachment. This connectivity is significant in establishing the quantified relationships between MANIFOLD components.

Model Activation

Correct stoichiometry between MANIFOLD parts required a method of quantification for both transcriptional and translational parameters. Due to the large number of required components, part expression would have to be carefully modulated to be in high enough quantity for a high-output system without killing the E. coli host. Transcriptional relativity had already been documented through the Anderson Promoter Library. The Anderson RBS Library has introduced several RBS combinations but the relativity in these sequences has yet to be fully elucidated. We then looked at the proclaimed RBS Calculator by the Salis Lab (https://salislab.net/software/). This software calculates the translation initiation rate (TIR) based on a thermodynamic model of free energy on and around the start codon.(https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2782888/) Using 21 Anderson RBS sequences and two others found naturally in acquired vectors, we found the calculated TIR values for our eight parts requiring an RBS. These values were used in conjunction with the relative transcriptional values of 19 Anderson Promoters to find optimized sequence combinations to achieve correct relative part expression. Due to the sheer number of combinations, the process yielded computational ratios with minimal error compared to ideal, theoretical ratios (Figure 3). Important assumptions for this model are listed below.

Assumptions

  • Assume steady state with consistency in other central dogma factors (allows for transcriptional and translational rates to be multiplied)
  • Assume ACC and HIV RT expressions may be calculated through subunit with lowest translation initiation rate
  • Assume ribosome binding only occurs at primary stop codon less than 20 base pairs downstream of selected RBS sequence
Part# of moleculesIdeal RatioRBSPromoterComputational RatioError
pduD fusion18410.353BBa_J61107BBa_J231010.3570.00335
GFP fusion36820.706BBa_J61111BBa_J231060.7110.00500
pduJAKBNUT52121.000Warren RBSBBa_J231141.0000
Fig 3. Results from the optimization ‘PromoterCalc’ model which detail the success in customizing Anderson Promoter and RBS sequences for individual part expression relative to pduJ. Top: The graph represents the differences between ideal and computationally-derived ratios for each part. Ideal ratios are derived from stoichiometric comparisons between parts in the idealized system. Computational ratios are derived from the model output using the quantification of Anderson promoters and translation initiation rates of Anderson RBS sequences. Bottom: The table presents sample calculations for finding both ideal and computational ratios along with the corresponding error. Note: All r_oligo parts are uniformly represented as one value and pduA’BJKNUT is omitted for simplicity.
Exploring Model Validity
A supplementary model was created in complement to the optimization model to verify its findings. Specifically, the kinetics of central dogma theory in creation of our DNA Scaffolds was further investigated. The relative ratios of both Reverse Transcriptases compared to pduJ are observably low (Figure 3). Efficient functioning of the MANIFOLD system requires a sufficient amount of DNA Scaffolds to be produced and encapsulated by the BMC before the compartment becomes fully enclosed.

Model Constraints

Research in BMC assembly remains primarily speculative. Therefore, we used information on viral capsid assembly (a similar proteinaceous compartment) for timeline comparison. Analysis of MS2 bacteriophage viral capsid assembly details the protein shell formation required around 280 seconds for nucleation, growth, and lag time.(https://www.pnas.org/content/116/45/22485) These assembled particles are around thirty nanometers in diameter upon completion. Assuming assembly time only increases with particle size, we used this timestamp as a lower bound for our BMC formation. In addition to the expected assembly time, it is important to know the amount of DNA Scaffold we may expect to be encapsulated within the BMC upon formation. The pduD localization peptide has a measured Incorporation Efficiency of 24.7% to the lumen of the BMC.(https://onlinelibrary.wiley.com/doi/full/10.1002/mbo3.1010#mbo31010-fig-0016) Using these parameters, our system would require the presence of 37,267 (or 4.571 on the log10 scale) molecules of DNA Scaffolds before t = 280 seconds.
We defined our relationships according to the schematic below (Figure 4). This process develops the connections between the input transcriptional/translational controlling factors (Promoter and RBS) and the output number of DNA Scaffold molecules. Our model used a numerical solver of the differential equations below to test this theory (Table I). Our assumptions are also listed below.
Fig 4. Central Dogma Schematic. Where ORI represents the plasmid copy number; kr represents the transcription rate; kd represents the degradation rate; kl represents the translation rate; krHIVRT represents the HIV-RT transcription rate; and ka represents the annealing rate.
last paragraph