Examples on how to use PyNetMet

Here we show several examples of analysis than can be made using PyNetMet with different metabolic modes. The models used here are:

iSyn811: Model of Synechocystis sp PCC. 6803.
iCM925: Model of Clostridium beijerinckii.

iAK692: Model of Spirulina platensis.

The first model is available here in its OptGene format, for the other two models we present links to the journal where they are published where one can download the models from the additional materials in SBML format.

Let's start with examples on how to create Enzyme objects:

in this example one defines the variable enz1 which contains the reaction whose name is reac1, where one molecule of A combines with two molecules of B to result in one molecule of C and one of D. One should always put spaces surrounding the ":", "->" and "+" signs, otherwise the symbols might get confused with the metabolite names.

The representation of an enzyme object will be its initial string, but with numbers transformed to float type. Note the following examples:

from these examples one should note that it is possible to use spaces in the metabolite names, but the Enzyme class will remove these spaces from the names. One can also use symbols like "+" or "-".

The first step to analyse any metabolic model is to load it as a Metabolism object:

The class Metabolism creates a new pathway for the transport reactions in the model:

The iCM925 model, for instance, is reported to have 938 reactions and 881 metabolites, while the object cbe has 957 reactions and 900 metabolites. The difference (19) is the number of transport reactions in the _TRANSPORT_ pathway.

Each metabolic model has an attribute called net which is the network formed by its metabolites. In the example below we show how to use it to produce the plots for the topological overlap of nodes in each network. For each model we make three plots, the first with the arbitrary order in which the nodes appear in the model, then with the nodes ordered by the Kruskal algorithm and finally with the algorithm contained in the Network object.

These commands should produce the nine plots below. One can clearly see the increase of quality in the clustering from the figures in the left (random ordering) to the right (ordering according to the algorithm implemented in PyNetMet.network class).

Plots for the topological overlap of metabolites. The first, second and third rows refer to the plots obtained by the three analyzed models, iSyn811, iCM925 and iak692, respectively. The plots in the first column are for an arbitrary ordering of the metabolites, in the second column an ordering is obtained via the Kruskal algorithm and in the third column the ordering is obtained by the algorithm implemented in the `plot_nCCs` method of the `Network` class.

The average clustering for each network can be easily obtained:

Another interesting analysis that can be made using the methods from the Network class are the search for disconnected components or the study of paths between the nodes of the metabolic network. Once the method components is called for a network, apart from the disc_comps attribute, it automatically creates two new attributes, dists and paths that contain the shortest distances and paths in between any two nodes of the network.

The attribute disc_comps is a list with the list of nodes in each disconnected component of the network. In the above example we printed the number of nodes in each component, which shows us the giant component (976 metabolites) that comprises the metabolism, and 5 other components which are the result of reactions disconnected from the main metabolism and that could be removed from the metabolic model. The metabolism method bad_reacs removes these reactions and also reactions where one product and one substrate only appear once in the whole metabolism, indicating that these reactions are also poorly connected to the main component.

For the other networks:

In the above examples the network under study is the one composed only by the metabolites in each metabolic model. One can chose to work with the bipartite network formed by metabolites and reactions. In the following examples we build this network in order to study paths between metabolites.

In this example we calculated the shortest path from glucose to pyruvate: it goes through reaction 2.7.1.2b, which has ADP as product and ADP is substrate in reaction 2.7.1.40a that produces pyruvate. Note that if one prints paths[ipyr] the numbers that one sees are [4, 990, 3, 1003, 21]. The numbers 3, 4 and 21 correspond to the positions of ADP, alpha-D-glucose and pyruvate in the list syf.metabol, but the numbers that correspond to reactions 2.7.1.2b and 2.7.1.40a in the list syf.enzymes are not 990 and 1003, but instead 1 (990-syf.nmets) and 14 (1003-syf.nmets).

Metabolites that could not be reached from glucose are marked with the symbol "X" in the dists list:

Here we see that 221 metabolites could not be reached from glucose. From those that could be reached, the average shortest path is around 5.788 and the furthest metabolite reached by glucose is Astxbm which is 20 nodes away.

Having a Metabolism object definined with a metabolic model, one only has to call the FBA in order to perform a FBA analysis:

The FBA objects have the method __sub__ defined, which allows a comparison between two realizations of a FBA. As an example on how it works, let's compare the metabolism of iSyn811 when optimizing its growth and when optimizing hydrogen production for a fixed value of growth.

In this series of commands we create the first FBA where the growth (reaction named "_Growth") of the Synechocistis is optimized. We then use the metabolic model to create a second FBA where the growth is fixed to 95% of its optimized value and then optimize the production of hydrogen (reaction named "_H2"). The last command will create the file called diff.txt where one can see the comparison between this two states of the metabolism. This file shows four columns, the first one is the name of each reaction, in the second and third one can find the values for the fluxes in each FBA, respectively. The fourth column shows the absolute value of the relative change in percentage (100%|(ν₁-ν₂)/ν₁|$). If the original flux was zero (ν₁=0) it will return "NA" in this column. By default it sorts the reaction by its difference value, so the first reactions listed on the file will have no difference in their flux and the reactions in the end of it will be the affected ones. One can clearly see that the most affected reactions are those related with the Synechocistis hydrogenase.

In the next examples, we analyse essential genes in the models:

This tells us that the transport of carbonic acid is not essential for the growth of the cell, but its removal reduces the growth by 50\%. We can also count the total number of essential reactions in each model:

So, the iCM925 model has 166 essential reactions, the iSyn811 221 and the iAK692 has 249. One also sees that the FBA class recognizes two problematic reactions in iAK692 model, in this case, where a metabolite is connected to itself by the reaction.

Last, we show examples with the shadow and max_min methods.

As we saw before, the transport of carbonic acid can be removed at cost of reducing by a factor two the growth in the iSyn811 model. So, calculating its maximal and minimal flux if we fix the growth to half its maximum value, we find that we don't need the reaction (one is able to minimize it to zero). But, if we fix the growth to 60% of its maximal value, the flux in the transport of carbonic acid must be at least equal to 0.34 and in this case the reaction cannot occur in the reversed direction (which is indicated by the X's in the second list).

Back to main page.