Gene Regulatory Network Inference

“A gene regulatory network is a set of genes, or parts of genes, that interact with each other to control a specific cell function. Gene regulatory networks are important in development, differentiation and responding to environmental cues.” Nature

Structure of a gene regulatory network – Wikipedia

Cancer is commonly known as a disease of the genes and there has been a huge effort to find the effective genes for different cancers. These approaches to control/cure cancer, however, have not been that much successful. In fact, the great difference between cancer and other genetic disease is the effect of genes interaction on the regulation of the genes. How the up-regulation of gene A will result in the upregulation/downregulation of gene B. In this project, instead of following the prevalent reductionist methods, we have used the approach of Complex Systems. We are interested in studying collective behavior among the genes. We have inferred regulatory interactions between the genes and by representing each gene as a node and the interaction between each two of them as a link, we have instructed the interaction network, the network is weighted and signed showing that the interactions between the genes are not identical.

Assuming the interactions to be pairwise (spin-glass system), using the Principle of Maximum Entropy, we can acquire the gene network by considering the mean and correlation of the experimental data set. We work on normal and cancerous genes data set to infer the network of these two groups. The issue of whether the upregulation of a gene is impressed by the up/down regulation of others is our concern.

Analysis by Pearson’s correlation yields interactions associating all three compounds A, B, and C, in contrast to the partial correlation approach which omits the “false” link between A and C. REF.

We want to make a probability distribution function for a sample of biological data set which needs to be able to describe the whole data. As we know a PDF (probability distribution function) has its own parameters. In accordance with Principle of Maximum Entropy, the object is to find a PDF among a number of PDFs that maximize entropy. PDF parameters can be obtained by Principle of Maximum Entropy and the method of Lagrange multipliers. We are interested in finding whether or to what extent there is a relationship between each pair of the genes. Using their correlation coefficient will give misleading results if there is another, confounding, gene that is related to both genes of interest. This misleading information can be avoided by controlling for the confounding genes, which is done by computing the partial correlation coefficient. The interaction between genes named A, B, C has been shown.

There are some subjects like the dynamics of the network that can be discussed when the networks were inferred. According to balance theory, frustrated triangles are expected when three genes are connected to each other. Suppose A, B, C are connected and make a triangle if A upregulates and B downregulate what will happen for C?

The result can be interpreted as normal (healthy) network is more dynamics rather than cancerous and healthy genes are able to adapt to cell changes easily, conversely a cancerous gene stays in a fixed state. The healthy network tries to lessen the energy to reach the global minimum of a system but cancerous network sticks in local minima, this is the cause of abnormal cell growth and division.

Other interesting and disputable questions can be the comparison of the dynamics of the network in the different stages of cancer or studying the gene network as a directed network in order to obtain more information from the genes interaction.


Click Here For the Latest Research and Reviews