Graphic Model-based Gene Regulatory Network Reconstruction using RNA Sequencing Count Data
Interactions between genes, such as regulations are best represented by gene regulatory networks (GRN). These are often constructed based on gene expression data. Few methods for the construction of GRN exist for RNA sequencing count data. One of the most used methods for microarray data is based on graphical Gaussian networks. Considering that count data have different distributions, a method assuming RNA sequencing counts distribute Poisson has been proposed recently. Nevertheless, it has been argued that the most likely distribution of RNA sequencing counts is not Poisson due to overdispersion. Therefore, the negative binomial distribution is much more likely. For this distribution, no model-based method for the construction of GRN has been proposed until now. Here, we present a graphical, model-based method for the construction of GRN assuming a negative binomial distribution of the RNA sequencing count data. The R code is available under request. We used the method proposed both on simulated RNA sequencing count data and on real data. The graph is showed, and its descriptive measurements were assessed. They were found some interesting biological conclusions. We confirm that using negative binomial distribution for fitting the model is suitable because RNA sequencing data present overdispersion.
Allen GI, Liu Z. A local poisson graphical model for inferring networks from sequencing data. IEEE Trans Nanobioscience 2013;12:189–98.
Karlebach G, Shamir R. Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol 2008;9:770–80.
Luo F, Yang Y, Zhong J, Gao H, Khan L, Thompson DK, et al. Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory. BMC Bioinformatics 2007;8:299.
Liang F, Song Q, Qiu P. An Equivalent Measure of Partial Correlation Coefficients for High-Dimensional Gaussian Graphical Models. J Am Stat Assoc 2015;110:1248–65.
Højsgaard S, Edwards D, Lauritzen S. Graphical models with R. Springer Science & Business Media; 2012.
Liu Z-P. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data. Curr Genomics 2015;16:3–22. doi:10.2174/1389202915666141110210634.
Jia B, Xu S, Xiao G, Lamba V, Liang F. Learning gene regulatory networks from next generation sequencing data. Biometrics 2017;73:1221–30.
Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol 2010;11:R106.
Karbalayghareh A, Hu T. Inference of Sparse Gene Regulatory Network from RNA-Seq Time Series Data. IEEE Glob Conf Signal Inf Process 2015:967–71. doi:10.1186/s12859-018-2125-2.
Thorne T. Approximate inference of gene regulatory network models from RNA-Seq time series data. BMC Bioinformatics 2018;19:1–12. doi:10.1186/s12859-018-2125-2.
Yang E, Allen G, Liu Z, Ravikumar PK. Graphical models via generalized linear models. Adv. Neural Inf. Process. Syst., 2012, p. 1358–66.
Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B (Statistical Methodol 2005;67:301–20. :57–68.
Copyright (c) 2019 Liliana Lopez-Kleine, Cristian Andres Gonzalez-Prieto
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain the copyright of their manuscripts, and all Open Access articles are distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided that the original work is properly cited.