Optimal Sparsity Criteria for Network Inference
2013 (English)In: Journal of Computational Biology, ISSN 1066-5277, E-ISSN 1557-8666, Vol. 20, no 5, 398-408 p.Article in journal (Refereed) Published
Gene regulatory network inference (that is, determination of the regulatory interactions between a set of genes) provides mechanistic insights of central importance to research in systems biology. Most contemporary network inference methods rely on a sparsity/regularization coefficient, which we call zeta (zeta), to determine the degree of sparsity of the network estimates, that is, the total number of links between the nodes. However, they offer little or no advice on how to select this sparsity coefficient, in particular, for biological data with few samples. We show that an empty network is more accurate than estimates obtained for a poor choice of zeta. In order to avoid such poor choices, we propose a method for optimization of zeta, which maximizes the accuracy of the inferred network for any sparsity-dependent inference method and data set. Our procedure is based on leave-one-out cross-optimization and selection of the zeta value that minimizes the prediction error. We also illustrate the adverse effects of noise, few samples, and uninformative experiments on network inference as well as our method for optimization of zeta. We demonstrate that our zeta optimization method for two widely used inference algorithms-Glmnet and NIR-gives accurate and informative estimates of the network structure, given that the data is informative enough.
Place, publisher, year, edition, pages
2013. Vol. 20, no 5, 398-408 p.
algorithms, gene networks, linear algebra
Biochemistry and Molecular Biology
IdentifiersURN: urn:nbn:se:kth:diva-124050DOI: 10.1089/cmb.2012.0268ISI: 000318854500004ScopusID: 2-s2.0-84881569817OAI: oai:DiVA.org:kth-124050DiVA: diva2:633465
QC 201306272013-06-272013-06-252013-06-27Bibliographically approved