Marshall, S. and Yu, L. and Xiao, Y. and Dougherty, E. (2007) Inference of a probabilistic Boolean network from a single observed temporal sequence. EURASIP Journal on Bioinformatics and Systems Biology, 2007 (1). pp. 116. ISSN 16874153

PDF
boolean.pdf  Published Version Available under License Creative Commons Attribution. Download (2MB)  Preview 
Abstract
The inference of gene regulatory networks is a key issue for genomic signal processing. This paper addresses the inference of probabilistic Boolean networks (PBNs) from observed temporal sequences of network states. Since a PBN is composed of a finite number of Boolean networks, a basic observation is that the characteristics of a single Boolean network without perturbation may be determined by its pairwise transitions. Because the network function is fixed and there are no perturbations, a given state will always be followed by a unique state at the succeeding time point. Thus, a transition counting matrix compiled over a data sequence will be sparse and contain only one entry per line. If the network also has perturbations, with small perturbation probability, then the transition counting matrix would have some insignificant nonzero entries replacing some (or all) of the zeros. If a data sequence is sufficiently long to adequately populate the matrix, then determination of the functions and inputs underlying the model is straightforward. The difficulty comes when the transition counting matrix consists of data derived from more than one Boolean network. We address the PBN inference procedure in several steps: (1) separate the data sequence into "pure" subsequences corresponding to constituent Boolean networks; (2) given a subsequence, infer a Boolean network; and (3) infer the probabilities of perturbation, the probability of there being a switch between constituent Boolean networks, and the selection probabilities governing which network is to be selected given a switch. Capturing the full dynamic behavior of probabilistic Boolean networks, be they binary or multivalued, will require the use of temporal data, and a great deal of it. This should not be surprising given the complexity of the model and the number of parameters, both transitional and static, that must be estimated. In addition to providing an inference algorithm, this paper demonstrates that the data requirement is much smaller if one does not wish to infer the switching, perturbation, and selection probabilities, and that constituentnetwork connectivity can be discovered with decent accuracy for relatively small timecourse sequences.
Item type:  Article 

ID code:  11603 
Keywords:  gene regularity networks, genomic signal processing, probabilistic Boolean networks, Electrical engineering. Electronics Nuclear engineering, Signal Processing, General, Statistics and Probability, Computer Science(all), Medicine(all) 
Subjects:  Technology > Electrical engineering. Electronics Nuclear engineering 
Department:  Faculty of Engineering > Electronic and Electrical Engineering 
Depositing user:  Strathprints Administrator 
Date Deposited:  23 Nov 2011 11:14 
Last modified:  26 Mar 2015 16:56 
URI:  http://strathprints.strath.ac.uk/id/eprint/11603 
Actions (login required)
View Item 