Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks

Olga Nikolova; Srinivas Aluru

doi:10.1109/ICPP.2011.49

Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks

Olga Nikolova, Srinivas Aluru

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

9 Scopus citations

Abstract

Bayesian networks enable formal probabilistic reasoning on a set of interacting variables of a domain, and have been shown to have broad applicability. More specifically, in bioinformatics Bayesian networks are used to model gene interactions. Learning the structure of a Bayesian network is an NP-hard problem making it necessary to employ heuristics for solving large-scale problems. In this paper, we present parallel algorithms for two problems that arise in relation with network structure learning and analysis: (i) the discovery of all direct causal relations for each variable, i.e., the set of parents and children of each node in the corresponding Bayesian network, and (ii) the computation of Markov boundary of each variable, defined as the minimal set of variables that shield the target variable from all other variables in the domain. Our parallel algorithms are based on state-of-the art constraint-based heuristic optimization methods. They are shown to be work-optimal and communication efficient, and exhibit nearly perfect scaling.

Original language	English (US)
Title of host publication	Proceedings - 2011 International Conference on Parallel Processing, ICPP 2011
Pages	512-521
Number of pages	10
DOIs	https://doi.org/10.1109/ICPP.2011.49
State	Published - 2011
Externally published	Yes
Event	40th International Conference on Parallel Processing, ICPP 2011 - Taipei City, Taiwan, Province of China Duration: Sep 13 2011 → Sep 16 2011

Publication series

Name	Proceedings of the International Conference on Parallel Processing
ISSN (Print)	0190-3918

Conference

Conference	40th International Conference on Parallel Processing, ICPP 2011
Country/Territory	Taiwan, Province of China
City	Taipei City
Period	9/13/11 → 9/16/11

Keywords

Bayesian networks
Causal relations
Constraint-based learning
Markov boundaries

ASJC Scopus subject areas

Software
General Mathematics
Hardware and Architecture

Access to Document

10.1109/ICPP.2011.49

Cite this

Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks. / Nikolova, Olga; Aluru, Srinivas.
Proceedings - 2011 International Conference on Parallel Processing, ICPP 2011. 2011. p. 512-521 6047219 (Proceedings of the International Conference on Parallel Processing).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Nikolova, O & Aluru, S 2011, Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks. in Proceedings - 2011 International Conference on Parallel Processing, ICPP 2011., 6047219, Proceedings of the International Conference on Parallel Processing, pp. 512-521, 40th International Conference on Parallel Processing, ICPP 2011, Taipei City, Taiwan, Province of China, 9/13/11. https://doi.org/10.1109/ICPP.2011.49

@inproceedings{bc24254f6d934339aa28d44d5846951e,

title = "Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks",

abstract = "Bayesian networks enable formal probabilistic reasoning on a set of interacting variables of a domain, and have been shown to have broad applicability. More specifically, in bioinformatics Bayesian networks are used to model gene interactions. Learning the structure of a Bayesian network is an NP-hard problem making it necessary to employ heuristics for solving large-scale problems. In this paper, we present parallel algorithms for two problems that arise in relation with network structure learning and analysis: (i) the discovery of all direct causal relations for each variable, i.e., the set of parents and children of each node in the corresponding Bayesian network, and (ii) the computation of Markov boundary of each variable, defined as the minimal set of variables that shield the target variable from all other variables in the domain. Our parallel algorithms are based on state-of-the art constraint-based heuristic optimization methods. They are shown to be work-optimal and communication efficient, and exhibit nearly perfect scaling.",

keywords = "Bayesian networks, Causal relations, Constraint-based learning, Markov boundaries",

author = "Olga Nikolova and Srinivas Aluru",

year = "2011",

doi = "10.1109/ICPP.2011.49",

language = "English (US)",

isbn = "9780769545103",

series = "Proceedings of the International Conference on Parallel Processing",

pages = "512--521",

booktitle = "Proceedings - 2011 International Conference on Parallel Processing, ICPP 2011",

}

TY - GEN

T1 - Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks

AU - Nikolova, Olga

AU - Aluru, Srinivas

PY - 2011

Y1 - 2011

N2 - Bayesian networks enable formal probabilistic reasoning on a set of interacting variables of a domain, and have been shown to have broad applicability. More specifically, in bioinformatics Bayesian networks are used to model gene interactions. Learning the structure of a Bayesian network is an NP-hard problem making it necessary to employ heuristics for solving large-scale problems. In this paper, we present parallel algorithms for two problems that arise in relation with network structure learning and analysis: (i) the discovery of all direct causal relations for each variable, i.e., the set of parents and children of each node in the corresponding Bayesian network, and (ii) the computation of Markov boundary of each variable, defined as the minimal set of variables that shield the target variable from all other variables in the domain. Our parallel algorithms are based on state-of-the art constraint-based heuristic optimization methods. They are shown to be work-optimal and communication efficient, and exhibit nearly perfect scaling.

AB - Bayesian networks enable formal probabilistic reasoning on a set of interacting variables of a domain, and have been shown to have broad applicability. More specifically, in bioinformatics Bayesian networks are used to model gene interactions. Learning the structure of a Bayesian network is an NP-hard problem making it necessary to employ heuristics for solving large-scale problems. In this paper, we present parallel algorithms for two problems that arise in relation with network structure learning and analysis: (i) the discovery of all direct causal relations for each variable, i.e., the set of parents and children of each node in the corresponding Bayesian network, and (ii) the computation of Markov boundary of each variable, defined as the minimal set of variables that shield the target variable from all other variables in the domain. Our parallel algorithms are based on state-of-the art constraint-based heuristic optimization methods. They are shown to be work-optimal and communication efficient, and exhibit nearly perfect scaling.

KW - Bayesian networks

KW - Causal relations

KW - Constraint-based learning

KW - Markov boundaries

UR - http://www.scopus.com/inward/record.url?scp=80155187592&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80155187592&partnerID=8YFLogxK

U2 - 10.1109/ICPP.2011.49

DO - 10.1109/ICPP.2011.49

M3 - Conference contribution

AN - SCOPUS:80155187592

SN - 9780769545103

T3 - Proceedings of the International Conference on Parallel Processing

SP - 512

EP - 521

BT - Proceedings - 2011 International Conference on Parallel Processing, ICPP 2011

T2 - 40th International Conference on Parallel Processing, ICPP 2011

Y2 - 13 September 2011 through 16 September 2011

ER -

Parallel discovery of direct causal relations and Markov boundaries with applications to gene networks

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this