BIC-based node order learning for improving Bayesian network structure learning

Lv, Yali; Miao, Junzhong; Liang, Jiye; Chen, Ling; Qian, Yuhua

doi:10.1007/s11704-020-0268-6

BIC-based node order learning for improving Bayesian network structure learning

Research Article
Published: 01 September 2021

Volume 15, article number 156337, (2021)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Yali Lv^1,2,
Junzhong Miao¹,
Jiye Liang²,
Ling Chen³ &
…
Yuhua Qian^2,4

95 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Node order is one of the most important factors in learning the structure of a Bayesian network (BN) for probabilistic reasoning. To improve the BN structure learning, we propose a node order learning algorithm based on the frequently used Bayesian information criterion (BIC) score function. The algorithm dramatically reduces the space of node order and makes the results of BN learning more stable and effective. Specifically, we first find the most dependent node for each individual node, prove analytically that the dependencies are undirected, and then construct undirected subgraphs U_G. Secondly, the U_G is examined and connected into a single undirected graph U_GC. The relation between the subgraph number and the node number is analyzed. Thirdly, we provide the rules of orienting directions for all edges in U_GC, which converts it into a directed acyclic graph (DAG). Further, we rank the DAG’s topology order and describe the BIC-based node order learning algorithm. Its complexity analysis shows that the algorithm can be conducted in linear time with respect to the number of samples, and in polynomial time with respect to the number of variables. Finally, experimental results demonstrate significant performance improvement by comparing with other methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Min-BDeu and Max-BDeu Scores for Learning Bayesian Networks

Structure Learning of Bayesian Network from the Data

KTOBS: An Approach of Bayesian Network Learning Based on K-tree Optimizing Ordering-Based Search

References

Judea P. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers, San Mateo, California, 1988
MATH Google Scholar
Friedman N. Inferring cellular networks using probabilistic graphical models. Science, 2004, 303(5659): 799–805
Article Google Scholar
Raval A, Ghahramani Z, Wild D L. A Bayesian network model for protein fold and remote homologue recognition. Bioinformatics, 2002, 18(6): 788–801
Article Google Scholar
Chen W, Zhu B, Zhang H. BN-mapping: visual analysis of geospatial data with Bayesian network. Chinese Journal of Computers, 2016, 39(7): 1281–1293
Google Scholar
Peng P, Tian Y, Wang Y, Li J, Huang T. Robust multiple cameras pedestrian detection with multi-view Bayesian network. Pattern Recognition, 2015, 48(5): 1760–1772
Article Google Scholar
Liu L, Wang S, Su G, Huang Z, Liu M. Towards complex activity recognition using a Bayesian network-based probabilistic generative framework. Pattern Recognition, 2017, 68: 295–309
Article Google Scholar
Oatley G, Ewart B. Crimes analysis software: ‘pins in maps’, clustering and Bayes net prediction. Expert Systems with Applications, 2003, 25(4): 569–588
Article Google Scholar
Chickering D M, Heckerman D, Meek C. Learning Bayesian networks is NP-hard. Technical Report, MSR-TR-94-17, Microsoft Research, Microsoft Corporation, 1994
Chickering D M, Heckerman D, Meek C. Large-sample learning of Bayesian networks is NP-hard. Journal of Machine Learning Research, 2004, 5: 1287–1330
MATH Google Scholar
Bouhamed H, Masmoudi A, Lecroq T, Rebai A. Structure space of Bayesian networks is dramatically reduced by subdividing it in subnetworks. Journal of Computational and Applied Mathematics, 2015, 287: 48–62
Article MATH Google Scholar
Heckerman D, Geiger D, Chickering D M. Learning Bayesian networks: the combination of knowledge and statistical data. Machine Learning, 1995, 20: 197–243
Article MATH Google Scholar
Lam W, Bacchus F. Learning Bayesian belief networks: an approach based on the MDL principle. Computational Intelligence, 1994, 10(3): 269–293
Article Google Scholar
Scanagatta M, Corani G, De Campos C P, Zaffalon M. Approximate structure learning for large Bayesian networks. Machine Learning, 2018, 107: 1209–1227
Article MATH Google Scholar
De Campos C P, Scanagatta M, Corani G, Zaffalon M. Entropy-based pruning for learning Bayesian networks using BIC. Artificial Intelligence, 2018, 260: 42–50
Article MATH Google Scholar
Cooper G F, Herskovits E H. A Bayesian method for the induction of probabilistic networks from data. Machine Learning, 1992, 9: 309–347
Article MATH Google Scholar
Scanagatta M, Corani G, Zaffalon M, Yoo J, Kang U. Efficient learning of bounded-treewidth Bayesian networks from complete and incomplete data sets. International Journal of Approximate Reasoning, 2018, 95: 152–166
Article MATH Google Scholar
Nie S, De Campos C P, Ji Q. Learning Bayesian networks with bounded treewidth via guided search. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016, 3294–3300
Parviainen P, Farahani H S, Lagergren J. Learning bounded treewidth Bayesian networks using integer linear programming. In: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics. 2014, 751–759
Elidan G, Gould S. Learning bounded treewidth Bayesian networks. Journal of Machine Learning Research, 2008, 9: 2699–2731
MATH Google Scholar
Niinimaki T, Parviainen P, Koivisto M. Structure discovery in Bayesian networks by sampling partial orders. Journal of Machine Learning Research, 2016, 17(1): 2002–2048
MATH Google Scholar
Teyssier M, Koller D. Ordering-based search: a simple and effective algorithm for learning Bayesian networks. In: Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence. 2005, 584–590
Scanagatta M, De Campos C P, Corani G, Zaffalon M. Learning Bayesian networks with thousands of variables. Neural Information Processing Systems, 2015, 28: 1855–1863
Google Scholar
Chen X, Anantha G, Lin X. Improving Bayesian network structure learning with mutual information-based node ordering in the K2 algorithm. IEEE Transactions on Knowledge and Data Engineering, 2008, 20(5): 1–13
Google Scholar
Ko S, Kim D. An efficient node ordering method using the conditional frequency for the K2 algorithm. Pattern Recognition Letters, 2014, 40: 80–87
Article Google Scholar
Hsu W H, Guo H, Perry B B, Stilson J A. A permutation genetic algorithm for variable ordering in learning Bayesian networks from data. In: Proceedings of the Genetic and Evolutionary Computation Conference. 2002, 383–390
Park Y W, Klabjan D. Bayesian network learning via topological order. Journal of Machine Learning Research, 2017, 18: 1–32
MATH Google Scholar
Zhang L, Guo H. Introduction to Bayesian Networks. Science Press, 2006
Zhang N L, Yan L. Independence of causal influence and clique tree propagation. International Journal of Approximate Reasoning, 1998, 19(3–4): 335–349
Article MATH Google Scholar
Mateescu R, Kask K, Gogate V, Dechter R. Join-graph propagation algorithms. Journal of Artificial Intelligence Research, 2010, 37: 279–328
Article MATH Google Scholar
Goudie R J, Mukherjee S. A Gibbs sampler for learning DAGs. Journal of Machine Learning Research, 2016, 17: 1–39
MATH Google Scholar
Benjumeda M, Bielza C, Larranaga P. Learning tractable Bayesian networks in the space of elimination orders. Artificial Intelligence, 2019, 274: 66–90
Article MATH Google Scholar
Benjumeda M, Luengosanchez S, Larranaga P, Bielza C. Tractable learning of Bayesian networks from partially observed data. Pattern Recognition, 2019, 91: 190–199
Article Google Scholar
Tsamardinos I, Brown L E, Aliferis C F. The max-min hill-climbing Bayesian network structure learning algorithm. Machine Learning, 2006, 65: 31–78
Article MATH Google Scholar
Lv Y, Wu J, Liang J, Qian Y. Random search learning algorithm of BN based on super-structure. Journal of Computer Research and Development, 2017, 54(11): 2558–2566
Google Scholar
Qi X, Fan X, Gao Y, Liu Y. Learning Bayesian network structures using weakest mutual-information-first strategy. International Journal of Approximate Reasoning, 2019, 114: 84–98
Article MATH Google Scholar
Talvitie T, Eggeling R, Koivisto M. Learning Bayesian networks with local structure, mixed variables, and exact algorithms. International Journal of Approximate Reasoning, 2019, 115: 69–95
Article MATH Google Scholar
Scutari M, Graafland C E, Gutiérrez J M. Who learns better Bayesian network structures: accuracy and speed of structure learning algorithms. International Journal of Approximate Reasoning, 2019, 115: 235–253
Article MATH Google Scholar
Ye Q L, Amini A A, Zhou Q. Optimizing regularized cholesky score for order-based learning of Bayesian networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, DOI: https://doi.org/10.1109/TPAMI.2020.2990820
Lee S, Kim S B. Parallel simulated annealing with a greedy algorithm for Bayesian network structure learning. IEEE Transactions on Knowledge and Data Engineering, 2020, 32(6): 1157–1166
Article Google Scholar
Yao T, Choi A, Darwiche A. Learning Bayesian network parameters under equivalence constraints. Artificial Intelligence, 2017, 244: 239–257
Article MATH Google Scholar
Riggelsen C. Learning parameters of Bayesian networks from incomplete data via importance sampling. International Journal of Approximate Reasoning, 2006, 42: 69–83
Article MATH Google Scholar
Niculescu R S, Mitchell T M, Rao R B. Bayesian network learning with parameter constraints. Journal of Machine Learning Research, 2006, 7: 1357–1383
MATH Google Scholar
Yang Y, Gao X, Guo Z, Chen D. Learning Bayesian networks using the constrained maximum a posteriori probability method. Pattern Recognition, 2019, 91: 123–134
Article Google Scholar
Benjumeda M, Bielza C, Larranaga P. Tractability of most probable explanations in multidimensional Bayesian network classifiers. International Journal of Approximate Reasoning, 2018, 93: 74–87
Article MATH Google Scholar
Madsen A L, Jensen F, Salmeron A, Langseth H, Nielsen T D. A parallel algorithm for Bayesian network structure learning from large data sets. Knowledge Based Systems, 2017, 117: 46–55
Article Google Scholar
Arnborg S, Corneil D G, Proskurowski A. Complexity of finding embeddings in a k-tree. SIAM Journal on Algebraic Discrete Methods, 1987, 8(2): 277–284
Article MATH Google Scholar
Nie S, Maua D D, De Campos C P, Ji Q. Advances in learning Bayesian networks of bounded treewidth. Advances in Neural Information Processing Systems, 2014, 27: 2285–2293
Google Scholar
Liao W, Ji Q. Learning Bayesian network parameters under incomplete data with domain knowledge. Pattern Recognition, 2009, 42: 3046–3056
Article MATH Google Scholar
Lv Y, Wu J, Jing T. Pqisem: BN’s structure learning based on partial qualitative influences and SEM algorithm from missing data. International Journal of Wireless and Mobile Computing, 2018, 14(4): 348–357
Article Google Scholar
Masegosa A R, Feelders A, Der Gaag L C. Learning from incomplete data in Bayesian networks with qualitative influences. International Journal of Approximate Reasoning, 2016, 69: 18–34
Article MATH Google Scholar

Download references

Acknowledgements

The work partially supported by the National Natural Science Foundation of China (Grant Nos. 61432011, U1435212, 61322211 and 61672332), the Postdoctoral Science Foundation of China (2016M591409), the Natural Science Foundation of Shanxi Province, China (201801D121115 and 2013011016-4) and Research Project Supported by Shanxi Scholarship Council of China (2020-095).

Author information

Authors and Affiliations

Shanxi University of Finance & Economics, Taiyuan, 030031, China
Yali Lv & Junzhong Miao
Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University, Taiyuan, 030006, China
Yali Lv, Jiye Liang & Yuhua Qian
The Center for Artificial Intelligence, University of Technology Sydney, Sydney, New South Wales, 2007, Australia
Ling Chen
Institute of Big Data Science & Industry, Shanxi University, Taiyuan, 030006, China
Yuhua Qian

Authors

Yali Lv
View author publications
You can also search for this author in PubMed Google Scholar
Junzhong Miao
View author publications
You can also search for this author in PubMed Google Scholar
Jiye Liang
View author publications
You can also search for this author in PubMed Google Scholar
Ling Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yuhua Qian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuhua Qian.

Electronic supplementary material