Understanding the Effects of Search Constraints on Structure Learning
Hay, M., A. Fast and D. Jensen (2007). Understanding the effects of search constraints on structure learning. University of Massachusetts Technical Report 07-21.
- Abstract
Recently, Tsamardinos et al. [2006] presented an algorithm for Bayesian network structure learning
that outperforms many state-of-the-art algorithms in terms of efficiency, structure similarity and
likelihood. The Max-Min Hill Climbing algorithm is a hybrid of constraint-based and search-and-score
techniques, using greedy hill climbing to search a constrained space of possible network structures. The
constraints correspond to assertions of conditional independence that must hold in the network from
which the data were sampled. One would expect that constraining the space would make search both
faster and more accurate, focusing search on the “right” part of the space. The published results indicate,
however, that the resulting structures are less accurate when search is constrained. We reproduce these
results and explain why they occur. At small samples, the statistical test of conditional independence
has low power, which causes the algorithm to exclude edges between dependent variables. Also, the constraints
make search relatively harder, leading to errors in edge orientation. In an unconstrained space,
search can “repair” these errors by adding in extra edges. We conclude by proposing and evaluating an
improved algorithm.
-
- Text
- A PDF version of this paper is available.