J. Whittaker, Graphical models in applied multivariate statistics, 1990.

J. Cheng, D. Bell, and W. Liu, Learning belief networks from data: an information theory based approach, the 6th ACM International Conference on Information and Knowledge Management, pp.207-216, 1997.

J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988.

R. G. Cowell, A. P. Dawid, S. L. Lauritzen, and D. J. Spiegelhalter, Probabilistic networks and expert systems. Statistics for engineering and information science, 1999.

G. F. Cooper and E. Herskovits, A bayesian method for the induction of probabilistic networks from data, Machine Learning, vol.9, issue.4, pp.309-347, 1992.

D. J. Spiegelhalter, A. P. Dawid, S. L. Lauritzen, and R. G. Cowell, Bayesian analysis in expert systems, Statistical Science, vol.8, pp.219-282, 1993.

W. Lam and F. Bacchus, Learning bayesian belief networks: An approach based on the mdl principle, Computational Intelligence, vol.10, pp.269-293, 1994.

D. Heckerman, D. Geiger, and D. M. Chickering, Learning bayesian networks: The combination of knowledge and statistical data, Machine Learning, vol.20, issue.3, pp.197-243, 1995.

C. Chow and C. Liu, Approximating discrete probability distributions with dependence trees, IEEE Transactions on Information Theory, vol.14, pp.462-467, 1968.

J. Pearl and T. S. Verma, A theory of inferred causation, Principles of Knowledge Representation and Reasoning (KR'91, pp.441-452, 1991.

P. Spirtes, C. Glymour, and R. Scheines, Causation, Prediction, and Search, Lecture Notes in Statistics, 1993.

P. Spirtes and C. Meek, Learning bayesian networks with discrete variables from data, 1st International Conference on Knowledge Discovery and Data Mining (KDD'95, 1995.

D. Heckerman, A tutorial on learning with bayesian networks, the NATO Advanced Study Institute on Learning in graphical models, pp.301-354, 1998.

S. L. Lauritzen, The em algorithm for graphical association models with missing data, Computational Statistics and Data Analysis, vol.19, pp.191-201, 1995.

A. P. Dempster, N. M. Laid, and D. B. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society, vol.39, issue.1, pp.1-38, 1977.

D. M. Chickering and D. Heckerman, Efficient approximations for the marginal likelihood of bayesian networks with hidden variables, Machine Learning, vol.29, issue.2-3, pp.181-212, 1997.

R. J. Little and D. B. Rubin, Statistical analysis with missing data, 1987.

N. Friedman, Learning belief networks in the presence of missing values and hidden variables, 14th International Conference on Machine Learning, pp.125-133, 1997.

N. Friedman, The bayesian structural em algorithm, 14th Conference on Uncertainty in Artificial Intelligence, pp.129-138, 1998.

P. Leray and O. François, Bayesian network structural learning and incomplete data, International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05, pp.33-40, 2005.

J. W. Myers, K. B. Laskey, and T. S. Levitt, Learning bayesian networks from incomplete data with stochastic search algorithms, 15th Conference on Uncertainty in Artificial Intelligence (UAI'99), 1999.

J. W. Myers, K. B. Laskey, and K. Dejong, Learning bayesian networks from incomplete data using evolutionary algorithms, Genetic and Evolutionary Computation Conference (GECCO'99), 1999.

R. G. Cowell, Parameter estimation from incomplete data for bayesian networks, International Workshop on Artificial Intelligence and Statistics, pp.193-196, 1999.

M. F. Ramoni and P. Sebastiani, The use of exogenous knowledge to learn bayesian networks from incomplete databases, Second International Symposium on Advances in Intelligent Data Analysis and Reasoning about Data (IDA'97, vol.1280, 1997.

M. F. Ramoni and P. Sebastiani, Parameter estimation in bayesian networks from incomplete databases, Intelligent Data Analysis, vol.2, issue.1, pp.139-160, 1998.

M. F. Ramoni and P. Sebastiani, Learning bayesian networks from incomplete databases, 13th Conference on Uncertainty in Artificial Intelligence (UAI'97, pp.401-408, 1997.

C. Riggelsen and A. J. Feelders, Learning bayesian network models from incomplete data using importance sampling, 10th International Workshop on Artificial Intelligence and Statistics, pp.301-308, 2005.

X. Li, X. He, and S. Yuan, Learning bayesian networks structures from incomplete data: An efficient approach based on extended evolutionary programming, 9th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp.474-479, 2005.

X. Li, X. He, and S. Yuan, A new method of learning bayesian networks structures from incomplete data, 15th International Conference on Artificial Neural Networks, (ICANN'05, pp.261-266, 2005.

C. Riggelsen, Learning bayesian networks from incomplete data: An efficient method for generating approximate predictive distributions, 6th SIAM International Conference on Data Mining (SDM'06, 2006.

A. Ragel and B. Cremilleux, Treatment of missing values for association rules, Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.258-270, 1998.

R. Agrawal, T. Imielinski, and A. N. Swami, Mining Association Rules between Sets of Items in Large Databases, the ACM SIGMOD International Conference on Management of Data, pp.207-216, 1993.

D. Poole, A. Mackworth, and R. Goebel, Computational Intelligence, 1998.

S. L. Lauritzen, D. J. Spiegelhalter, I. A. Beinlich, H. J. Suermondt, R. M. Chavez et al., Local computations with probabilities on graphical structures and their application to expert systems, pp.415-448, 1990.