基础心理学

基于贝叶斯网模型的多级计分诊断测验分类及比较研究

  • 喻晓锋 ,
  • 肖遇春 ,
  • 秦春影
展开
  • 1. 江西师范大学心理学院,南昌 330022
    2. 南昌师范学院数学与信息科学学院,南昌 330032
喻晓锋,E-mail: xyu6@jxnu.edu.cn

收稿日期: 2022-01-16

  网络出版日期: 2023-03-23

基金资助

江西省教育科学“十四五”规划2021年度课题(21YB027)

版权

《心理与行为研究》编辑部, 2023, 版权所有,未经授权,不得转载、摘编本刊文章,不得使用本刊的版式设计。

A Comprehensive Comparison of Bayesian Networks Classification Model and Sequential Polytomous Scoring Diagnosis Model

  • Xiaofeng YU ,
  • Yuchun XIAO ,
  • Chunying QIN
Expand
  • 1. School of Psychology, Jiangxi Normal University, Nanchang 330022
    2. School of Mathematics and Information Science, Nanchang Normal University, Nanchang 330032

Received date: 2022-01-16

  Online published: 2023-03-23

Copyright

, 2023, Copyright reserved © 2023.

摘要

贝叶斯网模型提供了一种方便和直观的框架结构来表示变量间的关系,非常适合在诊断测验中对教育评估的内容进行建模。本研究将两种贝叶斯网分类模型与序列多级计分诊断模型S-GDINA进行综合比较。考察两种贝叶斯网分类模型与S-GDINA在Q矩阵正确界定和包含一定比例(25%、30%)的错误时,两者对被试的分类性能;并将贝叶斯网分类模型应用到实证数据中,展示贝叶斯网分类模型在实证数据中的分类过程和分类性能。研究结果表明:当Q矩阵由专家正确界定时,朴素贝叶斯分类模型的分类效果与S-GDINA模型相差不大,同样可以达到很好的分类效果,树增广的朴素贝叶斯分类模型的分类性能也能达到良好。实证结果进一步表明,将贝叶斯网分类模型应用于教育测量领域中的诊断分类工具是有其优势和可行的,尤其是当测验数据对于所选用诊断模型的拟合较差、测验的Q矩阵中包含错误或测验数据中包含较多的噪音时。

本文引用格式

喻晓锋 , 肖遇春 , 秦春影 . 基于贝叶斯网模型的多级计分诊断测验分类及比较研究[J]. 心理与行为研究, 2023 , 21(1) : 49 -57 . DOI: 10.12139/j.1672-0628.2023.01.008

Abstract

In recent years, Bayesian networks have been widely used in the field of artificial intelligence, but have received relatively little attention in the field of psychology. Most of the existing studies have applied Bayesian networks to binary items. This research combines Bayesian networks with polytomous items, which has important theoretical and practical implications. Two Bayesian network classification models were comprehensively compared with the sequential-GDINA (S-GDINA). The results showed that the classification performance of the naive Bayesian classifier (NBC) was comparable to that of the S-GDINA model, which could achieve equally good classification performance. Also, the classification performance of the tree augmented naive Bayesian classifier (TAN) could achieve good classification performance when the percentage of errors contained in the Q matrix was 0% (the Q matrix was correctly defined by the expert). Both the NBC and the TAN had better classification results in the polytomous scored test. The response classification consistency of the NBC and TAN reached 84% and 71%, respectively, indicating that Bayesian networks can be well applied to polytomous scored items.

参考文献

马奕帆. (2021). 基于三种结构学习算法的属性层级结构估计研究与应用(硕士学位论文). 江西师范大学, 南昌.
  肖遇春. (2021). 基于贝叶斯网模型的多级计分诊断测验分类研究(硕士学位论文). 江西师范大学, 南昌.
  喻晓锋, 丁树良, 秦春影, 陆云娜 贝叶斯网在认知诊断属性层级结构确定中的应用. 心理学报, 2011, 43 (3): 338- 346.
  喻晓锋, 罗照盛, 高椿雷, 秦春影 Q矩阵包含错误的诊断测验分类准确性比较. 心理科学, 2014, 37 (6): 1478- 1484.
  Almond, R. G., DiBello, L. V., Moulder, B., & Zapata-Rivera, J. Modeling diagnostic assessments with Bayesian networks. Journal of Educational Measurement, 2007, 44 (4): 341- 359.
  Béland, A., & Mislevy, R. J. Probability-based inference in a domain of proportional reasoning tasks. Journal of Educational Measurement, 1996, 33 (1): 3- 27.
  Chen, P., Xin, T., Wang, C., & Chang, H. H. Online calibration methods for the DINA model with independent attributes in CD-CAT. Psychometrika, 2012, 77 (2): 201- 222.
  de la Torre, J. The generalized DINA model framework. Psychometrika, 2011, 76 (2): 179- 199.
  Friedman, N., Geiger, D., & Goldszmidt, M. Bayesian network classifiers. Machine Learning, 1997, 29 (2): 131- 163.
  Gitomer, D. H., Steinberg, L. S., & Mislevy, R. J. (1995). Diagnostic assessment of troubleshooting skill in an intelligent tutoring system. In P. D. Nichols, S. F. Chipman, & R. L. Brennan (Eds.), Cognitively diagnostic assessment (pp. 73–101). Hillsdale, NJ: Erlbaum.
  Lee, J. (2003). Diagnosis of bugs in multi-column subtraction using Bayesian networks (Unpublished doctorial dissertation). Columbia University, New York.
  Lee, J. Y., & Corter, J. E. Diagnosis of subtraction bugs using Bayesian networks. Applied Psychological Measurement, 2011, 35 (1): 27- 47.
  Lee, Y. S., Park, Y. S., & Taylan, D. A cognitive diagnostic modeling of attribute mastery in Massachusetts, Minnesota, and the U.S. national sample using the TIMSS 2007. International Journal of Testing, 2011, 11 (2): 144- 177.
  Leighton, J. P., & Gierl, M. J. (2007). Cognitive diagnostic assessment for education. New York: Cambridge University Press.
  Ma, W. C., & de la Torre, J. A sequential cognitive diagnosis model for polytomous responses. British Journal of Mathematical and Statistical Psychology, 2016, 69 (3): 253- 275.
  Ma, W. C., & de la Torre, J. GDINA: An R package for cognitive diagnosis modeling. Journal of Statistical Software, 2020, 93 (14): 1- 26.
  Mihaljevic?, B., Bielza, C., Larra?aga, P. bnclassify: Learning Bayesian network classifiers. The R Journal, 2018, 10 (2): 455- 468.
  Mislevy, R. J. (1995). Probability-based inference in cognitive diagnosis. In P. D. Nichols, S. F. Chipman, & R. L. Brennan (Eds), Cognitively diagnostic assessment (pp. 43–71). Hillsdale, NJ: Erlbaum.
  Mislevy, R. J., Steinberg, L. S., Breyer, F. J., Almond, R. G., & Johnson, L. Making sense of data from complex assessments. Applied Measurement in Education, 2002, 15 (4): 363- 389.
  Neapolitan, R. E. (2003). Learning Bayesian networks. Harlow, United Kingdom: Prentice Hall.
  Nichols, P. D. A framework for developing cognitively diagnostic assessments. Review of Educational Research, 1994, 64 (4): 575- 603.
  Pearl, J. (1985). Bayesian networks: A model of self-activated memory for evidential reasoning. In Proceedings of the 7th Conference of the Cognitive Science Society (pp. 329–334). Irvine, CA.
  Rupp, A. A., & Templin, J. L. Unique characteristics of diagnostic classification models: A comprehensive review of the current state-of-the-art. Measurement: Interdisciplinary Research and Perspective, 2008, 6 (4): 219- 262.
  Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores Psychometrika, 34 (1), 1–97.
  Vomlel, J. (2004). Bayesian networks in educational testing. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 12(S1), 83–100.
  von Davier, M., & Lee, Y. S. (2019). Handbook of diagnostic classification models. New York: Springer.
  Wang, L. L., Xin, T., & Liu, Y. L. An improved parameter-estimating method in Bayesian networks applied for cognitive diagnosis assessment. Frontiers in Psychology, 2021, 12, 665441.
  Yadav, S., & Shukla, S. (2016). Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification. In 2016 IEEE 6th International Conference on Advanced Computing (pp. 78–83). Bhimavaram, India.
  Yu, X. F., & Cheng, Y. Data-driven Q-matrix validation using a residual-based statistic in cognitive diagnostic assessment. British Journal of Mathematical and Statistical Psychology, 2020, 73 (S1): 145- 179.
文章导航

/


版权所有 © 《心理与行为研究》编辑部
地址:天津市西青区宾水西道393号,天津师范大学106#邮箱 邮编:300387
电话:022-23540231, 23541213 E-mail:psybeh@126.com
本系统由北京玛格泰克科技发展有限公司设计开发