心理与行为研究 ›› 2023, Vol. 21 ›› Issue (1): 49-57.DOI: 10.12139/j.1672-0628.2023.01.008

• 基础心理学 • 上一篇    下一篇

基于贝叶斯网模型的多级计分诊断测验分类及比较研究

喻晓锋1,*(), 肖遇春1, 秦春影1,2   

  1. 1. 江西师范大学心理学院,南昌 330022
    2. 南昌师范学院数学与信息科学学院,南昌 330032
  • 收稿日期:2022-01-16 出版日期:2023-01-20 发布日期:2023-01-20
  • 通讯作者: 喻晓锋
  • 基金资助:
    江西省教育科学“十四五”规划2021年度课题(21YB027)

A Comprehensive Comparison of Bayesian Networks Classification Model and Sequential Polytomous Scoring Diagnosis Model

Xiaofeng YU1,*(), Yuchun XIAO1, Chunying QIN1,2   

  1. 1. School of Psychology, Jiangxi Normal University, Nanchang 330022
    2. School of Mathematics and Information Science, Nanchang Normal University, Nanchang 330032
  • Received:2022-01-16 Online:2023-01-20 Published:2023-01-20
  • Contact: Xiaofeng YU

摘要:

贝叶斯网模型提供了一种方便和直观的框架结构来表示变量间的关系,非常适合在诊断测验中对教育评估的内容进行建模。本研究将两种贝叶斯网分类模型与序列多级计分诊断模型S-GDINA进行综合比较。考察两种贝叶斯网分类模型与S-GDINA在Q矩阵正确界定和包含一定比例(25%、30%)的错误时,两者对被试的分类性能;并将贝叶斯网分类模型应用到实证数据中,展示贝叶斯网分类模型在实证数据中的分类过程和分类性能。研究结果表明:当Q矩阵由专家正确界定时,朴素贝叶斯分类模型的分类效果与S-GDINA模型相差不大,同样可以达到很好的分类效果,树增广的朴素贝叶斯分类模型的分类性能也能达到良好。实证结果进一步表明,将贝叶斯网分类模型应用于教育测量领域中的诊断分类工具是有其优势和可行的,尤其是当测验数据对于所选用诊断模型的拟合较差、测验的Q矩阵中包含错误或测验数据中包含较多的噪音时。

关键词: 认知诊断, 序列多级计分诊断模型, 贝叶斯网络

Abstract:

In recent years, Bayesian networks have been widely used in the field of artificial intelligence, but have received relatively little attention in the field of psychology. Most of the existing studies have applied Bayesian networks to binary items. This research combines Bayesian networks with polytomous items, which has important theoretical and practical implications. Two Bayesian network classification models were comprehensively compared with the sequential-GDINA (S-GDINA). The results showed that the classification performance of the naive Bayesian classifier (NBC) was comparable to that of the S-GDINA model, which could achieve equally good classification performance. Also, the classification performance of the tree augmented naive Bayesian classifier (TAN) could achieve good classification performance when the percentage of errors contained in the Q matrix was 0% (the Q matrix was correctly defined by the expert). Both the NBC and the TAN had better classification results in the polytomous scored test. The response classification consistency of the NBC and TAN reached 84% and 71%, respectively, indicating that Bayesian networks can be well applied to polytomous scored items.

Key words: cognitive diagnosis, S-GDINA, Bayesian networks

中图分类号: