Studies of Psychology and Behavior ›› 2023, Vol. 21 ›› Issue (2): 253-259.DOI: 10.12139/j.1672-0628.2023.02.015

Previous Articles     Next Articles

The Evaluation of the Quality of Postgraduate Entrance Examination Based on IRT

SONG Xueling1, LIANG Zhengyan2   

  1. 1. National Education Examinations Authority, Beijing 100084;
    2. School of Psychology, South China Normal University, Guangzhou 510631
  • Received:2022-11-09 Online:2023-03-20 Published:2023-03-20

基于项目反应理论的研究生招生考试命题质量评价

宋学玲1, 梁正妍2   

  1. 1. 教育部教育考试院, 北京 100084;
    2. 华南师范大学心理学院, 广州 510631
  • 通讯作者: 梁正妍,E-mail:2020010220@m.scnu.edu.cn
  • 基金资助:
    国家教育考试科研规划一般课题(GJK2021049);国家教育考试科研规划重点课题(GJK2021020)

Abstract: The quality evaluation of postgraduate entrance examination is important in management. Based on the answering data of 22,126 subjects in the psychology major of the postgraduate entrance examination in 2022, the quality of the test paper was analyzed by using the IRT through SPSS 21.0 and R. The results showed that, 1) the structure of the test paper was completely consistent with the requirements of the test outline; 2) the range of subjects’ abilities was widely distributed; 3) the test information basically met the requirements. The difficulty level of the test paper is medium to easy; and there are still some poor items, and the proportion of average items is also relatively high. From the perspective of information function, 1) the maximum information content of most items was higher than the expected information content; 2) the proportion of the maximum information content provided by Experimental Psychology, Psychological Statistics and Measurement was higher than the proportion of the information content that should be provided; 3) the test information function curve was on the left as a whole, which was not good for differentiation of groups of subjects with higher ability.

Key words: item response theory (IRT), postgraduate entrance examination, evaluation

摘要: 研究生招生考试命题质量评价是考试管理的重要环节。针对2022年全国硕士研究生招生考试心理学专业基础科目,随机抽取22,126份作答数据,利用SPSS21.0和R软件,采用双参数Logistic模型和拓广分部评分模型对命题质量进行评价。结果显示:试卷结构与考试大纲的要求完全一致;各种题型对知识点的考查均偏重基础;考生能力范围分布较广;测验信息量基本满足要求。从难度来看,试卷难度中等偏易;从区分度来看,仍有部分差级试题,中级试题占比也偏高。从信息函数来看,绝大多数试题的最大信息量都高于期望信息量;实验心理学、心理统计与测量实际提供的最大信息量比例高于应提供的信息量比例;测验信息函数曲线整体偏左,对于能力水平较高的考生群体区分度有待提高。

关键词: 项目反应理论, 研究生招生考试, 评价

CLC Number: