Studies of Psychology and Behavior ›› 2026, Vol. 24 ›› Issue (2): 151-160.DOI: 10.12139/j.1672-0628.2026.02.002

Previous Articles     Next Articles

The Application of the Comparative Judgment in Chinese Text Difficulty Assessment

YANG Guandoudou1, TAN Jingwen1,2, LIU Miaomiao1,3, LI Hong1   

  1. 1. Beijing Key Laboratory of Applied Experimental Psychology, National Demonstration Center for Experimental Psychology Education(Beijing Normal University), Institute of Children’s Reading and Learning, Faculty of Psychology, Beijing Normal University, Beijing 100875;
    2. Shenzhen Hongling Primary School, Shenzhen 518000;
    3. Faculty of Education, Henan Normal University, Xinxiang 453007
  • Received:2025-03-18 Online:2026-03-20 Published:2026-03-20

两两比较在汉语文本难度评估中的应用

杨官豆豆1, 谭静文1,2, 刘苗苗1,3, 李虹1   

  1. 1. 北京师范大学心理学部,应用实验心理北京市重点实验室,心理学国家级实验教学示范中心(北京师范大学),儿童阅读与学习研究院,北京 100875;
    2. 深圳市红岭小学,深圳 518000;
    3. 河南师范大学教育学部,新乡 453007
  • 通讯作者: 李虹
  • 基金资助:
    国家语委“十四五”科研规划项目(WT45-41);国家社会科学基金教育学一般项目“AI赋能因材施教:基于智能体的学习困难的多层级评估与个体化干预体系研究”(BBA250057)。

Abstract: One effective way to obtain a trustworthy assessment of text difficulty is to employ comparative judgment, a holistic assessment method that has not yet been utilized in Chinese research. The current study gathered 80 evaluators and evaluated the difficulty of 80 texts using the comparative judgment approach. The results showed that the results obtained from comparative judgment had high reliability and were significantly correlated with the number of volumes and readability scores. The reliability and validity tended to be stable with the increase in the number of comparative judgments. No effect of the evaluator characteristics on the comparative judgment results was found, which implies that comparative judgment is also reliable in the assessment of the difficulty of Chinese texts.

Key words: Chinese leveled reading, text difficulty, comparative judgment, elementary-school-level Chinese language and literature textbooks

摘要: 两两比较作为一种相对整体并能够高效获得可靠结果的文本难度评估手段,在汉语文本评估中的效果还有待探索。本研究通过80名评估者对80篇文本的两两比较,探讨了比较次数对两两比较信效度的影响。结果表明,两两比较获得的结果具有较高的信度,且与文本册数和可读性分数显著相关,随着比较次数的增加,信效度逐渐增加且趋于稳定,同时未发现评估者特征对比较结果的影响,这意味着两两比较在汉语文本难度评估中具备一定可靠性。

关键词: 汉语分级阅读, 文本难度, 两两比较, 小学语文教材

CLC Number: