|
陈茹玲, 蔡鑫廷, 宋曜廷, 李宜宪. 文本适读性分级架构之建立研究. 教育科学研究期刊, 2015, 60 (1): 1- 32.
|
|
刘苗苗, 李燕, 王欣萌, 甘琳琳, 李虹. 分级阅读初探: 基于小学教材的汉语可读性公式研究. 语言文字应用, 2021 (2): 116- 126.
DOI
|
|
杨慊, 贺文洁, 王海龙. 单参数单维度Rasch模型的优势与意义. 心理科学, 2021, 44 (6): 1491- 1498.
|
|
中国新闻出版研究院. (2022). 第十九次全国国民阅读调查结果. 2022-11-30取自https://society.huanqiu.com/article/47ix20UIt5x
|
|
Bartholomew, S. R., Ruesch, E. Y., Hartell, E., & Strimel, G. J. Identifying design values across countries through adaptive comparative judgment. International Journal of Technology and Design Education, 2020, 30 (2): 321- 347.
DOI
|
|
Bloxham, S. Marking and moderation in the UK: False assumptions and wasted resources. Assessment & Evaluation in Higher Education, 2009, 34 (2): 209- 220.
DOI
|
|
Bradley, R. A., & Terry, M. E. Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika, 1952, 39 (3–4): 324- 345.
DOI
|
|
Bramley, T. (2007). Paired comparison methods. In P. Newton, J. A. Baird, H. Goldstein, H. Patrick, & P. Tymms (Eds.), Techniques for monitoring the comparability of examination standards (pp. 246–300). London: Qualifications and Curriculum Authority.
|
|
Bramley, T. (2015). Investigating the reliability of adaptive comparative judgment. Cambridge: Cambridge University Press & Assessment.
|
|
Bramley, T., & Vitello, S. (2019). The effect of adaptivity on the reliability coefficient in adaptive comparative judgement. Assessment in Education: Principles, Policy & Practice, 26(1), 43–58.
|
|
Chall, J. S., & Conard, S. S. (1991). Should textbooks challenge students? :The case for easier or harder books. New York: Teachers College Press.
|
|
Chen, S. Y., & Fang, S. P. Developing a Chinese version of an author recognition test for college students in Taiwan. Journal of Research in Reading, 2015, 38 (4): 344- 360.
DOI
|
|
Coertjens, L., Lesterhuis, M., Verhavert, S., van Gasse, R., & De Maeyer, S. Judging texts with rubrics and comparative judgement: Taking into account reliability and time investment. Pedagogische Studien, 2017, 94 (4): 283- 303.
|
|
Crompvoets, E. A. V., Béguin, A. A., & Sijtsma, K. Adaptive pairwise comparison for educational measurement. Journal of Educational and Behavioral Statistics, 2020, 45 (3): 316- 338.
DOI
|
|
Crossley, S., Heintz, A., Choi, J. S., Batchelor, J., Karimi, M., & Malatinszky, A. A large-scaled corpus for assessing text readability. Behavior Research Methods, 2023, 55 (2): 491- 507.
DOI
|
|
Crossley, S. A., Skalicky, S., & Dascalu, M. Moving beyond classic readability formulas: New methods and new models. Journal of Research in Reading, 2019, 42 (3–4): 541- 561.
DOI
|
|
Dale, E., & Chall, J. S. The concept of readability. Elementary English, 1949, 26 (1): 19- 26.
|
|
Fountas, I. C., & Pinnell, G. S. Guided reading: The romance and the reality. The Reading Teacher, 2012, 66 (4): 268- 284.
DOI
|
|
Fry, E. Readability versus leveling. The Reading Teacher, 2002, 56 (3): 286- 291.
|
|
Jones, I., & Inglis, M. The problem of assessing problem solving: Can comparative judgement help?. Educational Studies in Mathematics, 2015, 89 (3): 337- 355.
DOI
|
|
Jones, I., Swan, M., & Pollitt, A. Assessing mathematical problem solving using comparative judgement. International Journal of Science and Mathematics Education, 2015, 13 (1): 151- 177.
DOI
|
|
Kuhn, M. R., Schwanenflugel, P. J., & Meisinger, E. B. Aligning theory and assessment of reading fluency: Automaticity, prosody, and definitions of fluency. Reading Research Quarterly, 2010, 45 (2): 230- 251.
DOI
|
|
Landrieu, Y., De Smedt, F., van Keer, H., & De Wever, B. Assessing the quality of argumentative texts: Examining the general agreement between different rating procedures and exploring inferences of (dis)agreement cases. Frontiers in Education, 2022, 7, 784261.
DOI
|
|
Lesterhuis, M., Bouwer, R., van Daal, T., Donche, V., & De Maeyer, S. Validity of comparative judgment scores: How assessors evaluate aspects of text quality when comparing argumentative texts. Frontiers in Education, 2022, 7, 823895.
DOI
|
|
Lesterhuis, M., van Daal, T., van Gasse, R., Coertjens, L., Donche, V., & De Maeyer, S. When teachers compare argumentative texts: Decisions informed by multiple complex aspects of text quality. L1-Educational Studies in Language and Literature, 2018, 18 (1): 1- 22.
DOI
|
|
Liu, M. M., Li, Y. X., Su, Y. Q., & Li, H. Text complexity of Chinese elementary school textbooks: Analysis of text linguistic features using machine learning algorithms. Scientific Studies of Reading, 2024, 28 (3): 235- 255.
DOI
|
|
Luce, R. D. (1959). Individual choice behavior: A theoretical analysis. New York: John Wiley & Sons, Inc.
|
|
Meng, X. L., Rosenthal, R., & Rubin, D. B. Comparing correlated correlation coefficients. Psychological Bulletin, 1992, 111 (1): 172- 175.
DOI
|
|
Paquot, M., Rubin, R., & Vandeweerd, N. Crowdsourced adaptive comparative judgment: A community-based solution for proficiency rating. Language Learning, 2022, 72 (3): 853- 885.
DOI
|
|
Pollitt, A. (2012). The method of adaptive comparative judgement. Assessment in Education: Principles, Policy & Practice, 19(3), 281–300.
|
|
Pollitt, A., & Murray, N. L. (1996). What raters really pay attention to. In M. Milanovic & N. Saville (Eds.), Studies in language testing 3: Performance testing, cognition and assessment (pp. 74–91). Cambridge: Cambridge University Press.
|
|
Renaissance. (2022). What kids are reading report 2022. Retrieved November 30, 2022, from https://www.renaissance.com/2022/03/01/news-renaissance-shares-findings-of-worlds-largest-annual-k12-reading-survey/
|
|
Sheehan, K. M., Kostin, I., Napolitano, D., & Flor, M. The TextEvaluator tool: Helping teachers and test developers select texts for use in instruction and assessment. The Elementary School Journal, 2014, 115 (2): 184- 209.
DOI
|
|
Smith, D. R., Stenner, A. J., Horabin, I., & Smith, M. (1989). The lexile scale in theory and practice: Final report for NIH Grant HD-19448. Bethesda, MD: National Institutes of Health.
|
|
Thurstone, L. L. A law of comparative judgment. Psychological Review, 1927, 34 (4): 273- 286.
|
|
Thwaites, P., Kollias, C., & Paquot, M. Is CJ a valid, reliable form of L2 writing assessment when texts are long, homogeneous in proficiency, and feature heterogeneous prompts?. Assessing Writing, 2024, 60, 100843.
DOI
|
|
Verhavert, S., Bouwer, R., Donche, V., & De Maeyer, S. (2019). A meta-analysis on the reliability of comparative judgement. Assessment in Education: Principles, Policy & Practice, 26(5), 541–562.
|
|
Verhavert, S., De Maeyer, S., Donche, V., & Coertjens, L. Scale separation reliability: What does it mean in the context of comparative judgment?. Applied Psychological Measurement, 2018, 42 (6): 428- 445.
DOI
|
|
Wheadon, C., Barmby, P., Christodoulou, D., & Henderson, B. (2020). A comparative judgement approach to the large-scale assessment of primary writing in England. Assessment in Education: Principles, Policy & Practice, 27(1), 46–64.
|