Materials Similar to Investigation of the Inter-Rater Reliability between Large Language Models and Human Raters in Qualitative Analysis
- 30%: Machine learning for automated content analysis: characteristics of training data impact reliability
- 30%: Assessing a combined human coding and natural language processing method for qualitative analysis in physics education research
- 29%: Analysis of student essays in an introductory physics course using natural language processing
- 29%: Rasch model based analysis of the Force Concept Inventory
- 29%: Exploring Large Language Models as Formative Feedback Tools in Physics
- 25%: Model analysis: Representing and assessing the dynamics of student learning
- 25%: Identifying a preinstruction to postinstruction factor model for the Force Concept Inventory within a multitrait item response theory framework
- 25%: Students' Perceptions to a Large Language Model's Generated Feedback and Scores of Argumentation Essays
- 24%: Evaluation of high school Cambodian students’ comprehension of the projectile trajectory using the model analysis technique
- 24%: Exploratory factor analysis of a Force Concept Inventory data set
- 23%: Item response theory analysis of the mechanics baseline test
- 23%: Assessing the assessment: Mutual information between response choices and factor scores
- 22%: Approaches to data analysis of multiple-choice questions
- 22%: Modernizing use of regression models in physics education research: A review of hierarchical linear modeling
- 22%: Associations between learning assistants, passing introductory physics, and equity: A quantitative critical race theory investigation
- 22%: Investigating institutional influence on graduate program admissions by modeling physics Graduate Record Examination cutoff scores
- 21%: Model analysis of fine structures of student models: An example with Newton's third law
- 21%: Applying machine learning models in multi-institutional studies can generate bias




