home - login - register

PERC 2025 Abstract Detail Page

Previous Page  |  New Search  |  Browse All

Abstract Title: Exploring Query-Driven Centroiding and Embedding Strategies for Automated Thematic Analysis in Physics Education
Abstract Type: Contributed Poster Presentation
Abstract: Understanding how Physics Education Research (PER) has changed and matured helps researchers and materially improve physics teaching and learning. The advent of large language models (LLMs) and embedding-based NLP techniques opens new avenues for this work by enabling the analysis of large corpora with minimal manual coding. We explore the use of text embeddings and retrieval-augmented generation (RAG)-like methods without generative AI to analyze themes across a random sample of 94 articles from major PER journals. Specifically, we examine how two methodological choices affect topic modeling accuracy: (1) representing articles using a single embedding versus multiple sentence-level embeddings, and (2) deriving topic centroids from representative texts versus researcher-defined queries. All results are evaluated against a human-coded dataset focused on four overarching thematic categories: teacher-centered, student-centered, physics content, and journal business. Our findings inform best practices for researchers seeking scalable, interpretable, and low-barrier approaches to literature analysis in PER.
Session Time: Poster Session A
Poster Number: A-9
Contributed Paper Record: Contributed Paper Information
Contributed Paper Download: Download Contributed Paper

Author/Organizer Information

Primary Contact: Michael Mingyar
Montana State University
Bozeman, MT 59715
Phone: 8149324710
Co-Author(s)
and Co-Presenter(s)
Tor Ole B. Odden, Center for Computing in Science Education, Department of Physics, University of Oslo, 0316 Oslo, Norway

Shannon Willoughby, Department of Physics, Montana State University,
Bozeman, MT 59717

Contributed Poster

Contributed Poster: Download the Contributed Poster