Exploring Data Augmentation and Resampling Strategies for Transformer-Based Models to Address Class Imbalance in AI Scoring of Scientific Explanations in NGSS Classroom
arXiv cs.LG / 4/23/2026
💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper studies how data augmentation can improve transformer-based (SciBERT) text classification for automated rubric scoring of NGSS-aligned scientific explanations, where class imbalance is especially severe for advanced-reasoning categories.
- Using a dataset of 1,466 high-school responses labeled across 11 binary-coded analytic rubric categories, the authors compare multiple augmentation methods against fine-tuning alone and a traditional oversampling baseline (SMOTE).
- GPT-4–generated synthetic responses boost both precision and recall, ALP achieves perfect precision/recall/F1 for the most severely imbalanced categories, and EASE improves alignment with human scoring across both correct scientific ideas and inaccurate ideas.
- Overall results suggest that targeted augmentation can mitigate severe imbalance without overfitting and while preserving coverage needed for learning-progression-aligned automated scoring at scale in science education.
Related Articles

Black Hat USA
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

10 AI Tools Every Developer Should Try in 2026
Dev.to

Why use an AI gateway at all?
Dev.to