Enhancing Science Classroom Discourse Analysis through Joint Multi-Task Learning for Reasoning-Component Classification
arXiv cs.CL / 4/24/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper introduces ADAS, an automated system for analyzing science classroom discourse by jointly classifying utterance type and reasoning components for both teacher and student speech.
- To handle strong label imbalance, the authors use stratified re-splitting of the dataset, LLM-based synthetic data augmentation focused on minority classes, and a dual-probe RoBERTa-base classifier.
- They report that a zero-shot GPT-5.4 baseline reaches macro-F1 around 0.47 for utterance type (UT) and 0.476 for reasoning components (RC), providing upper bounds for prompt-only methods and motivation for fine-tuning.
- Beyond classification, the study performs several discourse analyses (e.g., UTxRC co-occurrence, cognitive complexity, lag-sequential, and IRF chain analyses) and finds teacher “Feedback-with-Question (Fq)” moves are the most consistent antecedents of students’ inferential reasoning (SR-I).
- The results suggest LLM augmentation improves minority-class recognition for UT, while the RC task’s structural simplicity makes it more tractable even for lexical baselines.
Related Articles

Black Hat USA
AI Business

The 67th Attempt: When Your "Knowledge Management" System Becomes a Self-Fulfilling Prophecy of Excellence
Dev.to

Context Engineering for Developers: A Practical Guide (2026)
Dev.to

GPT-5.5 is here. So is DeepSeek V4. And honestly, I am tired of version numbers.
Dev.to
AI Visibility Tracking Exploded in 2026: 6 Tools Every Brand Needs Now
Dev.to