Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies
arXiv cs.AI / 3/31/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The study compares ChatGPT, Gemini, and DeepSeek as “teaching agents” using an evaluation protocol focused on three pedagogical strategies for beginner C programming learners: Examples, Explanations & Analogies, and the Socratic Method.
- Across Examples and Explanations/Analogies, the models show broadly similar interaction patterns, suggesting comparable effectiveness for those teaching approaches.
- For the Socratic Method, model behavior becomes more sensitive to both the chosen strategy and the initial prompt, indicating less consistent performance without careful prompting.
- Human judges rated ChatGPT and Gemini higher overall, while DeepSeek scored lower across evaluation criteria, reflecting measurable differences in pedagogical quality among LLMs.
- The paper addresses a gap in empirical evidence about LLM pedagogical skills by using systematic human evaluation rather than relying on general claims about AI tutoring.
Related Articles
Why AI agent teams are just hoping their agents behave
Dev.to

Harness as Code: Treating AI Workflows Like Infrastructure
Dev.to

How to Make Claude Code Better at One-Shotting Implementations
Towards Data Science

The Crypto AI Agent Stack That Costs $0/Month to Run
Dev.to

Bag of Freebies for Training Object Detection Neural Networks
Dev.to