Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs
arXiv cs.AI / 5/1/2026
📰 NewsModels & Research
Key Points
- The paper introduces MEDS (Math Education Digital Shadows), a new dataset designed to measure how LLMs reason about and report math under both human-like and AI-assistant-like conditions.
- MEDS is built from 28,000 “personas” across 14 LLMs (including Mistral, Qwen, DeepSeek, Granite, Phi, and Grok), with each shadow containing math prompts plus psychological and sociodemographic persona metadata.
- The dataset goes beyond traditional math benchmarks by including task types and measures tied to self-efficacy, math anxiety, cognitive networks/attitudes, and confidence—not just math accuracy.
- Validation results indicate schema integrity and consistent persona behavior, while also revealing family-specific patterns such as human-like negative math attitudes, logical fallacies, and overconfidence.
- MEDS is intended to support learning analytics, cognitive science research, and the development of safer math tutoring AI systems.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
THE DECODER

Qualcomm teases ‘dedicated CPU for agentic experiences’ and ‘agentic smartphones’
The Register
Finetuning Dataset: Claude Opus 4.6/4.7 - 8.7k Chats
Reddit r/LocalLLaMA
![Phosphene local video and audio generation for Apple Silicon open source (LTX 2.3) [P]](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2Fvutakjb0vgyg1.png%3Fwidth%3D140%26height%3D59%26auto%3Dwebp%26s%3D08ecb95fd65ade25c924988f1992e9abe3d79f62&w=3840&q=75)
Phosphene local video and audio generation for Apple Silicon open source (LTX 2.3) [P]
Reddit r/MachineLearning