Deployment-Oriented Session-wise Meta-Calibration for Landmark-Based Webcam Gaze Tracking
arXiv cs.CV / 3/16/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- EMC-Gaze is a lightweight landmark-only gaze-tracking method that enables session-wise adaptation by using a shared geometric encoder and a small calibration set per session, with meta-training to differentiate through the ridge calibrator.
- It leverages an E(3)-equivariant landmark-graph encoder, local eye geometry, binocular emphasis, auxiliary 3D gaze supervision, and a differentiable closed-form ridge calibrator to achieve robust performance with reduced pose leakage via a two-view canonicalization consistency loss.
- In evaluations, EMC-Gaze achieves 5.79 ± 1.81 deg RMSE after 9-point calibration on fixation-style data (better than Elastic Net at 6.68 ± 2.34 deg) and shows larger gains for still-head queries; it maintains advantage across subject holdouts and performs well on MPIIFaceGaze with few-shot calibration.
- The exported eye-focused encoder has 944,423 parameters (about 4.76 MB in ONNX) and enables calibrated browser prediction in about 12.58 ms per sample (mean/median/p90) in Chromium with ONNX Runtime Web, demonstrating deployment practicality and a deployment-oriented operating point.
Related Articles

Astral to Join OpenAI
Dev.to

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic
Dev.to

Your AI coding agent is installing vulnerable packages. I built the fix.
Dev.to

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication
Dev.to

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA