Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models
arXiv cs.CL / 3/17/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes Decision-Level Ordinal Modeling (DLOM) that treats AES scoring as explicit ordinal decisions by using the language model head to extract score-wise logits for predefined score tokens, addressing limitations of autoregressive generation in multimodal AES.
- It adds DLOM-GF for multimodal AES, a gated fusion module that adaptively combines textual and visual score logits, and DLOM-DA for text-only AES with a distance-aware regularization term to reflect ordinal distances.
- Experiments on the multimodal EssayJudge dataset show DLOM improves over a generation-based SFT baseline across traits, with DLOM-GF providing further gains when modality relevance is heterogeneous; on ASAP/ASAP++ benchmarks, DLOM remains effective without visuals, and DLOM-DA further improves performance and outperforms strong baselines.
- The work enables direct optimization in score space, offering a more interpretable and robust framework for ordinal rubric scoring in LLM-based AES across both multimodal and text-only settings.
Related Articles
I Was Wrong About AI Coding Assistants. Here's What Changed My Mind (and What I Built About It).
Dev.to

Interesting loop
Reddit r/LocalLLaMA
Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants
Reddit r/LocalLLaMA
A supervisor or "manager" Al agent is the wrong way to control Al
Reddit r/artificial
FeatherOps: Fast fp8 matmul on RDNA3 without native fp8
Reddit r/LocalLLaMA