Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models
arXiv cs.CL / 3/17/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes Decision-Level Ordinal Modeling (DLOM) that treats AES scoring as explicit ordinal decisions by using the language model head to extract score-wise logits for predefined score tokens, addressing limitations of autoregressive generation in multimodal AES.
- It adds DLOM-GF for multimodal AES, a gated fusion module that adaptively combines textual and visual score logits, and DLOM-DA for text-only AES with a distance-aware regularization term to reflect ordinal distances.
- Experiments on the multimodal EssayJudge dataset show DLOM improves over a generation-based SFT baseline across traits, with DLOM-GF providing further gains when modality relevance is heterogeneous; on ASAP/ASAP++ benchmarks, DLOM remains effective without visuals, and DLOM-DA further improves performance and outperforms strong baselines.
- The work enables direct optimization in score space, offering a more interpretable and robust framework for ordinal rubric scoring in LLM-based AES across both multimodal and text-only settings.
Related Articles
Hey dev.to community – sharing my journey with Prompt Builder, Insta Posts, and practical SEO
Dev.to
How to Build Passive Income with AI in 2026: A Developer's Practical Guide
Dev.to
The Research That Doesn't Exist
Dev.to
ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成
日経XTECH
Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI
TechCrunch