Agentar-Fin-OCR
arXiv cs.CV / 3/12/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- Agentar-Fin-OCR is introduced as a document parsing system tailored to financial-domain documents that converts ultra-long PDFs into semantically consistent, structured outputs with auditing-grade provenance.
- It combines Cross-page Contents Consolidation and Document-level Heading Hierarchy Reconstruction to restore continuity across pages and build a globally consistent TOC for structure-aware retrieval, along with a difficulty-adaptive curriculum learning strategy and a CellBBoxRegressor to localize table cells from decoder states without external detectors.
- The work introduces FinDocBench, a benchmark with six financial document categories and metrics like TocEDS, cross-page TEDS, and Table Cell IoU to evaluate table parsing across finance documents.
- Experiments show state-of-the-art models on FinDocBench and position Agentar-Fin-OCR as a practical foundation for reliable downstream financial document applications.
Related Articles
How to Enforce LLM Spend Limits Per Team Without Slowing Down Your Engineers
Dev.to
v1.82.6.rc.1
LiteLLM Releases
How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models
Reddit r/LocalLLaMA
Reduce errores y costos de tokens en agentes con seleccion semantica de herramientas
Dev.to
How I Built Enterprise Monitoring Software in 6 Weeks Using Structured AI Development
Dev.to