HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings
arXiv cs.CL / 3/20/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- Introduces the HiFi-KPI dataset, a large-scale resource for hierarchical KPI extraction from earnings filings, comprising 1.65M paragraphs and 198k hierarchical labels linked to iXBRL taxonomies.
- Defines three evaluation tasks (KPI classification, KPI extraction, and structured KPI extraction) and releases HiFi-KPI-Lite, a manually curated 8K-paragraph subset.
- Reports strong baselines: encoder-based models reach over 0.906 macro-F1 on classification, while LLMs achieve about 0.440 F1 on structured extraction, with most errors tied to date handling.
- Open-sources all code and data at the provided GitHub repository, facilitating reproducibility and further research.
- Aims to improve cross-company transferability of KPI tagging in financial filings and accelerate rapid evaluation for KPI extraction systems.
Related Articles

ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成
日経XTECH

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

Windsurf’s New Pricing Explained: Simpler AI Coding or Hidden Trade-Offs?
Dev.to

Building Production RAG Systems with PostgreSQL: Complete Implementation Guide
Dev.to