JFinTEB: Japanese Financial Text Embedding Benchmark
arXiv cs.CL / 4/20/2026
💬 OpinionSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- The paper introduces JFinTEB, a benchmark dedicated to evaluating Japanese financial text embeddings, focusing on gaps left by existing general-purpose embedding benchmarks.
- It includes both retrieval and classification tasks that mirror realistic financial text processing, such as instruction-following retrieval and sentiment/document categorization.
- The benchmark is evaluated across many embedding models, including Japanese-specific models of different sizes, multilingual models, and commercial embedding services.
- The authors publicly release the JFinTEB datasets and an evaluation framework to standardize assessment and support future research in Japanese financial text mining.
- Overall, JFinTEB provides a domain-specific foundation for advancing research on embeddings for Japanese financial applications.



