Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA
arXiv cs.CL / 4/8/2026
💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper addresses how to improve LLM function calling for online financial question-answering by tailoring API tool use to the financial domain rather than relying on generic function-calling behavior.
- It proposes a data-driven pipeline with periodic dataset construction and updates, using user-query-related samples to better match real online query patterns.
- The method introduces an augmentation strategy called AugFC to explore possible function parameter values, increasing diversity and mitigating out-of-distribution issues between user queries and required API inputs.
- A two-step training approach is used to teach the LLM to correctly invoke financial functions, with experiments on offline datasets and an online deployment scenario showing improved performance.
- The pipeline has been adopted in YuanBao, a large-scale financial chat platform in China, indicating practical value beyond offline evaluation.



