Agentic Jackal: Live Execution and Semantic Value Grounding for Text-to-JQL
arXiv cs.CL / 4/13/2026
📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- The paper introduces Jackal, the first execution-based benchmark for translating natural language into Jira Query Language (JQL), including 100,000 validated NL–JQL pairs grounded in a live Jira instance with 200,000+ issues.
- It argues that single-pass LLMs struggle with instance-specific categorical values, ambiguous field references, and Boolean predicate generation, because they cannot verify queries against live data.
- To address this, the authors propose Agentic Jackal, a tool-augmented agent that uses the Jira MCP server for live execution and JiraAnchor for semantic retrieval of categorical values via embedding-based similarity.
- Across nine frontier LLMs, single-pass models achieve an average of 43.4% execution accuracy on short queries, and the agentic approach improves 7 of 9 models, with a 9.0% relative gain on the hardest linguistic variant.
- An ablation shows JiraAnchor’s impact is large (categorical-value accuracy rising from 48.7% to 71.7%), and the study finds dominant failure modes are semantic ambiguities like issue-type and text-field selection rather than only value-resolution errors.




