FD-NL2SQL: Feedback-Driven Clinical NL2SQL that Improves with Use

arXiv cs.CL / 4/20/2026

📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research

共有:

Key Points

FD-NL2SQL is a feedback-driven clinical NL2SQL assistant designed for SQLite-based oncology trial databases, helping clinicians write multi-constraint SQL without deep schema knowledge.
The system uses a schema-aware LLM to decompose a clinician’s natural-language question into predicate-level sub-questions, retrieves semantically similar expert-verified NL2SQL exemplars, and synthesizes executable SQL with validity checks.
It improves over time by learning from clinician-approved edits to generated SQL and by applying lightweight logic-based SQL augmentation that keeps only variants yielding non-empty results.
An additional LLM automatically reconstructs natural-language questions and predicate decompositions for accepted augmented/edited variants, expanding the exemplar bank without further manual annotation.
The demo interface supports interactive refinement by exposing decomposition, retrieval, synthesis, and execution outcomes to users during query building.

Abstract

Clinicians exploring oncology trial repositories often need ad-hoc, multi-constraint queries over biomarkers, endpoints, interventions, and time, yet writing SQL requires schema expertise. We demo FD-NL2SQL, a feedback-driven clinical NL2SQL assistant for SQLite-based oncology databases. Given a natural-language question, a schema-aware LLM decomposes it into predicate-level sub-questions, retrieves semantically similar expert-verified NL2SQL exemplars via sentence embeddings, and synthesizes executable SQL conditioned on the decomposition, retrieved exemplars, and schema, with post-processing validity checks. To improve with use, FD-NL2SQL incorporates two update signals: (i) clinician edits of generated SQL are approved and added to the exemplar bank; and (ii) lightweight logic-based SQL augmentation applies a single atomic mutation (e.g., operator or column change), retaining variants only if they return non-empty results. A second LLM generates the corresponding natural-language question and predicate decomposition for accepted variants, automatically expanding the exemplar bank without additional annotation. The demo interface exposes decomposition, retrieval, synthesis, and execution results to support interactive refinement and continuous improvement.

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 4/20DailyView insight →

Black Hat USA

AI Business

Black Hat Asia

AI Business

Which Version of Qwen 3.6 for M5 Pro 24g

Reddit r/LocalLLaMA

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)

Dev.to

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI

Dev.to

FD-NL2SQL: Feedback-Driven Clinical NL2SQL that Improves with Use

Key Points

Abstract

💡 Insights using this article

Related Articles

Black Hat USA

Black Hat Asia

Which Version of Qwen 3.6 for M5 Pro 24g

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer