Automating Database-Native Function Code Synthesis with LLMs
arXiv cs.AI / 2026/4/10
💬 オピニオンDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
要点
- The paper highlights that generating database-native (kernel) functions is increasingly urgent and hard because a single function may require multi-unit registration, cross-reference linking, and correct logic implementation.
- It introduces DBCooker, an LLM-based system tailored specifically for database-native function synthesis rather than generic code-generation workflows.
- DBCooker characterizes functions by aggregating declarations, detecting units needing specialized coding, and tracing dependencies across units to provide the right context to the model.
- It uses a pseudo-code plan generator and a hybrid fill-in-the-blank approach to build structured implementation skeletons and integrate reusable internal routines.
- A three-level progressive validation pipeline (syntax, standards compliance, and LLM-guided semantic checks) plus adaptive orchestration with history from similar functions leads to higher accuracy, including synthesizing functions not present in the latest SQLite.