GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure
arXiv cs.AI / 3/18/2026
📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces GSI Agent, a domain-enhanced LLM framework designed to improve performance on Green Stormwater Infrastructure tasks by combining supervised fine-tuning on a curated GSI instruction dataset, retrieval-augmented generation over an internal GSI knowledge base built from municipal documents, and an agent-based reasoning pipeline that coordinates retrieval, context integration, and structured response generation.
- It also constructs a new GSI dataset aligned with real-world inspection and maintenance scenarios, and reports that BLEU-4 on the GSI dataset improves from 0.090 to 0.307 while performance on a general knowledge dataset remains stable (0.304 vs. 0.305).
- The approach integrates three strategies—SFT, RAG over municipal documents, and an agent-based reasoning workflow—to adapt general-purpose LLMs for professional infrastructure tasks and reduce domain-specific hallucinations.
- The work demonstrates how systematic domain knowledge enhancement can enable LLMs to perform more reliably in engineering contexts, suggesting broader applicability to similar domain-specific infrastructure applications.




