TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

arXiv cs.LG / 3/27/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper introduces TopoPilot, an agentic framework designed to reliably automate scientific visualization workflows from natural-language prompts, targeting issues like invalid operations, subtle errors, and missing clarifications in underspecified inputs.
  • TopoPilot uses a reliability-centered two-agent architecture: an orchestrator translates prompts into workflows of atomic actions, while a verifier checks workflow structure and semantic consistency before execution.
  • The framework is modular and extensible, aiming to generalize beyond topological data analysis/visualization by allowing new descriptors and domain-specific workflows to be integrated without changing the core system.
  • To improve robustness, the authors define a taxonomy of failure modes and implement targeted guardrails for each category of failure.
  • In simulated evaluations with 1,000 multi-turn conversations across 100 prompts (including adversarial and infeasible requests), TopoPilot reports over 99% success rate versus under 50% for baseline systems lacking comprehensive verification.

Abstract

Recent agentic systems demonstrate that large language models can generate scientific visualizations from natural language. However, reliability remains a major limitation: systems may execute invalid operations, introduce subtle but consequential errors, or fail to request missing information when inputs are underspecified. These issues are amplified in real-world workflows, which often exceed the complexity of standard benchmarks. Ensuring reliability in autonomous visualization pipelines therefore remains an open challenge. We present TopoPilot, a reliable and extensible agentic framework for automating complex scientific visualization workflows. TopoPilot incorporates systematic guardrails and verification mechanisms to ensure reliable operation. While we focus on topological data analysis and visualization as a primary use case, the framework is designed to generalize across visualization domains. TopoPilot adopts a reliability-centered two-agent architecture. An orchestrator agent translates user prompts into workflows composed of atomic backend actions, while a verifier agent evaluates these workflows prior to execution, enforcing structural validity and semantic consistency. This separation of interpretation and verification reduces code-generation errors and enforces correctness guarantees. A modular architecture further improves robustness by isolating components and enabling seamless integration of new descriptors and domain-specific workflows without modifying the core system. To systematically address reliability, we introduce a taxonomy of failure modes and implement targeted safeguards for each class. In evaluations simulating 1,000 multi-turn conversations across 100 prompts, including adversarial and infeasible requests, TopoPilot achieves a success rate exceeding 99%, compared to under 50% for baselines without comprehensive guardrails and checks.