Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement

arXiv cs.AI / 4/14/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • 大規模クラウドではオンコール対応が大量に発生し、人手の負担が大きい中で、従来のLLM活用の「反応型エージェント」は未解決時に人へエスカレーションされると支援が止まるという課題がある。
  • 提案論文は、オンコールの全ライフサイクルで動作し、人間アナリストが関与している段階でも対話に割り込みながら、ユーザーの明示的呼び出しなしに支援する「Vigil」というプロアクティブなエージェントシステムを紹介している。
  • Vigilは、人間が解決したケースから知識を抽出して能力を自律的に更新する継続的自己改善メカニズムを組み込み、未対応に終わったケースの学習機会も取り込む設計になっている。
  • Volcano Engine(ByteDanceのクラウド)に10か月以上デプロイされ、実運用における有効性と実用性を評価で示している。
  • オープンソース版が公開されており(GitHub)、同様のオンコール/CS運用への導入検討が可能な形で提供されている。

Abstract

In large-scale cloud service platforms, thousands of customer tickets are generated daily and are typically handled through on-call dialogues. This high volume of on-call interactions imposes a substantial workload on human support analysts. Recent studies have explored reactive agents that leverage large language models as a first line of support to interact with customers directly and resolve issues. However, when issues remain unresolved and are escalated to human support, these agents are typically disengaged. As a result, they cannot assist with follow-up inquiries, track resolution progress, or learn from the cases they fail to address. In this paper, we introduce Vigil, a novel proactive agent system designed to operate throughout the entire on-call life-cycle. Unlike reactive agents, Vigil focuses on providing assistance during the phase in which human support is already involved. It integrates into the dialogue between the customer and the analyst, proactively offering assistance without explicit user invocation. Moreover, Vigil incorporates a continuous self-improvement mechanism that extracts knowledge from human-resolved cases to autonomously update its capabilities. Vigil has been deployed on Volcano Engine, ByteDance's cloud platform, for over ten months, and comprehensive evaluations based on this deployment demonstrate its effectiveness and practicality. The open source version of this work is publicly available at https://github.com/volcengine/veaiops.