Grammar of the Wave: Towards Explainable Multivariate Time Series Event Detection via Neuro-Symbolic VLM Agents
arXiv cs.LG / 3/13/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Knowledge-Guided TSED where a model grounds natural-language event descriptions to intervals in multivariate signals with little or no labeled data.
- It proposes Event Logic Tree (ELT) to connect linguistic descriptions with time-series via modeling the intrinsic temporal-logic structures of events.
- It presents a neuro-symbolic VLM agent framework that instantiates primitives from signal visualizations and composes them under ELT constraints, producing detected intervals and explanations.
- It releases a benchmark based on real-world time series data with expert knowledge and annotations, and experiments show the method outperforms supervised fine-tuning baselines and zero-shot LLM/VLM approaches, while mitigating VLM hallucination.




