GO-GenZip: Goal-Oriented Generative Sampling and Hybrid Compression

arXiv cs.LG / 3/23/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper GO-GenZip proposes a GenAI-driven sampling and hybrid compression framework that jointly optimizes what to observe and how to encode network telemetry data.
  • It uses adaptive masking and task relevance to guide selective data collection, reducing data volume while preserving downstream analytical fidelity.
  • It combines traditional lossless coding with GenAI-driven lossy compression to maintain reconstruction accuracy and task performance.
  • Experiments on real network datasets show over 50% reductions in sampling and data transfer costs with comparable reconstruction and analytics fidelity, indicating strong practical impact.

Abstract

Current network data telemetry pipelines consist of massive streams of fine-grained Key Performance Indicators (KPIs) from multiple distributed sources towards central aggregators, making data storage, transmission, and real-time analysis increasingly unsustainable. This work presents a generative AI (GenAI)-driven sampling and hybrid compression framework that redesigns network telemetry from a goal-oriented perspective. Unlike conventional approaches that passively compress fully observed data, our approach jointly optimizes what to observe and how to encode it, guided by the relevance of information to downstream tasks. The framework integrates adaptive sampling policies, using adaptive masking techniques, with generative modeling to identify patterns and preserve critical features across temporal and spatial dimensions. The selectively acquired data are further processed through a hybrid compression scheme that combines traditional lossless coding with GenAI-driven, lossy compression. Experimental results on real network datasets demonstrate over 50\% reductions in sampling and data transfer costs, while maintaining comparable reconstruction accuracy and goal-oriented analytical fidelity in downstream tasks.