From Similarity to Structure: Training-free LLM Context Compression with Hybrid Graph Priors

arXiv cs.CL / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes a training-free, model-agnostic method to compress long LLM contexts by selecting a small set of sentences under a strict token budget.
It builds a sparse hybrid sentence graph that mixes semantic mutual k-NN links with short-range sequential edges, then derives a topic “skeleton” via clustering.
Sentences are ranked with an interpretable scoring function that balances task relevance, cluster representativeness, bridge centrality, and a cycle-coverage cue to preserve coherence.
A budgeted greedy selection with redundancy suppression picks sentences while keeping them in original order, aiming to maintain readability and coverage.
Experiments on four datasets indicate the approach is competitive with strong extractive and abstractive baselines and shows bigger improvements on long-document benchmarks.

Abstract

Long-context large language models remain computationally expensive to run and often fail to reliably process very long inputs, which makes context compression an important component of many systems. Existing compression approaches typically rely on trained compressors, dense retrieval-style selection, or heuristic trimming, and they often struggle to jointly preserve task relevance, topic coverage, and cross-sentence coherence under a strict token budget. To address this, we propose a training-free and model-agnostic compression framework that selects a compact set of sentences guided by structural graph priors. Our method constructs a sparse hybrid sentence graph that combines mutual k-NN semantic edges with short-range sequential edges, extracts a topic skeleton via clustering, and ranks sentences using an interpretable score that integrates task relevance, cluster representativeness, bridge centrality, and a cycle coverage cue. A budgeted greedy selection with redundancy suppression then produces a readable compressed context in original order. Experimental results on four datasets show that our approach is competitive with strong extractive and abstractive baselines, demonstrating larger gains on long-document benchmarks.

The foundational UK sovereign-AI patents are filed. The collaboration door is open.

Dev.to

Building a Shopify app with Claude Code — spec-driven development and pricing design

Dev.to

The AI Habit That Pays Dividends (And Takes Zero Extra Time)

Dev.to

From Chaos to Clarity: AI-Powered Client Portals for Designers

Dev.to

I Used to Treat AI Like a Search Engine. Then I Realized I Was Doing It Wrong.

Dev.to

From Similarity to Structure: Training-free LLM Context Compression with Hybrid Graph Priors

Key Points

Abstract

Related Articles

The foundational UK sovereign-AI patents are filed. The collaboration door is open.

Building a Shopify app with Claude Code — spec-driven development and pricing design

The AI Habit That Pays Dividends (And Takes Zero Extra Time)

From Chaos to Clarity: AI-Powered Client Portals for Designers

I Used to Treat AI Like a Search Engine. Then I Realized I Was Doing It Wrong.

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer