SAP: Segment Any 4K Panorama
arXiv cs.CV / 3/16/2026
📰 NewsModels & Research
Key Points
- SAP is a new foundation model designed for 4K panoramic instance segmentation, addressing performance gaps on 360° panoramas.
- It reformulates panoramic segmentation as fixed-trajectory perspective video segmentation, decomposing panoramas into overlapping perspective patches along a spherical traversal to preserve native 4K resolution and smooth viewpoint transitions.
- The approach uses large-scale supervision by synthesizing 183,440 4K panoramic images with instance segmentation labels via the InfiniGen engine.
- SAP generalizes to real-world 360° images and achieves a +17.2 zero-shot mIoU gain over vanilla SAM2 of different sizes on a 4K panorama benchmark.
Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA
QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!
Reddit r/LocalLLaMA
acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan
Reddit r/LocalLLaMA

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**
Hugging Face Blog

Newest GPU server in the lab! 72gb ampere vram!
Reddit r/LocalLLaMA