MemRoPE: Training-Free Infinite Video Generation via Evolving Memory Tokens
arXiv cs.CV / 3/16/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- MemRoPE presents a training-free framework for infinite video generation by maintaining both long-term and short-term memory tokens to prevent fidelity loss and identity drift over long horizons.
- The approach uses Memory Tokens to compress all past keys into dual streams via exponential moving averages, preserving global identity while capturing recent dynamics within a fixed-size cache.
- Online RoPE Indexing caches unrotated keys and applies dynamic positional embeddings at attention time to keep temporal aggregation well-defined without conflicting positional phases.
- The two components are mutually enabling, allowing fixed-size caching to support unbounded generation while maintaining temporal coherence, fidelity, and subject consistency for minute- to hour-scale videos.




