Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion
arXiv cs.CV / 3/20/2026
📰 NewsModels & Research
Key Points
- ColourCrafter proposes token-level fusion of RGB colour tokens and image tokens in latent space to enable region-aware colour edits with improved locality and structural fidelity over global tone-transfer methods.
- It propagates colour information selectively to semantically relevant regions while preserving the underlying structure of the image.
- A perceptual Lab-space loss decouples luminance and chrominance and constrains edits within masked areas to achieve higher pixel-level precision.
- The work introduces ColourfulSet, a large-scale dataset of image pairs with continuous and diverse colour variations, and reports state-of-the-art performance in colour accuracy, controllability, and perceptual fidelity.
Related Articles
Next-Generation LLM Inference Technology: From Flash-MoE to Gemini Flash-Lite, and Local GPU Utilization
Dev.to
The Wave of Open-Source AI and Investment in Security: Trends from Qwen, MS, and Google
Dev.to
Implementing Deep Q-Learning (DQN) from Scratch Using RLax JAX Haiku and Optax to Train a CartPole Reinforcement Learning Agent
MarkTechPost
[D] Training a classifier entirely in SQL (no iterative optimization)
Reddit r/MachineLearning
LLM failure modes map surprisingly well onto ADHD cognitive science. Six parallels from independent research.
Reddit r/artificial