Towards In-Context Tone Style Transfer with A Large-Scale Triplet Dataset
arXiv cs.CV / 4/20/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses limitations in tone style transfer for photo retouching, where a lack of high-quality large-scale triplet datasets has forced models to rely on weaker self-supervised or proxy objectives.
- It introduces TST100K, a new large-scale dataset of 100,000 content–reference–stylized triplets, built via a data construction pipeline that uses a trained tone style scorer to enforce strict stylistic consistency.
- To improve visual and semantic quality, the authors propose ICTone, a diffusion-based in-context tone transfer framework that jointly conditions on both images to better preserve semantics and avoid issues like inappropriate color transfer.
- The approach further uses reward feedback learning driven by the tone style scorer to enhance stylistic fidelity and overall visual aesthetics, achieving state-of-the-art results in both quantitative metrics and human evaluations.
Related Articles

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)
Dev.to

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI
Dev.to

Building Digital Souls: The Brutal Reality of Creating AI That Understands You Like Nobody Else
Dev.to
Local LLM Beginner’s Guide (Mac - Apple Silicon)
Reddit r/artificial

Is Your Skill Actually Good? Systematically Validating Agent Skills with Evals
Dev.to