Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration

arXiv cs.CV / 4/20/2026

📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research

共有:

Key Points

The paper addresses a key safety challenge for text-to-image generative models: they can generate unsafe or undesirable content due to biases learned from large-scale training data.
It critiques existing concept erasure approaches, noting that text-only methods may not fully remove concepts, while naive image-guided methods can remove unrelated content.
The authors propose TICoE (Text-Image Collaborative Erasing), which uses a continuous convex concept manifold and hierarchical visual representation learning to remove target concepts precisely while preserving other semantics.
They introduce a fidelity-oriented evaluation strategy to objectively assess how usable the outputs remain after erasure, rather than focusing only on removal success.
Experiments across multiple benchmarks show TICoE outperforms prior methods in both concept removal precision and content fidelity, with code released publicly.

Abstract

Text-to-image generative models have achieved impressive fidelity and diversity, but can inadvertently produce unsafe or undesirable content due to implicit biases embedded in large-scale training datasets. Existing concept erasure methods, whether text-only or image-assisted, face trade-offs: textual approaches often fail to fully suppress concepts, while naive image-guided methods risk over-erasing unrelated content. We propose TICoE, a text-image Collaborative Erasing framework that achieves precise and faithful concept removal through a continuous convex concept manifold and hierarchical visual representation learning. TICoE precisely removes target concepts while preserving unrelated semantic and visual content. To objectively assess the quality of erasure, we further introduce a fidelity-oriented evaluation strategy that measures post-erasure usability. Experiments on multiple benchmarks show that TICoE surpasses prior methods in concept removal precision and content fidelity, enabling safer, more controllable text-to-image generation. Our code is available at https://github.com/OpenAscent-L/TICoE.git

Black Hat USA

AI Business

Black Hat Asia

AI Business

Which Version of Qwen 3.6 for M5 Pro 24g

Reddit r/LocalLLaMA

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)

Dev.to

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI

Dev.to

Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration

Key Points

Abstract

Related Articles

Black Hat USA

Black Hat Asia

Which Version of Qwen 3.6 for M5 Pro 24g

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer