Erase Persona, Forget Lore: Benchmarking Multimodal Copyright Unlearning in Large Vision Language Models

arXiv cs.CV / 5/6/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper highlights that large vision-language models (LVLMs) may memorize and reproduce copyrighted visual content, and that machine unlearning could help mitigate this risk after training.
  • Existing evaluation approaches for multimodal (cross-modal) copyright unlearning are described as insufficiently robust and often unable to capture how well concepts are erased across different visual variations.
  • The authors introduce CoVUBench, a benchmark framework specifically built to evaluate copyright-related unlearning in LVLMs.
  • CoVUBench uses procedurally generated, legally safe synthetic data with systematic visual changes (including composition and domain variations) to test how well forgetting generalizes.
  • The evaluation protocol measures both forgetting effectiveness from a copyright-holder perspective and the retention of general utility from a deployer perspective, emphasizing the key trade-off.

Abstract

Large Vision-Language Models (LVLMs), trained on web-scale data, risk memorizing and regenerating copyrighted visual content such as characters and logos, creating significant challenges. Machine unlearning offers a path to mitigate these risks by removing specific content post-training, but evaluating its effectiveness, especially in the complex multimodal setting of LVLMs, remains an open problem. Current evaluation methods often lack robustness or fail to capture the nuances of cross-modal concept erasure. To address this critical gap, we introduce the CoVUBench benchmark, the first framework specifically designed for evaluating copyright content unlearning in LVLMs. CoVUBench utilizes procedurally generated, legally safe synthetic data coupled with systematic visual variations spanning compositional changes and diverse domain manifestations to ensure realistic and robust evaluation of unlearning generalization. Our comprehensive multimodal evaluation protocol assesses both forgetting efficacy from the copyright holder perspective and the preservation of general model utility from the deployer viewpoint. By rigorously measuring this crucial trade-off, CoVUBench provides a standardized tool to advance the development of responsible and effective unlearning methods for LVLMs.