High-Fidelity Mural Restoration via a Unified Hybrid Mask-Aware Transformer

arXiv cs.CV / 4/7/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses digital restoration of ancient murals by reconstructing large missing regions while preserving authentic, undamaged areas under heavy environmental and human-caused degradation.
  • It proposes the Hybrid Mask-Aware Transformer (HMAT), combining mask-aware dynamic filtering for local texture generation with a Transformer bottleneck for long-range structural inference.
  • To handle different degradation morphologies, HMAT includes a mask-conditional style fusion module that guides the generative process using degradation masks.
  • A Teacher-Forcing Decoder with hard-gated skip connections is used to enforce fidelity in valid (undamaged) regions and concentrate reconstruction on missing areas.
  • Experiments on the DHMural dataset and a curated Nine-Colored Deer dataset show competitive results versus state-of-the-art methods, with improved structural coherence and visual faithfulness.

Abstract

Ancient murals are valuable cultural artifacts, but many have suffered severe degradation due to environmental exposure, material aging, and human activity. Restoring these artworks is challenging because it requires both reconstructing large missing structures and strictly preserving authentic, undamaged regions. This paper presents the Hybrid Mask-Aware Transformer (HMAT), a unified framework for high-fidelity mural restoration. HMAT integrates Mask-Aware Dynamic Filtering for robust local texture modeling with a Transformer bottleneck for long-range structural inference. To further address the diverse morphology of degradation, we introduce a mask-conditional style fusion module that dynamically guides the generative process. In addition, a Teacher-Forcing Decoder with hard-gated skip connections is designed to enforce fidelity in valid regions and focus reconstruction on missing areas. We evaluate HMAT on the DHMural dataset and a curated Nine-Colored Deer dataset under varying degradation levels. Experimental results demonstrate that the proposed method achieves competitive performance compared to state-of-the-art approaches, while producing more structurally coherent and visually faithful restorations. These findings suggest that HMAT provides an effective solution for the digital restoration of cultural heritage murals.