Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors

arXiv cs.CV / 5/4/2026

📰 NewsTools & Practical UsageModels & Research

共有:

Key Points

The paper introduces FaithEIR, a diffusion-based framework designed to improve extreme image rescaling (e.g., 16× or higher) while preserving semantically consistent structures.
FaithEIR uses a learnable reversible transformation inspired by SVD to perform invertible downscaling and upscaling in the latent space, reducing the ill-posedness of resolution mapping.
To address information loss from quantization, the method adds an adaptive detail prior implemented as a high-frequency dictionary learned from frequently occurring training structures.
It also incorporates a lightweight pixel semantic embedder to provide semantic conditioning to a pretrained diffusion model.
Experiments on reconstruction fidelity and perceptual quality show FaithEIR outperforming state-of-the-art approaches, with code and model releases available on GitHub.

Abstract

Most recent extreme rescaling methods struggle to preserve semantically consistent structures and produce realistic details, due to the severely ill-posed nature of low- to high-resolution mapping under scaling factors of

16\times

or higher. To alleviate the above problems, we propose FaithEIR, a diffusion-based framework for extreme image rescaling. Inspired by singular value decomposition, we develop learnable reversible transformation that enables invertible downscaling and upscaling in the latent space. To compensate for information loss due to quantization, we propose an adaptive detail prior, a high-frequency dictionary that captures the empirical average of commonly occurring structures in the training data. Finally, we design a lightweight pixel semantic embedder to provide semantic conditioning for the pretrained diffusion model. We present extensive experimental results demonstrating that our FaithEIR consistently outperforms state-of-the-art methods, achieving superior reconstruction fidelity and perceptual quality. Our code, model weights, and detailed results are released at https://github.com/cshw2021/FaithEIR.

Black Hat USA

AI Business

A very basic litmus test for LLMs "ok give me a python program that reads my c: and put names and folders in a sorted list from biggest to small"

Reddit r/LocalLLaMA

ALM on Power Platform: ADO + GitHub, the best of both worlds

Dev.to

Iron Will, Iron Problems: Kiwi-chan's Mining Misadventures! 🥝⛏️

Dev.to

Experiment: Does repeated usage influence ChatGPT 5.4 outputs in a RAG-like setup?

Dev.to

Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors

Key Points

Abstract

Related Articles

Black Hat USA

A very basic litmus test for LLMs "ok give me a python program that reads my c: and put names and folders in a sorted list from biggest to small"

ALM on Power Platform: ADO + GitHub, the best of both worlds

Iron Will, Iron Problems: Kiwi-chan's Mining Misadventures! 🥝⛏️

Experiment: Does repeated usage influence ChatGPT 5.4 outputs in a RAG-like setup?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer