LatentBurst: A Fast and Efficient Multi Frame Super-Resolution for Hexadeca-Bayer Pattern CIS images

arXiv cs.CV / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • The paper presents LatentBurst, a multi-frame super-resolution network designed specifically for burst images from hexadeca-Bayer pattern CIS sensors, performing demosaicing, denoising, fusion, and super-resolution end-to-end.
  • It targets key difficulties of hexadeca-Bayer data, including harder interpolation due to larger pixel spacing between same-color groups and image degradation from motion-induced misalignment (blurring/ghosting).
  • LatentBurst uses a pyramid alignment-and-fusion strategy in latent features to handle large motion more robustly.
  • To meet real-time mobile constraints, it employs an efficient UNet-based architecture along with fine-tuned optical-flow estimation and a two-step knowledge distillation approach to better reduce domain gaps.
  • Experiments across multiple scenarios show improved reconstruction quality versus existing state-of-the-art methods.

Abstract

This paper introduces a novel multi frame super-resolution network (MFSR) for burst hexadeca Bayer pattern Contact Image Sensor (CIS) images, which includes demosaicing, denoising, multi-frame fusion, and super-resolution. Designing a high-quality reconstruction network poses several challenges as follows: 1) Unlike the Bayer color filter array (CFA) pattern, it is hard to interpolate hexadeca-Bayer pattern since the pixel distance between the same color groups increases; 2) Due to large object motion and camera movements, the final fusion result usually suffers the misalignment resulting a blurry image or ghosting artifacts; 3) The proposed network should be fast and efficient enough to operate in real-time on mobile devices. To overcome these challenges, we propose a novel network, called LatentBurst, which contains: 1) a pyramid align and fusion approach in latent feature to deal with large motion scenario; 2) an efficient UNet-based structure which can run efficiently on mobile device; 3) fine-tuned optical flow estimation and two-step knowledge distillation to reduce domain-gap more effectively. Experimental results in various scenarios demonstrate the effectiveness of our proposed method compared with other state-of-the-art methods.