FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer

arXiv cs.CV / 4/14/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces FREE-Switch, a frequency-domain importance-driven dynamic LoRA switching approach to merge multiple LoRA adapters for diffusion-based style transfer without the content drift common in prior merging methods for image generation.
  • It argues that different adapters contribute differently across diffusion steps, so FREE-Switch dynamically selects or weights adapter influence based on frequency-domain step importance rather than using uniform fusion.
  • To prevent detail degradation when switching/combining adapters, the method includes an automatic Generation Alignment mechanism that aligns generation intents at the semantic level across adapters.
  • Experiments reportedly show FREE-Switch can combine adapters for different objects and styles while substantially reducing training cost compared with computationally expensive training-based merging alternatives.

Abstract

With the growing availability of open-sourced adapters trained on the same diffusion backbone for diverse scenes and objects, combining these pretrained weights enables low-cost customized generation. However, most existing model merging methods are designed for classification or text generation, and when applied to image generation, they suffer from content drift due to error accumulation across multiple diffusion steps. For image-oriented methods, training-based approaches are computationally expensive and unsuitable for edge deployment, while training-free ones use uniform fusion strategies that ignore inter-adapter differences, leading to detail degradation. We find that since different adapters are specialized for generating different types of content, the contribution of each diffusion step carries different significance for each adapter. Accordingly, we propose a frequency-domain importance-driven dynamic LoRA switch method. Furthermore, we observe that maintaining semantic consistency across adapters effectively mitigates detail loss; thus, we design an automatic Generation Alignment mechanism to align generation intents at the semantic level. Experiments demonstrate that our FREE-Switch (Frequency-based Efficient and Dynamic LoRA Switch) framework efficiently combines adapters for different objects and styles, substantially reducing the training cost of high-quality customized generation.