Customizing Large Vision Model-Guided Low-Rank Approximation for Ground-Roll Denoise

arXiv cs.CV / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper targets ground-roll attenuation in land and vertical seismic profiling, arguing that existing transform-domain, sparse, and deep-learning approaches struggle with adaptability, leakage, or the need for labeled training data under strong signal–noise overlap.
  • It introduces a training-free framework that turns the problem into semantic-guided signal separation by using a promptable large vision model on visualized seismic gathers to localize ground-roll-dominant regions via text or image prompts.
  • The method converts the model’s semantic response into a continuous soft mask and plugs it into a mask-conditioned low-rank inverse formulation to achieve spatially adaptive suppression while preserving reflections.
  • An ADMM-based solver is developed to efficiently solve the resulting inverse problem, aiming for stable, physically consistent recovery without task-specific training or manual annotation.
  • Experiments on synthetic and field VSP datasets show improved ground-roll attenuation with better reflection continuity and waveform fidelity compared with representative filtering and implicit neural representation baselines.

Abstract

Ground-roll is a dominant source of coherent noise in land and vertical seismic profiling (VSP) data, severely masking reflection events and degrading subsequent imaging and interpretation. Conventional attenuation methods, including transform-domain filtering, sparse representation, and deep learning, often suffer from limited adaptability, signal leakage, or dependence on labeled training data, especially under strong signal-noise overlap. To address these challenges, we propose a training-free framework that reformulates ground-roll attenuation as a semantic-guided signal separation problem. Specifically, a promptable large vision model is employed to extract high-level semantic priors by converting seismic gathers into visual representations and localizing ground-roll-dominant regions via text or image prompts. The resulting semantic response is transformed into a continuous soft mask, which is embedded into a mask-conditioned low-rank inverse formulation to enable spatially adaptive suppression and reflection-preserving reconstruction. An efficient alternating direction method of multipliers (ADMM)-based solver is further developed to solve the proposed inverse problem, enabling stable and physically consistent signal recovery without requiring task-specific training or manual annotation. Extensive experiments on both synthetic and field VSP datasets demonstrate that the proposed method achieves superior ground-roll attenuation while preserving reflection continuity and waveform fidelity, consistently outperforming representative transform-domain filtering and implicit neural representation methods.