FSKD: Monocular Forest Structure Inference via LiDAR-to-RGBI Knowledge Distillation

arXiv cs.CV / 4/3/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces FSKD, a LiDAR-to-RGBI knowledge-distillation framework that trains an RGBI-only SegFormer student to infer high-resolution forest structure metrics typically derived from costly airborne LiDAR.
Using a multimodal teacher that fuses RGBI imagery with LiDAR-derived planar metrics and vertical profiles via cross-attention, the student achieves state-of-the-art zero-shot canopy height model (CHM) performance on geographically distinct test tiles.
On 384 km² of Saxony, Germany data (20 cm GSD), the method reports strong improvements over HRCHM/DAC baselines (29–46% MAE reduction) and outputs CHM, PAI, and foliage height diversity (FHD) jointly, extending beyond many monocular CHM-only approaches.
Ablations indicate multimodal fusion boosts accuracy by 10–26% versus RGBI-only training, and that asymmetric distillation plus suitable model capacity are important for best results.
The approach tolerates temporal mismatch (e.g., winter LiDAR with summer RGBI), supporting more scalable operational monitoring for digital-twin and national orthophoto-style workflows, though PAI/FHD accuracy remains region-dependent and benefits from local calibration.

Abstract

Very High Resolution (VHR) forest structure data at individual-tree scale is essential for carbon, biodiversity, and ecosystem monitoring. Still, airborne LiDAR remains costly and infrequent despite being the reference for forest structure metrics like Canopy Height Model (CHM), Plant Area Index (PAI), and Foliage Height Diversity (FHD). We propose FSKD: a LiDAR-to-RGB-Infrared (RGBI) knowledge distillation (KD) framework in which a multi-modal teacher fuses RGBI imagery with LiDAR-derived planar metrics and vertical profiles via cross-attention, and an RGBI-only SegFormer student learns to reproduce these outputs. Trained on 384

km^2

of forests in Saxony, Germany (20 cm ground sampling distance (GSD)) and evaluated on eight geographically distinct test tiles, the student achieves state-of-the-art (SOTA) zero-shot CHM performance (MedAE 4.17 m,

R^2

=0.51, IoU 0.87), outperforming HRCHM/DAC baselines by 29--46% in MAE (5.81 m vs. 8.14--10.84 m) with stronger correlation coefficients (0.713 vs. 0.166--0.652). Ablations show that multi-modal fusion improves performance by 10--26% over RGBI-only training, and that asymmetric distillation with appropriate model capacity is critical. The method jointly predicts CHM, PAI, and FHD, a multi-metric capability not provided by current monocular CHM estimators, although PAI/FHD transfer remains region-dependent and benefits from local calibration. The framework also remains effective under temporal mismatch (winter LiDAR, summer RGBI), removing strict co-acquisition constraints and enabling scalable 20 cm operational monitoring for workflows such as Digital Twin Germany and national Digital Orthophoto programs.

Black Hat Asia

AI Business

Who is Xu Rui, the ex-ByteDance executive tapped by Meta to lead AI hardware?

SCMP Tech

I Built a Voice AI with Sub-500ms Latency. Here's the Echo Cancellation Problem Nobody Talks About

Dev.to

LLM Semantic Caching: The 95% Hit Rate Myth (and What Production Data Actually Shows)

Dev.to

Inside the Creative Artificial Intelligence (AI) Stack: Where Human Vision and Artificial Intelligence Meet to Design Future Fashion

MarkTechPost

FSKD: Monocular Forest Structure Inference via LiDAR-to-RGBI Knowledge Distillation

Key Points

Abstract

Related Articles

Black Hat Asia

Who is Xu Rui, the ex-ByteDance executive tapped by Meta to lead AI hardware?

I Built a Voice AI with Sub-500ms Latency. Here's the Echo Cancellation Problem Nobody Talks About

LLM Semantic Caching: The 95% Hit Rate Myth (and What Production Data Actually Shows)

Inside the Creative Artificial Intelligence (AI) Stack: Where Human Vision and Artificial Intelligence Meet to Design Future Fashion

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer