A High-Resolution Landscape Dataset for Concept-Based XAI With Application to Species Distribution Models

arXiv cs.CV / 4/16/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper introduces the first concept-based Explainable AI (XAI) approach for species distribution models (SDMs) that aims to keep predictive accuracy while improving ecological interpretability.
  • It uses Robust TCAV (Testing with Concept Activation Vectors) to quantify how curated landscape concepts affect SDM predictions at both patch and landscape levels.
  • To support this methodology, the authors release an open-access high-resolution landscape concept dataset built from multispectral and LiDAR drone imagery, containing 653 concept patches across 15 concepts plus 1,450 random reference patches.
  • Experiments on two aquatic insect species (Plecoptera and Trichoptera) using two CNNs and one Vision Transformer show that concept-based XAI can validate models against expert ecological knowledge and reveal novel associations.
  • The authors make the code and datasets publicly available, positioning the work as a reusable resource for conservation, invasive species management, and hypothesis generation.

Abstract

Mapping the spatial distribution of species is essential for conservation policy and invasive species management. Species distribution models (SDMs) are the primary tools for this task, serving two purposes: achieving robust predictive performance while providing ecological insights into the driving factors of distribution. However, the increasing complexity of deep learning SDMs has made extracting these insights more challenging. To reconcile these objectives, we propose the first implementation of concept-based Explainable AI (XAI) for SDMs. We leverage the Robust TCAV (Testing with Concept Activation Vectors) methodology to quantify the influence of landscape concepts on model predictions. To enable this, we provide a new open-access landscape concept dataset derived from high-resolution multispectral and LiDAR drone imagery. It includes 653 patches across 15 distinct landscape concepts and 1,450 random reference patches, designed to suit a wide range of species. We demonstrate this approach through a case study of two aquatic insects, Plecoptera and Trichoptera, using two Convolutional Neural Networks and one Vision Transformer. Results show that concept-based XAI helps validate SDMs against expert knowledge while uncovering novel associations that generate new ecological hypotheses. Robust TCAV also provides landscape-level information, useful for policy-making and land management. Code and datasets are publicly available.