PhyloSDF: Phylogenetically-Conditioned Neural Generation of 3D Skull Morphology via Residual Flow Matching

arXiv cs.CV / 4/29/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • PhyloSDF is a phylogenetically conditioned neural generative model designed to create novel, biologically plausible 3D skull morphologies while respecting evolutionary (phylogenetic) relationships despite limited data.
  • The method combines a DeepSDF auto-decoder regularized with a Phylogenetic Consistency Loss that strongly aligns latent space structure with evolutionary distances (Pearson r = 0.993).
  • A Residual Conditional Flow Matching (Residual CFM) framework factorizes generation into an analytic species-centroid lookup plus a learned residual predictor, enabling generation with as few as ~4 specimens per species.
  • On 100 micro-CT skulls across Darwin’s finches and relatives (24 species), the model produces new meshes that match real intra-species variation at the code level (88–129%) and is reported to avoid memorization across all 180 generated meshes.
  • Compared with diffusion, standard flow matching, and Gaussian mixture baselines, Residual CFM shows better fidelity and morphometric performance and demonstrates phylogenetic extrapolation via leave-one-species-out experiments, including plausible ancestral reconstructions from smooth latent interpolations.

Abstract

Generating novel, biologically plausible three-dimensional morphological structures is a fundamental challenge in computational evolutionary biology, hampered by extreme data scarcity and the requirement that generated shapes respect phylogenetic relationships among species. In this work, we present PhyloSDF, a phylogenetically-conditioned neural generative model for 3D biological morphology that integrates two innovations: (1) a DeepSDF auto-decoder regularized by a novel Phylogenetic Consistency Loss that structures the latent space to correlate with evolutionary distances (Pearson r=0.993); (2) a Residual Conditional Flow Matching (Residual CFM) architecture that factorizes generation into analytic species-centroid lookup and learned residual prediction, enabling generation from as few as ~4 specimens per species. We evaluate PhyloSDF on 100 micro-CT-scanned skulls of Darwin's Finches and their relatives across 24 species. The model generates novel meshes achieving 88-129% of real intra-species variation at the code level, with all 180 generated meshes verified as non-memorized. Residual CFM surpasses denoising diffusion (which fails entirely at this scale), standard flow matching (which mode-collapses to 3-6% variation), and a Gaussian mixture baseline in both fidelity (Chamfer Distance 0.00181 vs. 0.00190) and morphometric Fr\'{e}chet distance (10,641 vs. 13,322). Leave-one-species-out experiments across 18 species demonstrate phylogenetic extrapolation capability, and smooth latent interpolations produce biologically plausible ancestral skull reconstructions.