Formal verification of tree-based machine learning models for lateral spreading
arXiv cs.LG / 3/19/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper encodes trained tree ensembles as logical formulas for an SMT solver to verify physical specifications across the entire input domain in geotechnical hazard prediction.
- Four geotechnical specifications (water table depth, PGA monotonicity, distance safety, and flat-ground safety) are formalized and checked against XGBoost ensembles and Explainable Boosting Machines trained on the Christchurch earthquake lateral spreading dataset.
- The unconstrained EBM (80.1% accuracy) violates all four specifications, while a fully constrained EBM (67.2% accuracy) satisfies three of four, illustrating a trade-off between accuracy and physical compliance.
- SHAP analysis on counterexamples shows that post-hoc explanations cannot substitute formal verification, supporting a verify-fix-verify loop and formal certification for deploying physically consistent ML models in safety-critical contexts.




