A Systematic Evaluation Protocol of Graph-Derived Signals for Tabular Machine Learning
arXiv cs.AI / 3/17/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that current studies on graph-derived signals in tabular learning rely on limited experimental setups and lack reliability analyses, and introduces a taxonomy-driven empirical analysis approach.
- It presents a unified, reproducible evaluation protocol to assess which categories of graph-derived signals yield statistically significant and robust improvements, with an extensible setup for integrating signals into tabular learning pipelines and features like automated hyperparameter optimization, multi-seed evaluation, formal significance testing, and robustness under graph perturbations.
- The protocol is demonstrated through a large-scale, imbalanced cryptocurrency fraud detection case study, identifying signal categories that provide consistently reliable gains and offering interpretable insights into fraud-discriminative structural patterns.
- Robustness analyses show pronounced differences in how various signals handle missing or corrupted relational data, underscoring practical utility for fraud detection and applicability to other domains.




