Leveraging Natural Language Processing and Machine Learning for Evidence-Based Food Security Policy Decision-Making in Data-Scarce Making

arXiv cs.AI / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ZeroHungerAI, an integrated NLP and ML framework for evidence-based food security policy modeling in data-scarce, text-fragmented regions.
  • It uses transfer learning with a DistilBERT-based approach to combine structured socio-economic indicators with contextual policy text embeddings.
  • On a 1,200-sample hybrid dataset covering 25 districts, the method achieves strong results under class imbalance, including 91% classification accuracy, 0.89 precision, 0.85 recall, and 0.86 F1.
  • Comparative experiments show 13% improvement over SVM and 17% over Logistic Regression, with precision-recall metrics indicating reliable minority-class detection.
  • A fairness-aware optimization step reduces demographic parity differences to 3%, supporting more equitable rural–urban policy inference.

Abstract

Food security policy formulation in data-scarce regions remains a critical challenge due to limited structured datasets, fragmented textual reports, and demographic bias in decision-making systems. This study proposes ZeroHungerAI, an integrated Natural Language Processing (NLP) and Machine Learning (ML) framework designed for evidence-based food security policy modeling under extreme data scarcity. The system combines structured socio-economic indicators with contextual policy text embeddings using a transfer learning based DistilBERT architecture. Experimental evaluation on a 1200-sample hybrid dataset across 25 districts demonstrates superior predictive performance, achieving 91 percent classification accuracy, 0.89 precision, 0.85 recall, and an F1 score of 0.86 under imbalanced conditions. Comparative analysis shows a 13 percent performance improvement over classical SVM and 17 percent over Logistic Regression models. Precision Recall evaluation confirms robust minority class detection (average precision around 0.88). Fairness aware optimization reduces demographic parity difference to 3 percent, ensuring equitable rural urban policy inference. The results validate that transformer based contextual learning significantly enhances policy intelligence in low resource governance environments, enabling scalable and bias aware hunger prediction systems.