Benchmarking Scientific Machine Learning Models for Air Quality Data

arXiv cs.LG / 2026/3/24

💬 オピニオンIdeas & Deep AnalysisModels & Research

共有:

要点

The study benchmarks multiple classical, machine-learning, and deep-learning approaches for multi-horizon AQI forecasting (PM2.5 and O3) in North Texas using EPA daily observations from 2022–2024.
It builds standardized, city-level lag-wise forecasting datasets with forecasting horizons using LAG in {1, 7, 14, 30} days and evaluates models with chronological train-test splits.
Deep-learning models (MLP and LSTM) outperform simpler baselines (linear regression and SARIMAX) across evaluated error metrics such as MAE and RMSE.
Physics-guided variants (MLP+Physics, LSTM+Physics) incorporate EPA breakpoint-based AQI formulation as a weighted-loss consistency constraint, improving stability and producing physically consistent pollutant–AQI relationships.
The largest gains from physics guidance appear for short-horizon predictions and for specific pollutants (notably PM2.5 and O3), yielding a region-specific guideline for model selection.

Abstract

Accurate air quality index (AQI) forecasting is essential for the protecting public health in rapidly growing urban regions, and the practical model evaluation and selection are often challenged by the lack of rigorous, region-specific benchmarking on standardized datasets. Physics-guided machine learning and deep learning models could be a good and effective solution to resolve such issues with more accurate and efficient AQI forecasting. This research study presents an explainable and comprehensive benchmark that enables a guideline and proposed physics-guided best model by benchmarking classical time-series, machine-learning, and deep-learning approaches for multi-horizon AQI forecasting in North Texas (Dallas County). Using publicly available U.S. Environmental Protection Agency (EPA) daily observations of air quality data from 2022 to 2024, we curate city-level time series for PM2.5 and O3 by aggregating station measurements and constructing lag-wise forecasting datasets for LAG in {1,7,14,30} days. For benchmarking the best model, linear regression (LR), SARIMAX, multilayer perceptrons (MLP), and LSTM networks are evaluated with the proposed physics-guided variants (MLP+Physics and LSTM+Physics) that incorporate the EPA breakpoint-based AQI formulation as a consistency constraint through a weighted loss. Experiments using chronological train-test splits and error metrics MAE, RMSE showed that deep-learning models outperform simpler baselines, while physics guidance improves stability and yields physically consistent pollutant with AQI relationships, with the largest benefits observed for short-horizon prediction and for PM2.5 and O3. Overall, the results provide a practical reference for selecting AQI forecasting models in North Texas and clarify when lightweight physics constraints meaningfully improve predictive performance across pollutants and forecast horizons.

人型ロボットを被災建築物の調査に活用、建築研究所などが公開実験

日経XTECH

ホンダEV3車種の開発中止、損失はなぜこれほど膨らんだのか

日経XTECH

日本企業「ブラックボックス戦略」の断末魔、AI時代に慌てても手遅れかもな

日経XTECH

文書の内容を学習なしでLLMに反映、Sakana AIの新技術 RAG代替は可能か

日経XTECH

AI時代のトラフィックはますます予測困難に、NaaSによる柔軟な制御が不可欠

日経XTECH

Benchmarking Scientific Machine Learning Models for Air Quality Data

要点

Abstract

関連記事

人型ロボットを被災建築物の調査に活用、建築研究所などが公開実験

ホンダEV3車種の開発中止、損失はなぜこれほど膨らんだのか

日本企業「ブラックボックス戦略」の断末魔、AI時代に慌てても手遅れかもな

文書の内容を学習なしでLLMに反映、Sakana AIの新技術 RAG代替は可能か

AI時代のトラフィックはますます予測困難に、NaaSによる柔軟な制御が不可欠

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer