AI Navigate

MeTok: An Efficient Meteorological Tokenization with Hyper-Aligned Group Learning for Precipitation Nowcasting

arXiv cs.AI / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces MeTok, a distribution-centric meteorological tokenization that groups similar weather features to better reflect interactions in meteorological systems rather than relying on a position-centric approach.
  • It proposes the Hyper-Aligned Grouping Transformer (HyAGTransformer) with two innovations: Grouping Attention (GA) for self-aligned learning across precipitation patterns and Neighborhood FFN (N-FFN) to aggregate contextual information from adjacent groups.
  • On ERA5 data for 6-hour precipitation nowcasting, the method achieves at least an 8.2% IoU improvement in extreme precipitation predictions and demonstrates scalability with more data and larger models.
  • The approach offers robustness across diverse precipitation patterns and potential efficiency gains for weather forecasting, influencing future deployment in meteorological prediction systems.

Abstract

Recently, Transformer-based architectures have advanced meteorological prediction. However, this position-centric tokenizer conflicts with the core principle of meteorological systems, where the weather phenomena undoubtedly involve synergistic interactions among multiple elements while positional information constitutes merely a component of the boundary conditions. This paper focuses primarily on the task of precipitation nowcasting and develops an efficient distribution-centric Meteorological Tokenization (MeTok) scheme, which spatially sequences to group similar meteorological features. Based on the rearrangement, realigned group learning enhances robustness across precipitation patterns, especially extreme ones. Specifically, we introduce the Hyper-Aligned Grouping Transformer (HyAGTransformer) with two key improvements: 1) The Grouping Attention (GA) mechanism uses MeTok to enable self-aligned learning of features from different precipitation patterns; 2) The Neighborhood Feed-Forward Network (N-FFN) integrates adjacent group features, aggregating contextual information to boost patch embedding discriminability. Experiments on the ERA5 dataset for 6-hour forecasts show our method improves the IoU metric by at least 8.2% in extreme precipitation prediction compared to other methods. Additionally, it gains performance with more training data and increased parameters, demonstrating scalability, stability, and superiority over traditional methods.