AI Navigate

[D] Looking for arXiv endorsement (cs.LG) - PDE-based world model paper

Reddit r/MachineLearning / 3/18/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The post is a request for arXiv endorsement to submit FluidWorld, a PDE-based world model for video prediction, claiming the predictor uses a reaction-diffusion PDE rather than attention.
  • It describes the model as using Laplacian diffusion for spatial propagation, learned reaction terms for nonlinear mixing, and PDE integration to produce predictions, with 867K parameters and O(N) complexity.
  • A parameter-matched comparison against Transformer and ConvLSTM on UCF-101 shows similar single-step metrics but better multi-step rollouts for the PDE approach, as diffusion acts as a spatial regularizer reducing error accumulation.
  • The author provides a link to the FluidWorld paper and an endorsement code, and invites endorsements from researchers working on world models, video prediction, neural PDEs, or efficient architectures.

Hi everyone,

I'm a researcher looking for an arXiv endorsement for cs.LG to submit my first paper. I've been working for about a year on FluidWorld, a world model where the prediction engine is a reaction-difffusion PDE instead of attention. The Laplacian diffusion handles spatial propagation, learned reaction terms do the nonlinear mixing, and the PDE integration itself produces the prediction.

No attention, no KV-cache, O(N) complexity, 867K parameters total. I ran a parameter matched comparison (PDE vs Transformer vs ConvLSTM, all at ~800K params, same encoder/decoder/losses/data on UCF-101) and the interesting finding is that while single-step metrics are nearly identical, the PDE holds together much better on multi-step rollouts -- the diffusion acts as a natural spatial regularizer that prevents error accumulation.

Paper: https://github.com/infinition/FluidWorld/blob/main/paper/Fluidworld.pdf

Endorsement code: 6AB9UP
https://arxiv.org/auth/endorse?x=6AB9UP

If anyone is working on world model, video prediction, neural PDEs, or efficient architectures could endorse me, that would be really appreciated. Happy to answer any questions about the work. Thanks!

submitted by /u/Bright_Warning_8406
[link] [comments]