Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)
arXiv cs.CV / 4/10/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes XShapeEnc, a training-free, general-purpose positional encoding method for spatially grounded 2D geometric shapes that aims to extend beyond standard 1D sequence encodings.
- XShapeEnc decomposes each shape into normalized geometry (in the unit disk) and a pose vector that is transformed into a harmonic pose field, both encoded with orthogonal Zernike bases.
- It includes a frequency-propagation step intended to enrich the representation with high-frequency content for better neural discriminability.
- The authors claim five key properties for the resulting compact encoding, including invertibility, adaptivity, and frequency richness, and they provide theoretical validation plus efficiency/discriminability analysis.
- Experiments across multiple shape-aware tasks, supported by a self-curated XShapeCorpus, are used to demonstrate applicability and to position XShapeEnc as a foundational tool for “2D spatial intelligence” research.



