| Hey guys, I did something hella entertaining. With some black magic and vodoo I was able to extract pretty cool images that are like an MRI from the model. I'm not stating anything, I have some hypothesis about it... It is mostly because it is just so pretty and mind bogging. I stumbled up a way to visualize LM's structure of structure structures in a 3D volume. Here is the Gist Link with a speed run of the idea. Some images: y3i12/Prisma (my research model) At the present moment I'm looking for a place where I can upload the interactive HTML. If you know of something, let me know that I'll link them. It is very much a lot mesmerizing to keep looking at them at different angles. The mediator surface that comes out of this is also pretty interesting: I wonder if this one of many possible interpretations of "loss landscape". [link] [comments] |
[P] Visualizing LM's Architecture and data flow with Q subspace projection
Reddit r/MachineLearning / 3/23/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The post describes an attempt to visualize language models' architecture and data flow using a Q subspace projection, presenting it as an MRI-like glimpse into the model's inner structure.
- It shows a 3D volume visualization of what the author calls the 'structure of structure structures' and provides a gist with a quick speed run of the idea.
- Images from multiple models (Prisma, Qwen3.5-0.8B, HuggingFace SmolLM-360M, RWKV-4-430M, and state-spaces/mamba-370m-hf) are shared to illustrate cross-model architecture views.
- The author is looking for a place to host an interactive HTML visualization and highlights the 'mediator surface' as potentially related to loss landscapes.
- Overall, the post is exploratory and invites input rather than announcing a formal discovery or release.
Related Articles
The Honest Guide to AI Writing Tools in 2026 (What Actually Works)
Dev.to
Next-Generation LLM Inference Technology: From Flash-MoE to Gemini Flash-Lite, and Local GPU Utilization
Dev.to
The Wave of Open-Source AI and Investment in Security: Trends from Qwen, MS, and Google
Dev.to
How I built a 4-product AI income stack in 4 months (the honest version)
Dev.to
I stopped writing AI prompts from scratch. Here is the system I built instead.
Dev.to