Models & Research

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Reddit r/LocalLLaMA / 3/19/2026

QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!

Reddit r/LocalLLaMA / 3/19/2026

acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan

Reddit r/LocalLLaMA / 3/19/2026

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Hugging Face Blog / 3/19/2026

Newest GPU server in the lab! 72gb ampere vram!

Reddit r/LocalLLaMA / 3/19/2026

Nemotron 3 Super 120b Claude Distilled

Reddit r/LocalLLaMA / 3/19/2026

Attention Residual connections

Reddit r/LocalLLaMA / 3/19/2026

[D] Breaking down MiroThinker H1's verification centric reasoning: why fewer interaction rounds produce better agent performance

Reddit r/MachineLearning / 3/19/2026

I fine-tuned Qwen 0.5B for task automation and wanted to share the results.

Reddit r/LocalLLaMA / 3/19/2026

Benchmarked MiniMax M2.7 through 2 benchmarks. Here's how it did

Reddit r/LocalLLaMA / 3/19/2026

Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders

Reddit r/LocalLLaMA / 3/19/2026

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)

Dev.to / 3/19/2026

OpenRouter利用量ランキングTOP10を全解説 — 各モデルの系統・用途・開発元まとめ（2026年3月版）

Zenn / 3/19/2026

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more

Reddit r/LocalLLaMA / 3/19/2026

Qwen3.5 Knowledge density and performance

Reddit r/LocalLLaMA / 3/19/2026

I think I made the best general use System Prompt for Qwen 3.5 (OpenWebUI + Web search)

Reddit r/LocalLLaMA / 3/19/2026

gemini embedding 2 による資料の類似性は見た目で決まる？内容で決まる？ PDF と画像で検証してみた

Zenn / 3/19/2026

#2 : プロンプト研究講座【第17回】プロンプトの「温度感」と「湿度感」の表現

note / 3/19/2026

菊地康巳「AIとぼくの研究日記」

note / 3/19/2026

🧠 Reiが「自分の推論を監査する」存在になった日——STEP181〜186、二層監査体制完成と統合インターフェイスの誕生

note / 3/19/2026

Title

Dev.to / 3/19/2026

[Boost]

Dev.to / 3/19/2026

Hunter Alpha was a stealth model revealed on March 18th as an early testing version of MiMo-V2-Pro.

Reddit r/LocalLLaMA / 3/19/2026

[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

Dev.to / 3/19/2026

From Language to Action in Arabic: Reliable Structured Tool Calling via Data-Centric Fine-Tuning

arXiv cs.LG / 3/19/2026

Behavior-Centric Extraction of Scenarios from Highway Traffic Data and their Domain-Knowledge-Guided Clustering using CVQ-VAE

arXiv cs.CV / 3/19/2026

Predicting Trajectories of Long COVID in Adult Women: The Critical Role of Causal Disentanglement

arXiv cs.LG / 3/19/2026

TrackDeform3D: Markerless and Autonomous 3D Keypoint Tracking and Dataset Collection for Deformable Objects

arXiv cs.CV / 3/19/2026

Integrating Explainable Machine Learning and Mixed-Integer Optimization for Personalized Sleep Quality Intervention

arXiv cs.LG / 3/19/2026

RHYME-XT: A Neural Operator for Spatiotemporal Control Systems

arXiv cs.LG / 3/19/2026

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control

arXiv cs.LG / 3/19/2026

GenLie: A Global-Enhanced Lie Detection Network under Sparsity and Semantic Interference

arXiv cs.CV / 3/19/2026

Empirical Recipes for Efficient and Compact Vision-Language Models

arXiv cs.CV / 3/19/2026

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

arXiv cs.LG / 3/19/2026

Attention Sinks Induce Gradient Sinks

arXiv cs.LG / 3/19/2026

Embedding World Knowledge into Tabular Models: Towards Best Practices for Embedding Pipeline Design

arXiv cs.LG / 3/19/2026

Solution for 10th Competition on Ambivalence/Hesitancy (AH) Video Recognition Challenge using Divergence-Based Multimodal Fusion

arXiv cs.CV / 3/19/2026

Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity

arXiv cs.LG / 3/19/2026

Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting

arXiv cs.LG / 3/19/2026

Adaptive Anchor Policies for Efficient 4D Gaussian Streaming

arXiv cs.CV / 3/19/2026

Are a Thousand Words Better Than a Single Picture? Beyond Images -- A Framework for Multi-Modal Knowledge Graph Dataset Enrichment

arXiv cs.CV / 3/19/2026

DesertFormer: Transformer-Based Semantic Segmentation for Off-Road Desert Terrain Classification in Autonomous Navigation Systems

arXiv cs.CV / 3/19/2026

Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback

arXiv cs.LG / 3/19/2026

Symmetry-Reduced Physics-Informed Learning of Tensegrity Dynamics

arXiv cs.LG / 3/19/2026

The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle

arXiv cs.LG / 3/19/2026

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

arXiv cs.CV / 3/19/2026

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

arXiv cs.LG / 3/19/2026

Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates

arXiv cs.LG / 3/19/2026

A Multi-Agent System for Building-Age Cohort Mapping to Support Urban Energy Planning

arXiv cs.CV / 3/19/2026

Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis

arXiv cs.CV / 3/19/2026

Models & Research

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!

acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**

Newest GPU server in the lab! 72gb ampere vram!

Nemotron 3 Super 120b Claude Distilled

Attention Residual connections

[D] Breaking down MiroThinker H1's verification centric reasoning: why fewer interaction rounds produce better agent performance

I fine-tuned Qwen 0.5B for task automation and wanted to share the results.

Benchmarked MiniMax M2.7 through 2 benchmarks. Here's how it did

Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)

OpenRouter利用量ランキングTOP10を全解説 — 各モデルの系統・用途・開発元まとめ（2026年3月版）

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more

Qwen3.5 Knowledge density and performance

I think I made the best general use System Prompt for Qwen 3.5 (OpenWebUI + Web search)

gemini embedding 2 による資料の類似性は見た目で決まる？ 内容で決まる？ PDF と画像で検証してみた

#2 : プロンプト研究講座【第17回】プロンプトの「温度感」と「湿度感」の表現

菊地康巳「AIとぼくの研究日記」

🧠 Reiが「自分の推論を監査する」存在になった日——STEP181〜186、二層監査体制完成と統合インターフェイスの誕生

**Title**

[Boost]

Hunter Alpha was a stealth model revealed on March 18th as an early testing version of MiMo-V2-Pro.

[Meta-RL] We told an AI agent 'you can fail 3 times.' Accuracy went up 19%.

From Language to Action in Arabic: Reliable Structured Tool Calling via Data-Centric Fine-Tuning

Behavior-Centric Extraction of Scenarios from Highway Traffic Data and their Domain-Knowledge-Guided Clustering using CVQ-VAE

Predicting Trajectories of Long COVID in Adult Women: The Critical Role of Causal Disentanglement

TrackDeform3D: Markerless and Autonomous 3D Keypoint Tracking and Dataset Collection for Deformable Objects

Integrating Explainable Machine Learning and Mixed-Integer Optimization for Personalized Sleep Quality Intervention

RHYME-XT: A Neural Operator for Spatiotemporal Control Systems

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control

GenLie: A Global-Enhanced Lie Detection Network under Sparsity and Semantic Interference

Empirical Recipes for Efficient and Compact Vision-Language Models

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Attention Sinks Induce Gradient Sinks

Embedding World Knowledge into Tabular Models: Towards Best Practices for Embedding Pipeline Design

Solution for 10th Competition on Ambivalence/Hesitancy (AH) Video Recognition Challenge using Divergence-Based Multimodal Fusion

Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity

Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting

Adaptive Anchor Policies for Efficient 4D Gaussian Streaming

Are a Thousand Words Better Than a Single Picture? Beyond Images -- A Framework for Multi-Modal Knowledge Graph Dataset Enrichment

DesertFormer: Transformer-Based Semantic Segmentation for Off-Road Desert Terrain Classification in Autonomous Navigation Systems

Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback

Symmetry-Reduced Physics-Informed Learning of Tensegrity Dynamics

The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates

A Multi-Agent System for Building-Age Cohort Mapping to Support Urban Energy Planning

Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

gemini embedding 2 による資料の類似性は見た目で決まる？内容で決まる？ PDF と画像で検証してみた

Title