AI Navigate

Newest GPU server in the lab! 72gb ampere vram!

Reddit r/LocalLLaMA / 3/19/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Read original →

共有:

Key Points

A new GPU server with 72 GB Ampere VRAM was built in the lab to support large AI models.
It is reportedly running gptoss 120b at 90t/s and qwen 3.5 35b a3b at 80 t/s.
The node serves as the host for an RPC mesh with two 64 GB Orin development kits.
The post was submitted by /u/braydon125 on Reddit's LocalLLaMA and links to a video.

Newest GPU server in the lab! 72gb ampere vram!

Built this beautiful monstrosity to satisfy my mental illness. Running gptoss 120b at 90t/s, qwen 3.5 35b a3b at 80 t/s.

This node is running host for my RPC mesh with the two 64gb orin dev kits

submitted by /u/braydon125
[link] [comments]

Related Articles

5 Dangerous Lies Behind Viral AI Coding Demos That Break in Production

5 Dangerous Lies Behind Viral AI Coding Demos That Break in Production

Dev.to

Two bots, one confused server: what Nimbus revealed about AI agent identity

Dev.to

OpenTelemetry just standardized LLM tracing. Here's what it actually looks like in code.

OpenTelemetry just standardized LLM tracing. Here's what it actually looks like in code.

Dev.to

What is MCP?

What is MCP?

Dev.to

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark forFinance

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。