AI Navigate

インサイトインサイト最新記事最新記事一覧 AI大全AI大全カオスマップAIカオスマップ

Mistral-Medium-3.5-128B-Q3_K_M on 3x3090 (72GB VRAM)

Reddit r/LocalLLaMA / 5/4/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research

Read original →

共有:

Key Points

The post demonstrates the local inference speed of Mistral Medium 3.5 128B using a Q3 quantized model on a multi-GPU setup of three NVIDIA 3090 cards with a combined 72GB VRAM.
It includes performance screenshots and output rendered in multiple formats, suggesting the author ran benchmarks and verified end-to-end responsiveness.
The use of 3x3090 indicates a practical approach for running larger LLMs locally by relying on quantization (Q3) and multi-GPU distribution.
Overall, the content focuses on real-world throughput/latency behavior rather than describing a new model release or vendor announcement.

Mistral-Medium-3.5-128B-Q3_K_M on 3x3090 (72GB VRAM)

Here is the actual speed of Mistral Medium Q3 running locally on 3x3090

first some Python

https://preview.redd.it/3blnqya7o0zg1.png?width=1670&format=png&auto=webp&s=bab477f9889c16558044ccebb22e3ebfb6a56118

https://preview.redd.it/76a3j6u7o0zg1.png?width=1620&format=png&auto=webp&s=e302a90ae32a7d01959dfee5f7a921dc73ef20b5

https://preview.redd.it/xmd5tzj8o0zg1.png?width=1276&format=png&auto=webp&s=45bc1d77391da81049b6f026dcf6a4af40dc9ec3

then svg

https://preview.redd.it/8q5am5alo0zg1.png?width=1594&format=png&auto=webp&s=a7feeb832c17481526838e8488f4be3069f56443

https://preview.redd.it/u4mbv1klo0zg1.png?width=1600&format=png&auto=webp&s=7c83a3437c67ebefe1b0339861f05b9d67c6f030

https://preview.redd.it/e8vw83rlo0zg1.png?width=782&format=png&auto=webp&s=fadb4f04bba756056d38049c465d0f7a4323b66d

then html

https://preview.redd.it/zs9c36xbp0zg1.png?width=1626&format=png&auto=webp&s=428cb84d3158e4285eb4f1d47283646e876f55be

https://preview.redd.it/6dw74a5cp0zg1.png?width=1540&format=png&auto=webp&s=cc5af763d980329c0d98064e4f53265cfdf9ec2f

https://preview.redd.it/4s3zccecp0zg1.png?width=3796&format=png&auto=webp&s=6defbc181dcbee1fe4523559792e1642aaf504f8

https://preview.redd.it/30n07tlcp0zg1.png?width=3782&format=png&auto=webp&s=4ae343f915f4f70e48bc17add7ff856e1af5ceab

submitted by /u/jacek2023
[link] [comments]

Related Articles

Black Hat USA

Black Hat USA

AI Business

5 AI Prompts That Write Better Marketing Copy Than Most Humans

5 AI Prompts That Write Better Marketing Copy Than Most Humans

Dev.to

Giving an AI agent a recon toolbox: wiring 30+ security tools into an MCP server

Giving an AI agent a recon toolbox: wiring 30+ security tools into an MCP server

Dev.to

I'm Offering AI-Powered Copywriting Services - Starting at /Post

I'm Offering AI-Powered Copywriting Services - Starting at /Post

Dev.to

Agent Workspace as Code: stop copy-pasting your CLAUDE.md across projects

Agent Workspace as Code: stop copy-pasting your CLAUDE.md across projects

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。