Best local model that fits into 24GB VRAM for classification, summarization, explanation?

Reddit r/LocalLLaMA / 3/23/2026

💬 OpinionSignals & Early TrendsTools & Practical Usage

共有:

Key Points

投稿者は24GB VRAM（必要なら64GB RAM）環境で動作し、分類・要約・説明を行えるローカルLLM/モデルの候補を探しています。
入力としてテキストや画像を扱い、指定したタクソノミ（分類体系）に基づいて分類結果を構造化データとして返すことを想定しています。
要約や長所/短所の提示などはプロンプト側でルール（指示）を追加することで制御したいと述べています。
目標の推論速度として最低20〜40 tokens/secondを求めています。

Looking for suggestions for a model that can fit in 24GB VRAM and 64GB RAM (if needed) that could run at least a 20-40 tokens/second.

I need to take input text or image and classify content based on a provided taxonomy list, summarize the input or explain pros/cons (probably needs another set of rules added to the prompt to follow) and return structured data. Thanks.

submitted by /u/AdaObvlada
[link] [comments]

The Moonwell Oracle Exploit: How AI-Assisted 'Vibe Coding' Turned cbETH Into a $1.12 Token and Cost $1.78M

Dev.to

How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command

Dev.to

Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets

Dev.to

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Dev.to

Vision and Hardware Strategy Shaping the Future of AI: From Apple to AGI and AI Chips

Dev.to

Best local model that fits into 24GB VRAM for classification, summarization, explanation?

Key Points

Related Articles

The Moonwell Oracle Exploit: How AI-Assisted 'Vibe Coding' Turned cbETH Into a $1.12 Token and Cost $1.78M

How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command

Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Vision and Hardware Strategy Shaping the Future of AI: From Apple to AGI and AI Chips

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer