AI Navigate

インサイト最新記事一覧 AI大全

I made a 35% REAP of 397B with potentially usable quality in 96GB GPU

Reddit r/LocalLLaMA / 4/5/2026

💬 OpinionSignals & Early TrendsTools & Practical UsageModels & Research

Read original →

共有:

Key Points

The post claims the author produced a REAP-compressed version of a 397B model achieving a reported 35% REAP while maintaining potentially usable quality.
The resulting model is stated to fit and run on a 96GB GPU setup, positioning it as more feasible for local/consumer-grade hardware compared with full-size 397B variants.
A Hugging Face link is provided to the released artifact (Qwen3.5-397B-A17B-REAP35), enabling others to test, benchmark, and fine-tune the compression result.
The focus is on practical viability of weight compression/efficiency techniques (REAP) rather than a new training method or official product announcement.

I made a 35% REAP of 397B with potentially usable quality in 96GB GPU

submitted by /u/Goldkoron
[link] [comments]

Related Articles

Black Hat USA

Black Hat USA

AI Business

Black Hat Asia

Black Hat Asia

AI Business

Who is Xu Rui, the ex-ByteDance executive tapped by Meta to lead AI hardware?

Who is Xu Rui, the ex-ByteDance executive tapped by Meta to lead AI hardware?

SCMP Tech

I Built a Voice AI with Sub-500ms Latency. Here's the Echo Cancellation Problem Nobody Talks About

I Built a Voice AI with Sub-500ms Latency. Here's the Echo Cancellation Problem Nobody Talks About

Dev.to

How I Found $1,240/Month in Wasted LLM API Costs (And Built a Tool to Find Yours)

How I Found $1,240/Month in Wasted LLM API Costs (And Built a Tool to Find Yours)

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。