| The previous post was probably automoded or something, so I'll give you the TL;DR and point you to search for the model card yourself. Tbh, it's sad that bot posts / posts made by an AI gets prompted, while human made one gets banned. I trained 8B on 4chan data, and it outperform the base model, did the same for 70B and it also outperformed the base model. This is quite rare. You could read about it in the linked threads. (and there's links to the reddit posts in the model cards). [link] [comments] |
4Chan data can almost certainly improve model capabilities.
Reddit r/LocalLLaMA / 4/7/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- 投稿では、4chanデータで8Bおよび70Bモデルを学習したところ、ベースモデルより性能が向上したと主張している。
- 4chanのような公開データが追加学習に有効である可能性が示唆されており、同様の改善が起きるのは「かなり稀」と述べている。
- 具体的な検証方法や結果はリンク先のスレッド/モデルカードで確認できるとしている。
- 一方で、AIボット投稿が不利に扱われたり人間の投稿が禁止されるなど、データ収集・学習の運用面の問題にも触れている。
Related Articles

Black Hat Asia
AI Business

Grab your tickets here →
The Batch

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

New Tech Roles Created by the Rise of AI
Dev.to
OpenAI lays out policy vision for a world remade by AI
Reddit r/artificial