Nemotron 3 Superは自分の推論をユーザーのメッセージとして読み取ってしまうのか？

Reddit r/LocalLLaMA / 2026/3/18

💬 オピニオンDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

共有:

要点

投稿は、Nemotron 3 Super が llama-server バックエンドと Aider クライアントを使用した際に自分の推論をループさせてしまい、ユーザー入力の自己分析を繰り返す現象について論じている。
この挙動は、ハードウェア関連フラグの一部をスキップしても持続することが指摘されており、単純なバックエンドのバグというより、モデルまたは設定の問題を示唆している。
長い抜粋では、最終的な回答を生成する前にモデルが自分のテキストを繰り返し省察する様子が示されており、ループの問題を示している。
著者は自己参照的な推論ループを診断・修正するデバッグのアイデアを求めており、長いトークンの消費を防ぐ方法についても検討している。

そう... Nemotron 3 Superを試してみたいと思いました（llama-serverをバックエンドとして、Aiderをクライアントとして使用します）。そして、それは自分自身の推論をユーザーメッセージとして解釈し、それをループしてしまうようです... これは興味深い問題で、他のモデルでは一度も経験したことがありません。多分、私が何か間違っているのでしょうか？何かアイデアはありますか？

重要な点として、llama-serverを以下のフラグで実行しています。ハードウェアにのみ関連するフラグはスキップします： --special（このフラグなしでも同じ動作をしました）、 --jinja

参考として、彼の推論の冒頭部分（および8kトークン分の思考の末の最終回答 xd）:

Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text. Let me break this down. The user starts with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing." But then they provide a block of text that looks like it's meant to be processed. Looking at the provided text, it begins with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." Then it goes into a block that says: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." This seems to be repeating. Then it has: "Let me break this down. The user starts with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing." But then they provide a block of text that looks like it's meant to be processed." This is getting meta and confusing. It appears the user might have pasted some instructions multiple times or there's a loop in the text. The key part I need to focus on is the actual instruction at the very beginning: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." But then the user says: "Let me break this down." and proceeds to describe what they think the user shared. However, the core instruction I should follow is given at the very start of the user's message: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." ... ... <a couple thousands of tokens later> I apologize, but I'm unable to identify a clear query or specific text to process in your input due to repetitive, meta-commentary content and potential formatting issues. Your message appears to contain multiple instances of self-referential instructions about handling queries (e.g., "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing...") without a distinct, actionable request or data to work on. To assist you effectively, please: • Resend your actual question, text for processing, or task in a clear, concise format. • Avoid including meta-instructions or repetitive commentary in your resubmission. • Example of a clear query: "Summarize the key points of this article: [paste text]" or "What is the capital of Japan?" Once you provide a straightforward input, I’ll be happy to help with text processing, analysis, or any other task you need. Thank you for your patience!

submitted by /u/Real_Ebb_7417
[リンク] [コメント]

Translator

Azure OpenAI Service ドキュメント

200人のChatGPTユーザーに聞いた最大の不満。トップ5はすべてChatGPT Toolboxが解決する問題だった。

Reddit r/artificial

すべてのPRをセキュリティバグでレビューするAIを作った — その方法（2026）

Dev.to

[R] アイデンティティ・アンカーと権限階層の組み合わせが abliterated LLMs で 100% の拒否を実現 — システムプロンプトのみ、ファインチューニングなし

Reddit r/MachineLearning

私がリードを見つけ、個別化されたコールドメールを作成するAI SDRエージェントを構築した方法

Dev.to

Nemotron 3 Superは自分の推論をユーザーのメッセージとして読み取ってしまうのか？

要点

関連記事

Translator

200人のChatGPTユーザーに聞いた最大の不満。トップ5はすべてChatGPT Toolboxが解決する問題だった。

すべてのPRをセキュリティバグでレビューするAIを作った — その方法（2026）

[R] アイデンティティ・アンカーと権限階層の組み合わせが abliterated LLMs で 100% の拒否を実現 — システムプロンプトのみ、ファインチューニングなし

私がリードを見つけ、個別化されたコールドメールを作成するAI SDRエージェントを構築した方法

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer