AI Navigate

Nemotron 3 Super reads his own reasoning as user message?

Reddit r/LocalLLaMA / 3/18/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The post discusses Nemotron 3 Super looping on its own reasoning when used with a llama-server backend and Aider client, causing repetitive self-analysis of user inputs.
  • It notes that this behavior persists even when some hardware-related flags are skipped, indicating a potential model or configuration issue rather than a simple backend bug.
  • A lengthy excerpt shows the model repeatedly reflecting on its own text before producing a final answer, illustrating the looping problem.
  • The author asks for debugging ideas to diagnose and fix the self-referential reasoning loop and prevent it from consuming long tokens.

So... I wanted to try Nemotron 3 Super (with llama-server as backend and Aider as client). And it looks like it interprets it's own reasoning as user message and then goes over it in a loop... That's an interesting issue, I never encountered it with other models. I guess, I'm doing something wrong? Do you have any ideas?

Out of important things, I run llama-server with these flags, skipping flags that are only hardware-related: --special (although the model did the same thing without this flag), --jinja

For reference, a top part of his reasoning (and the final answer after 8k tokens of thinking xd):

Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text. Let me break this down. The user starts with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing." But then they provide a block of text that looks like it's meant to be processed. Looking at the provided text, it begins with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." Then it goes into a block that says: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." This seems to be repeating. Then it has: "Let me break this down. The user starts with: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing." But then they provide a block of text that looks like it's meant to be processed." This is getting meta and confusing. It appears the user might have pasted some instructions multiple times or there's a loop in the text. The key part I need to focus on is the actual instruction at the very beginning: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." But then the user says: "Let me break this down." and proceeds to describe what they think the user shared. However, the core instruction I should follow is given at the very start of the user's message: "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing. The main content appears to be a series of instructions about how to handle a query, followed by some repetitive and possibly corrupted text." ... ... <a couple thousands of tokens later> I apologize, but I'm unable to identify a clear query or specific text to process in your input due to repetitive, meta-commentary content and potential formatting issues. Your message appears to contain multiple instances of self-referential instructions about handling queries (e.g., "Okay, the user has shared a query that seems to be a mix of instructions and possibly a test or example related to text processing...") without a distinct, actionable request or data to work on. To assist you effectively, please: • Resend your actual question, text for processing, or task in a clear, concise format. • Avoid including meta-instructions or repetitive commentary in your resubmission. • Example of a clear query: "Summarize the key points of this article: [paste text]" or "What is the capital of Japan?" Once you provide a straightforward input, I’ll be happy to help with text processing, analysis, or any other task you need. Thank you for your patience! 
submitted by /u/Real_Ebb_7417
[link] [comments]