Hello everybody,
I am working on a project where the player gives commands to a creature in a structured game world and the creature shall react to the player's prompt in a sensible way.
The world is described as JSON with distances, directions, object type, unique id
The prompt examples are:
- Get the closest stone
- Go to the tree in the north
- Attack the wolf
- Get any stone but avoid the wolf
And the output is (grammar enforced) JSON with action (move, attack, idle, etc) and the target plus a reasoning for debugging.
I tried Qwen 1.5B instruct and reasoning models it works semi well. Like 80% of the time the action is correct and the reasoning, too and the rest is completely random.
I have some general questions when working with this kind of models:
- is JSON input and output a good idea or shall I encode the world state and output using natural language instead? Like "I move to stone_01 at distance 7 in north direction"
- are numeric values for distances good practice or rather a semantic encoding like "adjacent", "close", "near", "far"
- Is there a better model family for my task? in wanna stay below 2B if possible due to generation time and size.
Thanks for any advice.
[link] [comments]



