My system has 32gb RAM and 8gb VRAM. I tried out DeepSeek-R1-Distill-Qwen-7B-Q6_K_L.gguf and it was vastly inadequate for what I wanted so looking for other suggestions.
I'm working on a procedural text-adventure engine where the world is a strict 2D coordinate grid. The model receives a board state (10x10) and a list of valid actions (up to 50). I’ve found that the 7B model I tried failed at 'spatial IQ' and kept hallucinating and trying to ignore the grid layout. Looking for a split model I can keep under 32gb to retain some system memory.
[link] [comments]




