How many move your favorite LLM model before it's cheat then brain-dead in chess game ?

I try with Gemma 4 E4B via llama-sever to play chess at https://www.chess.com/play/computer (any platform or site you convenient), result quite unexpected for me.

Result: 9 moves before it make cheating move (like try to move a pawn take aside enemy) and brain-dead at 25 moves as it stuck in loop try to switch side, cheat move and create a non-exited piece to win a match.

https://preview.redd.it/01fr72svrgvg1.png?width=1472&format=png&auto=webp&s=dae0624a66c4db9cd489dd116029e893286b9b3a

--swa-full : not much better but waste double of VRam.

Enable Reasoning : not help at all.

--swa-full Reasoning : Waste both tokens and VRam.

System Message : Depend, it could be better, but I got it worse even with rule and how each piece move.

My though before this test is LLM might be loss as it's quite generic on doing thing, but I never thought it didn't even able to reach the end of a match, at best only half way.

submitted by /u/revennest
[link] [comments]

How many move your favorite LLM model before it's cheat then brain-dead in chess game ?

Key Points

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer