So I've tried using kimi 2.5 in a personal project through AWS Bedrock. For simple tasks it does quite well. But when it comes to tool calling it seems the model is not that great, it hallucinates the tool calls 5 out of 10 times or what I noticed. On the other hand Claude and Openai models are really efficient at tool calling. Anyone else faced this issue or is this a bedrock problem? I haven't tried the official Kimi api but still under the hood the model is same.
Are the chinese model really that good as we think they are?
[link] [comments]



