I fine-tuned Gemma 3 4B on a psychotherapy dataset using DPO as part of an experiment to make a local chatbot that can act as a companion (yes, this is absolutely not intendended to give medical advice or be a therapist).
I must thank whoever invented QLoRa and PeFT - I was able to run the finetuning on my RTX 3050Ti laptop. It was slow, and the laptop ran hot - but it worked in the end :D
What testbenches can I run locally on my RTX 3050Ti 4GB to evaluate the improvement (or lack thereof) of my finetuned model vis-a-vis the "stock" Gemma 3 model?
[link] [comments]



