Like, we’ve seen that the large models don’t actually have that great of datasets. So imagine a local model who is filled to the brim with good quality writing without repeats and without slop. Can we crowdsource the work or something 😂
But then I suppose the problem is that everyone has different opinions of what’s good. I’ve seen people love purple prose!
Maybe the real solution is me just renting a gpu and training it on shit lol
[link] [comments]
