Does anyone know? I can’t even find a server gpu <b200 on vast, and for the first time that I’ve ever seen on mithril, at multiple points last week have h100/h200/b200 all been at over $1k an hour, for sustained periods! I don’t know why you wouldn’t just migrate to runpod at that point, even their pricing isn’t that costly.
Seriously, academics can’t afford that, and I’d assume startups would just buy hardware to lock compute prices in. What in gods green Earth is going on?
———
EDIT: this applies to localLlama as I am literally training models / developing projects expressly for the consumption of the community here. I can’t finish my bitnet pipeline until pricing comes back down.
[link] [comments]


