de meeste mensen realiseren zich niet dat de temperatuur gewoon de ventilatorsnelheid op de GPU-cluster die jou bedient, aanpast.
gojo
gojo15 aug, 23:20
i was literally talking to this "LLM researcher" about setting temperature in LLMs and i asked you know why lowering or raising the temperature results in more deterministic or random outputs, right? and he said yeah it changes the way tokens are represented. boy wtf, people IN the fucking field have no idea about botzmann stats or even softmax. i'm gonna cry.
451