fant ut hvordan jeg kunne "angre" RL og gjøre gpt-oss tilbake til en basismodell vil slippe vektene i morgen Gn
jack morris
jack morris9. aug., 03:21
curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre time for a deep dive 🧵
195,39K