Martineski@lemmy.dbzer0.comtoTechnology@lemmy.zip•OpenAI releases o1, its first model with ‘reasoning’ abilitiesEnglish
3·
7 days agoI’m curious how it will do on the private benchmark that ai explained made. I think it was called simple bench?
Check out !lemmydirectory@lemmy.dbzer0.com. :3
I’m curious how it will do on the private benchmark that ai explained made. I think it was called simple bench?
https://simple-bench.com/index.html I was referring to this benchmark specifically because the point of it is to benchmark the actual reasoning capabilities of LLMs: