Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Kimi 2 is remarkably consistently the best. I wonder if it's somehow been trained specifically on tasks like these. It seems too consistent to be coincidence

Also shocking is how the most common runner up I've seen is DeepSeek



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: