Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are the "random-baseline accuracy" numbers correct?

In the "Two circles" test, do they really have 50% chance of overlapping? I think this comes from "Distances between circle perimeters: -0.15 to 0.5 times the diameter" but doesn't say the distribution they use.



They asked the AI a question with a yes/no response. If the AI chose randomly, it would be correct 50% of the time. That’s what “random baseline accuracy” means.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: