Are the "random-baseline accuracy" numbers correct? In the "Two circles" test, d... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		tantalor on July 10, 2024 \| parent \| context \| favorite \| on: Vision language models are blind Are the "random-baseline accuracy" numbers correct? In the "Two circles" test, do they really have 50% chance of overlapping? I think this comes from "Distances between circle perimeters: -0.15 to 0.5 times the diameter" but doesn't say the distribution they use.

jdlshore on July 10, 2024 [–]

They asked the AI a question with a yes/no response. If the AI chose randomly, it would be correct 50% of the time. That’s what “random baseline accuracy” means.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact