Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Yeah it's surprising that it works from such sparse rewards. I think imagining a lot of scenarios in parallel using the world model does some of the heavy lifting here.

This is such gold. Thanks for sharing. Immediately added to my notes.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: