Hacker Newsnew | past | comments | ask | show | jobs | submit | cmonday's commentslogin

I'm excited to share Fuji-Web, an open-source AI web agent designed to automate various web tasks.

I had this idea of using vision-LLM to build web agent project in Nov 2023 when GPT-4V was just released. I'm so proud the interesting idea has evolved into a state-of-the-art Web Agent. (You can find benchmarks in the blog post.)

We started this research because we wanted to find out how far away we are from having an LLM-based assistant that's capable of navigating the complex real world. It turns out if you are able to narrow down the problem and give clear instructions, we are almost there!

Repo: https://github.com/normal-computing/fuji-web Our blog post: https://blog.normalcomputing.ai/posts/2024-05-22-introducing...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: