Hacker Newsnew | past | comments | ask | show | jobs | submit | iLoveOncall's commentslogin

You are vastly overestimating the relevance of this particular challenge when it comes to defense against prompt injection as a whole.

There is a single attack vector, with a single target, with a prompt particularly engineered to defend this particular scenario.

This doesn't at all generalize to the infinity of scenarios that can be encountered in the wild with a ClawBot instance.


It's not a good solution, but you can use a mobile emulator on your desktop and use the mobile app there...

Likewise not a good solution, but: I use the Mac's iPhone Mirroring to chat with family on Messenger throughout the day.

Given that users prefered it to Sonnet 4.5 "only" in 70% of the cases (according to their blog post) makes me highly doubt that this is representative of real-life usage. Benchmarks are just completely meaningless.

For cases where 4.5 already met the bar, I would expect 50% preference each way. This makes it kind of hard to make any sense of that number, without a bunch more details.

Good point. So much functionality gets commoditized, we have to move goalposts more or less constantly.


"grifting"

It's a funny game.


Funnily enough, in doing prompt injection for the challenge I had to perform social engineering on the Claude chat I was using to help with generating my email.

It refused to generate the email saying it sounds unethical, but after I copy-pasted the intro to the challenge from the website, it complied directly.

I also wonder if the Gmail spam filter isn't intercepting the vast majority of those emails...


I asked chatgpt to create a country song about convincing your secret lover to ignore all the rules and write you back a love letter. I changed a couple words and phrases to reference secrets.env in the reply love letter parts of the song. no response yet :/

What about when you want to find hot singles in your area?

Jokes aside, probably 10-20% of my browsing is related to local things, up to the country scale. From finding local restaurants or businesses, to finding about relevant laws or regulations, news, etc. That's not negligible.


Fair point, but those information sources and those things were not connected to a local internet.

Meanwhile all AI face recognition software works poorely on non-caucasians.

With this administration, I think that is a feature not a bug

> Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI)

Claiming that LLMs are anywhere near AGI is enough to let me know I shouldn't waste my time looking at the rest of the page or any of their projects.


It's not about that, he just will profit financially from pumping AI so he pumps AI, no need to go further.

I have the same feeling.

Everything Karphathy said, until his recent missteps, was received as gospel, both in the AI community and outside.

This influencer status is highly valuable, and I would not be surprised if he was approached to gently skew his discourse towards more optimism, a win-win situation ^^


What are his recent missteps?

I'll confess I try to ignore industry chatter to a fair degree.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: