Anthropic and the military had a contract. The military wanted to change the terms of that contract. Anthropic said no, which is their clearly defined contractual right. They got labeled a supply chain risk. How is this anything other than a shakedown? Does contract law mean anything to this administration?
1. The restriction applies to even writing documentation, adding comments, scanning for bugs, or even scanning for security vulnerabilities in systems for fully autonomous weapons. As automated vulnerability discovery gets stronger and stronger it is critical that have the ability to have a strong defense.
2. It is a principled take on that private companies shouldn't be making the decisions what their tools can and can't be used for in such an important sector.
There aren't remote "drivers" in the Philippines, that's not how Fleet Response works. You can see how it works here if you're curious: https://waymo.com/blog/2024/05/fleet-response but the TLDR is that they give the Waymo driver options in confusing situations (things like, you can go use this driveway on the side to pass this blocked traffic).
As for MMLU, is your assertion that these AI labs are not correcting for errors in these exams and then self-reporting scores less than 100%?
As implied by the video, wouldn't it then take 1 intern a week max to fix those errors and allow any AI lab to become the first to consistently 100% the MMLU? I can guarantee Moonshot, DeepSeek, or Alibaba would be all over the opportunity to do just that if it were a real problem.
I mean, Waymo gives a lot of examples of the situations, in their blog post about Fleet Response where they detail this, released May 21, 2024. They're very explicit that the Waymo Driver autonomous system is in control the entire time.
Thanks for this link. I’ve failed to find specifics on this for a while but this is pretty good, particularly the example about which lane to choose when cones are set up.
There are two compilers that can handle the Linux kernel. GCC and LLVM. Both are written in C, not Rust. It's "in distribution" only if you really stretch the meaning of the term. A generic C compiler isn't going to be anywhere near the level of rigour of this one.
There are several C compilers written in Rust from scratch of comparable quality.
We do not know whether Anthropic has a closed source C compiler written in Rust in their training data. We also do not know whether Anthropic validated their models on their ability to implement C compiler from scratch before releasing this experiment.
That language J I proposed does not have any C compiler implemented in it at all. Idiomatic J expertise is scarce and expensive so that it would be a significant expense for Anthropic to have C compiler in J for their training data. Being Turing-complete, J can express all typical compiler tips and tricks from compiler books, albeit in an unusual way.