Apple doesn't have as much of an advantage as it appears. Their Neural Engine are fast and getting better, but modern machine learning models need another resource: fast RAM. Unfortunately for Apple, their whole product stack is segmented based on RAM. There are millions and millions of Apple devices with lightning-fast processors and 8GB or even 4GB RAM. These won't be able to do anything like Copilot+, unless they make calls to a datacenter like Microsoft is doing.
Their existing local models are hit-or-miss already. FaceTime's audio transcription is laughable. Whether this is due to memory requirements, I don't know, but it doesn't bode well for further models.
Their existing local models are hit-or-miss already. FaceTime's audio transcription is laughable. Whether this is due to memory requirements, I don't know, but it doesn't bode well for further models.