Strong vision and reasoning performance, and the 35-a3b model run s pretty ok on a 16gb GPU with some CPU layers.
reply
otherwise openrouter for routing to lots of different providers.
[1]: https://github.com/huggingface/transformers/tree/main/src/tr...
> News
> 2026-02-16: More sizes are coming & Happy Chinese New Year!
Strong vision and reasoning performance, and the 35-a3b model run s pretty ok on a 16gb GPU with some CPU layers.
reply