Was using Gemma-4-A3b-26B for a while for chat (using llama.cpp for backend and Open Web UI for client features). I’ve been using Qwen-3.6-A3B for agents and am currently playing with one of HauHuaCS’s uncensored Qwen models for chat and really liking it.
I also have an agent using Kimi 2.6 as a backend (which is open, but not local) and for some coding tasks as well.
I also have an agent using Kimi 2.6 as a backend (which is open, but not local) and for some coding tasks as well.