Notepad completely froze up on me the other day, from just closing tabs of text files. It's so bloated its a complete joke, it should be nothing more than text editing, get rid of all the nonsense added to it since win11
The BitNet docker image has been updated to support both llama-server and llama-cli in Microsoft's inference framework.
It had been updated to support just the llama-server, but turns out cnv/instructional mode isn't supported in the server only CLI mode, so support for CLI has been reintroduced enabling you to chat with many BitNet processes in parallel with an improved conversational mode (where as server responses were less coherent).
TL;DR: The updated extension simplifies fetching/running the FastAPI-BitNet docker container which enables initializing & then chatting with many local llama BitNet processes (conversational CLI & non-conversational server) from within the VSCode copilot chat panel for free.
I was able to run about 100 BitNet CLI processes before the additional processes started getting moved to SSD page swap file instead of running in RAM. How many do you think you could run on your computer?
This MASSIVELY improves the BitNet model; the prior BitNet models were kinda goofy, but this model is capable of actually outputting code and makes sense!