This is really promising. Are they now going to scale this up to hundreds of billions of parameters? Why stop at 1.5B if they found a potentially SOTA architecture?
Probably constrained by training resources. It's much easier to experiment with a smaller architecture. You may need many training runs to figure out hyperparameters for example. If each run needs multiple GPUs for a week the cost adds up quickly. I think it makes a lot of sense to start small.
This just happened up north, apparently the largest proposed AI data center in Canada (Synapse's $10B 1GW campus in Olds, Alberta) was just put on pause after the Utilities Commission rejected its power application on March 6, 2026 due to noise pollution concerns from 20 gas turbines, 10 steam generators, and up to 600 diesel backups near 800 homes (just 200m away). The assessments failed to model cumulative worst-case noise. The proposal will be revised and resubmitted of course but the concern isn't going to go away.
I recently set up Hammerspoon to surveil my own computer usage actions (active tab/window, typing state, scrolling) to have a next-action predictor. It shows the predicted next action at the top of the screen but I was thinking of using it to improve voice command accuracy.
What I think would actually be useful is a version of what was implemented on /r/ClaudAI which is an official bot which summarizes the discussion (and updates after x number of comments have been added). I think this level of synthesis has a compounding effect on discussion quality and pruning redundant arguments/topics.
I don't spend much time on that subreddit, but I've seen that bot on a couple posts I've read and have been pleasantly surprised by how useful it seemed. I may eat my words on this later, but to me this is exactly the kind of application of AI that I have always thought was the most promising.
I came to the same conclusion from first principles. Whether AI or not, it's useful to have a third party facilitator identify and compress the key topics of a conversation to help experts navigate the crystallization of new knowledge
tmux supports tabs so you can have multiple Claude Code sessions running concurrently. You do need to learn a few tmux keyboard shortcuts to use it effectively (e.g. opening/closing/switching tab).
With hardware and model improvements, the future is bright.
reply