More

dial9-1 · 2026-03-31T04:51:12 1774932672

still waiting for the day I can comfortably run Claude Code with local llm's on MacOS with only 16gb of ram

bearjaws · 2026-03-31T12:43:30 1774961010

My super uninformed theory is that local LLM will trail foundation models by about 2 years for practical use.

For example right now a lot of work is being done on improving tool calling and agentic workflows, which tool calling was first popping up around end of 2023 for local LLMs.

This is putting aside the standard benchmarks which get "benchmaxxed" by local LLMs and show impressive numbers, but when used with OpenCode rarely meet expectations. In theory Qwen3.5-397B-A17B should be nearly a Sonnet 4.6 model but it is not.

gedy · 2026-03-31T04:54:43 1774932883

How close is this? It says it needs 32GB min?

HDBaseT · 2026-03-31T05:09:53 1774933793

You can run Qwen3.5-35B-A3B on 32GB of RAM sure, although to get 'Claude Code' performance, which I assume he means Sonnet or Opus level models in 2026, this will likely be a few years away before its runnable locally (with reasonable hardware).

Foobar8568 · 2026-03-31T05:25:42 1774934742

I fully agree, I run that one with Q4 on my MBP, and the performance (including quality of response) is a let down.

I am wondering how people rave so much about local "small devices" LLM vs what codex or Claude code are capable of.

Sadly there are too much hype on local LLM, they look great for 5min tests and that's it.

brcmthrowaway · 2026-03-31T05:28:57 1774934937

Just train it better with AGENTS.md

Hamuko · 2026-03-31T10:48:24 1774954104

I'm reading "more than 32GB of unified memory" to mean at least a 36 GB model.

rubymamis · 2026-03-31T10:10:25 1774951825

Doesn't OpenCode supports local models?

g947o · 2026-03-31T12:40:15 1774960815

You can, but the quality sucks.

Local LLMs don't make sense for most people compared to "cloud" services, even more so for coding.

dial9-1 · 2026-03-02T03:33:03 1772422383

the OP probably is not telling the whole story and must have some kind of drug addiction going on that sucks up all of his wealth, because how do you even end up on a van when you split rent with a girlfriend/roommates? also the part where he refuses non-vegan food. yikes. and skateboarding as a hobby sounds great when you are uninsured

IAmBroom · 2026-03-05T13:32:22 1772717542

> and skateboarding as a hobby sounds great when you are uninsured

OK, THAT is clearly a failure of the system. Broken and sick people deserve top-quality medical care; it's what a high-functioning society would do.

dial9-1 · on Feb 21, 2024

scale of complexity

nonrandomstring · on Feb 21, 2024

That's a tempting answer. I see why you proffer it. But I have to say no.

Complexity is neither an immanent feature nor inevitability. Behind unruly complexity is our failure to manage it. And indeed, a love of complexity, a fetish for it that seduces us into ever more.

To defeat complexity we have to embrace, and engage with it. We have to see what parts of technology that got us to where we are, must now be justifiably rejected.

All I see right now, especially with regards to "AI" and the new wave of techno-populism, is a retreat from complexity and more embrace of "magic".

dial9-1 · on Oct 25, 2023

modules just do it better

dial9-1 · on Oct 20, 2023

buy an ad

blitz_skull · on Oct 22, 2023

I don’t own lunch money so it’s just a user’s opinion. But your cynicism has added real value to this thread. Thanks!

dial9-1 · on Oct 17, 2023

simply because your wage will not keep up with inflation and would force most people to take a second job

dial9-1 · on Sept 17, 2023

it's catalá in catalá

slazaro · on Sept 17, 2023

It's actually "català" not "catalá". Source: I'm català.

dial9-1 · on Aug 14, 2023

diversity is good

dial9-1 · on July 10, 2023

you can accomplish that with just modules and functions

marcosdumay · on July 10, 2023

I'd say that all that (modern style) OOP does for organizing code comes from its copying earlier module systems. There is really nothing else there.

dial9-1 · on June 15, 2023

so it's just the "drag & drop .jar file into production" reinvented