More

davidcann · 2026-04-24T23:28:33 1777073313

Is your team measuring how much of your code is being written with claude and comparing amongst the team, like what works best in your codebase? How are you learning from each other?

I’m making a team version of my buildermark.dev open source project and trying to learn about how teams would like to use it.

barnabee · 2026-04-24T23:46:21 1777074381

Different teams are using it in very different ways so it can be tough to compare meaningfully.

Backends handling tens to hundreds of thousands of messages per second with extremely high correctness and resilience requirements are necessarily taking a different approach to less critical services that power various ancillary sites/pages or to front end web apps.

That said there's a lot of very open discussion around tooling, "skills", MCP, etc., harnesses, and approaches and plenty of sharing and cross-pollination of techniques.

It would be great to find ways to better quantify the actual value add from LLMs and from the various ways of using them, but our experience so far is that the landscape in terms of both model capability and tooling is shifting so fast that that's quite hard to do.

davidcann · 2026-04-25T00:03:52 1777075432

Thanks for the feedback. I agree that it’s changing very fast, which is why my thesis is that this tooling will be needed to help everyone on the team keep up.

davidcann · 2026-04-22T01:23:40 1776821020

To help you decide if you should keep your Claude subscription, you can see how much of your code is written by Claude Code with my project (open source, local): https://github.com/gelatinousdevelopment/buildermark

davidcann · 2026-04-12T21:46:29 1776030389

I’m working on Buildermark (open source, local) that calculates how much of your code is written with coding agents.

It scans your claude and codex history to find edits and matches those to git commits (even if the code was auto-formatted).

https://buildermark.dev/

You can browse all 364 prompts that wrote 94% of the code here:

https://demo.buildermark.dev/projects/u020uhEFtuWwPei6z6nbN

davidcann · 2026-03-09T07:46:34 1773042394

Native macOS sandbox terminal:

- UI for sandbox-exec to protect filesystem - Network sandbox per domain - Secrets filter via gitleaks - Vertical tabs option

It's highly customizable. You generate native macOS app wrappers for each terminal app, each with its own rules and customizations.

https://multitui.com

sersi · 2026-03-09T11:12:48 1773054768

I've been using a VM for claude code (probably would keep doing that as I do like how much control I have over it by doing that) but this is definitely a useful tool, I'll happily use that in the future.

davidcann · 2026-03-09T00:00:38 1773014438

I made a native macOS app with a GUI for sandbox-exec, plus a network sandbox with per-domain filtering and secrets detection: https://multitui.com/

davidcann · 2026-02-28T01:14:31 1772241271

My app is a macOS terminal wrapper with nice GUI for sandbox-exec and network sandbox. I just added a vertical tabs option too. https://multitui.com

davidcann · 2026-02-21T15:03:25 1771686205

I made a UI for this to run terminal apps, like claude and codex: https://multitui.com

hmokiguess · 2026-02-21T15:33:24 1771688004

I’m impressed really neat work! Why did you opt for closed source?

edit: I don’t have a problem with closed source, but when software is expected to be accountable for my security I get a little paranoid, so was curious about the safety and guarantees here. The UX and everything else looks great

davidcann · 2026-02-22T15:57:23 1771775843

Yeah, that’s understandable. Many open source macOS-only apps seem to get abandoned, so I’m trying to build something sustainable.

It uses only 3 dependencies that are very well known and widely used, so supply chain risk is minimal. That leaves me, the developer, as the main point of trust.

e1g · 2026-02-21T15:21:04 1771687264

I like this! I built something similar for sandboxing CLI agents, and in the repo have a collection of minimal profiles for sandbox-exec to use - https://agent-safehouse.dev/

Tiberium · 2026-02-21T15:50:39 1771689039

Codex already uses sandbox-exec on macOS :)

davidcann · 2026-02-22T15:50:26 1771775426

Yeah, they all do sometimes, but the agent decides what to allow and they can choose to not use it. This gives the user full control of the sandbox and you can run the agent in yolo mode.

ithkuil · 2026-02-21T16:01:57 1771689717

Which terminal do you embed?

davidcann · 2026-02-22T15:51:36 1771775496

SwiftTerm, for not. I may switch to ghostty when their library is ready for embedding.

ithkuil · 2026-02-24T09:38:04 1771925884

have you find a way to make claude emit the OSC hyperlink escapes when using SwiftTerm?

davidcann · 2026-02-28T01:23:43 1772241823

No, I run a separate URL detection to make links clickable. However, SwiftTerm just added link detection a few days ago and I haven’t had time to look into theirs yet.

kilroy123 · 2026-02-21T15:43:40 1771688620

Wow, this looks very nice.

davidcann · 2026-02-04T09:40:57 1770198057

The same concept is possible on Linux, but I don't think anyone has created a nice UI for it yet. There was a post yesterday about doing it on the command line in linux: https://news.ycombinator.com/item?id=46874139

One of the nice things in Multitui is that it monitors what is blocked and gives you a way to add a read/write rule from the UI.

davidcann · 2026-02-03T21:36:20 1770154580

I’m seeing log delays too the past couple of days… something like 14 hours, in my case.

kawsper · 2026-02-03T23:06:37 1770159997

At 14:00 UTC, I was still seeing requests from 2026-02-01, and none from today, the order also seems random.

Hopefully it will be fixed soon.

davidcann · 2026-02-03T20:36:12 1770150972

My app does this on macOS! https://multitui.com