Sincere question: Has anyone figured out how we're going to code review the outp...

jsheard · 2025-04-13T15:01:03 1744556463

Insincere answer that will probably be attempted sincerely nonetheless: throw even more agents at the problem by having them do code review as well. The solution to problems caused by AI is always more AI.

regularfry · 2025-04-13T18:24:18 1744568658

Technically that's known as "LLM-as-judge" and it's all over the literature. The intuition would be that the capability to choose between two candidates doesn't exactly overlap with the ability to generate either one of them from scratch. It's a bit like how (half of) generative adversarial networks work.

brookst · 2025-04-13T15:41:15 1744558875

s/AI/tech

sensanaty · 2025-04-13T22:46:47 1744584407

Most of the people pushing this want to just sell an MVP and get a big exit before everything collapses, so code review is irrelevant.

lsllc · 2025-04-13T16:08:10 1744560490

Simple, just ask an(other) AI! But seriously, different models are better/worse at different tasks, so if you can figure out which model is best at evaluating changes, use that for the review.

phamilton · 2025-04-14T00:47:26 1744591646

I suspect this will indeed be part of it, but it won't work with today's AIs on today's codebases.

Models will improve, but also I predict code style and architecture will evolve towards something easier for machine review.

nchmy · 2025-04-13T21:38:17 1744580297

sincere question: why would you not be able to code review it in the same way you would for humans?

phamilton · 2025-04-14T00:04:37 1744589077

Agents could generate more PRs in a weekend than my team could code review in a month.

Initially we can absolutely just review them like any other PR, but at some point code review will be the bottleneck.

nchmy · 2025-04-18T11:00:36 1744974036

Surely humans are the ones initiating the agent though, no? Just do that at a measured pace. And set up comprehensive prompts/mechanisms to make sure the agent satisfies your criteria for tests, style, etc - there's a lot of prompts and tools around the Cline/Roo community for doing stuff like that.

fxtentacle · 2025-04-13T15:29:15 1744558155

You just don't. Choose randomly and then try to quickly sell the company. /s