Lines of code got a better publicist

This weird trend reached an apex in a Feb 2026 OpenAI blog post [1], recently on the front page [2], which describes the process for building... something... written 100% by agents.

There is no description of what the thing is, no indication of what value it provides its users. The closest it gets is "the product has been used by hundreds of users internally, including daily internal power users".

But the fact that the thing has a million lines of code is repeated twice in the first few hundred words.

[1] https://openai.com/index/harness-engineering/

[2] https://news.ycombinator.com/item?id=48416264

I'm constantly thinking about that Microsoft guy who posted something like "we want 1 million LoC per engineer per month", which basically read as satire to most engineers I talked to, except apparently it was not satire at all, and indeed seemed to reflect the position of many CEOs etc when it comes to LLM code generation.

I do think that over the past few months, it feels like the hype around producing unmaintainable amounts of LoC has started dying down. More pragmatic and realistic takes are seemingly shared more openly, and are maybe even getting through to top leadership at some tech companies. Maybe not all is lost yet.

> When a company says “AI made everyone more productive, so we need fewer people”, I want to see the evidence - and I don’t believe it exists today.

Because they're bullshitting and using AI as an excuse to correct from their covid era over-hiring while simultaneously making themselves look good to investors by showing they're embracing the hip new technologies to become a more streamlined and cost-efficient operation than ever.

It is endlessly... amusing (?) to me, that we as a community spent decades trying to make it clear that our productivity is not easily measured because what we're doing is complicated and long running, only for AI to come along and suddenly LoC, Nx multipliers, tickets / week etc are held up as useful if not objective measurements.

The reasons we rejected LoC and other measurements have not changed (broadly: code output isn't important, quality output is). AI has all the same problems people do. But for whatever reason we are throwing what we've learnt away. It's kind of embarrassing.

If your A+ senior developer spends 8 months working on a feature that ultimately doesn’t get shipped or a MVP that gets killed, then you wasted that A+ senior developer and their productivity was the same as the other two B+ engineers that also worked on the project. This is actually a very common issue and usually ignored when it comes to things like hiring or assigning resources to a project. AI won’t change that in a meaningful way, your team may just finish their tasks a lot faster but the bureaucratic layer above will likely remain the same, which will make any AI coding gains negligible. Companies would have to be rebuilt from the top down for AI and that’s very unlikely to happen.

Weird baseless push for AI on the end, with no reasoning, no goal, no claim of gain. "Just go and use AI, people, developers must adopt new things."

It's not the first article I've read recently that is an ad for AI after a short context pretending to criticize it, with nothing connecting them.

>The difference this time is pace: you could delay adopting “the cloud” for a couple of years and survive. With AI you might get a few months.

It is weird that the author seems to understand that the pro-AI claims made by AI companies about the product’s necessity are not falsifiable, but then backtracks with “woah woah woah but don’t think I’m anti-AI.”

How is the assertion above any more rigorous than the productivity claims the author is criticizing throughout the rest of the article? That you won’t “survive” if you don’t adopt AI within a few months?

It is not true when the AI CEO says it, and it is not true when the person calling BS on the AI CEO… for some reason also says it…

>When a company says “AI made everyone more productive, so we need fewer people”,

They are implicitly saying that as a company, they don't want to be more productive. They want the same productivity by paying fewer more productive people.

Why is there an imbalance between what an employer gets paid for a unit of production and what an employee gets paid for a unit of production?

I largely think that we engineers are to blame for LoC being still perceived as an asset rather than a liability. We are proud of stuff we create, but it turns out that you can't describe how "big" something is without some metric, and so we fall back on the metric that is easiest to compute.

Suggestion: we should all shift our terminology, and in particular make heavy use of phrase "...and it cost N lines of code". And say what we spent those LoC on.

"I implemented new feature X, and it only cost 200 lines!"

"That bug was brutal to figure out, but in the end it only cost 6 lines of code."

"It was doing something in case X that it didn't do in case Y, and it turns out that the distinction wasn't even needed. So I fixed the problem and saved 20 lines of code at the same time!"

Lines of code are a price you pay. We don't go around bragging about how we spent $200 without any mention of what we purchased with that money. Why do we do that with LoC? "I had to pay an extra $200 because I signed up late" and "I only paid $200 for my hand-painted artisanal pottery lamp hanger. Factory-made ones cost upward of $1200 on Amazon!" are two very different statements, and map to exactly the same distinction in code.

I don't see LOC as that different from number of hours in the office. They'd always say pre-pandemic "If they're not in the office, how will I know they're working?" Simple, use the output metrics that you use to evaluate all of your workers to see what they contribute to the business.

Anecdotally, coworkers are writing a LOT more unit tests. By which I mean NOBODY is writing unit tests, but Claude is generating a ton of em. We’re talkin 300 lines of unit tests for 20 line changes that are already covered by other, better kinds of tests. Huge JSON objects of test data that we already have generators for, etc

I kinda feel as if this was the money quote:

> If you got a free headcount increase essentially overnight, why wouldn’t you use it to deliver more value to your customers, faster?

That shows that, in reality, it's short-sighted profit-taking. Boss just wants another lambo in the garage, and doesn't really plan to be around, when it's time to pay the piper.

By this argument, I guess we should write our code slower too. Maybe we should only use a single finger as we type. And add an extra day between email responses.

Because by the inverse of their argument, slower MUST be better, right?

This is silly. LLMs are a diesel powered keyboard. If I've got a backlog of features, why shouldn't I ship all of them? If they're bad ideas, I can also just remove them.

Nothing about SDLC best practices has changed EXCEPT the ability to increase volume.

> I’ve watched this industry absorb higher-level languages, IDEs, autocomplete, agile and devops, and there were always crusty hold-outs reminiscing about the good old days before X came along and ruined everything.

But all of those things were consciously built, deterministic and transparent tools. LLMs/AI are something fundamentally different.

More that LoC is a simple metric that has always been a problem.

Non-Functional requirements is a vestigial term from ‘function point analysis’ which is from the late 70s, and which also ended up being a proxy for LoC.

The entire industry is so focused on measuring now, and incentives are so skewed to short term that lagging indicators like maintainability are a non starter in many organizations that it will be challenging to fix this time.

This is already changing again now that CEOs have wised up to the fact that they're paying for code by the line but these lines don't translate to profit.

Not enough people read The Goal.

Ugh. Just imagine the following on a normal curve:

Pre-AI: The goal is to make more money.

With-AI: The goal is to ship more code.

Post-AI: The goal is to make more money.

Can't wait to see how we get there...

It seems to naturally follow that a company that sells lines of code would want to measure success in lines of code.

What happened was the audience changed. Before, the audience for things about writing code was mostly software developers. Now it's the employing class. The collective wisdom of engineers? It has little impact on the conversation because the audience doesn't care. In fact, it's more than indifference; many are eager to no longer be "burdened" by it.

Not only do they not want to pay our salaries, which is an expense, they're eager not to have to depend on our expertise or judgement as well. That judgement and expertise is a locus of control that resides outside their own hierarchy.

I think a better metric these days is what percentage of code is not reviewed / understood by humans. That is the real bottleneck. Until we can stop looking at the code, AI barely matters - you are just trading quality for quantity.

Thats why it is so amazing for speed runs and prototypes. Here it is legitimately > 10X faster.

We're still in the FA phase of FAFO when it comes to LLM code generation, aren't we?

The paradigm used to be create good enough abstractions you can express what you need in a few dozen lines or whatever. Those lines will be clearer and more precise than English for describing what it does.

I wonder if we'll ever get back to that? If it's still relevant?

It is pretty funny how this whole industry in a very short amount of time, with tons of experience and knowledge to lean on, reverted back to dubious measurements of productivity. If you track LoC and tokens used as productivity measurements, developers are going to max their LoC and token usage! Its so predictable that we have a Law named after this phenomenon! The fallout was so predictable I feel like I should have been positioning myself for all of the potential consulting work that's about to be needed.

  When a company says “AI made everyone more productive, so we need fewer people”, I want to see the evidence - and I don’t believe it exists today.

I think these companies doing "AI layoffs" do actually see improvement though it is a placebo and not caused by the AI usage. Don't we know for a long time already that leaner software teams perform more efficiently?

  don’t read any of this as anti-AI

I am not afraid to say I am anti-AI it surfaces a rot that in this industry marketing ideals and anecdotes have more impact then measured performance and that many people still find it very hard to estimate a developers performance, impact.

AI is shit, doesn't speed up my work. Only 10/20% of programming is typing and AI can do that fast, but the whole process no. If you disagree show me a proper study where actual improvement is measured.

Probably you could get a cheaper and more constant improvement if you make sure the developers are properly trained in the IDE's and environments they are already using. For example give everyone a Unix programming course and a course in their preferred IDE.

i wish the shortcut from a-b will be faster then ever this 2026! thanks all

> Measuring programming progress by lines of code is like measuring aircraft building progress by weight.

https://www.goodreads.com/quotes/536587-measuring-programmin...

If developers burn through thousands in AI tokens a day, does it really matter, and is it a good spend? Are the outputs actually checked for sanity, fitness, qa/qc, security etc. How much rework is coming out because of lack of validation, or too much automation in the soup.

The more I read, the more I feel that 1 dev, 1 ai agent with the dev as a gatekeeper is probably the most appropriate workflow. Where you now treat the single dev + ai as a team in terms of planning and cost analysis and you get about 1.2-1.3x the throughput compared to a traditional team of 3-5 devs with partial PM and partial QA where the Dev now needs to take on those roles too.

The output should include more/better testing, examples, demos etc... since the bus factor is now 1, but AI is expected to be able to do the heavy lift.

Not a better publicist, but:

A) a newly-receptive audience - engineers who have discovered that they very much enjoy and appreciate the tradeoff of proximity to the code for amplified velocity and impact, now that it's possible to achieve without being a manager of messy human teams.

B) an ecosystem in which it's grown nearly impossible to connect a functional description of something to how much bespoke construction and effort was involved, partially because of marketing and partially because of how much software already exists to be built on top of. It's impossible to tell from a few paragraphs of functional description whether something was built in a weekend or took a team 4 years to ship, so volume of code is the natural fallback for describing complexity.

When I read recent news on HN, I feel it is a fable about Goodhart's Law. The law says: 'When a measure becomes a target, it ceases to be a good measure.' The dog should wag its tail. But the tail is wagging the dog.

LoC by itself is useless and so is AI LoC, it doesn't really show anything by itself.

But if you pair AI LoC in a range and also task completed in the same range and then compare that with historical data over a similar range without AI, then you have something tangible.

You also need to look at defect reports to understand the full picture of is AI being helpful.

So, we do need to measure AI LoC and AI PR counts, but we also need to make sure we are using other metrics to help paint the full picture.

Large enterprises class systems are notoriously difficult to work Ai. It’s the context window limitation. Assuming 10 tokens per LoC. The best models today cannot wrangle 100k distributed LOC across multiple repos. It’s great for building new and maintaining smallish codebases. All this code being written is fantastic, but maintaining them efficiently over the code lifecycle is tricky.

Converting the production database to Prolog to ship LOC.

My old CTO has a spiritual metric that always resonated with me: Revenue / Lines of Code. The higher the number the better.

>The difference this time is pace: you could delay adopting “the cloud” for a couple of years and survive. With AI you might get a few months.

I don't think so. Take a good company A (with a good product and a good pace of good features) of today. Take the extreme case they decide not to use AI at all. Well, they will still be shipping good features at their current pace.

No amount of AI will make a bad company ship a better product than A's. If any, bad/mediocre companies will be pushing crap faster than they did before, but that's it.

AI can make good companies better, but cannot make bad companies good. Why does company A need to worry about shitty companies using AI? Sure, other good competitors could be using AI, but all in all, shipping "faster" is not the "mark" of good quality

I too made a post like this on here not that long ago. The point i was trying to make was to be AI knowledge first. The tools are here and someone has spent lots of money(foolishly yess!, But) creating this tools, so why not use them to your advantage. The one's that wait too long find themselves playing catch up on the next big thing that's comes of it. This is a phase that will blow over and in that process there is something cooking. Think of it as the stepping stone to the next era into the future. The future isn't created on its own, we must push for it. Hope doesn't do much in creating innovative advancements.

Before the smartpphones we have today there was touch pad LCD products all fighting for what now we call smartphones, that came with heavy innovative techniques to achieve a goal. The goal wasn't evident nor clear at the time.

This will lead to something else that is far more useful and could be harmful in many ways not just to employment.Economies are changing like never before transformational force are changing our lives as we speak. housing prices, Wages, Technology, Political Power, Warfare, ideologies, the list goes on. AI is the starting point in this new era. Choosing to use silly words like " This is not how we used to do things: no shit, we also used to ride horses and sacrifice virgins for it to rain. I don't know when people will ever get this ( The world is ever evolving) that is what humans do.

Having a say so and a control in what is harmful and what is helpful is just as equally important.

Reporting on percentage of AI generated lines of code is very different from total lines of code. Yes I know both of them are missing what's the value delivered, but the later assumes the value is the number of lines, while the former assumes value is at least the same but delivered faster.

So what has actually shipped? I'm already using much many more AI-coded projects in my daily life than I was a few months ago.

It's dying down now because unlike before, companies are paying out the ass for those tokens, where many companies now have token budgets enforced rather than the previous tokenmaxxing.

> Call it AI-first, AI-proficient, whatever you like

Can we just call it AI assistant and since it is really what it is. Just call a spade a spade, call it a day.

Nvidia boss Jensen Huang refer to AI as teammate in his recent COMPUTEX presentation, but it's disingenuous to call that since it's a just tool, but a very potent tool nonetheless. He's obviously biased to a fault but he's literally banking his company on AI now, but for the rest of us AI assistant should do more than fine.

Calling it teammate, workmate or friend is also rather childish. It's like having an imaginary friend that can lead young people to do silly things and this risk probably can be extended to junior developers [1],[2].

[1] Chatbots Can Be Dangerous For Kids:

https://www.psychiatrictimes.com/view/chatbots-can-be-danger...

[2] Why AI companions and young people can make for a dangerous mix:

https://med.stanford.edu/news/insights/2025/08/ai-chatbots-k...

Confusing skeptic and sceptic will never not be funny to me (edit: I now live in shame)

I agree with the main point but this misses something crucial:

> why wouldn’t you use it to deliver more value to your customers, faster? That should show up as MAU, conversion, revenue

Most roadmaps are full of garbage and would be better off being deleted. You get very few truly useful new features in a year.

To paraphrase ESR: the value to your customers is in them being able to know that can rely on your product still operating next year, not in those 20 new features.

Or to think about it another way, maybe block will be better off with fewer developers, but only if they produce sufficiently FEWER features so that they’re forced to prioritize.

> But! Hold my beer… in February 2026 METR effectively walked it back : their follow-up estimates flipped to a speedup (with error bars wide enough to ride a Moto Guzzi, with panniers, through!), and they abandoned the study design entirely - because developers now refuse to work without AI, and can’t reliably self-report time on agentic work. Their latest position: AI probably speeds developers up in 2026, and we can no longer cleanly measure by how much.

This may be true, but they followed in May with this [0]:

> Importantly, survey results are not necessarily grounded in reality. There are reasons to be skeptical of people’s responses to counterfactual questions such as about AI’s effect on productivity — for instance, our study in early 2025 found that people overestimated AI’s effect on their time spent on tasks by 40 percentage points on average.

[0] https://metr.org/blog/2026-05-11-ai-usage-survey/#productivi...

When will performance or lacks of bugs become a metric again?

It’s worth looking at sectors where LLM code generation hasn’t been very visible, such as certification-accredited flight-control, braking, train-control, medical, or nuclear-control source code involving real-time embedded operating systems. This sector relies on assurance: deterministic scheduling requirements, detailed commit traceability, tool qualification, configuration management, independent verification, etc.

Since this is an area where failure can lead not to Instagram accounts getting hacked, but planes falling out of the sky and nuclear reactors spewing radioactive elements, it’s worth a close look. Some of the most visible companies in this sector include: QNX, Wind River, SYSGO, Lynx, Green Hills, Siemens Embedded, etc. None of them seem to have much if any adoption of LLMs for source code generation based on public statements.

Research in this area agrees with this view:

“In this paper, I have conducted a comparative analysis of the C++ code generated by popular LLMs including: OpenAI ChatGPT, Google Gemini, DeepSeek, Meta AI, and Microsoft Copilot for compliance with MISRA C++. The study revealed that none of the evaluated LLMs generated MISRA-compliant code despite clear prompts, with DeepSeek showing the fewest violations and Meta AI the most.”

https://arxiv.org/abs/2506.23535

Many mid size to large companies are hilariously inefficient and the executives have no idea how work actually gets done. This means you can (in theory) fire a decent percentage of your workforce without affecting your output. You can then claim they’ve been replaced by AI without anyone ever challenging that assertion. I’m not making a pro or anti AI claim here, just saying that you won’t know whether you were wrong about it until/unless things start to go really badly.

I don't get how if productivity is barely moving, how a decrease in comprehension will improve anything.

beautifully put, AI is changing the way we make software but the way we measure productivity is still in our hands. L'chaim!

So, how the comapny will be evaluating the students on what basis?

> But adoption is the starting line, not the scoreboard.

Yes yes, shout it from the rooftops! Over the next few years I think we're going to see that companies that get this point will keep doing meaningful things, and stand a chance of weathering this transition period.

I think we're going to see a bunch of companies that went all in on AI for AI's sake go under because they've lost their mission, lost their implementation, and won't have a way to get those back in a reasonable timeframe and at a reasonable cost.

Writing. Code. Is. No. Longer. The. Bottleneck.

Deciding what to build. Reviewing Code. And testing code. Are the new bottleneck.

So of course we don't see massive productivity gains. Because these parts of the SCLC were always bottlenecked but their capacity matched the throughout. We fired all the dedicated QAs years ago. Sr+ engineers that do all the code review are limited.

Teams have not re-organized to match the new code-input velocity.

Engineers don't want to do QA because it's "beneath them".. and most engineers don't like performing or are not Sr enough to do extensive or high quality code review.