> 500k lines of code Isn't it a simple REPL with some tools and integrations, wr...

samusiam · 2026-04-01T12:34:10 1775046850

I just checked competitors' codebases:

- Opencode (anomalyco/opencode) is about 670k LOC

- Codex (openai/codex) is about 720k LOC

- Gemini (google-gemini/gemini-cli) is about 570k LOC

Claude Code's 500k LOC doesn't seem out of the ordinary.

lelanthran · 2026-04-01T13:45:22 1775051122

> Claude Code's 500k LOC doesn't seem out of the ordinary.

Aren't all the other products also vibe-coded? "All vibe-coded products look like this" doesn't really seem to answer the question "Why is it so damn large?"

It's a repl, that calls out to a blackbox/endpoint for data, and does basic parsing and matching of state with specific actions.

I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

1. If the bulk of those lines implement specific and simple actions, why is it so large compared to other software that implements single actions (coreutils, etc)

2. If the actions constitute only a small part of the codebase, wtf is the rest of it doing?

samusiam · 2026-04-01T13:55:34 1775051734

You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

lelanthran · 2026-04-01T14:00:59 1775052059

>> I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

> You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

Where did I complain about how I feel about the actual code? I have feelings, negative ones, about the size of the code given the simple functionality it has, but I have no feelings on the code because I did not look at the code.

arandomhuman · 2026-04-01T14:17:50 1775053070

Are you ESL by any chance? You’re missing the forest for the trees.

johnisgood · 2026-04-01T12:55:14 1775048114

All of them are really, REALLY bad.

surajrmal · 2026-04-01T13:32:59 1775050379

Bad by whose definition? They work really well in my experience. They aren't perfect but the amount of hand holding has gone down dramatically and you can fix any glaring problems with a code review at the end. I work on a multimillion line code base which does not use any popular frameworks and it does a great job. I may be benefiting from the fact that the codebase is open source and all models have obviously been trained on it.

bdhtu · 2026-04-01T14:30:23 1775053823

It takes 10 seconds for Gemini CLI to load. 10 seconds to show an input field. This is for a CLI program.

For comparison, it takes me less time to load Chrome and go to gemini.google.com.

causal · 2026-04-01T14:40:42 1775054442

> They work really well in my experience.

Yeahhh strong disagree there, I find Codex and CC to be buggy as hell. Desktop CC is very bad and web version is nigh unusable.

oblio · 2026-04-01T13:45:08 1775051108

At least Gemini and Claude constantly break down with scrolling in various Linux terminals, something which was solved by countless TUIs decades ago.

I think a lot of the people prasing Claude & co are on Macs.

johnisgood · 2026-04-01T13:46:13 1775051173

Most of their issues have been solved a long time ago, with 1000x less code. It is depressing at this point. I really had no clue IT was in the shitters this much. I knew it was theatrical but I had no idea that it was by this much.

geodel · 2026-04-01T14:53:32 1775055212

All these AI tools teams have most valid excuse "We are just a bunch of people who only know Javascript/typescript/NodeJS. Please bear with us while we resolve 10,000 open issues."

samusiam · 2026-04-01T13:53:32 1775051612

I haven't seen the scrolling glitch in months, where previously it was happening multiple times a day. Also haven't seen anyone complain about it in quite some time. Pretty sure they have resolved that.

msully4321 · 2026-04-01T15:27:37 1775057257

They have not! If I am scrolled up while more output is produced, the scrollback jumps to the top pretty consistently.

oblio · 2026-04-01T16:55:30 1775062530

I'll try again but lately I've been using strictly the VS Code terminal. Gnome Terminal and Termux in Ubuntu 24.04 were unusable even with 1000 hacks.

causal · 2026-04-01T14:41:40 1775054500

I'm on a mac! And I still find bugs on a regular basis...

spiderfarmer · 2026-04-01T10:37:22 1775039842

I don't know if you're mindlessly repeating the HN trope that JS/typescript/Electron is bad and that all bloat can easily prevented, but if you're truly interested in answers to your questions: RTFA.

carterschonwald · 2026-04-01T10:58:28 1775041108

yeah its honestly full of vibe fixes to vibe hacks with no overarching desig. . some great little empirical observations though!i think the only clever bit relative to my own designs is just tracking time since last cache ht to check ttl. idk why i hadnt thought of that, but makes perfect sense

ale · 2026-04-01T14:45:57 1775054757

There’s probably a subconscious incentive to make a tool that’s “complex” because the underlying LLM also is complex.

fragmede · 2026-04-01T10:34:46 1775039686

How many LoC should it be, for that kind of program?

forgotpwd16 · 2026-04-01T11:29:36 1775042976

Other notable agents' LOC: Codex (Rust) ~519K, Gemini (TS) ~445K, OpenCode (TS) ~254K, Pi (TS) ~113K LOC. Pi's modular structure makes it simple to see where most of code is. Respectively core, unified API, coding agent CLI, TUI have ~3K, ~35K, ~60K, ~15K LOC. Interestingly, the just uploaded claw-code's Rust version is currently at only 28K.

edit: Claude is actually (TS) 395K. So Gemini is more bloat. Codex is arguable since is written in lower-level language.

ernst_klim · 2026-04-01T11:58:06 1775044686

Well FFmpeg is roughly 1500k, but it's C+Asm and it's dozens of codecs and pretty complex features. SBCL is around 500k I guess.

I'm not saying that this is necessarily too much, I'm genuinely asking if this is a bloat or if it's justified.

troupo · 2026-04-01T10:53:37 1775040817

It's a TUI API wrapper with a few commands bolted on.

I doubt it needs to be more than 20-50kloc.

You can create a full 3D game with a custom 3D engine in 500k lines. What the hell is Claude Code doing?

neurostimulant · 2026-04-01T11:49:27 1775044167

Just check the leaked code yourself. Two biggest areas seem to be the `utils` module, which is a kitchen sink that covers a lot of functionality from sandboxing, git support, sessions, etc, and `components` module, which contains the react ui. You could certainly build a cli agent with much smaller codebase, with leaner ui code without react, but probably not with this truckload of functionality.

cogman10 · 2026-04-01T14:07:47 1775052467

They are doing some strange "reinvent the wheel" stuff.

For example, I found an implementation of a PRNG, mulberry32 [1], in one of the files. That's pretty strange considering TS and Javascript have decent PRNGs built into the language and this thing is being used as literally just a shuffle.

[1] https://github.com/AprilNEA/claude-code-source/blob/main/src...

hombre_fatal · 2026-04-01T16:04:42 1775059482

mulberry32 is one of the smallest seedable prngs. Math.random() is not seedable.

If you search mulberry32 in the code, you'll see they use it for a deterministic random. They use your user ID to always pick the same random buddy. Just like you might use someone's user ID to always generate the same random avatar.

So that's 10 lines of code accounted for. Any other examples?

neurostimulant · 2026-04-01T14:42:25 1775054545

Well, at least that confirms they weren't lying when they said all recent updates to claude code were made by claude. You certainly won't do this stuff if you were writing the code yourself.

hombre_fatal · 2026-04-01T11:55:14 1775044514

Software doesn’t end at the 20k loc proof of concept though.

What every developer learns during their “psh i could build that” weekendware attempt is that there is infinite polish to be had, and that their 20k loc PoC was <1% of the work.

That said, doesn't TFA show you what they use their loc for?

mpalmer · 2026-04-01T13:01:46 1775048506

Check out `print.ts` to see how "more LOC" doesn't mean "more polished"

hombre_fatal · 2026-04-01T13:07:29 1775048849

Okay, I'm looking at it. Now what?

This file is exactly what I'm talking about.

Take the loadInitialMessage function: It's encumbered with real world incremental requirements. You can see exactly the bolted-on conditionals where they added features like --teleport, --fork-session, etc.

The runHeadlessStreaming function is a more extreme version of that where a bunch of incremental, lateral subsystems are wired together, not an example of superfluous loc.

mpalmer · 2026-04-01T17:09:59 1775063399

The file is more than 5000 lines of code. The main function is 3000. Code comments make reference to (and depend on guarantees in connection with) the specific behavior of code in other files. Do I need to explain why that's bad?

hombre_fatal · 2026-04-01T17:22:37 1775064157

By real-world polish, I don't mean refining the code quality but rather everything that exists in the delta between proof of concept vs real world solution with actual users.

You don't have to explain why there might be better ways to write some code because the claim is about lines of code. It could be the case that perfectly organizing and abstracting the code would result in even more loc.

sarchertech · 2026-04-01T12:28:37 1775046517

I think that’s why the author was comparing to to a finished 3D game.

hombre_fatal · 2026-04-01T12:43:51 1775047431

I guess because you see 3D stuff in a 3D game instead of text, people assume that it must be the most complex thing in software? Or because you solve hard math problems in 3D, those functions are gonna be the most loc?

It's a completely different domain, e.g. very different integration surface area and abstractions.

Claude Code's source is dumped online so there's probably a more concrete analysis to be had than "that sounds like too many loc".

sarchertech · 2026-04-01T12:50:39 1775047839

It is a different domain but that wasn’t your argument. Your argument was that someone was comparing it to a POC when in fact they were comparing to a finished product.

Also a AAA game (with the engine) with physics, networking, and rendering code is up there in terms of the most complex pieces of software.

hombre_fatal · 2026-04-01T12:57:35 1775048255

They just claimed that you can build a 3D game in 500k loc, thus Claude Code shouldn't use so many loc. They/you didn't render the argument for that.

For example, without looking at the code, the superstition also works in the opposite direction: Claude Code is an interface to using AI to do any computer task while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc. That's equally unsatisfying.

You'd have to be more concrete than "sounds like a lot".

troupo · 2026-04-01T13:47:36 1775051256

> Claude Code is an interface to using AI to do any computer task

Claude Code is quite literally a wrapper around a few APIs. At one point it needed 68GB of RAM to run and requires 11ms to "lay a scene graph" to display a few hundred characters on screen. All links here: https://news.ycombinator.com/item?id=47598488

> while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc.

Yes, most games should be done in fewer loc

anthk · 2026-04-01T15:42:16 1775058136

I could run a text adventure with a Zmachine emulator under a 6502 based machine and 48k of RAM, with Ozmoo you can play games like Tristam Island. On a Commodore 64, or an Apple II for you US commenters. I repeat the game it's being emulated in a simple computer with barely more processing power than a current keyboard controller.

As the ZMachine interpreter (V3 games at least, enough for the mentioned example), even a Game Boy used to play Pokemon Red/Blue -and Crystal/Sylver/Blue, just slightly better specs than the OG GB- can run Tristam Island with keypad based input picking both selected words from the text or letter by letter as when you name a character in an RPG. A damn Game Boy, a pocket console from 1989. Not straightly running a game, again. Emulating a simple text computer -the virtual machine- to play it. No slowdowns, no-nothing, and you can save the game (the interpreter status) in a battery backed cartridge, such as the Everdrive. Everything under... 128k.

Claude Code and the rest of 'examples' it's what happens when trade programmers call themselves 'engineers' without even a CS degree.

hombre_fatal · 2026-04-01T16:14:41 1775060081

Your claim was that they could implement the same app in 50k lines of code.

A cursory glance at the codebase shows that it's not just a wrapper around a few APIs.

mpalmer · 2026-04-01T17:14:41 1775063681

Yes, because they've vibed it into phenomenally unnecessary complexity. The mistake you continually make in this thread is to look at complexity and see something that is de facto praiseworthy and impressive. It is not.

lelanthran · 2026-04-01T13:48:34 1775051314

> Claude Code is an interface to using AI to do any computer task

Shouldn't interfaces be smaller than the implementation?

hombre_fatal · 2026-04-01T16:08:45 1775059725

No. We aren't talking about .h vs .c files nor PL interfaces.

A GUI/client can be arbitrarily more or less complex than the things it's GUI'ing.

lelanthran · 2026-04-02T10:29:16 1775125756

> A GUI/client can be arbitrarily more or less complex than the things it's GUI'ing.

If it's an interface to ffmpeg, then sure, the GUI could be extremely complicated code.

But that's not what we are talking about, is it? We are talking about an interface to a chatbot that can accept and return chats, accept and return files, and run a selection of internal commands (which include invoking itself recursively).

The interface to this chatbot that has a settings entry for "personality" is still only going to map that to one of a small number of chatbot inputs. Same with basically anything else (read the skills file, etc).

I dunno... maybe 500kSloC for a fancy IRC client is the going rate, but the last time I wrote an interface to a chat client, it was barely 10k lines, not counting the lib*.so that the the program called to interact with the chatbot, with said chatbot supporting file uploads and '/' commands.

fragmede · 2026-04-02T13:45:17 1775137517

Did your IRC client have a sandbox that let other users run commands on your box? I don't think there's enough LoC in the world before I'd let that happen!

spiderfarmer · 2026-04-01T11:07:07 1775041627

Comments like these remind me of the football spectators that shout "Even I could have scored that one" when they see a failed attempt.

Sure. You could have. But you're not the one playing football in the Champions League.

There were many roads that could have gotten you to the Champions League. But now you're in no position to judge the people who got there in the end and how they did it.

Or you can, but whatever.

boomskats · 2026-04-01T11:15:35 1775042135

I don't think this is warranted given that the comment you're criticising is simply expressing an opinion explicitly solicited by the comment it's responding to.

sarchertech · 2026-04-01T12:31:18 1775046678

It’s more like “Player A is better than Player B” coming from a professional player in a smaller league who is certainly qualified to have that opinion.

troupo · 2026-04-01T11:57:09 1775044629

> Sure. You could have. But you're not the one playing football in the Champions League.

The only reason people are using Claude Code is because it's the only way to use their (heavily subsidized) subscription plans. People who are okay with using and paying for their APIs often opt out for other, better, tools.

Also, analogies don't work. As we know for a fact that Claude Code is a bloated mess that these "champions league-level engineers" can't fix. They literally talk about it themselves: https://news.ycombinator.com/item?id=47598488 (they had to bring in actual Champions League engineers from bun to fix some of their mess).

spiderfarmer · 2026-04-01T14:38:55 1775054335

"Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

You just repeat the same statement.

That bloated mess is what got them to the Champions League. They did what was necessary to get them here. And they succeeded so far.

But hey, according to some it can be replicated in 50k lines of wrapper code around a terminal command, so for Anthropic it's just one afternoon of vibe coding to get rid of this mess. So what's the problem? /s

troupo · 2026-04-01T15:01:34 1775055694

> Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

Since you keep putting words in my mouth that I never said, and keep being deliberately obtuse, this particular branch is over.

Go enjoy Win11 written by same level of champions or something.

Adieu.

boomskats · 2026-04-02T01:20:16 1775092816

Ah, Winning Eleven.

Not what you were referring to.

vardalab · 2026-04-01T14:48:47 1775054927

Yes, exactly. I like this analogy. I am surprised the level of pearl clutching in these discussions on Hacker News. Everybody wants to be an attention sharecropper, lol.

criley2 · 2026-04-01T11:08:17 1775041697

Honest question: Why does it matter? They got the product shipped and got millions of paying customers and totally revolutionized their business and our industry.

Engineers using LOC as a measure of quality is the inverse of managers using LOC as a measure of productivity.

dandellion · 2026-04-01T11:27:23 1775042843

More code means more entropy, more room for bugs, harder to find issues, more time to fix, more attack surface, more memory used, more duplication, more inconsistencies... I bet you at some point we'll get someone reporting how AI performance deteriorates as the code base grows, and some blog post about how their team improved the success of their AI by trimming the code base down to less than 100k LOC or something like that.

The principles of good software don't suddenly vanish just because now it's a machine writing the code instead of a human, they still have to deal with the issues humans have for more than half a century. The history of programming is new developers coming up with a new paradigm, then rediscovering all the issues that the previous generation had figured out before them.

criley2 · 2026-04-01T11:33:10 1775043190

The history of programming is also each generation writing far less performant code than the one before it. The history of programming is each generation bemoaning the abstractions, waste and lack of performance of the code of the next generation.

It turns out that there is a tradeoff in code between velocity and quality that smart businesses consider relative to hardware cost/quality. The businesses that are outcompeting others are rarely those who have the highest quality code, but rather those that are shipping quickly at a quality level that is satisfactory for current hardware.

sarchertech · 2026-04-01T11:39:47 1775043587

> far less performant code than the one before it.

That worked because of rapid advancements in CPU performance. We’ve left that era.

It’s about more than performance. Code is and always has been a liability. Even with agents, you start seeing massive slowdowns with code base size.

It’s why I can nearly one shot a simple game for my kid in 20 minutes with Claude, but using it at work on our massive legacy codebase is only marginally faster than doing it by hand.

dandellion · 2026-04-01T12:20:12 1775046012

You asked why the size of the code matters, I gave you the answer. If you want to ramble about the non technical aspects of software development talk to someone else, I'm not interested.

criley2 · 2026-04-01T12:34:17 1775046857

I asked a rhetorical question to get the reader to think about a topic. I was not looking for a rote recitation of a well-known textbook answer. Maybe you should not be on the comment section of an engineering website if you find discussion so offensive.

dandellion · 2026-04-03T14:10:35 1775225435

I'm pretty sure I can post however I want, so I'll be ignoring your suggestion. Also I didn't say I find it offensive, just that I'm not interested.

raincole · 2026-04-01T11:18:06 1775042286

It doesn't. LoC is only meaningful when you use it to belittle others' code.

ulbu · 2026-04-01T11:43:44 1775043824

hehe, belittle (to make smaller)

sarchertech · 2026-04-01T11:34:47 1775043287

The reason it’s not useful as a measure of productivity is because it’s measure of complexity (not directly, but it’s correlated). But it tells you nothing about whether that complexity was necessary for the functionality it provides.

But given that we know the functionality of Claude Code, we can guess how much complexity should be required. We could also be wrong.

>Why does it matter?

If there’s massively more code than there needs to be that does matter to the end user because it’s harder to maintain and has more surface area for bugs and security problems. Even with agents.

GuB-42 · 2026-04-01T13:35:39 1775050539

Among the hundreds of thousands of lines of code that Anthropic produced was one that leaked the source code. It is likely to be a config file, not part of the Claude Code software itself, but it still something to track.

The more lines of code you have the more likely there is for one of them to be wrong and go unnoticed. It results in bugs, vulnerabilities,... and leaks.

viktorcode · 2026-04-01T12:44:26 1775047466

More bugs. More costly maintenance.

blantonl · 2026-04-01T11:40:31 1775043631

Exactly. Imagine if Claude Code was a PHP script. Some folks would lose their damn minds

troupo · 2026-04-01T11:29:23 1775042963

> Honest question: Why does it matter?

Because it's unmaintainable slop that they themselves don't know how to fix when something happens? https://news.ycombinator.com/item?id=47598488

sumtechguy · 2026-04-01T13:06:20 1775048780

It will be exactly that. But that is a 'them' problem. I can look at it a go 'that looks like a bad idea' but they are the ones who have to live with it.

At some point someone will probably take their LLM code and repoint it at the LLM and say 'hey lets refactor this so it uses less code is easier to read but does the same thing' and let it chrun.

One project I worked on I saw one engineer delete 20k lines of code one day. He replaced it with a few lines of stored procedure. That 20k lines of code was in production for years. No one wanted to do anything with it but it was a crucial part of the way the thing worked. It just takes someone going 'hey this isnt right' and sit down and fix it.

troupo · 2026-04-01T14:04:31 1775052271

> But that is a 'them' problem. I

When a TUI requires 68 GB of RAM to run, or when they spend a week not being able to find a bug that causes multiple people to immediately run out of tokens, it's not a "them" problem.