ChatGPT Pro now starts at $100/month

Very good move. In my experience, for system programming at least, GPT 5.4 xhigh is vastly superior to Claude Opus 4.6 max effort. I ran many brutal tests, including reconstructing for QEMU the SCSI controller (not longer accessible) of a SVSY UNIX of the early 90s used in a 386. Side by side, always re-mirroring the source trees each time one did a breakthrough in the implementation. Well, GPT 5.4 single handed did it all, while Opus continued to take wrong paths. The same for my Redis bug tracking and development. But 200$ is too much for many people (right now, at least: the reality is that if frontier LLMs are not democratized, we will end paying like a house rent to a few providers), and also while GPT 5.4 is much stronger, it is slower and less sharp when the thing to do is simple, so many people went for Claude (also because of better marketing and ethical concerns, even if my POV is different on that side: both companies sell LLM models with similar capabilities and similar internal IP protection and so forth, to me they look very similar in practical terms). This will surely change things, and many people will end with a Claude 5x account + a Codex 5x account I bet.

It's interesting seeing all the ChatGPT users in this thread, knowing what we know about OpenAI. Either they don't care about what OpenAI does, don't know their reputation, or feel like their use is too insignificant to matter.

5.4, in my own testing, was almost always ahead of Opus 4.6 for reviews and planning. I'm on plus plan on openai, so I couldn't test it so deeply. Anyone who had more experience on both could perhaps chime in? Pros/cons compared to Opus? I'm invested in Claude ecosystem but the recent quality and session limits decrease have me on the edge.

The era of subsidization is over, it seems.

For my money, on the code side at least, GitHub Copilot on VSCode is still the most cost effective option, 10 bucks for 300 requests gets me all I need, especially when I use OpenAI models which are counted as 1x vs Opus which is 3x. I've stopped using all other tools like Claude Code etc.

The title is misleading. The only thing they seem to have done was add a $100 plan identical to Claude's, which gives 5x usage of ChatGPT Plus. There is still a $200 plan that gives 20x usage.

That has me quite tempted. In general, I stay under the Plus limits, but I do watch my consumption. I could use `/fast` mode all of the time, with extra high reasoning, and use gpt-5.4-pro for especially complex tasks. It wasn't worth 10x the price to me before, but 5x is approachable.

I like that they kept limited access to Codex even on free tier.

LE: Someone said this is how the tiers are now counted:

"Essentially if old plus is 1x then new limits are: Plus - 0.3x Pro $100 - 1.5x Pro $200 - 6x (unchanged)"

Any idea way "5x or 20x more usage" means?

They are actively exploiting the compute shortages of Anthropic. In our team we're pushing for more or less vanilla and portability, since the best harness today might not be the best one in 6 months.

This is an additional offering to the existing plan.

5x=$100 20x=$200

Are you allowed to run your own autonomous agents with it outside of Codex, like OpenClaw and others?

It looks like its called prolite.

https://snipboard.io/jmGKfM.jpg

How much was it before?

Does this give you something different than the $20/mo plan when using codex?

For me it's not the price. It's the fact that they obviously read my prompts and may even use a derived version of my data for training. As it's very clear in the meantime that SAMA lies most of the time, there's just no way I can trust this company in any way.

I wish these plans had burst mode where I could set default plan size and max plan size and just scale up for a month automatically if needed but automatically drop back to my default plan at the next billing cycle

Now they just need every iPhone owner on the planet to subscribe, and this AI bubble will officially be unpoppable!

https://x.com/OpenAI/status/2042296046009626989

>Our existing $200 Pro tier still remains our highest usage option.

Awesome news.

And that includes usage of the API with any agent without risking being banned. OpenAI is also very supportive of open source software.

I'm using GPT-5.4 with Swival (https://swival.dev) for a while, alongside local models, and it's absolutely fantastic.

It really feels like LLMs will mostly become tools for tech workers rather than the kind of civilization-level transformation sama has been peddling. Every single comment here seems to confirm the above.

This is like the 2010s hosting price wars.

What is the difference between Pro and normal mode apart from the fact the Pro takes ages to finish? I see not much difference in output quality.

Tell me you're losing market share to competitors without telling me you're losing market share to competitors

Can you guys remind me again why you're doing this?

just a rumor, but i heard altman was adding a timer which required the R&D dept. to triple

Price drops are nice. Unfortunately, the quality differential versus the competitor is night and day.

And everyone serious uses the API rate billing anyway.

I like that they kept limited access to Codex even on free tier.

LE: Someone said this is how the tiers are now counted:

"Essentially if old plus is 1x then new limits are: Plus - 0.3x Pro $100 - 1.5x Pro $200 - 6x (unchanged)"

This is an additional offering to the existing plan.

5x=$100 20x=$200

Are you allowed to run your own autonomous agents with it outside of Codex, like OpenClaw and others?

It looks like its called prolite.

https://snipboard.io/jmGKfM.jpg

Now they just need every iPhone owner on the planet to subscribe, and this AI bubble will officially be unpoppable!

https://x.com/OpenAI/status/2042296046009626989

>Our existing $200 Pro tier still remains our highest usage option.

Awesome news.

And that includes usage of the API with any agent without risking being banned. OpenAI is also very supportive of open source software.

I'm using GPT-5.4 with Swival (https://swival.dev) for a while, alongside local models, and it's absolutely fantastic.

This is like the 2010s hosting price wars.

What is the difference between Pro and normal mode apart from the fact the Pro takes ages to finish? I see not much difference in output quality.

Tell me you're losing market share to competitors without telling me you're losing market share to competitors

If Sam Altman told me what time it was I'd check my watch and probably still not believe him.

For what it’s worth, I cancelled my ChatGPT subscription, and every time I try debugging a Linux system issue, I feel sad that Claude is sooooooo confidently bad at it.

Claude is noticeably poor for my use case on this particular issue. That said, I imagine I’m not alone in refusing to continue paying OpenAI. We’re in for a wild ride.

What has the tech industry ever resisted on moral or reputational grounds?

The difference between Silicon Valley and Wall Street is that Wall Street knows they are lying when they justify the awful things they do in the name of enriching themselves.

What's with the holier-than-thou attitude? Why do you think you're better than someone using chatgpt?

I assume in any sort of thread on a topic like this there is going to be inorganic activity. These companies are all fighting rather hard to try to gain marketshare, potentially worth $trillions, with a product fully capable of producing endless reasonably compelling content to populate an account, a website, or any other basic proof of identity one might ever want.

It's probably never been the case that plurality of views meant anything since online is a bubble to begin with, filtered by endless biases wherever we happen to be reading, making it an even more fringe bubble, but the advent of AI has pushed it all over the edge to the point that perceived pluralities are just completely and utterly meaningless. Somewhat depressing for a one who enjoys online chat as a pasttime, but it's the reality of the world now.

I use opencode, so can toggle between Claude and Codex fairly easily, and do so whenever one of them is having problems (until yesterday, that is, when Claude blocked opencode for good, and I cancelled my account). This means I'm using the same prompts and instructions for both.

Personally, it seems like I have to redirect Opus/Sonnet much less often. GPT felt pretty "dense", it was more likely to ignore earlier instructions in the session, I had to remind it more often, and when I reviewed the code it produced I had to make more corrections that seemed obvious.

Entirely subjective, but I also find I prefer Claude's "personality" to ChatGPT, but I couldn't point to any specific differences.

Same for me. I'm on $20 plan for both and I use them both interchangeably. Similar "intelligence" imo. Just different way of doing things, that's all. But Claude is getting worse in terms of token usage so I've cancelled my plan last month.

Yeah it's probably a bit better overall. 5.4 is a month newer than Opus 4.6

My guess is that 5.5 will come out soon and be significantly better so you'd want to be using Codex then, but then when Opus 5 comes out probably back to claude code

Also 5.4 has fast mode, and higher usage limits since it's cheaper

Do you mind elaborating on your experience here?

Just curious as I've often heard that Claude was superior for planning/architecture work while ChatGPT was superior for actual implementation and finding bugs.

The era of subsidization is over, it seems.

I use both GH Copilot as well as CC extensively and it does seem more economical, though I wonder how long this will last as I imagine Github has also been subsidizing LLM usage extensively.

FWIW it feels like GH Copilot is a cheaper version of OpenRouter but with trade-offs like being locked into VSCode and the Microsoft ecosystem overall. I already use VSCode though and otherwise I don't see much downside to using GH Copilot outside of that.

OT:

In general I view VS Code and VS.NET Community + SQL Server free universe as the most effective option :) I think these products are great actually.

I tried Claude Code for a week straight recently to see what all the hype was about and while it pumped out a bunch of reasonable looking code and features I ended up feeling completely disconnected from my codebase and uncomfortable.

Cancelled the plan I had with them and happily went back to just coding like normal in VSCode with occasional dips into Copilot when a need arose or for rubber ducking and planning. Feels much better as I'm in full control and not trusting the magic black box to get it right or getting fatigue from reading thousands of lines of generated code.

Anyone who says they're able to review thousands of lines effectively that Claude might slop out in a day are lying to themselves.

> The era of subsidization is over

Of course it is. Returns are diminishing, AGI isn't happening with current techniques but it is good enough to sell, so it's time to monetize. I just got an email from OpenAI as well about ads in their free tier (I signed up once out of curiosity).

Not over yet. More hikes will come. It will reach $1000.

Pro used to be $200.

The title is misleading. The only thing they seem to have done was add a $100 plan identical to Claude's, which gives 5x usage of ChatGPT Plus. There is still a $200 plan that gives 20x usage.

That is not the "only" thing: You get access to GPT-5.4 pro.

Notably, up until now Pro had 6x usage of Plus. So the title is only slightly misleading.

On the other hand, the benchmark of Plus usage seems to be to be all over the place, so it’s difficult to say now how does the usage compare to the old Pro.

You’re right. I missed the “From $100”. Edited title.

I think you currently can't use pro inside codex, or can you?

Any idea way "5x or 20x more usage" means?

What’s the difference between the two Pro plans?

Both Pro plans include the same core capabilities. The main difference is usage allowance: Pro $100 unlocks 5x higher usage than Plus (and 10x Codex usage vs. Plus for a limited time), while Pro $200 unlocks 20x usage than Plus.

From their faq

5x more usage than in Plus is 100$

20x more usage than in Plus is 200$

I see this when I try to upgrade my Plus subscription.

If you pay 200 you get 20x

I assume it means 5x if they get to choose. They're the ones enforcing the limits.

Suppose you enter a casino and the owner welcomes you in and sees that you are a frequent loyal s̶p̶e̶n̶d̶e̶r̶ customer (with the amount of tokens you are spending a month) with an existing membership.

With this new VIP membership that comes with 5x or 20x usage, if you spend $100 you get 5x. $200 you get 20x and you get to spin the wheel and use the slot machines unlimited times even at peak hours more than most without any restrictions, 24/7, no waiting for hours with priority.

So spend more to get more abundance and more simultaneous spins at the wheel.

Except if you're trying to abuse the slot machines themselves or sharing or reselling your membership to other customers who want a spin at the roulette wheel; but were previously banned. [0]

[0] https://help.openai.com/en/articles/9793128-about-chatgpt-pr...

How much was it before?

$20 or $200 plan, now we have a middle plan.

Does this give you something different than the $20/mo plan when using codex?

Yes, it's 5x more usage than Plus, and with the current promotions you actually get 10x more usage than Plus on the $100 plan until May 31st.

Same for the $200 plan, it's still 2x its normal usage until that date.

are your prompts that important that you would not use SOTA models just to protect them?

For me, they are just a means to an end and disposable.

> Every single comment here seems to confirm the above.

The population on Hacker News heavily skewed towards tech workers so I wouldn't draw a conclusion from that.

The missing piece isn't smarter models, it's making them usable for normal people. OpenClaw proved there's massive demand for AI agents but the setup and maintenance is brutal. We built Atmita (https://atmita.com) from scratch, not based on OpenClaw, cloud-native so there's nothing to install. It can hit any REST API directly, integrates with 800+ services out of the box, has a built-in browser, runs scheduled tasks in the background, and has built-in approval controls. All through a chat interface anyone can use.

This is kind of the goldilocks zone for LLMs right now.

I wouldn't mistake this for any kind of capability plateau. There is a massive push towards making transformers the engine of humanoid (and other kinds of) robotics, we just haven't reached the hype moment for those yet.

I have heard a CTO had a major success building a side project over a weekend.

just a rumor, but i heard altman was adding a timer which required the R&D dept. to triple

I heard it’ll take about a year. Timers are a hard problem to solve.

GPT 5.4 is the surly physics PhD post-doc who slowly and angrily sits in a basement to write brilliant, undocumented, uncommented code that encapsulates a breakthrough algorithm.

Opus 4.6 is the L5 new hire SWE keen to prove their chops and quickly turn out totally reasonable code with putatively defensible reasons for doing it that way (that are sometimes tragically wrong) and then catch an after-work yoga class with you.

Thanks for confirming my impressions, it's been like 4 months now that I've arrived at the same conclusions. GPT models are just better at any kind of low-level work: reverse engineering including understanding what the decompiled code/assembly does, renaming that decompiled code (functions/types), any kind of C/C++, way more reliable security research (Opus will find way more, but most will turn out to be false positives). I've had GPT create non-trivial custom decompilers for me for binaries built with specific compilers (it's a much simpler task than what IDA Pro/Ghidra are doing but still complex), and modify existing Java decompilers.

Regarding speed, I don't use xhigh that often, and surprisingly for me GPT 5.4 high is faster than Claude 4.6 Opus high (unless you enable fast mode for Opus).

Of course I still use Opus for frontend, for some small scripts, and for criticizing GPT's code style, especially in Python (getattr).

+1 to this, I've found GPT/Codex models consistently stronger in engineering tasks (such as debugging complex, cross-systems issues, concurrency problems, etc).

I use both OpenAI and Anthropic models, though for different purposes, what surprises me is how underrated GPT still feels (or, alternatively, how overhyped Anthropic models can be) given how capable it is in these scenarios. There also seems to be relatively little recognition of this in the broader community (like your recent YouTube video). My guess is that demand skews toward general codegen rather than the kind of deep debugging and systems work where these differences really show.

My non scientific tests has been that GPT models follow the prompts literally. Every time I give it an example, it uses the example in literal sense instead of using it to enhance its understanding of the ask. This is a good thing if I want it to follow instructions but bad if I want it to be creative. I have to tell it that the examples I gave are just examples and not to be used in output. I feel comfortable using it when I have everything mapped out.

Claude on the other hand can be creative. It understands that examples are for reference purposes only. But there are times it decides to off on a tangent on its own and decide not to follow instructions closely. I find it useful for bouncing off ideas or test something new,

The other thing I notice is Claude has slightly better UI design sensibilities even if you don’t give instructions. GPT on the other hand needs instructions otherwise every UI element will be so huge you need to double scroll to find buttons.

Yup I've mentioned this in another thread, I got gpt 5.4xhigh to improve the throughout of a very complex non typical CUDA kernel by 20x. This was through a combination of architecture changes and then do low level optimizations, it did the profiling all by itself. I was extremely impressed.

What I like most about gpt coding models is how predictable of a lever that thinking effort is.

Xhigh will gather all the necessary context. low gathers the minimum necessary context.

That doesn’t work as well with me for Opus. Even at max effort it’ll overlook files necessary to understanding implementations. It’s really annoying when you point that out and you get hit with an”you’re absolutely right”.

Codex isn’t the greatest one shot horse in the race but, once you figure out how to harness it, it’s hard to go back to other models.

1000%. I have been running claude's work through codex for about a week now and it's insane the number of mistakes it catches. Not really sure why I've been doing this, just interesting to watch I guess.

Not to mention a billion times more usage than you get with claude, dollar for dollar.

GPT5.4 with any effort level is scary when you combine it with tricks like symbolic recursion. I actually had to reduce the effort level to get the model to stop trying to one shot everything. I struggled to come up with BS test cases it couldn't dunk in some clever way. Turning down the reasoning effort made it explore the space better.

The $100/mo giving access to GPT Pro (with reduced usage) is a nice counter to the just teased Claude Mythos. But GPT 5.4 xhigh being able to perform that kind of low-level reconstruction task is very impressive already.

Can you guys remind me again why you're doing this?

What I like most about gpt coding models is how predictable of a lever that thinking effort is.

Xhigh will gather all the necessary context. low gathers the minimum necessary context.

Codex isn’t the greatest one shot horse in the race but, once you figure out how to harness it, it’s hard to go back to other models.

If Sam Altman told me what time it was I'd check my watch and probably still not believe him.

$20 or $200 plan, now we have a middle plan.

Yes, it's 5x more usage than Plus, and with the current promotions you actually get 10x more usage than Plus on the $100 plan until May 31st.

Same for the $200 plan, it's still 2x its normal usage until that date.

are your prompts that important that you would not use SOTA models just to protect them?

For me, they are just a means to an end and disposable.

So.. would you mind releasing all your code on GitHub?

> Every single comment here seems to confirm the above.

The population on Hacker News heavily skewed towards tech workers so I wouldn't draw a conclusion from that.

This is kind of the goldilocks zone for LLMs right now.

> I wouldn't mistake this for any kind of capability plateau. There is a massive push towards making transformers the engine of humanoid (and other kinds of) robotics, we just haven't reached the hype moment for those yet.

Problem is that the fuel to get this train going relies on investors money. Investors aren't going to be happy with the quote I took from your message.

And that's the real bet really, can the industry turn the spark into fire before the investor money runs out?

I have heard a CTO had a major success building a side project over a weekend.

Is that CTO in the room with us?

I heard it’ll take about a year. Timers are a hard problem to solve.

While they work on getting it to tell the time they'll just be over there listing targets for military strikes.

Price drops are nice. Unfortunately, the quality differential versus the competitor is night and day.

And everyone serious uses the API rate billing anyway.

> the quality differential versus the competitor is night and day.

This myth about the inferiority of ChatGPT and Codex is becoming a meme.

I have active subscriptions to both. I am throwing at Codex all kinds of data engineering, web development and machine learning problems, have been working on non-tech tasks in the "Karpathy Obsidian Wiki" [1] style before he posted about it.

Not only does Codex crush Claude on cost, it's also significantly better at adherence and overall quality. Claude is there on my Mac, gathering dust, to the point I am thinking of not renewing the sub.

There are plenty of fellow HNers here who feel the same from what I read in the flamewars. I suspect none of us really has a horse in this race and many are half-competent (in other threads, they mention they do things like embedded programming, distributed DL systems, etc.)

I'm starting to suspect a vast majority of people pushing the narrative that Claude is vastly better haven't even tried the 5.3 / 5.4 models and are doing it out of sheer tribalism.

[1] https://gist.github.com/karpathy/442a6bf555914893e9891c11519...

Disagree. I use codex extensively. It just works so well with vscode and python. Claude with ridiculous limits - thanks no. For some even xAI is good fit.

This take is out-of-date by months (which is an eternity in this space). Codex today has caught up and is very much on par with CC.

I prefer and use 5.4 over Opus, it's simply better, faster, and doesn't glaze me like Claude models want to do for some reason.

GPT 5.4 is the surly physics PhD post-doc who slowly and angrily sits in a basement to write brilliant, undocumented, uncommented code that encapsulates a breakthrough algorithm.

Who replies to you with fucking emoji brainrot

> and then catch an after-work yoga class with you.

That's cute, but do you mean something concrete with this, aka are there some non-coding prompting you use it for that you're referring to with that or is it simply a throwaway line about L5 SWEs (at a FAANG).

(FWIW, I find myself using ChatGPT for non-coding prompting for some reason, like random questions like if oil is fungible and not Claude, for some reason.)

GPT is also cautious and Defensive but opus is agreeable.

Regarding speed, I don't use xhigh that often, and surprisingly for me GPT 5.4 high is faster than Claude 4.6 Opus high (unless you enable fast mode for Opus).

Of course I still use Opus for frontend, for some small scripts, and for criticizing GPT's code style, especially in Python (getattr).

In the SCSI controller work I mentioned, a very big part of the work was indeed reasoning about assembly code and how IRQs and completion of DMAs worked and so forth. Opus, even if TOOLS.md had the disassembler and it was asked to use it many times, didn't even bothered much. GPT 5.4 did instead a very great reverse engineering work, also it was a lot more sensible to my high level suggestions, like: work in that way to make more isolated progresses and so forth.

+1 to this, I've found GPT/Codex models consistently stronger in engineering tasks (such as debugging complex, cross-systems issues, concurrency problems, etc).

It's surprising to me how much LLM "personality" seems to matter to people, more than actual capability.

I do turn to Anthropic for ideation and non-tech things. But I find little reason to use it over codex for engineering tasks. Sometimes for planning, but even there, 5.4 is more critical of my questionable ideas, and will often come up with simpler ways to do things (especially when prompted), which I appreciate.

And I don't do hard-tech things! I've chosen a b2b field where I can provide competent products for a niche that is underserved and where long term relationships matter, simply because I'm not some brilliant engineer who can completely reinvent how something is done. I'm not writing kernels or complex ML stacks. So I don't really understand what everyone is building where they don't see the limits of Opus. Maybe small greenfield projects with few users.

I use codex for cleaning after cloude and it always finds so many bugs, some of them quite obvious.

This is also what I noticed.

GPT doesn't know how to get creative, you need to tell it exactly what to do and what code you want it to write.

For Claude you can be more general and it will look up solutions for you outside of the scope you gave it.

I presonaly prefer Claude.

Not to mention a billion times more usage than you get with claude, dollar for dollar.

It's widely reported that opus has been greatly reduced for a number of weeks since Mythos was released internally

can you explain what you mean by symbolic recursion tricks in this context?

This take is out-of-date by months (which is an eternity in this space). Codex today has caught up and is very much on par with CC.

I prefer and use 5.4 over Opus, it's simply better, faster, and doesn't glaze me like Claude models want to do for some reason.

GPT is also cautious and Defensive but opus is agreeable.

I use codex for cleaning after cloude and it always finds so many bugs, some of them quite obvious.

This is also what I noticed.

GPT doesn't know how to get creative, you need to tell it exactly what to do and what code you want it to write.

For Claude you can be more general and it will look up solutions for you outside of the scope you gave it.

I presonaly prefer Claude.

It's widely reported that opus has been greatly reduced for a number of weeks since Mythos was released internally

The difference between Silicon Valley and Wall Street is that Wall Street knows they are lying when they justify the awful things they do in the name of enriching themselves.

Entirely subjective, but I also find I prefer Claude's "personality" to ChatGPT, but I couldn't point to any specific differences.

Yeah it's probably a bit better overall. 5.4 is a month newer than Opus 4.6

My guess is that 5.5 will come out soon and be significantly better so you'd want to be using Codex then, but then when Opus 5 comes out probably back to claude code

Also 5.4 has fast mode, and higher usage limits since it's cheaper

OT:

In general I view VS Code and VS.NET Community + SQL Server free universe as the most effective option :) I think these products are great actually.

Pro used to be $200.

Notably, up until now Pro had 6x usage of Plus. So the title is only slightly misleading.

On the other hand, the benchmark of Plus usage seems to be to be all over the place, so it’s difficult to say now how does the usage compare to the old Pro.

You’re right. I missed the “From $100”. Edited title.

What’s the difference between the two Pro plans?

From their faq

5x more usage than in Plus is 100$

20x more usage than in Plus is 200$

I see this when I try to upgrade my Plus subscription.

I assume it means 5x if they get to choose. They're the ones enforcing the limits.

So spend more to get more abundance and more simultaneous spins at the wheel.

Except if you're trying to abuse the slot machines themselves or sharing or reselling your membership to other customers who want a spin at the roulette wheel; but were previously banned. [0]

[0] https://help.openai.com/en/articles/9793128-about-chatgpt-pr...

So.. would you mind releasing all your code on GitHub?

Problem is that the fuel to get this train going relies on investors money. Investors aren't going to be happy with the quote I took from your message.

And that's the real bet really, can the industry turn the spark into fire before the investor money runs out?

Development towards those goals will continue with or without massive investor capital, see Google’s involvement with Boston Dynamics.

And plenty of very wealthy folks see the writing on the wall wrt robotics.

Is that CTO in the room with us?

While they work on getting it to tell the time they'll just be over there listing targets for military strikes.

> the quality differential versus the competitor is night and day.

This myth about the inferiority of ChatGPT and Codex is becoming a meme.

I'm starting to suspect a vast majority of people pushing the narrative that Claude is vastly better haven't even tried the 5.3 / 5.4 models and are doing it out of sheer tribalism.

[1] https://gist.github.com/karpathy/442a6bf555914893e9891c11519...

I have access to effectively infinite API tokens for all models from Anthropic as well as OpenAI. The differential in performance in complex tasks is vast and strongly in favor of Opus, in my experience. I do not use the official harnesses for either model, though - as they are not my taste.

Codex is closer to my taste, as it is at least a native app and not typescript slop. But the model is just not up to snuff.

Disagree. I use codex extensively. It just works so well with vscode and python. Claude with ridiculous limits - thanks no. For some even xAI is good fit.

> For some even xAI is good fit.

Grok makes sense if you want s.th. less censored that is not biased towards woke ideology.

I don't see how this matters for coding though. I only use it to give me a summary of recent news (so I don't have to actually read the bs newspapers and X posts myself).

Who replies to you with fucking emoji brainrot

You are absolutely right!

> and then catch an after-work yoga class with you.

(FWIW, I find myself using ChatGPT for non-coding prompting for some reason, like random questions like if oil is fungible and not Claude, for some reason.)

I think the point they are trying to make is the golden retriever vibe/energy you get from Claude gives "after work yoga."

GPT 5.4 is remarkably good at figuring out machine code using just binutils. Amusingly, I watched it start downloading ghidra, observe that the download was taking a while, and then mostly succeed at its assignment with objdump :)

It's surprising to me how much LLM "personality" seems to matter to people, more than actual capability.

> It's surprising to me how much LLM "personality" seems to matter to people, more than actual capability. > I do turn to Anthropic for ideation and non-tech things. But I find little reason to use it over codex for engineering tasks. Sometimes for planning, but even there, 5.4 is more critical of my questionable ideas, and will often come up with simpler ways to do things (especially when prompted), which I appreciate.

Aren't you saying here that the LLM personality matters to you, too? Being critical of you is a personality attribute, not a capabilities one.

> I'm not some brilliant engineer who can completely reinvent how something is done

With an honest evaluation of your own capabilities you are already far above average. Also its hard to see the insane amount of work that often was necessary to invent the brilliant stuff and most people can not shit that out consistently.

can you explain what you mean by symbolic recursion tricks in this context?

The model can call a copy of itself as a tool (i.e., we maintain actual stack frames in the hosting layer). Explicit tools are made available: Call(prompt) & Return(result).

The user's conversation happens at level 0. Any actual tool use is only permitted at stack depths > 0. When the model calls the Return tool at stack depth 0 we end that logical turn of conversation and the argument to the tool is presented to the user. The user can then continue the conversation if desired with all prior top level conversation available in-scope.

It's effectively the exact same experience as ChatGPT, but each time the user types a message an entire depth-first search process kicks off that can take several minutes to complete each time.

For what it’s worth, I cancelled my ChatGPT subscription, and every time I try debugging a Linux system issue, I feel sad that Claude is sooooooo confidently bad at it.

Claude is noticeably poor for my use case on this particular issue. That said, I imagine I’m not alone in refusing to continue paying OpenAI. We’re in for a wild ride.

Do you know game theory? If you look at it through this perspective this doesn't sound like a good strategy.

Basically the classical prisoner dilemma. The other devs with less moral can then outperform you.

It could be a valid strategy if you can increase your crediblity with this relinquishment.

What has the tech industry ever resisted on moral or reputational grounds?

This is sarcastically-stated but an excellent point, and an honest answer will come up with a vanishingly small list. We geeks may think we care about Important Things, but our industry cares for nothing but money and power — morality is a hindrance to the accumulation of those.

One example I can think of is Google + Project Maven [1], where Google was partnering with the DoD but "withdrew in 2018 after internal protests". Though they've since partnered with the DoD on other initiatives [2].

[1] https://en.wikipedia.org/wiki/Project_Maven

[2] https://www.reuters.com/business/autos-transportation/us-dep...

Maybe it's time to start.

What's with the holier-than-thou attitude? Why do you think you're better than someone using chatgpt?

If an AI company has done unethical things do you think it is inappropriate to discuss that? Take Grok: among other things it created sexualized images of underaged women without their consent, not by accident but as a feature. Is that just something you want to ignore? In response the people in charge merely restricted the feature to paid subscribers instead of removing it.

Do you think people who mention grok creating CSAM is a holier-than-thou attitude? Do you not think the people who ignore that are worse than other people?

People love to roleplay as activists because it gives their life some meaning and illusion of control

Yeah yeah yeah, everyone's a bot expect you with all the right opinions...

Do you mind elaborating on your experience here?

Just curious as I've often heard that Claude was superior for planning/architecture work while ChatGPT was superior for actual implementation and finding bugs.

Claude makes more detailed plans that seem better if you just skim them, but when analyzed, has a lot of errors, usually.

It compensates for most during implementation if you make it use TDD by using superpower et al, or just telling it to do so.

GPT 5.4 makes more simple plans (compared to superpowers - a plugin from the official claude plugin marketplace - not the plan mode), but can better fill the details while implementing.

Plan mode in Claude Code got much better in the last months, but the lacking details cannot be compensated by the model during the implementation.

So my workflow has been:

Make claude plan with superpowers:brainstorm, review the spec, make updates, give the spec to gpt, usually to witness grave errors found by gpt, spec gets updates, another manual review, (many iterations later), final spec is written, write the plan, gpt finds mind boggling errors, (many iterations later), claude agent swarm implements, gpt finds even more errors, I find errors, fix fix fix, manual code review and red tests from me, tests get fixed (many iterations later) finally something usable with stylistic issues at most (human opinion)!

This happens with the most complex features that'd be a nightmare to implement even for the most experienced programmers of course. For basic things, most SOTA modals can one-shot anyway.

I use both GH Copilot as well as CC extensively and it does seem more economical, though I wonder how long this will last as I imagine Github has also been subsidizing LLM usage extensively.

You’re not locked into vscode. There are plugins for other IDEs, and a ‘copilot’ cli tool very similar to Claude Code’s cli tool.

I also wouldn’t say you’re locked into Microsoft’s ecosystem. At work we just have skills that allow for interaction with Bitbucket and other internal tooling. You’re not forced to use GitHub at all.

You can use copilot models from OpenCode:

https://github.blog/changelog/2026-01-16-github-copilot-now-...

I'm hopeful because Microsoft already has a partnership and owns much of OpenAI so can get their models at cost to host on Azure with they already do, so they can pass on the savings to the user. This is why Opus is 3x as expensive in Copilot, because Microsoft needs to buy API usage from Anthropic directly.

You could use something like [https://opencode.ai](OpenCode) which supports integration with Copilot.

> but with trade-offs like being locked into VSCode and the Microsoft ecosystem overall

You can use GH Copilot with most of Jetbrains IDEs.

Anyone who says they're able to review thousands of lines effectively that Claude might slop out in a day are lying to themselves.

I don't like calling a posture 'ignorant' , but I think that's what we have here. I don't mean that as an insult.

It's likely you didn't learn how to use the tool properly, and I'd suggest 'trying again' because not using AI soon will be tantamount to digging holes with shovels instead of using construction equipment. Yes, we still need our 'core skill's but, we're not going to be able to live without the leverage of AI.

Yes - AI can generate slop, and probably too many Engineers do that.

Yes - you can 'feel a loss of control' but that's where you have to find your comfort zone.

It's generally a bad idea to produce 'huge amounts of code' - unless it's perfectly consistent with a design, and he architecture is derived from well-known conventions.

Start by using it as an 'assistant' aka research, fill in all the extra bits, and get your testing going.

You'll probably want to guide the architecture, and at least keep an eye on the test code.

Then it's a matter of how much further 'up' you can go,

There are few situations in which we should be 'accepting' large amounts of code, but some of it can be reviewed quickly.

The AI, already now in 2026 can write better code than you at the algorithmic level - it will be tight, clean, 'by the book' and far lesss likley to have erros.

It fails at the architectural and modular level still, that will probably change.

The AI 'makes a clean cut' in the wood, tighter to the line than any carpenter could - like a power tool.

A carpenter that does not use power tools is an 'artisnal craft person' , not really building functional things.

This is the era of motor cars, there is really no option - I don't say that because I'm pro or anti anything, AI is often way over-hyped - that's something else entirely.

It's like the web / cloud etc. it's just 'imminent'.

So try again, experiment, stay open minded.

>Anyone who says they're able to review thousands of lines effectively that Claude might slop out in a day are lying to themselves.

The amount you can review before burning out is now the reasonable limit, for the same reason that a car is supposed to stay at the speed you can handle and not the max speed of the engine.

Of course, many people are secretly skipping reviews and some dare to publicly advocate for getting rid of them entirely.

why not just use it to review your codebase/commits/prs? you don't have to let it write a bunch of code for you neccessarily.

I think most people doing what is now called agentic development, aren't following most established dev methodologies and are to a great extent playing it by vibe.

The codebase disconnect is real.

We are like blue collar workers that need to hit the gym to maintain the body that our cavemen ancestors could maintain by doing their daily duties.

Codebase gym sessions might become a thing.

> The era of subsidization is over

Subsidization is not nearly over

It's true AGI is 'not happening' but it doesn't matter.

Demand for AI is explosive, sales are skyrocketing.

We have another 5-8 years of this crazy investment stuff.

Altman will step aside before they turn into a 'normal company'.

Like they did at Uber.

> AGI isn't happening with current techniques but it is good enough to sell, so it's time to monetize.

Or perhaps it was a scam in the first place for an IPO.

Not over yet. More hikes will come. It will reach $1000.

That's what I said by subsidization being over.

That is not the "only" thing: You get access to GPT-5.4 pro.

Just to clarify, one does not get access to the pro model on the Pro plan?

Will they fix the pro model so it actually finishes the last step instead of hanging for 10-20m doing nothing?

It’s only use case now is when you can walk away for an hour.

Hacker Times

Hacker Times

ChatGPT Pro now starts at $100/month

Discussion

Discussion

Pricing