GitHub Copilot is moving to usage-based billing

Something is hilariously off here: Why should I pay $10 and be forced to use it by the end of the month, while I can pay $10 and have it last as long as I want?

Their "API pricing" is exactly the same as that of providers: https://docs.github.com/en/copilot/reference/copilot-billing...

The era of subsidised inference is truly ending. The new model multipliers (https://docs.github.com/en/copilot/reference/copilot-billing...) seem like a huge leap, though. From 1x to 6x for new-ish GPT and Sonnet models. 27x for Opus...

Seems like folks would be better off with OpenRouter instead.

"Your plan pricing is unchanged: Copilot Pro remains $10/month and Pro+ remains $39/month, and each includes $10 and $39 in monthly AI Credits, respectively."

If there's no discount on credits (in terms of tokens per dollar) over other providers, I'm going to switch to a PAYG provider. If there's a month where there's little to no coding I can pocket the 10$. What incentive do they give to stay with this plan?

Well.

Just got an email from GitHub saying they'll be raising prices for Co Pilot.

"To keep up with the way you use Copilot, we're transitioning to usage-based billing, and we want to give you enough time to prepare."

Man, it was fun. Having my tokens subsidized by Microsoft. If the prices go up to much I guess I'll try Deepseek again.

Everybody who says it's a 5-9-27x seems to not be aware of the obvious loophole. More like 50x increase. You were able to use over $500 worth of Opus on a $10/mo Github plan easily, no hacks. You could just prompt "plan this out for me, don't stop until fully planned, don't ask any questions", and you would get ~$5 worth of planning in one 3x request. At 100 requests/mo, each easily reaching $5, that's easy $500 worth of tokens.

"Plan prices aren’t changing.”

Isn't this like saying "The Porsche you rented at $200/mo is now a Honda. But the price hasn't changed!"

I was curious why a company would still use the VS Code + Copilot sidebar method for coding, rather than something like Claude Code. Turns out there’s a GitHub Copilot CLI!

I thought I was pretty familiar with available options, but no one in my circles ever mentions this product. It doesn’t seem to have much mindshare.

Has anyone used it? What’s your experience?

https://github.com/features/copilot/cli

Windsurf made a similar change in March: https://docs.windsurf.com/windsurf/accounts/quota

> In March 2026, Windsurf replaced the credit-based system with a quota-based usage system. Instead of buying and spending credits, your plan now includes a daily and weekly usage allowance that refreshes automatically.

With hindsight, per-request pricing makes no sense at all if an agent can burn a widely varying amount of tokens satisfying that request. These pricing plans were designed before coding agents changed the dynamics of token usage.

I don't use Copilot or any paid AI but all of this usage-based billing reminds me of cellphones back when you paid per individual text message.

Usage paying for AI is 1000x crazier because you're not even getting a guarantee in the thing you pay for in the end. You have to keep feeding it prompts and hope it gives you the solution you want. You may end up with no expected result yet you are paying for it. At least with texting, you got what you paid for.

I wonder how long it'll be before all AI costs are flat unlimited monthly fees or even free across the board, without compromise.

I wonder if GitHub (Microsoft) is implicitly betting that enterprise demand is sticky enough to absorb these rates, especially given that Opus 4.6 “fast” was being listed at a 27x multiplier. Maybe they saw enough usage at that price point to conclude the demand is real. Or maybe the strategy is to keep the enterprise customers who can justify it while shedding heavier individual and power-user usage.

The interesting question is how long it takes enterprises to notice the capability/pricing tradeoff, and whether they respond by limiting access to the strongest models internally.

The part that worries me is that this market is still very early. Most developers and organizations are still learning how to use these tools effectively. Raising the experimentation cost this much may slow down the discovery process that makes the tools valuable in the first place.

Github had, by far, the most easily game-able agent usage policy. People would force the agent to run a script before the end of turns that consisted entirely of `input("prompt: ")` so that you could essentially talk endlessly to an agent for the price of a turn. I see this less about the future of this industry and more about fighting the costs incurred by bad actors.

Anyone paying enterprise billing can you shed some light how on the earth you will be able to fund this bill going forward?

Were you able to see assisted AI coding savings proportional to costs increase now you are going to get?

Companies removed people as AI assisted coding will be cheaper and now coding cost are going up from fixed $X to non-deterministic. The posts by Uber few days back about spending 12 months' worth of money in 4 months tells a lot.

Only path forward seems using Open-source models and many companies don't use Chinese that makes only Mistral one as the option.

There is noticeable trend across all agentic coding platforms that this situation is no longer sustainable.

With this kind of pricing (sonnet 4.6 has 9x multiplier, previously 1x) it begs the question why use Copilot to begin with.

You could easily just buy the tokens directly and have a lot more choice as well.

Cancelling. Going with Codex $100, Kimi annual plan, DeepSeek API, and a local LLM once I get a Mac Studio.

Inference economics are going to be brutal in 2026 H2 when DeepSeek's new infra and model improvements come online, and Kimi launches K3. By brutal, I mean for OpenAI and Anthropic.

So given that I primarily interact with LLM's through VSCode, and I prefer the Copilot interface to the Claude Code plugin, does anyone have any suggestions on other plugins I should try? In my experience, Copilot is much more "plugged in" than any of the other plugins, in the sense that it can see things like linter outputs in VSCode. Basically, copilot "sees what I see" in a way that no other plugin or command line tool can, which make it much more ergonomic to use.

With this pricing change, I see no reason at all to stick with Copilot in principle, but I really need to solve this issue of IDE integration to move on.

I liked copilot because I didn't have to think about tokens. I get hung up when having to think about the price of things, and its hard to think about the project at the same time I got to think about token usage like a gas bill. The usage system had its own issues, but having a set amount of requests was a very comfortable way to use a paid AI service.

What's the current situation for coding with Local LLM's on decent hardware? I have an M3 Max with 64 gb of ram and am thinking I should start looking at Ollama and Opencode? Is this a useful stack for smaller personal projects?

Has anyone found the answer to this yet?

> What is the benefit of using the Copilot Pro+ at 39$/month instead of using the Copilot Pro at 10$/month and paying for extra usage?

After seeing the ridicolous multiplier increase I've added a calendar event to cancel my subscription mid-May.

(I'm a copilot subscriber since 2022)

They're not the only ones in the AI sphere to wind back, but they're the weirdest case in my eyes. Microsoft invests in having engineers building open models and they don't use a single one. I really don't get it.

But what really surprised me most about Copilot is that it would bill you per question, nothing about tokens. So if I managed to produce a prompt that gave me back an insane amount of tokens for something, which using any Claude model would easily accomplish, you were giving me my money's worth, at your own expense. The math is not gonna math out forever.

Current multipliers vs from June

  Opus 4.6  3x -> 27x
  Opus 4.7  3x -> 27x
  GPT  5.4  1x ->  6x

EDIT: only applies to annual plans

The cheapest copilot plan felt totally unsustainable to me. For around £8 month i was getting 100 opus 4.6 prompts (albeit with a reduced context window size around 128k iirc vs 200k to 1m for first party hosted opus). Gpt5.4 was hosted with 400k context iirc.

On top of that, you’ve got 2000minutes of container runtime, so running cloud agents was included. As was anthropic agent sdk mode via copilot which is very comparable with claude code - not identical, the anthropic “modular prompt” is much leaner in the sdk version.

I cant say im mad, i got above what i paid in value. That said, going forward ill probably go back to openrouter payg rather than a subscription.

I got a free 3months of the gemini £19 plan and ive been playing quite a bit, 3.1 pro is a good model, i just find it slow. Flash i think i under appreciated until now.

I pay for Copilot annually, and mostly for its code auto completion features. I use CC if I want to do anything agentic. Not sure if I want to pay more for occasionally-good-intellisense at this point.

This marks the beginning of the end of the AI free money era. Next they will dramatically raise prices when they have to be profitable on tokens.

I was surprised to find that this sentence

> Plan prices aren’t changing

did not continue with an em-dash followed by something profound that is changing.

Plan prices aren't changing -- the value you get out of it is.

How is this legal when people paid for a yearly plan in advance?

“Base plan pricing isn’t changing” is technically true, but for anyone using the more capable models heavily, this is still a price increase in all the ways that matter. The old abstraction was hiding compute costs; the new one mostly stops pretending.

I bought a copilot subscription for some small personal projects at Christmas.

I haven't been able to use my subscription much over the busy spring months, but i'm being charged every month.

I'd be tempted to keep the subscription if usage-based billing meant that i'd save money when i had less time.

But today, after hearing this, i cancelled my subscription.

GitHub Copilot has been the most expensive LLM subscription on the internet, when it comes to dollars-per-request-limit. Literally any other subscription provides more usage per dollar. https://codeberg.org/mutablecc/calculate-ai-cost/src/branch/...

It begins.

"It" being the end of subsidization of tokens and plans (expected) but while lock-in to foundational models and cloud services is still lacking. Guess investors want their ROI sooner than later, given how big of a wrench the AI boom has thrown into global economics.

So I guess from now on GH Copilot is only worth it if you want a quality autocomplete in VSCode.

Annual Pro+ subscriber here, mostly using it as a fallback when my Claude Max plan hits limits. The request-based pricing was genuinely the appeal, I could spill over from Max into Copilot without thinking about token costs. Going from 3x to 27x on Opus on annual plans is rough. The newer reasoning models with variable thinking budgets probably made per-request pricing untenable in the long run, but the way this lands on existing annual subscribers feels harsh. Going to look into the prorated refund.

Time to cancel. Happy to hear about some alternatives.

I started to use github copilot with vscode, but have never been too happy about the system. Over the months I gravitated to much more agentic workstyle, hardly ever editing much code by hand. The vscode IDE was getting more in the way. I had already started to look at OpenCode, and when I found it has a web interface, I was happy to switch over. I use a simple editor (KDE's Kate), or just less to skim through the code and/or a git diff. OpenCode has some free models in it, but I think I will need to get some kind of subscription for a better one. But it won't be copilot any more. The market is moving so fast that I don't know what are the most resonable models, or the most flexible way to set them up so I can switch when prices change yet again.

Does this mean you can only prompt "Hello" every morning for a month with Opus 4.7 ?

some of Github's open source maintainers have lost their free github copilot pro, guess this is really the next step for them to save cost in their infrastructure.

I pay for github copilot solely for completions and use codex for actual agent work. I really wish I could pay like $5/month for just completions or there was a good local alternative for them.

> Code completions and Next Edit suggestions remain included in all plans and do not consume AI Credits.

This is the VSCode autocomplete stuff right? Really enjoy this.

> Copilot code review will also consume GitHub Actions minutes, in addition to GitHub AI Credits. These minutes are billed at the same per-minute rates as other GitHub Actions workflows.

That sucks.

What's the residual value of Copilot after the changes? For Enterprise plans, even copilot code reviews will be charged at token price + github action minutes for the execution of the review. One can roll one's own reviewer and have it spend the API tokens as well. At least one will be able to select the subset of files for the change set, if needed.

The background agents will also depreciate in value because of their harness that's a black box that's not optimized for token usage at all. Rolling one's own will be a better choice here.

Looks like Microsoft has run out of compute and can't scale it fast enough to serve copilot users and Azure AI Foundry needs, given that the customer base is growing there as well.

Whose idea was this “premium request” model anyway? If you’re going to invent a new metric used to bill, why not align it with what, even at the time, was a clear underlying cost structure that GitHub actively chose to ignore for a more confusing system.

I only use copilot for the occasional auto-complete suggestion. I'm betting I could run a lightweight local LLM with llama.cpp to get similar functionality. Maybe this would be a decent replacement https://github.com/TabbyML/tabby

I'm happy I invested in local solutions and cutting context to the bone for API providers. Claims about AI being able to fully replace programmers never took into account the long-run equilibrium price of inference.

I was always curious how they can sustain a request based pricing model when requests can range from tiny to huge with all the modalities GH Copilot offers. Was a steal for agentic coding that turned out to be too good to be true, in the end. Still: Thank you for the ride.

I guess this is the "Google App engine" of Vibe coding when google raised pricing of App Engine significantly after being in preview. Doing weird price changes like this when coming out for new services make much more sense than doing this change which will just anger users.

According to `bunx ccusage` I'm easily doing $250-400/day in "real" API costs on my $200/month plan. There's no way everybody else isn't going to do the same thing and completely change the industry again. Both beginner and advanced developers are already hooked on all this stuff and they all know it.

Built credit pricing into my SaaS for AI features and the hardest part wasn't the math, it was that customers can't easily predict their own usage. They underuse and feel cheated, or overuse and churn. Subscriptions hide that volatility from the customer. Usage based pricing makes it their problem, which is honest but harder to sell.

Here goes my Copilot Pro subscription then, reluctantly heading over to Codex CLI since the CC base plan is downright unusable.

I'd be fine with it if they can make Sonnet 4.5 unlimited. I haven't personally seen any major differences between 4.6 or any of the other newer models. Claude 4.5 seems to be the right balance and works great for me.

Something is hilariously off here: Why should I pay $10 and be forced to use it by the end of the month, while I can pay $10 and have it last as long as I want?

Their "API pricing" is exactly the same as that of providers: https://docs.github.com/en/copilot/reference/copilot-billing...

I'm thinking the same. Downgrade to Pro and use OpenRouter (same price) for overage.

Seems a massive loss for Microsoft. Presumably there's a further rugpull to come.

I'm wondering if they're basically saying they're going to give $10/month free API credits to students and open source maintainers and so on... while otherwise getting out of the consumer portion of this space.

I have to wonder if it's because of how many Enterprise customers they have who have standardized on Github Copilot and gotten it through the gauntlet of legal approvals etc.

Enterprise gets pooled credits and will like having everything go through one place so I think it still works.

Seems like folks would be better off with OpenRouter instead.

"Your plan pricing is unchanged: Copilot Pro remains $10/month and Pro+ remains $39/month, and each includes $10 and $39 in monthly AI Credits, respectively."

Well.

Just got an email from GitHub saying they'll be raising prices for Co Pilot.

"To keep up with the way you use Copilot, we're transitioning to usage-based billing, and we want to give you enough time to prepare."

Man, it was fun. Having my tokens subsidized by Microsoft. If the prices go up to much I guess I'll try Deepseek again.

"Plan prices aren’t changing.”

Isn't this like saying "The Porsche you rented at $200/mo is now a Honda. But the price hasn't changed!"

I'm thinking the same. Downgrade to Pro and use OpenRouter (same price) for overage.

Seems a massive loss for Microsoft. Presumably there's a further rugpull to come.

> Presumably there's a further rugpull to come.

How would that be? They are already charging as much as the underlying providers. They can hardly expect to have any customers if they are charging more.

I'm already on Pro. Why should I keep it?

they're downsizing free github copilot pro for open source maintainers. At the very least, it looks like small open source projects got their free copilot pro cut off

I have to wonder if it's because of how many Enterprise customers they have who have standardized on Github Copilot and gotten it through the gauntlet of legal approvals etc.

for my experience currently, I greatly prefer the VSCode Copilot extension experience over the Claude Extension

I think VSCode only supports copilot for "autocomplete" too

on top of that, you need GitHub Copilot for the PR reviewer functionality in GitHub

Enterprise gets pooled credits and will like having everything go through one place so I think it still works.

Is there a way to use the autocomplete feature with an api?

Lots of us have noticed that usage limits for Claude have been nerfed in recent weeks/months.

If anything, these new multipliers are more transparent than anything OpenAI or Anthropic have communicated regarding actual costs and give us a more realistic understanding of what it's costing these providers.

The fact that we were able to get such a substantial amount of usage for $20/$100/$200 a month was never meant to last and to think otherwise was perhaps a bit naive.

This feels like a strategy from the ZIRP era of tech growth where companies burned investor capital and gave away their products and services for free (or subsidized them heavily) in order to prioritize user acquisition initially. Then once they'd gained enough traction and stickiness they'd then implement a monetization strategy to capitalize on said user base.

Yeah, totally. The recent pricing changes have just made my Copilot subscription go from great deal to awful value over night.

I've been wanting to get off MS more generally and this is good motivation. Will be playing round with OR this week.

"This change aligns Copilot pricing with actual usage and is an important step toward a sustainable, reliable Copilot business and experience for all users."

I see statements like this as strong indicators that the sales people are wrapping up their work and the accountants are taking over. The land rush is switching to an operational efficiency play.

Even Sonnet 4.6 is 9x multiplier (previously 1x)!

The only model I even used on Copilot was Sonnet and now its got a ridiculous multiplier.

At this point they might as well just charge per Million tokens like every other provider instead of having a subscription.

27x for Opus is genuinely shocking. at that point you're not paying for convenience anymore, you're just paying a GitHub tax. OpenRouter or direct API makes way more sense unless you're really glued to the IDE integration.

The point of this loss leading is to properly hoover up the money in the pockets of enterprise customers, get them locked into the idea that they need the latest and greatest cloud-based model, while simultaneously starving everyone of the memory they'd need in order to run competent models locally.

In not-too-distant future we're going to be running better models on our phones than we can buy access to today in the cloud. Skate where the puck is going: soak the customers until that day comes.

It's interesting that the cost multiplier for Claude Sonnet 4/4.5/4.6 varies so much (1/6/9), while the API cost is exactly the same for all three models.

Also, the multiplier of 27 for Claude Opus 4.6/4. is way higher than the increase in API price would suggest.

I wonder why that is.

One theory of the play of SpaceX might do if everyone migrates to query-based billing:

Provide cheap and unlimited access to Grok for programmers (hence the Cursor partnership/purchase for distribution).

-> This would drag massive revenue right before the IPO announcement, like if the company is super growing

-> At a loss, but don't worry, we need these funds to build the biggest datacenter of the universe.

This announcement would create enough momentum to increase valuation, and because of the merge of his companies, would save his X/Twitter investors from a tragedy.

-> Would also be a great service to Cursor investors and so, who are stuck with their VSCode fork

Why would folks be better paying 5.5% fee to OpenRouter ("Open") if most people just use one or two providers? Just use the provider's API.

FYI, these are the multipliers for annual plan. I would hazard a guess most people are not on an annual plan

Yep.

Or if you're a business with multiple seats, these plans may be more inefficient than raw API usage billing. Since if anyone at your organization fails to utilize their full $19/39 allotment each month, that's wasting money, whereas with API credits it is 100% utilized.

I don't think they've thought through the implications of this. Everyone should cancel and go usage-based billing with caps.

This was my first thought too. "Oh cool, I should be seeing lower prices" as I don't use Co-pilot that often anymore. But no, that's not the case. It rather served to remind me that I should probably just cancel.

They could add rollover balances and be back to cell phone plans in the early 2000s.

$39 of credits at API costs is useless too, what are you going to do there, a single hour of coding? One half of a feature per month?

Are you thinking something like rollover plans?

Seems like a strong signal the money burning party is coming to a close. Nearly all AI companies have tightened their belts in the past month. Anthropic removed Claude Code from the Pro plan, Z.AI increased their prices, GitHub removed some Claude models from Copilot, now this.

Also, Opus 4.7 seems like a model more intended to save Anthropic money than push the bar.

Link to the announcement for anyone else like me who hasn't gotten the email yet: https://github.blog/news-insights/company-news/github-copilo...

Not really sure why I would stick with Copilot after this, and increasing Sonnet from 1x to 9x for annual subscribers is highway fucking robbery. Very glad I didn't commit myself to an annual plan.

It's amazing how much I was able to build for $40/mo- something that would have taken a team of 100 twice the time just a few years ago.

Will always be grateful for the greed of trillion dollar corporations that subsidized me.

(This was originally posted to Microsoft and OpenAI end their exclusive and revenue-sharing deal - https://news.ycombinator.com/item?id=47921248, but in a perhaps-futile effort to keep the discussions partitioned, Maxwell's demon will move it to the Copilot pricing thread.)

Even more so, questions and user answers from agents were not charged as separate requests.

That was not my experience. When I tried to use Opus for longer tasks with Copilot, it would fill up the context completely and then crash without any output, while still consuming premium requests. (At least from September 2025 to January this year. Haven't tried after that.)

Bingo. I created a few autonomous skills that did exactly that for plan review, implementation, and branch review, review autonomously until green.

I was using 100M+ tokens per day, $250 per day or so and only paying $160 per month to GitHub.

I cancelled my GHCP sub and switched to Codex last week, so far so good but I miss Gemini 3.1 Pro for UI work.

This was my solution to very very though compiler tests that would take sometime up to 4 hours to figure out. Some of the time would be spent on running the tests, but still... I was burning so much tokens. I have free Copilot for my open source work so I wasn't even paying the $20.

this is the project that I am working on https://github.com/mohsen1/tsz

exactly why i loved Github Copilot, you could pull of these shenanigans, and nothing would ever happen. That was the best part of it

but now, you get literally nothing

I did many 1h+ sessions of agent asking questions, delegating to subagents - all for 1 premium request.

I would say its a x1000 increase in price for agentic workflows.

This may be a more accurate analogy... "The Porsche you rented at $200/mo now only allows you a maximum of 100km of travel. You will be automatically charged extra when you go over that."

On top of being worth less, the subscriber discounts are gone.

The old plans were $0.033/request for Pro, $0.026/request for Pro+ and $0.04/request for pay-as-you-go. That discount is now gone. They even still advertise "5x the number of requests" for Pro+ over Pro.

Yeah, if I go to a petrol station with 50€, but only get a tenth of the amount of petrol I got last week, I may think that the price has in fact changed.

More like, The Porsche you had for a month you'll now have for 5 min only.

"Your monthly fee isn't changing but it now only covers about 3 days of driving."

More like, the rising gas prices aren’t a problem, I only ever fill up for $40

It's more like saying, "and you may now only use the Porsche for 5 minutes out of every day."

It’s technically true that the plan prices haven’t changed, it’s just the value you get from those plans has plummeted. It’s classic deceptive sales language.

They are now charging per gallon instead of a flat rate per trip

The interesting question is how long it takes enterprises to notice the capability/pricing tradeoff, and whether they respond by limiting access to the strongest models internally.

As someone who works in a Microsoft shop with Copilot, I am curious to see what happens.

Due to data governance it will be difficult to move to a different provider.

At the same time, this price hike is so large that the ROI on copilot will be a net negative.

I think what will ultimately happen is that we will not pay Microsoft more than we currently do and we'll simply end up with less AI usage in the company and a reduction in productivity.

I also suspect that there are many "slow-moving", Microsoft heavy enterprises but with in-house devs that can't get anything but Copilot approved, and Microsoft trusts this will remain so.

It's not turning consumption based because there are a ton of these licenses just sitting idle.

Opus 4.6 "Fast" was originally at 10x, literally the same cost as Opus 4.1. After promotion period, it was 30x.

Subsidies stop when LLMs improvement plateaued (though they still benchmark higher somehow). At some point, you have to make money or at least break even; and I think they concluded that we reached that point.

As someone that is on the enterprise side in a non-tech F500 company, what I'm seeing is some FOMO and need to be part of the hype cycle. We're about to plonk a bunch of money on more Copilot licenses. Something got in the water where all the C-levels the past two months are pushing everyone to use AI but when they bring up examples of their uses its like "I use it to rewrite my emails" or prompt 'engineering' ideas that point more to patching over poor processes, data management, and decision-making within the organization or not.

What we're seeing across the board is every software company tossing AI onto their name or sales pitch and no one understanding what that actually means. But we will spend money on it because of FOMO.

I really question if we're reaching the end of the hype cycle to the point. I wish I were brave enough to put money on it. It feels like there was a command from up top to 'do something with AI' and leadership is scambling for some resume-building projects vs doing the hard work they should've done the past two years at a people and process level.

I really hope that they think so and that they're wrong and they get burned hard. Them and all the AI labs that lied, stole, inflated, hoarded and tried to justify all this as an existential moment where AGI would radically change society. I hope their calculus to reel in paying users is all wrong and now they all crash and burn instead of recouping VC money.

I cant say im mad, i got above what i paid in value. That said, going forward ill probably go back to openrouter payg rather than a subscription.

I got a free 3months of the gemini £19 plan and ive been playing quite a bit, 3.1 pro is a good model, i just find it slow. Flash i think i under appreciated until now.

I was surprised to find that this sentence

> Plan prices aren’t changing

did not continue with an em-dash followed by something profound that is changing.

Plan prices aren't changing -- the value you get out of it is.

I bought a copilot subscription for some small personal projects at Christmas.

I haven't been able to use my subscription much over the busy spring months, but i'm being charged every month.

I'd be tempted to keep the subscription if usage-based billing meant that i'd save money when i had less time.

But today, after hearing this, i cancelled my subscription.

It begins.

Does this mean you can only prompt "Hello" every morning for a month with Opus 4.7 ?

some of Github's open source maintainers have lost their free github copilot pro, guess this is really the next step for them to save cost in their infrastructure.

The background agents will also depreciate in value because of their harness that's a black box that's not optimized for token usage at all. Rolling one's own will be a better choice here.

Looks like Microsoft has run out of compute and can't scale it fast enough to serve copilot users and Azure AI Foundry needs, given that the customer base is growing there as well.

Here goes my Copilot Pro subscription then, reluctantly heading over to Codex CLI since the CC base plan is downright unusable.

> Presumably there's a further rugpull to come.

How would that be? They are already charging as much as the underlying providers. They can hardly expect to have any customers if they are charging more.

Enterprise sales will be the answer. Microsoft will have some story that convinces an exec eight levels up the org chart from the normal users that this is an essential product they need to overpay for. Given their existing relationshipsand immense sales team they'll probably have success.

I'm already on Pro. Why should I keep it?

OpenRouter charges a 5% (?) fee for buying credits.

they're downsizing free github copilot pro for open source maintainers. At the very least, it looks like small open source projects got their free copilot pro cut off

That is exactly where I am.

We're putting other providers through the gauntlet. An M4 Studio or two running the latest Qwen3 or whatever counts for state of the art in open models is also looking a little more viable all the time.

This rings true to me as someone who's worked at a few large corps like this. A price hike does not change things when there is a mandate to use MS products over other vendors.

You can pool credits through open router (afaik, I'm only using a single user account), but if you top-up $10 per user, per month, any unused credits will rollover.

Tbh I think it still works, but only because the new allowance will likely get used very quickly within a billing cycle - I'm expecting this change to increase our orgs bill significantly based on how many API credits with open router I consume in a weekend using a single agent in a pairing style.

The pooling will only be useful if you have a bunch of infrequent/low usage users that you still want to have licenses.

I was curious why a company would still use the VS Code + Copilot sidebar method for coding, rather than something like Claude Code. Turns out there’s a GitHub Copilot CLI!

I thought I was pretty familiar with available options, but no one in my circles ever mentions this product. It doesn’t seem to have much mindshare.

Has anyone used it? What’s your experience?

https://github.com/features/copilot/cli

Windsurf made a similar change in March: https://docs.windsurf.com/windsurf/accounts/quota

I wouldn't call it hindsight - I don't think anyone, at any stage, thought running a 10 minute+ sonnet session for 1 premium credit was ever profitable. We all knew it was a loss leader to get people using it.

per-request was broken, yeah. but $10 of monthly credits is basically just a prepaid wallet with a reset timer.

I don't use Copilot or any paid AI but all of this usage-based billing reminds me of cellphones back when you paid per individual text message.

I wonder how long it'll be before all AI costs are flat unlimited monthly fees or even free across the board, without compromise.

I expect in the future we'll find out that someone in the industry was juicing the numbers with fake thinking tokens or something. The whole pricing model of charging you for the tokens it generates while not knowing how much it is going to generate going in has always been pretty crazy.

It incentivizes you to do most of that prompting on your own hardware/time, and only feed the final prompt with only necessary context to the big AI in the sky. It might even force you to think about the problems yourself for a bit!

Yeah, this was my frustration with Suno and Sora. You can burn a lot of credits (not to mention time) generating things that aren't what you wanted.

I don't mind a PAYG model for a simple chat interface. But when it comes to actually producing things, you burn through TONS of tokens creating the wrong output.

> I wonder how long it'll be before all AI costs are flat unlimited monthly fees or even free across the board, without compromise.

That's already the case if you can self-host an LLM; you don't even need a mythical H200: gamer-grade GeForce cards can get you a long way there (if this page is to be believed: https://www.runpod.io/gpu-compare/rtx-5090-vs-h200 )

...after RAM prices return to normalcy, of course - and then wait another 2 or 3 generations of GPU development for a 96GB HBM card to hit the streets - and also assuming SotA or cloud-only LLMs don't experience lifestyle-inflation, but I assume they must, because OpenAI/Anthropic/Etc's business-model depends on people paying them to access them, so it's in their interests to make it as difficult as possible to run them locally.

Give it 5 years from now and reassess.

I never played any games like that, but simply giving the agent a clear exit criteria and instructions to check the exit criteria every time it thinks it's done on a complex task was often enough to keep it chugging away for most of a day on a single prompt in my experience. Per-prompt pricing just isn't sustainable period, even if everyone is acting in good faith.

Charging by prompt was always wild to me.

I once asked it to do a comprehensive security review of our code. It churned for nearly an hour (and then produced 90% false positives). Insane that that usage was charged the same amount as me just saying "Hello".

There is noticeable trend across all agentic coding platforms that this situation is no longer sustainable.

With this kind of pricing (sonnet 4.6 has 9x multiplier, previously 1x) it begs the question why use Copilot to begin with.

You could easily just buy the tokens directly and have a lot more choice as well.

One reason I used it was that I wasn't locked into a single provider and switching them was as easy as changing a drop-down. Small feature? Sonnet or GPT5.4/mini? Large changes? Opus. And why not see how good Raptor Mini does this one refactor?

It also helped build an intuition of what wach model could do and which parts it was weaker at because you could try them almost side by side, especially if one model's output wasn't great.

That said, these were all side projects so nothing truly consequential. Otoh, you might leave some extra perf on the table but I found the models worked quite with the Copilot harness.

The problem is I can't afford the tokens! Even on my $10/mo plan, running either 100 opus, or 300 sonnet agent runs would cost hundreds of dollars - well above my budget!

Doesn't GitHub get volume discounting they can pass on to their Copilot customers?

With this pricing change, I see no reason at all to stick with Copilot in principle, but I really need to solve this issue of IDE integration to move on.

You can use Copilot Chat* with basically any API provider, and if you switch to the VS Code Insiders build you can configure it to use literally any OpenAI API-compatible endpoint.

Other than that Zed has a similar experience which is pretty decent.

* By which I mean the good one, whatever it's called now - the part of Copilot that used to be a plugin and is now part of VS Code, not the thing that has always been part of VS Code.

Try Cline, a vscode plugin that's been around since the start and has really active development. You can also use most providers as well.

I used to feel exactly like you do, but now I use the Claude CLI exclusively and am very pleased with the results.

Sounds like you're a candidate for a local model. It's kinda nice not caring what the token count means except as to compaction.

It’s getting there. You could give a try with qwen 3.6. It’s worth paying for better models in the cloud, but local models are now better than nothing.

One nice development recently was ollama's support for MLX optimization on Mac hardware. It's not obvious how to know you're using a model that works with it, yet, so it's rough around the edges.

https://ollama.com/blog/mlx

Use llama.cpp or better yet Unsloth Studio

Has anyone found the answer to this yet?

> What is the benefit of using the Copilot Pro+ at 39$/month instead of using the Copilot Pro at 10$/month and paying for extra usage?

Some models, for example Opus 4.7 and GPT 5.5, are only available on Pro+; Pro+ has audit logs and GitHub Spark; that's about it, as far as I can tell from https://docs.github.com/en/enterprise-cloud@latest/copilot/g...

If I had to guess...

On my personal account, Copilot Pro+ still only gave me back Opus 4.7, whereas my work's Pro account still lets me use Opus 4.6.

So, my gut says, it's entirely possible that Pro+ will continue to have more segregation on model availability...

FTA

> Last week, we also rolled out temporary changes to Copilot Individual plans, including Free, Pro, Pro+, and Student, and paused self-serve Copilot Business plan purchases. These were reliability and performance measures as we prepare for the broader transition to usage-based billing. We will loosen usage limits once usage-based billing is in effect.

There's enough weasel wording here that I would expect only certain models get re-enabled on Pro.

e.x. lots of people seem to get good enough results from Opus 4.6, personally I prefer it over 4.7 in GH Copilot... locking that down to Pro+ would be, given this salvo of enshittification, a 'logical' move on their part.

I thought of this as soon as I saw the headline: I hope this is the start. Crossing my fingers to see more line up with your predictions.

After seeing the ridicolous multiplier increase I've added a calendar event to cancel my subscription mid-May.

(I'm a copilot subscriber since 2022)

Already cancelled here! One less thing tied to GitHub.

Current multipliers vs from June

  Opus 4.6  3x -> 27x
  Opus 4.7  3x -> 27x
  GPT  5.4  1x ->  6x

EDIT: only applies to annual plans

I think that only applies to held-over users on the annual plan:

> Users on annual Pro or Pro+ plans will remain on their existing plan with premium request-based pricing until their plan expires, however, model multipliers will increase on June 1 (see table).

Not apples for apples.

Before:

- Opus 4.6 each premium request is 3 premium requests

After:

- Opus 4.6 each dollar spent is 27 dollars in copilot AI Credits.

Given that you'll receive 19 dollars of AI Credits in Business plan, that means you can probably say 1 "hi" to opus per month.

GPT 4.1 had a multiplier of 0.

GPT 5.4mini is even worse, from 0.33x to 6x which is ~18 times more expensive now.

Same! I wonder what other alternatives there might be for autocomplete.

Code completions and Next Edit Suggestions remain free with Copilot Pro.

I'm similarly thinking about sticking with the auto-downgrade back to Copilot Free when the annual sub ends and then just yelling about it any months I hit the 2000 completion cap.

I wouldn't mind a plan between Free and Pro that is just "all I care about is code completion and next edit suggestions".

How is this legal when people paid for a yearly plan in advance?

In order to most-to-least charitable, any of:

1. Github could choose to grandfather in those plans and make no changes until those plans expire.

2. Github could offer, or the user could request, a pro-rated refund along with cancellation of the account.

3. Tough luck, those users agreed that Github could unilaterally change the ToS at any time.

I doubt you can force them to provide the service with the original terms, but you might be able to ask for a (partial) refund. If not today, after a week of verbal abuse they will receive for this online.

In places with reasonable consumer protections (Australia, Germany) it almost certainly is illegal unless they give a full (whole year) refund. I think the short time limit of applying for a refund won't be looked at favorably either. Regardless of their ToS which I'm sure covers this.

But companies do lots of illegal things, and in general nobody takes them to court over it.

My thought exactly! First the usage limits + model limitations and now fundamental change to the billing. Hope some consumer watchdogs are looking into this!

100% it says in their terms that they can change the service during the agreement.

I just checked and you can cancel with a refund.

For the yearly plan they only change the model multiplier. And it's in the subscription contract they can change that multiplier at any time.

So I guess from now on GH Copilot is only worth it if you want a quality autocomplete in VSCode.

That was the first thing I turned off in VSCode. Autocomplete for my TypeScript projects was great. And the "AI" suggestions/completions were really getting in the way of me still being the "driver."

Check out Zed sometime, its pretty decent too.

It made more sense in the ye old days where a request was basically just a chat message in a sidebar and it could also edit code. Then saying someone can use 300 chat messages a month kinda makes sense.

Turns out when a request can spawn tens of subagents and use millions of tokens over many turns of toolcalls then suddenly github copilot has a massive financial problem on their hands.

This approach started with the “Ask a question about your code” feature, which is more comparable to single chat message with relatively predictable token usage. Now it’s an agent who might work for 30 minutes, read the whole codebase, and write 1000 lines

I'm not usually a Conspiracy Guy, and the answer is probably `incompetence * tech_debt`. But I think that having sufficient layers of abstraction to any billing model is a useful way to hide the real cost of things. It's why it's done everywhere.

We all knew this from the very beginning but couldn't compete with OpenAI or Anthropic on their subscription-based pricing strategy. It was nuts except for those few corporations burning investor money to keep competition out as long as possible. Now they don't have to hide anymore that subscription pricing won't do it for ai. The pyramid scheme is falling.

I'm curious about the opposite: Why would anyone use the CLI when, at least with Copilot, the VSCode plugin is super tightly integrated with VSCode, meaning the agent can see everything I can see. There's no mismatch in linter calls where I can see a lint in the ide that the agent can't find for example. I've had this problem even using CC in their VSCode extension, so I can't imagine it's not an issue in the CLI as well.

What's actually better in the CLI?

The vs code integration is pretty slick. I can copy and paste function names into the prompt and it automatically turns them into these `#sym:` reference objects that I presume populate the context window with metadata about the function and where it lives. It knows what file I'm currently looking at as I jump around in the code, and that automatically gets loaded into the context. I can also drag and drop folders or specific files for context into the sidebar.

It's a lot of stuff that makes me have to type less into the prompt, since it's already getting so much info from my editor

Yeah, I've been using it heavily at work since the beginning of January (and have a personal Anthropic sub to compare to). Copilot CLI is pretty good, honestly. Most new features in Claude Code get cloned by Copilot CLI within a couple weeks. Claude models seem mildly more clumsy in that harness than the one they're trained on - subjective guess around 20% more turns for an equivalent task - but it's not a noticeable difference in the final output.

I’m actually trying to move back from the Claude Code style, I feel like it’s easy to become distant from your own code, and I am feeling uncomfortable with that.

I’ve “vibe-coded” some projects and when I start to find issues or go to refactor them I don’t have that memory of why decisions were made, because many decisions were never made.

I've used it quite a bit. There are a lot of AI terminal coding products and this is another one. It works well, handles sub-agents without issue and does a reasonable job operating in the Copilot ecosystem. It handles mid-task questions and such we well.

> I was curious why a company would still use the VS Code + Copilot sidebar method for coding, rather than something like Claude Code.

I use Claude Code, but I kept my Copilot subscription around mostly for really cheap usage of other models when I need to try a different one (which appears to be ending, in a sense) and also the autocomplete in Visual Studio Code which was really great across a bunch of files, I could make changes in one file and then just tab through some others.

I wonder what other good autocomplete is out there.

The other cool thing is Copilot SDK, so you can build agentic capabilities into apps, or build tools, that leverage the agent harness of the Copilot CLI:

https://github.com/github/copilot-sdk/

It's in their ToS to allow using Copilot subscription with OpenCode - https://github.blog/changelog/2026-01-16-github-copilot-now-...

Absolutely the cheapest way to get a lot of tokens through a solid harness for $10/month. Until now

I'm just so confused why people aren't just using ghostty/kitty/terminal.app and claude code. Compared to the other approaches I've tried, it's by far the most effective way to get performance from opus 4.6/4.7

Is there a way to use the autocomplete feature with an api?

Hacker Times

Hacker Times

GitHub Copilot is moving to usage-based billing

Discussion

Discussion

Why we’re making this change

What’s changing

What this means for individuals

What this means for businesses and enterprises

The bottom line

Written by

Related posts

Explore more from GitHub

Docs

GitHub

Customer stories

The GitHub Podcast