GitHub experience various partial-outages/degradations

Looks like Azure as a platform just killed the ability for VM scale operations, due to a change on a storage account ACL that hosted VM extensions. Wow... We noticed when github actions went down, then our self hosted runners because we can't scale anymore.

Information

Active - Virtual Machines and dependent services - Service management issues in multiple regions

Impact statement: As early as 19:46 UTC on 2 February 2026, we are aware of an ongoing issue causing customers to receive error notifications when performing service management operations - such as create, delete, update, scaling, start, stop - for Virtual Machines (VMs) across multiple regions. These issues are also causing impact to services with dependencies on these service management operations - including Azure Arc Enabled Servers, Azure Batch, Azure DevOps, Azure Load Testing, and GitHub. For details on the latter, please see https://www.githubstatus.com.

Current status: We have determined that these issues were caused by a recent configuration change that affected public access to certain Microsoft‑managed storage accounts, used to host extension packages. We are actively working on mitigation, including updating configuration to restore relevant access permissions. We have applied this update in one region so far, and are assessing the extent to which this mitigates customer issues. Our next update will be provided by 22:30 UTC, approximately 60 minutes from now.

https://azure.status.microsoft/en-us/status

It's notable that they blame "our upstream provider" when it's quite literally the same company. I can't imagine GitHub engineers are very happy about the forced migration to Azure.

In the Bad Old Days before Github (before Sourceforge even) building and package sucked because of the hundred source tarballs you had to fetch, on any given day 3 would be down (this is why Debian does the "_orig" tarballs the way they do). Now it sucks because on any given day either all of them are available or none of them are.

Looks like Github Actions is having another bad day today as of an hour ago, but status page is not yet updated.

As an isolated event, this is not great, but when you see the stagnation (if not downwards trajectory) of GitHub as a whole, it‘s even worse in my opinion.

edit: Before someone says something. I do understand that the underlying issue is some issue with Azure.

Getting the monthly GitHub outage out of the way early, good work.

This is why I come to hacker news. Sanity check on why my jobs are failing.

Copilot being down probably increased code quality

There’s never been a better time to migrate to another forge or at least have a self-hosted bare repository to handle outages.

Recently my download speed from GitHub releases has decreased dramatically. But I'm sure they will be fixing that with Claude Code soon... Will they?

50% of code written by AI, now let the AI handle this outage.

It is always a config problem. somewhere somplace in the mess of permissioning issues.

Tay.ai and Zoe AI Agents probably running infra operations at GitHub and still arguing about how to deploy to production without hallucinating a config file and deploying a broken fix to address the issue.

Since there is no GitHub CEO, (Satya is not bothered anymore) and human employees not looking, Tay and Zoe are at the helm ruining GitHub with their broken AI generated fixes.

This happens routinely every other Monday or so.

Jobs get stuck. Minutes are being consumed. The problem isn't just it being unavailable.

Feels like acquiring GitHub was another way to hurt open source projects

With linkedin down, I wonder if this is an azure thing ? IIRC github is being moved to azure, maybe the azure piece was partially enabled ?

Will paid users be credited for the wasted Actions minutes?

If you are still using GitHub, you have failed

De-risk yourself from Microsoft

If you look at the history, they have as many incidents as there are days since the year started.

Copilot being down probably increased code quality

It is always a config problem. somewhere somplace in the mess of permissioning issues.

Jobs get stuck. Minutes are being consumed. The problem isn't just it being unavailable.

Will paid users be credited for the wasted Actions minutes?

Information

Active - Virtual Machines and dependent services - Service management issues in multiple regions

https://azure.status.microsoft/en-us/status

They've always been terrible at VM ops. I never get weird quota limits and errors in other places. It's almost as if Amazon wants me to be a customer and Microsoft does not.

Their AI probably hallucinated the configuration change

It's notable that they blame "our upstream provider" when it's quite literally the same company. I can't imagine GitHub engineers are very happy about the forced migration to Azure.

Having worked there around 2020-2021 there were many folks not happy with being forced to use azure and being forced to build GitHub actions based on azure devops. Lots of AWS usage still existed at that time but these days u bet it’s mostly gone.

I would imagine the majority of Github engineers there currently joined post MS acquisition.

something about antifreeze in the dogfood

As an isolated event, this is not great, but when you see the stagnation (if not downwards trajectory) of GitHub as a whole, it‘s even worse in my opinion.

edit: Before someone says something. I do understand that the underlying issue is some issue with Azure.

It really doesn't even matter why it failed. Shifting blame on Azure doesn't change the fact that GitHub is becoming more and more unreliable.

I don't get how Microsoft views this level of service as acceptable.

Sadly Github moving more into Azure will expose the fragility of the cloud platform as a whole. We've been working around these rough edges for years. Maybe it will make someone wake up, but I don't think they have any motivation to.

> Azure

Which is again even worse.

Getting the monthly GitHub outage out of the way early, good work.

Unfortunately that won’t clear up the weekly GitHub outages

well played sir. well played.

This is why I come to hacker news. Sanity check on why my jobs are failing.

Exactly same reason why I posted. My Github Actions jobs were not being picked up.

better luck with your next job :)

50% of code written by AI, now let the AI handle this outage.

Catch-22, the AI runs on Azure...

Since there is no GitHub CEO, (Satya is not bothered anymore) and human employees not looking, Tay and Zoe are at the helm ruining GitHub with their broken AI generated fixes.

This happens routinely every other Monday or so.

I was going to joke "so, it's Monday, right?" but I thought my memory was playing tricks on me.

With linkedin down, I wonder if this is an azure thing ? IIRC github is being moved to azure, maybe the azure piece was partially enabled ?

It is: https://azure.status.microsoft/en-us/status

"Impact statement: As early as 19:46 UTC on 2 February 2026, we are aware of an ongoing issue causing customers to receive error notifications when performing service management operations - such as create, delete, update, scaling, start, stop - for Virtual Machines (VMs) across multiple regions. These issues are also causing impact to services with dependencies on these service management operations - including Azure Arc Enabled Servers, Azure Batch, Azure DevOps, Azure Load Testing, and GitHub. For details on the latter, please see https://www.githubstatus.com."

Their AI probably hallucinated the configuration change

something about antifreeze in the dogfood

> Azure

Which is again even worse.

Unfortunately that won’t clear up the weekly GitHub outages

Catch-22, the AI runs on Azure...

I was going to joke "so, it's Monday, right?" but I thought my memory was playing tricks on me.

It is: https://azure.status.microsoft/en-us/status

They've always been terrible at VM ops. I never get weird quota limits and errors in other places. It's almost as if Amazon wants me to be a customer and Microsoft does not.

Amazon isn't much better there. Wait until you hit an EC2 quota limit and can't get anyone to look at it quickly (even under paid enterprise support) or they say no.

Also had a few instance types which won't spin up in some regions/AZs recently. I assume this is capacity issues.

Agreed...I've been waiting for months now to increase my quota for a specific Azure VM type by 20 cores. I get an email every two weeks saying my request is still backlogged because they don't have the physical hardware available. I haven't seen an issue like this with AWS before...

How is Azure still having faults that affect multiple regions? Clearly their region definition is bollocks.

It's awful. Any other service in Azure that relies on the core systems seems to have issues trying to depend on it, I feel for those internal teams.

Ran into an issue upgrading an AKS cluster last week. It completely stalled and broke the entire cluster in a way where our hands were tied as we can't see the control plane at all...

I submit a severity A ticket and 5 hours later I get told there was a known issue with the latest VM image that would create issues with the control plane leaving any cluster that was updated in that window to essentially kill itself and require manual intervention. Did they notify anyone? Nope, did they stop anyone from killing their own clusters. Nope.

It seems like every time I'm forced to touch the Azure environment I'm basically playing Russian roulette hoping that something's not broken on the backend.

I would imagine the majority of Github engineers there currently joined post MS acquisition.

That doesn't necessarily mean they're happy about Azure as a backend.

It really doesn't even matter why it failed. Shifting blame on Azure doesn't change the fact that GitHub is becoming more and more unreliable.

I don't get how Microsoft views this level of service as acceptable.

Doesn't seem like Microsoft managers care - it's not their core business, so any time anyone complains about issues with GitHub they probably think something along the line of "peasants whining again".

Must be nice to be a monopoly that has most of the businesses in the world as their hostages.

How is Azure still having faults that affect multiple regions? Clearly their region definition is bollocks.

It's awful. Any other service in Azure that relies on the core systems seems to have issues trying to depend on it, I feel for those internal teams.

Ran into an issue upgrading an AKS cluster last week. It completely stalled and broke the entire cluster in a way where our hands were tied as we can't see the control plane at all...

It seems like every time I'm forced to touch the Azure environment I'm basically playing Russian roulette hoping that something's not broken on the backend.

Must be nice to be a monopoly that has most of the businesses in the world as their hostages.

well played sir. well played.

Exactly same reason why I posted. My Github Actions jobs were not being picked up.

better luck with your next job :)

Ai deploys itself to aws, saving GitHub but destroying Microsoft’s cloud business — full circle

Amazon isn't much better there. Wait until you hit an EC2 quota limit and can't get anyone to look at it quickly (even under paid enterprise support) or they say no.

Also had a few instance types which won't spin up in some regions/AZs recently. I assume this is capacity issues.

The cloud isn’t some infinite thing.

There’s a bunch of hardware, and they can’t run more servers than they have hardware. I don’t see a way around that.

We've ran into that issue as well, ended up having to move regions entirely because nothing was changing in the current region. I believe it was westus1 at the time. It's a ton of fun to migrate everything over!

That’s was years ago, wild to see they have the same issues.

That doesn't necessarily mean they're happy about Azure as a backend.

I've been a software "engineer" for over 20 years, and my personal experience is that software engineers are basically never happy.

Ai deploys itself to aws, saving GitHub but destroying Microsoft’s cloud business — full circle

The cloud isn’t some infinite thing.

There’s a bunch of hardware, and they can’t run more servers than they have hardware. I don’t see a way around that.

That’s was years ago, wild to see they have the same issues.

There’s never been a better time to migrate to another forge or at least have a self-hosted bare repository to handle outages.

If you are still using GitHub, you have failed

De-risk yourself from Microsoft

I've been a software "engineer" for over 20 years, and my personal experience is that software engineers are basically never happy.

True enough. The world is never as predictable as the computers we program, and the computers we program are never as predictable as we feel they should be.

I’ve used AWS for almost 20 years and I can tell you it’s more stable than Azure

Plenty of happy engineers at the other cloud. :)

Looks like Github Actions is having another bad day today as of an hour ago, but status page is not yet updated.

Recently my download speed from GitHub releases has decreased dramatically. But I'm sure they will be fixing that with Claude Code soon... Will they?

Feels like acquiring GitHub was another way to hurt open source projects

If you look at the history, they have as many incidents as there are days since the year started.

True enough. The world is never as predictable as the computers we program, and the computers we program are never as predictable as we feel they should be.

I’ve used AWS for almost 20 years and I can tell you it’s more stable than Azure

Yep can confirm, waiting 10-15 minutes for actions to run

On what OS have you noticed this? Very in character for microsoft to artificially slow non-windows downloads. Then again, my apt upgrades on Debian have been dog slow lately...

That's surely a feature, not a bug.

Microsoft loves open source projects, as long as they help Microsoft make money.

How are they demonstrating that?

Or, if part of a future plan: how?

Plenty of happy engineers at the other cloud. :)

That's surely a feature, not a bug.

Microsoft loves open source projects, as long as they help Microsoft make money.

On what OS have you noticed this? Very in character for microsoft to artificially slow non-windows downloads. Then again, my apt upgrades on Debian have been dog slow lately...

I was mostly on macOS. It seems to me that there's an issue with GitHub's CDN or routing.

Yep can confirm, waiting 10-15 minutes for actions to run

How are they demonstrating that?

Or, if part of a future plan: how?

~20 minute delay so far from our perspective, looks to be increasing.

Their status page seems to think everything's A-OK.

I was mostly on macOS. It seems to me that there's an issue with GitHub's CDN or routing.

Resolved - On February 2, 2026, between 18:35 UTC and 22:15 UTC, GitHub Actions hosted runners were unavailable, with service degraded until full recovery at 23:10 UTC for standard runners and at February 3, 2026 00:30 UTC for larger runners. During this time, Actions jobs queued and timed out while waiting to acquire a hosted runner. Other GitHub features that leverage this compute infrastructure were similarly impacted, including Copilot Coding Agent, Copilot Code Review, CodeQL, Dependabot, GitHub Enterprise Importer, and Pages. All regions and runner types were impacted. Self-hosted runners on other providers were not impacted.

This outage was caused by a backend storage access policy change in our underlying compute provider that blocked access to critical VM metadata, causing all VM create, delete, reimage, and other operations to fail. More information is available at https://azure.status.microsoft/en-us/status/history/?trackingId=FNJ8-VQZ. This was mitigated by rolling back the policy change, which started at 22:15 UTC. As VMs came back online, our runners worked through the backlog of requests that hadn’t timed out.

We are working with our compute provider to improve our incident response and engagement time, improve early detection before they impact our customers, and ensure safe rollout should similar changes occur in the future. We recognize this was a significant outage to our users that rely on GitHub’s workloads and apologize for the impact this had.

Feb 3, 00:56 UTC

Update - Actions is operating normally.
Feb 3, 00:56 UTC

Update - Based on our telemetry, most customers should see full recovery from failing GitHub Actions jobs on hosted runners.
We are monitoring closely to confirm complete recovery.
Other GitHub features that rely on GitHub Actions (for example, Copilot Coding Agent and Dependabot) should also see recovery.
Feb 2, 23:50 UTC

Update - Actions is experiencing degraded performance. We are continuing to investigate.
Feb 2, 23:43 UTC

Update - Copilot is operating normally.
Feb 2, 23:42 UTC

Update - Pages is operating normally.
Feb 2, 23:31 UTC

Update - Our upstream provider has applied a mitigation to address queuing and job failures on hosted runners.
Telemetry shows improvement, and we are monitoring closely for full recovery.
Feb 2, 22:53 UTC

Update - We continue to investigate failures impacting GitHub Actions hosted-runner jobs.
We're waiting on our upstream provider to apply the identified mitigations, and we're preparing to resume job processing as safely as possible.
Feb 2, 22:10 UTC

Update - Copilot is experiencing degraded performance. We are continuing to investigate.
Feb 2, 21:27 UTC

Update - We continue to investigate failures impacting GitHub Actions hosted-runner jobs.
We have identified the root cause and are working with our upstream provider to mitigate.
This is also impacting GitHub features that rely on GitHub Actions (for example, Copilot Coding Agent and Dependabot).
Feb 2, 21:13 UTC

Update - The team continues to investigate issues causing GitHub Actions jobs on hosted runners to remain queued for extended periods, with a percentage of jobs failing. We will continue to provide updates as we make progress toward mitigation.

Feb 2, 20:27 UTC

Update - Pages is experiencing degraded performance. We are continuing to investigate.
Feb 2, 19:48 UTC

Update - Actions is experiencing degraded availability. We are continuing to investigate.
Feb 2, 19:43 UTC

Update - GitHub Actions hosted runners are experiencing high wait times across all labels. Self-hosted runners are not impacted.
Feb 2, 19:07 UTC

Investigating - We are investigating reports of degraded performance for Actions
Feb 2, 19:03 UTC

~20 minute delay so far from our perspective, looks to be increasing.

Their status page seems to think everything's A-OK.

Copilot is probably waiting for a time slot to vibecode a fix as well :D

Hacker Times

Hacker Times

GitHub experience various partial-outages/degradations

Discussion

Discussion