In the current climate limiting someone's use of AI might be expected to be about restricting access or restricting what someone can do with it, but the story here ostensibly seems to be about capacity constraints, not any limitation on what models or capabilities Google is giving Meta access to.
Image generation and veo models I’d imagine quite effective for creators; new Instagram accounts with AI content that are garnering millions of followers in spans of weeks are quite common now
Understanding the actual problems we are trying to solve with code and efficiently coming up with solutions (essentially, pre-LLM development) will always be better than wastefully brute forcing solutions with LLMs.
That said, I expect much of the AI bubble to pop. Google Gemini with Antigravity is a good product, as is a Claude Code subscription but I have switched to using DeepSeek v4 Pro with the Claude Code harness and DeepSeek v4 Flash with the OpenCode harness (when I am not using local models with little-coder/pi) and at least for the foreseeable future I don’t think I am going back. Fast APIs at low cost trumps having to spend a little more time to get the same quality of results.
I want to know how impacted Gemini has been by that, because that will reveal a lot about their margins and revenue generating first party demand. Each MSFT earnings report they discuss the balance they’re dealing with between supplying GPUs to Azure customers and first party demand.
My pet theory is that Gemini is “losing” the LLM race because they’re preferentially selling the TPUs to competitors, while keeping just enough for themselves to stay competitive and build their own products.
It's probably the best multimodal model I've worked with (if somebody knows a better one for audio analysis, please let me know!)
Who says they aren't? Could be using all of them for "research".
* Repeated instances of incorrect code insertion that the agent cannot clean up. Sure, version control, but this is often happening in new files that aren't even in version control yet.
* Lost chat history when I close and restart the app.
* Not being able to restore a chat from the history (just saw this last week).
* Overly broad searches that waste time and tokens.
* No vertical scroll bar arrows. WTF?? Doesn't the interface look "flat" enough already? This feels arbitrary and stupid.
* The previous chat prompt takes up a large portion of the vertical space of the chat window, even on a high res display.
When it works Antigravity is excellent. When it doesn't work, it's absolutely horrible. If you check the update history, there are usually just a few items and they're super generic things like "Fixed a bug with text entry.".
I don't see it improving at any kind of reasonable pace, even over the last 6 months As a result, I've mostly relegated Antigravity to a planning tool and it does an excellent job. Or I use it to write prompts that I give to Codex. It definitely can do an excellent job writing code sometimes, but sometimes it also does an absolutely horrible job with not breaking the code when it inserts it. It seems to be terrible at understanding C++ braces. How often? Way too often. I always know it's happening because it prompts me to run Git while it's doing something. LOL, that's how I know that it's broken something.
Codex is definitely way, way, way better. It's not even a contest at this point. Codex never breaks my code. It might not always do what I want, but it's just an order of magnitude better than Antigravity. Antigravity really feels like a comedy of errors at this point. ESPECIALLY from a company with Google's resources.
I HIGHLY doubt that Gemini is overloaded, Google has been bullshitting with their crap models since release. Waste of everyone's time.
OTOH, if they are stressing Google's capacity then it seems it has to be for production use, which would relfect a massive failure on Meta's side given their investment in datacenters and AI. If they can't utilize their own models and datacenters, then maybe they should just rent the excess capacity to Google! :)
And their safety tuning is neither effective nor precise on edge models.
Cloud services like to present the illusion of an infinite amount of compute available at a fixed price per unit, but the reality is if you try to use too much of any service you'll find you have a quota and requests to increase it will fall on deaf ears if the provider doesn't have more of that resource.
Too much of my working life has been spent shoehorning services into less space/compute/ram/spindles or migrations to other data centers to solve such issues.
Llama Meta 70b is 50th or so down the list of popular models.
It has 24.1b tokens used in 7 days vs the top models that have trillions or hundreds of billions of tokens.
So practically dead!
Having said that, I agree with you. You have to request limit increases often and can't scale even in those instances if you don't plan ahead.
There has to be a name for this deceptive marketing tactic where you say something is unlimited and then it is only unlimited as long as you don't use very much.
It would be one thing if you occasionally got a "no more capacity" error when requesting large amounts of resources but it doesn't work that way. They confine you to a relatively small amount of resources the entire time you have an account. If you want more you have to request it.
The tiny blog sure isn't for the cloud, but also it's not the main client of the cloud.
> it's 20% more than you are currently using and you pay 300% more for that.
I'm assuming you are comparing to self hosting. Then you need to account for things that are difficult to put a price like your time maintaining a physical infrastructure and the lessons you will learn with it.
Sounds like I'm defending the big cloud, but there is a valid use that is disconsidered because it's trendy to hate on the cloud.
> They confine you to a relatively small amount of resources the entire time you have an account. If you want more you have to request it.
It's a form of KYC, nothing wrong with that.
Like literally 10x times more expensive to do so, to run CI jobs...
I dont want to imagine the margin AWS has like generally, cause it can easily be a 90% too
I assume you're using your owned server and not a provider like Hetzner? So you did have a substantial delivery time. Although in my city is a recycled that resells used servers, and I could show up there with a truck and get a server within hours if I'm not too picky. Or use some random desktop or laptop off the pile, short-term.
Right now the biggest issue is the vibe coded CI program is not really meant to be a distributed multi-node thing yet, so we're on the biggest machines (there's some newer bigger stuff we could migrate too) and the only issue is on peak hours queue can get a bit slow.. but that was also some other bugs etc making not ideal.
Tbh it works pretty well, we just need now to scale it to more than one node etc (which is not to say that is easy, but still, x10 headroom to work with)
INDIA - 2025/05/13: In this photo illustration, a Meta logo is seen displayed on a smartphone with a Google logo in the background.
Sopa Images | Lightrocket | Getty Images
Google has put limits on Meta's use of its Gemini AI models after the social media company sought more computing capacity than the rival tech group could provide, the Financial Times reported on Sunday.
Google, owned by Alphabet, told Meta around March it could not meet the full Gemini capacity the company had sought to purchase, the newspaper said, adding that the shortfall disrupted and delayed some of Meta's internal AI projects.
Several other Google clients have also been affected, though to a lesser extent, according to the report. Meta has been particularly impacted due to its exceptionally high demand for Google's models, the FT said.
Reuters could not immediately verify the report, which cited people familiar with the matter. Google and Meta did not immediately respond to requests for comment outside business hours.
Due to the restrictions, Meta has encouraged staff to be more efficient with AI tokens, the units that measure AI usage, the FT report said.
Even as companies continue to spend billions on chips and data centers, they are still struggling to secure enough computing power to support the growing demand for AI services.
Revenue at Google Cloud grew to $20 billion in the first quarter ended March, but CEO Sundar Pichai said computing power constraints prevented even higher growth and contributed to the cloud unit's backlog nearly doubling quarter on quarter.