We started the company two and a half years ago, and raised $7.3m in 2024 (announced only almost a year later). We've spent less than half of this amount.
Earlier this week we came to the difficult decision to wind down the project. The open-source repository remains available on GitHub (Apache 2.0) but won't be actively maintained by the team moving forward.
Their website landing page is now also showing the software is no longer maintained. No mention of why they made this decision, my best guess is they burned through their seed money and were unable to attract further investments.
[0]: https://www.tensorzero.com/blog/tensorzero-raises-7-3m-seed-...
It was a simple project in terms of technical complexity. I didn't publish it as I counted several similar projects in the field.
Putting $7.3M into such a project would make sense only in the case of a precise growth plan with already declared customers and an promising sales funnel. There is no technical moat.
"infra is safe" Hmm, but that wasn't a good idea. because if an open source infrastructure project like TensorZero gets shut down this quickly, won't they start to realize that those investment theories are also risky?
The difficult thing about AI infrastructure is that, unlike other industries, it will not become fragmented. It will likely remain tied to specific big tech models. What does this mean? It means that because AI models are not yet standardized, the infrastructure itself is actually riskier. In other words, the privatization of standards is happening.
The challenge with AI infrastructure is that an independent, stable standard layer has not formed, unlike in other software infrastructure markets such as databases, web servers, cloud, and containers. Over time, those ecosystems developed relatively standardized interfaces and operational layers. But the LLM ecosystem is still evolving rapidly. Models themselves change fast, APIs differ, pricing differs, context windows, tool calling, structured output, evaluation, fine tuning, caching, routing, everything keeps changing.
So even if an infrastructure startup tries to build a common abstraction layer across multiple models, before that common layer can stabilize, big model or cloud providers like OpenAI, Anthropic, Google, AWS, or Azure can just absorb the same functionality directly. In the end, AI infrastructure is at high risk of becoming an attached feature of model providers rather than solidifying as an independent layer.
But if a startup that raised 7.3 million dollars fails this quickly, who would trust and invest in such things? That aside, it seems AI startups are all the rage these days. I also want to learn AI and get funded like that. Does anyone here trust me enough to invest? About one hundredth of that would probably be enough
“TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and fuels ~1% of global LLM API spend today.”
One percent seems like a lot. Anyone on HN use this?
Wasn't GitHub once a place for humans? Now we could rename it SkyHub.
> It was a simple project in terms of technical complexity.
That’s the thing, though. The version I build for myself sheds all the features that get in my way. I don’t share them either because they’re only useful for me.
Perhaps in the future big tech projects will be delivered with a common “core” and the expectation that agents fill in the use-specific stuff.
A better model for VCs is: companies are finding tons of budget to allocate to new AI spend. Besides the labs, who is going to be able to capture some of that spend while they're actively looking to spend it?
Nobody at the seed stage is investing in things they think are "safe". They are investing in things they think have huge upside.
> VCs think, 'Apps are risky, infrastructure is safe,' so they invested in AI infra.
First off, this isn't even infra in the infra sense of the word. Infrastructure implied something physical, a pure software product can almost never be considered 'infra'. A tool maybe, but not 'infra'.
VCs can also be irrational and driven primarily by personal connections rather than reason. I didn't do a deep dive in this project/leadership, but often who you know is some important than what you produced. There's a reason why a lot of VCs go for the old motto of "I'd rather invest in an A team with a C product; than invest in a C team with an A product".
I might publish a long-form reflection when the dust settles.
The other half goes where?
That being said, while I am biased, there is a lot of work around infrastructure so calling it "just a wrapper" massively underestimates the effort - this is purely from my own experience building this space.
Besides, if it is true how come OpenClaw is spending so much money on a open source project. Salaries alone will cost 7 digit sum for a harness and I have first hand experience dealing with companies doing exactly this.
Shameful plug - we are building cbk.ai, better known today as chatbotkit.com.
Very early in my career I used to believe that I or anyone else could be a CEO.
It wasn't until working with tiny teams where the CEO/founders devoted everything in their life to the business -- often at the expense of hobbies, romantic relationships, and any shred of free time -- that I realized true CEOs are a rare breed.
When are you ask things like "what happens if the product fails?" the answer would always be "It won't."
They both relentlessly believe in, and put every ounce of energy toward, their vision because anything less would not suffice
Again as trite as it sounds, I empathize with these people in that to them losing their vision felt like losing something dearest to them
We are returning the remaining capital to investors.
I feel like this is really going to change the software industry moving forwards. Historically it was tedious and time consuming to actually develop tailored dev tools which is why so many organizations relied on third party solutions. When nowadays you can easily half bake something in a few hours and get it working, tailored _specifically_ to your needs.
I suspect so, the headless / "api/cli only" tools like CRM are pretty big right now and I don't think we've seen the end of that trend, probably more like just beginning.
PS: Someone won't become a trillionaire with this attitude.
The ~1% figure might be outdated today but it was a best-effort estimate a couple of months ago. TensorZero powered tens of trillions of inference tokens per month. TensorZero is not widely used but it was used by a couple of extreme-scale users.
Ultimately I found the data model and UI to be both cumbersome and unintuitive. Langfuse ended up being the observability tool I went with instead over the one I built (and still use today).
Familiar with creditors getting divvied in bankruptcies, but not refunds to investors… oh it’s because there’s never any money left when things wind down. (We hear of retail stores where employees discover closures posted on shop doors when reporting to work.)
or you're incompetent
I’d bet on extreme irresponsibility.
> are all the rage these days
Are they? Overall it seems kind of tame compared to 2020-21 since VCs are somewhat risk average outside of a few outliers. Funding looks much more concentrated these days.
I agree that most people misunderstand the concept of a 'moat' and become obsessed with that misunderstanding. People tend to think that only technical 'coding skills' which they can easily understand constitute a moat. But in reality, the moat is the entire workflow across the product's lifecycle, including coing skills. In that sense, infrastructure workflows are nothing more than 'the most easily replaceable consumables.' The essential purpose of infrastructure is to pursue 'standardization,' which paradoxically means a state of 'zero switching costs' where customers (app developers) can switch at any time to a better API or a big tech built in feature. Pure technology that doesn't latch onto the messy real world domains of customers will inevitably be absorbed without resistance by massive capital.
In some ways, customer lock in at the application layer, or even the fan culture around a product, creates emotional lock in. The end user app that provides a specific workflow integrated into users' daily routines can overcome even technical inferiority through 'experience' and 'emotion.' Technology can be copied, but the user identity attached to a tool is what I think a real moat is.(That is also the reason I love Windows.)
The example you gave, Cursor's Composer, is exactly the case I'm talking about. I think Cursor is inferior, and I don't think its Composer model feature is all that great either. But Cursor has a passionate fan base, and users who choose Composer as the best value for money no longer care about absolute technical performance or benchmark scores. They are captivated by the 'speed of experience' of code being completed quickly as they intended, and the 'frictionless workflow' the tool provides.it's not the company that builds the best AI model that wins, but the company that wraps 'good enough technology' in 'great UX' and dominates users' habits. That is how apps dominate infrastructure, and that's the moat you and I are thinking about.
That said, this conclusion is probably too hasty and has many flaws. Still, your thoughts are so similar to mine that I'm leaving this reply. Thanks for the great comment. Have a good day
When after a few months we accepted that it wasn’t going to work, our investor got basically all his money back.
It was pocket change amounts compared to the sums of money that they deal with in Silicon Valley. But the point is the same anyway, the investor got back basically everything.
Early stage startups tend not to have a lot debt to pay off, because there aren’t many places willing to offer them much credit.
That is indeed how the VC funding game is played. If you don't raise another round, you are dead anyway, so you spend down your seed round to try and justify that following round...
(Honestly I don’t think so here, but I predict that will happen eventually)
Honestly, I was close to flagging this story because the title is deliberately manipulative - it makes it sound like the founder did a rug pull. But I was really glad to see the founder come in to these comments and just say we tried, but the market shifted under us. Happens all the time.
The discussion here isn't about funding, it's that there's a presumptively useful community tool which got abandoned because its owners took their toys and went home when the money ran out (instead of making a sincere effort at transitioning to community governance). That's on the IP owners being selfish jerks and/or grifting losers. It's not the VC's fault.
The report says, the CEO and founder, is a Ketamine addicted weirdo, who does Nazi salutes in public, is know to have at least 24 kids, and lives in an isolated farm in Texas, with at least 5 to 7 female partners, and got sued for calling a guy who saved kids a Pedophile.
You in?
The title is misleading unfortunately but that's how social media goes...
TensorZero is an open-source LLMOps platform that unifies:
You can take what you need, adopt incrementally, and complement with other tools. It plays nicely with the OpenAI SDK, OpenTelemetry, and every major LLM provider.
TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and fuels ~1% of global LLM API spend today.
Website
·
Docs
·
Twitter
·
Slack
·
Discord
Quick Start (5min)
·
Deployment Guide
·
API Reference
·
Configuration Reference
[!NOTE]
🆕 TensorZero Autopilot
TensorZero Autopilot is an automated AI engineer powered by TensorZero that analyzes LLM observability data, sets up evals, optimizes prompts and models, and runs A/B tests.
It dramatically improves the performance of LLM agents across diverse tasks:
![]()
Integrate with TensorZero once and access every major LLM provider.
Anthropic, AWS Bedrock, AWS SageMaker, Azure, DeepSeek, Fireworks, GCP Vertex AI Anthropic, GCP Vertex AI Gemini, Google AI Studio (Gemini API), Groq, Hyperbolic, Mistral, OpenAI, OpenRouter, SGLang, TGI, Together AI, vLLM, and xAI (Grok).
Need something else? TensorZero also supports any OpenAI-compatible API (e.g. Ollama).
You can use TensorZero with any OpenAI SDK (Python, Node, Go, etc.) or OpenAI-compatible client.
base_url and model in your OpenAI-compatible client.from openai import OpenAI
# Point the client to the TensorZero Gateway
client = OpenAI(base_url="http://localhost:3000/openai/v1", api_key="not-used")
response = client.chat.completions.create(
# Call any model provider (or TensorZero function)
model="tensorzero::model_name::anthropic::claude-sonnet-4-6",
messages=[
{
"role": "user",
"content": "Share a fun fact about TensorZero.",
}
],
)
See Quick Start for more information.
Zoom in to debug individual API calls, or zoom out to monitor metrics across models and prompts over time — all using the open-source TensorZero UI.
Send production metrics and human feedback to easily optimize your prompts, models, and inference strategies — using the UI or programmatically.
Compare prompts, models, and inference strategies using evaluations powered by heuristics and LLM judges.
| Evaluation » UI | Evaluation » CLI |
|
Ship with confidence with built-in A/B testing, routing, fallbacks, retries, etc.
Build with an open-source stack well-suited for prototypes but designed from the ground up to support the most complex LLM applications and deployments.
How is TensorZero different from other LLM frameworks?
Can I use TensorZero with ___?
Yes. Every major programming language is supported. It plays nicely with the OpenAI SDK, OpenTelemetry, and every major LLM provider.
Is TensorZero production-ready?
Yes. TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and powers ~1% of the global LLM API spend today.
Here's a case study: Automating Code Changelogs at a Large Bank with LLMs
How much does TensorZero cost?
TensorZero (LLMOps platform) is 100% self-hosted and open-source.
TensorZero Autopilot (automated AI engineer) is a complementary paid product powered by TensorZero.
Who is building TensorZero?
Our technical team includes a former Rust compiler maintainer, machine learning researchers (Stanford, CMU, Oxford, Columbia) with thousands of citations, and the chief product officer of a decacorn startup. We're backed by the same investors as leading open-source projects (e.g. ClickHouse, CockroachDB) and AI labs (e.g. OpenAI, Anthropic). See our $7.3M seed round announcement and coverage from VentureBeat. We're hiring in NYC.
How do I get started?
You can adopt TensorZero incrementally. Our Quick Start goes from a vanilla OpenAI wrapper to a production-ready LLM application with observability and fine-tuning in just 5 minutes.
Start building today. The Quick Start shows it's easy to set up an LLM application with TensorZero.
Questions? Ask us on Slack or Discord.
Using TensorZero at work? Email us at hello@tensorzero.com to set up a Slack or Teams channel with your team (free).
We are working on a series of complete runnable examples illustrating TensorZero's data & learning flywheel.
Optimizing Data Extraction (NER) with TensorZero
This example shows how to use TensorZero to optimize a data extraction pipeline. We demonstrate techniques like fine-tuning and dynamic in-context learning (DICL). In the end, an optimized GPT-4o Mini model outperforms GPT-4o on this task — at a fraction of the cost and latency — using a small amount of training data.
Agentic RAG — Multi-Hop Question Answering with LLMs
This example shows how to build a multi-hop retrieval agent using TensorZero. The agent iteratively searches Wikipedia to gather information, and decides when it has enough context to answer a complex question.
Writing Haikus to Satisfy a Judge with Hidden Preferences
This example fine-tunes GPT-4o Mini to generate haikus tailored to a specific taste. You'll see TensorZero's "data flywheel in a box" in action: better variants leads to better data, and better data leads to better variants. You'll see progress by fine-tuning the LLM multiple times.
Image Data Extraction — Multimodal (Vision) Fine-tuning
This example shows how to fine-tune multimodal models (VLMs) like GPT-4o to improve their performance on vision-language tasks. Specifically, we'll build a system that categorizes document images (screenshots of computer science research papers).
Improving LLM Chess Ability with Best-of-N Sampling
This example showcases how best-of-N sampling can significantly enhance an LLM's chess-playing abilities by selecting the most promising moves from multiple generated options.
We write about LLM engineering on the TensorZero Blog. Here are some of our favorite posts:
Also the whole project is open source. If you want, you could take it over.
Yes, that's exactly what it means!
There are good and bad ways to extract yourself from maintainership obligations. This is the bad way.
"It's still open source because you can fork it if you really want" is a specious and unhelpful attitude, and it tells me that you, like the owners of this thing, are not to be trusted to manage such a thing.
Also read the link. This is apache 2 licensed. Even in whatever imaginary world where there is such a social contract, there is thankfully a legal contract that includes disclaimer of warranty.