I'm the co-founder and CEO of TensorZero.

We started the company two and a half years ago, and raised $7.3m in 2024 (announced only almost a year later). We've spent less than half of this amount.

Earlier this week we came to the difficult decision to wind down the project. The open-source repository remains available on GitHub (Apache 2.0) but won't be actively maintained by the team moving forward.

I'm the co-founder and CEO of TensorZero.

We started the company two and a half years ago, and raised $7.3m in 2024 (announced only almost a year later). We've spent less than half of this amount.

The title makes it sound like they just did a seed round, but the seed round was announced in August of last year [0].

Their website landing page is now also showing the software is no longer maintained. No mention of why they made this decision, my best guess is they burned through their seed money and were unable to attract further investments.

[0]: https://www.tensorzero.com/blog/tensorzero-raises-7-3m-seed-...

About one year ago, I created an LLM gateway with metrics, provider fallback and switching, tools support, injecting, etc. etc., and unique features like acting as an MCP tools client and server, all streamed, with low latency.

It was a simple project in terms of technical complexity. I didn't publish it as I counted several similar projects in the field.

Putting $7.3M into such a project would make sense only in the case of a precise growth plan with already declared customers and an promising sales funnel. There is no technical moat.

Just use Plexus [1]. The maintainer is not trying to be a hero or raise seed dollars or even really trying to promote it. He's just making an excellent, useful product. (Unaffiliated, just a happy user). It's not a full-on "LLMOps" platform (whatever that is), it's just a proxy that works very well and has some nice features.

[1] https://github.com/mcowger/plexus

VCs think, 'Apps are risky, infrastructure is safe,' so they invested in AI infra.

"infra is safe" Hmm, but that wasn't a good idea. because if an open source infrastructure project like TensorZero gets shut down this quickly, won't they start to realize that those investment theories are also risky?

The difficult thing about AI infrastructure is that, unlike other industries, it will not become fragmented. It will likely remain tied to specific big tech models. What does this mean? It means that because AI models are not yet standardized, the infrastructure itself is actually riskier. In other words, the privatization of standards is happening.

The challenge with AI infrastructure is that an independent, stable standard layer has not formed, unlike in other software infrastructure markets such as databases, web servers, cloud, and containers. Over time, those ecosystems developed relatively standardized interfaces and operational layers. But the LLM ecosystem is still evolving rapidly. Models themselves change fast, APIs differ, pricing differs, context windows, tool calling, structured output, evaluation, fine tuning, caching, routing, everything keeps changing.

So even if an infrastructure startup tries to build a common abstraction layer across multiple models, before that common layer can stabilize, big model or cloud providers like OpenAI, Anthropic, Google, AWS, or Azure can just absorb the same functionality directly. In the end, AI infrastructure is at high risk of becoming an attached feature of model providers rather than solidifying as an independent layer.

But if a startup that raised 7.3 million dollars fails this quickly, who would trust and invest in such things? That aside, it seems AI startups are all the rage these days. I also want to learn AI and get funded like that. Does anyone here trust me enough to invest? About one hundredth of that would probably be enough

This is the claim in the repo readme that presumably unlocked the VC investment:

“TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and fuels ~1% of global LLM API spend today.”

One percent seems like a lot. Anyone on HN use this?

Seed was in Aug ‘25 and website simply says the project will no longer be maintained: https://www.tensorzero.com/

There was a high severity advisory last week https://github.com/tensorzero/tensorzero/security/advisories... though these days can't even tell if this is related or just routine

I don’t understand the commenters here who are attacking the entrepreneur. If it's not your money, then he is accountable only to his investors.

For people like myself, who didn't understand the timing of events - raised in august 2025, archived yesterday without any notice.

I'm glad I used litellm in my last project

https://github.com/BerriAI/litellm/

Most VCs avoided application layer believing it is too risky with few player would emerge as winner over application layers calling them GPT wrapper (now called Harness) and pouring money into infra layer. Would love to see your opinion about how this thesis would turn out going forward.

Open source powers the business of many large corporations and they give essentially nothing back - why would maintainers refuse an offer for money in this environment?

At least it's not a pig butchering scam.

But it is written in Rust™.

Just switch to Bifrost already

Guys - skynet is winning the war. We oldschool humans are left behind here.

Wasn't GitHub once a place for humans? Now we could rename it SkyHub.

LOL, there's a sucker born every minute!

Are there any lessons around the why which may be publicly shared ?

What did you spend the money on?

The other half goes where?

Is there anything your willing to share on what went into the decision/ what you learned about trying to build this kind of product?

[1] https://github.com/mcowger/plexus

I don’t understand the commenters here who are attacking the entrepreneur. If it's not your money, then he is accountable only to his investors.

Are there any lessons around the why which may be publicly shared ?

There are many factors at play here but if I had to pick one... an open-source company has to find product market fit twice: first for the OSS project and again for a commercial product. The AI market moves very quickly so it's easy to take a step in the wrong direction and fall behind.

I might publish a long-form reflection when the dust settles.

The title makes it sound like they just did a seed round, but the seed round was announced in August of last year [0].

[0]: https://www.tensorzero.com/blog/tensorzero-raises-7-3m-seed-...

The company was started in January 2024, so the seed financing is likely a roll-up of two years of fundraising. $7m for ~30 months of running an AI startup in NYC is not that unusual.

Burning through $7m in 9 months? That's an impressive amount of avocado toast.

The project was probably just built to raise funds, a bait, and after thats done it's dead.

It was a simple project in terms of technical complexity. I didn't publish it as I counted several similar projects in the field.

Putting $7.3M into such a project would make sense only in the case of a precise growth plan with already declared customers and an promising sales funnel. There is no technical moat.

The calculus in “buy or build” has shifted for me over the last six months especially. If I can make an agent build it, I get the version that’s tailored for me.

> It was a simple project in terms of technical complexity.

That’s the thing, though. The version I build for myself sheds all the features that get in my way. I don’t share them either because they’re only useful for me.

Perhaps in the future big tech projects will be delivered with a common “core” and the expectation that agents fill in the use-specific stuff.

That's literally every project around AI. All the agent sandboxes. Hosting cron jobs that just hit ai rest endpoints for model completions etc

VCs think, 'Apps are risky, infrastructure is safe,' so they invested in AI infra.

Tell me you haven't talked to a VC.

A better model for VCs is: companies are finding tons of budget to allocate to new AI spend. Besides the labs, who is going to be able to capture some of that spend while they're actively looking to spend it?

Nobody at the seed stage is investing in things they think are "safe". They are investing in things they think have huge upside.

A few comments.

> VCs think, 'Apps are risky, infrastructure is safe,' so they invested in AI infra.

First off, this isn't even infra in the infra sense of the word. Infrastructure implied something physical, a pure software product can almost never be considered 'infra'. A tool maybe, but not 'infra'.

VCs can also be irrational and driven primarily by personal connections rather than reason. I didn't do a deep dive in this project/leadership, but often who you know is some important than what you produced. There's a reason why a lot of VCs go for the old motto of "I'd rather invest in an A team with a C product; than invest in a C team with an A product".

This is the claim in the repo readme that presumably unlocked the VC investment:

“TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and fuels ~1% of global LLM API spend today.”

One percent seems like a lot. Anyone on HN use this?

Seed was in Aug ‘25 and website simply says the project will no longer be maintained: https://www.tensorzero.com/

There was a high severity advisory last week https://github.com/tensorzero/tensorzero/security/advisories... though these days can't even tell if this is related or just routine

I'm glad I used litellm in my last project

https://github.com/BerriAI/litellm/

Open source powers the business of many large corporations and they give essentially nothing back - why would maintainers refuse an offer for money in this environment?

At least it's not a pig butchering scam.

But it is written in Rust™.

Just switch to Bifrost already

Guys - skynet is winning the war. We oldschool humans are left behind here.

Wasn't GitHub once a place for humans? Now we could rename it SkyHub.

The company was started in January 2024, so the seed financing is likely a roll-up of two years of fundraising. $7m for ~30 months of running an AI startup in NYC is not that unusual.

Burning through $7m in 9 months? That's an impressive amount of avocado toast.

$7m actually isn't a whole lot, especially if they hired a (larger) engineering team. Assuming their cali based, that's easily 150-200k per engineer, a team of 20 easily eats through that. Idk the specifics, but I don't the organization was fradulent, it could also be that they're going commercial and no longer want to maintain their oss stack

We raised in 2024 and only burned through ~$3m of it, mostly on salaries to support a small team.

Those Claude tokens are not cheap you know /s

The project was probably just built to raise funds, a bait, and after thats done it's dead.

Were the thousands of commits and hundreds of feature branches over the last 9 months just to keep up appearances, then? Were the 850 people who forked it in on the scheme, too?

Sometimes things just fail.

You can call it a bait but where is VCs due diligence for this. Most VCs where out there defending their infra layers investment. Just look at YC batches and see the inflated number of infra startups.

Do you understand that when you raise money it doesn't go into your personal account? Its not like you can move this money in your retirement account and sail into the sunset.

The calculus in “buy or build” has shifted for me over the last six months especially. If I can make an agent build it, I get the version that’s tailored for me.

> It was a simple project in terms of technical complexity.

That’s the thing, though. The version I build for myself sheds all the features that get in my way. I don’t share them either because they’re only useful for me.

Perhaps in the future big tech projects will be delivered with a common “core” and the expectation that agents fill in the use-specific stuff.

> The calculus in “buy or build” has shifted for me over the last six months especially. If I can make an agent build it, I get the version that’s tailored for me.

I feel like this is really going to change the software industry moving forwards. Historically it was tedious and time consuming to actually develop tailored dev tools which is why so many organizations relied on third party solutions. When nowadays you can easily half bake something in a few hours and get it working, tailored _specifically_ to your needs.

> Perhaps in the future big tech projects will be delivered with a common “core” and the expectation that agents fill in the use-specific stuff.

I suspect so, the headless / "api/cli only" tools like CRM are pretty big right now and I don't think we've seen the end of that trend, probably more like just beginning.

That's literally every project around AI. All the agent sandboxes. Hosting cron jobs that just hit ai rest endpoints for model completions etc

Tell me you haven't talked to a VC.

Nobody at the seed stage is investing in things they think are "safe". They are investing in things they think have huge upside.

A few comments.

> VCs think, 'Apps are risky, infrastructure is safe,' so they invested in AI infra.

I also believe the same. Many VCs are obsessed with moat that they clearly got wrong. To me the value created at app layers are so much that gives them the flexibility to diversify their infra layers. Good harnessed do not depend on a specific model provider or memory layer or etc that when it is taken down like anthropic fable they get no risk exposure. Many even after growing train their own model like what cursor did with composer. There’s many more examples in other verticals like manus, superhuman, fireflies, lovable, replit, cursor, nouswise, cline windsurf and kilo but many are concentrated in coding because again I think VCs have preferred this definition of moat.

Infra is perhaps somewhat safe but realistically it's a really low margin capital intense business long-term unless you can lock-in customers with hundreds of services like AWS. So not a lot of space for a huge ROI.

> are all the rage these days

Are they? Overall it seems kind of tame compared to 2020-21 since VCs are somewhat risk average outside of a few outliers. Funding looks much more concentrated these days.

We raised most of the capital before we had any traction. We raised on a rolling basis and had millions in the bank before we had even published the open-source repository. Ultimately we raised based on the team's background + vision.

The ~1% figure might be outdated today but it was a best-effort estimate a couple of months ago. TensorZero powered tens of trillions of inference tokens per month. TensorZero is not widely used but it was used by a couple of extreme-scale users.

I used it, but only briefly to evaluate it. It had some overlap with a tool I built myself, was curious if any of the extra features would be useful.

Ultimately I found the data model and UI to be both cumbersome and unintuitive. Langfuse ended up being the observability tool I went with instead over the one I built (and still use today).

Generally speaking, every YC company post ~2020 is forced to make pathologically false claims to compete in the (fundraising) market.

Just tell AI to write your copy and that's what you get, overhype-as-a-service.

Rounded to the nearest percent >0

Seed was in '24 actually but we only announced in '25.

Obviously they upgraded, bought a dash, and moved on to https://www.tensor-one.com/

This was coincidental. Someone reported the issue last week, we fixed it, and published the advisory.

For people like myself, who didn't understand the timing of events - raised in august 2025, archived yesterday without any notice.

Not my experience. I think most VCs thesis is around the application layer - not much around the infrastructure.

That being said, while I am biased, there is a lot of work around infrastructure so calling it "just a wrapper" massively underestimates the effort - this is purely from my own experience building this space.

Besides, if it is true how come OpenClaw is spending so much money on a open source project. Salaries alone will cost 7 digit sum for a harness and I have first hand experience dealing with companies doing exactly this.

Shameful plug - we are building cbk.ai, better known today as chatbotkit.com.

LOL, there's a sucker born every minute!

https://www.qwantz.com/index.php?comic=4483&mobile=1 couldn't resist.

PS: Someone won't become a trillionaire with this attitude.

I might publish a long-form reflection when the dust settles.

It might come off as trite, but I genuinely am sorry that things didn't pan out for you

Very early in my career I used to believe that I or anyone else could be a CEO.

It wasn't until working with tiny teams where the CEO/founders devoted everything in their life to the business -- often at the expense of hobbies, romantic relationships, and any shred of free time -- that I realized true CEOs are a rare breed.

When are you ask things like "what happens if the product fails?" the answer would always be "It won't."

They both relentlessly believe in, and put every ounce of energy toward, their vision because anything less would not suffice

Again as trite as it sounds, I empathize with these people in that to them losing their vision felt like losing something dearest to them

What did you spend the money on?

The other half goes where?

Mostly salaries to support a small team.

We are returning the remaining capital to investors.

Is there anything your willing to share on what went into the decision/ what you learned about trying to build this kind of product?

See my sibling comment

Not my experience. I think most VCs thesis is around the application layer - not much around the infrastructure.

Shameful plug - we are building cbk.ai, better known today as chatbotkit.com.

It might come off as trite, but I genuinely am sorry that things didn't pan out for you

Very early in my career I used to believe that I or anyone else could be a CEO.

When are you ask things like "what happens if the product fails?" the answer would always be "It won't."

They both relentlessly believe in, and put every ounce of energy toward, their vision because anything less would not suffice

Again as trite as it sounds, I empathize with these people in that to them losing their vision felt like losing something dearest to them

Mostly salaries to support a small team.

We are returning the remaining capital to investors.

I thought usually founders try to pivot till they run out of money. I wonder if that is good or bad for a serial entrepreneurs if they decide to shut it down instead of pivoting?

Kudos to you and your team for not burning through the rest. Hope you have better luck with your next project.

I’ve never heard of this before. Anyone know if it’s uncommon?

Familiar with creditors getting divvied in bankruptcies, but not refunds to investors… oh it’s because there’s never any money left when things wind down. (We hear of retail stores where employees discover closures posted on shop doors when reporting to work.)

See my sibling comment

150-200k is also just the employee’s salary, the actual cost to the company is significantly higher, you need to multiply that by something like 1.5 to get the fully loaded cost, people are expensive!

20 engineers would be incredibly aggressive growth for such a young company with that amount of capital, no?

Tokenmaxxing makes startups even leakier if they don't find token traction.

I thought ai was writing all the code. What do they need engineers for?

if you hire 20 engineers with your seed round you are either very confident you'll be able to use them to justify another raise soon

or you're incompetent

We raised in 2024 and only burned through ~$3m of it, mostly on salaries to support a small team.

That would be a lot still. That’s a lot of money.

I’d bet on extreme irresponsibility.

Those Claude tokens are not cheap you know /s

Were the thousands of commits and hundreds of feature branches over the last 9 months just to keep up appearances, then? Were the 850 people who forked it in on the scheme, too?

Well.. if that's it it's not really much to shown for if you spent $7 million on it.

Sometimes things just fail.

Do you understand that when you raise money it doesn't go into your personal account? Its not like you can move this money in your retirement account and sail into the sunset.

> The calculus in “buy or build” has shifted for me over the last six months especially. If I can make an agent build it, I get the version that’s tailored for me.

> Perhaps in the future big tech projects will be delivered with a common “core” and the expectation that agents fill in the use-specific stuff.

I suspect so, the headless / "api/cli only" tools like CRM are pretty big right now and I don't think we've seen the end of that trend, probably more like just beginning.

https://www.qwantz.com/index.php?comic=4483&mobile=1 couldn't resist.

PS: Someone won't become a trillionaire with this attitude.

That was funny, I'm not even mad. Upvote for you.

"I was assigned sucker at birth"

I used it, but only briefly to evaluate it. It had some overlap with a tool I built myself, was curious if any of the extra features would be useful.

Ultimately I found the data model and UI to be both cumbersome and unintuitive. Langfuse ended up being the observability tool I went with instead over the one I built (and still use today).

Generally speaking, every YC company post ~2020 is forced to make pathologically false claims to compete in the (fundraising) market.

Just tell AI to write your copy and that's what you get, overhype-as-a-service.

Seed was in '24 actually but we only announced in '25.

This was coincidental. Someone reported the issue last week, we fixed it, and published the advisory.

That was funny, I'm not even mad. Upvote for you.

"I was assigned sucker at birth"

I thought usually founders try to pivot till they run out of money. I wonder if that is good or bad for a serial entrepreneurs if they decide to shut it down instead of pivoting?

Kudos to you and your team for not burning through the rest. Hope you have better luck with your next project.

I’ve never heard of this before. Anyone know if it’s uncommon?

20 engineers would be incredibly aggressive growth for such a young company with that amount of capital, no?

Tokenmaxxing makes startups even leakier if they don't find token traction.

I thought ai was writing all the code. What do they need engineers for?

if you hire 20 engineers with your seed round you are either very confident you'll be able to use them to justify another raise soon

or you're incompetent

That would be a lot still. That’s a lot of money.

I’d bet on extreme irresponsibility.

Well.. if that's it it's not really much to shown for if you spent $7 million on it.

Did Claude make the commits and branches?

(Honestly I don’t think so here, but I predict that will happen eventually)

Right? I’ve been through due diligence and it’s neither a quick nor simple process, even for seed.

A great way to launder money then?

"Failure" is the expected median though. You can't due-diligence your way out of "startup ran out of runway"!

The discussion here isn't about funding, it's that there's a presumptively useful community tool which got abandoned because its owners took their toys and went home when the money ran out (instead of making a sincere effort at transitioning to community governance). That's on the IP owners being selfish jerks and/or grifting losers. It's not the VC's fault.

The due diligence report just come back:

The report says, the CEO and founder, is a Ketamine addicted weirdo, who does Nazi salutes in public, is know to have at least 24 kids, and lives in an isolated farm in Texas, with at least 5 to 7 female partners, and got sued for calling a guy who saved kids a Pedophile.

You in?

Due to the echo chamber effect, our opinions get reinforced, which can lead to biased conclusions, so it gives me pause. But your comment is so eerily similar to my own thoughts that I'm writing this reply.

I agree that most people misunderstand the concept of a 'moat' and become obsessed with that misunderstanding. People tend to think that only technical 'coding skills' which they can easily understand constitute a moat. But in reality, the moat is the entire workflow across the product's lifecycle, including coing skills. In that sense, infrastructure workflows are nothing more than 'the most easily replaceable consumables.' The essential purpose of infrastructure is to pursue 'standardization,' which paradoxically means a state of 'zero switching costs' where customers (app developers) can switch at any time to a better API or a big tech built in feature. Pure technology that doesn't latch onto the messy real world domains of customers will inevitably be absorbed without resistance by massive capital.

In some ways, customer lock in at the application layer, or even the fan culture around a product, creates emotional lock in. The end user app that provides a specific workflow integrated into users' daily routines can overcome even technical inferiority through 'experience' and 'emotion.' Technology can be copied, but the user identity attached to a tool is what I think a real moat is.(That is also the reason I love Windows.)

The example you gave, Cursor's Composer, is exactly the case I'm talking about. I think Cursor is inferior, and I don't think its Composer model feature is all that great either. But Cursor has a passionate fan base, and users who choose Composer as the best value for money no longer care about absolute technical performance or benchmark scores. They are captivated by the 'speed of experience' of code being completed quickly as they intended, and the 'frictionless workflow' the tool provides.it's not the company that builds the best AI model that wins, but the company that wraps 'good enough technology' in 'great UX' and dominates users' habits. That is how apps dominate infrastructure, and that's the moat you and I are thinking about.

That said, this conclusion is probably too hasty and has many flaws. Still, your thoughts are so similar to mine that I'm leaving this reply. Thanks for the great comment. Have a good day

> are all the rage these days

Are they? Overall it seems kind of tame compared to 2020-21 since VCs are somewhat risk average outside of a few outliers. Funding looks much more concentrated these days.

You're right. Looking at recent indicators, there are more stable investments than I thought. But please understand that, as a human, I haven't achieved ROI in terms of marriage, relationships, a stable job, etc., so my perspective might be mixed with a bit of envy

Rounded to the nearest percent >0

Obviously they upgraded, bought a dash, and moved on to https://www.tensor-one.com/

And new Github profile too.

https://github.com/TensorOne

That said, this conclusion is probably too hasty and has many flaws. Still, your thoughts are so similar to mine that I'm leaving this reply. Thanks for the great comment. Have a good day

And new Github profile too.

https://github.com/TensorOne

This is super common with startups and is usually called an orderly shutdown. You don’t want to wait until you are insolvent, but stop when there is enough money left to pay all outstanding liabilities as well as the people that will shut down the business entity, do a final tax return and so on. Then whatever is left eventually gets paid back to investors, who usually have a liquidation preference requiring this as well. The alternative, running truly out of money, no one shutting down anything, a ghost entity that continues to accumulate taxes and penalties, creditors chasing whoever they can get a hold of, is much worse. Just because everyone quits doesn’t mean the entity ceases to exist.

It's pretty common. If a startup winds down before it runs out of money, it typically returns whatever is left to the investors. We didn't have any debt.

It actually happens a lot. Sometimes founders may pivot when the original thesis isn't working out, but a lot of times the prudent thing to do is to just say that it didn't work out and return investors' money.

Honestly, I was close to flagging this story because the title is deliberately manipulative - it makes it sound like the founder did a rug pull. But I was really glad to see the founder come in to these comments and just say we tried, but the market shifted under us. Happens all the time.

When I was in university I unsuccessfully attempted to start a company with two other students. We had a small amount of capital from a single investor. We did not pay ourself any salary. We had spent money on incorporating the company and buying a couple of iPads, and not yet spent money on marketing etc.

When after a few months we accepted that it wasn’t going to work, our investor got basically all his money back.

It was pocket change amounts compared to the sums of money that they deal with in Silicon Valley. But the point is the same anyway, the investor got back basically everything.

It’s not atypical when a startup figures out things aren’t going to work while there’s still money in the bank.

Early stage startups tend not to have a lot debt to pay off, because there aren’t many places willing to offer them much credit.

The real number is probably closer to ~13 engineers, because it costs a company the worker's salary _again_ for benefits, payroll taxes, etc., etc.

Don’t forget the candles

It's not even clear they spent all the money. Maybe it just wasn't a viable product.

> very confident you'll be able to use them to justify another raise soon

That is indeed how the VC funding game is played. If you don't raise another round, you are dead anyway, so you spend down your seed round to try and justify that following round...

It's pretty common. If a startup winds down before it runs out of money, it typically returns whatever is left to the investors. We didn't have any debt.

When after a few months we accepted that it wasn’t going to work, our investor got basically all his money back.

It was pocket change amounts compared to the sums of money that they deal with in Silicon Valley. But the point is the same anyway, the investor got back basically everything.

It’s not atypical when a startup figures out things aren’t going to work while there’s still money in the bank.

Early stage startups tend not to have a lot debt to pay off, because there aren’t many places willing to offer them much credit.

Don’t forget the candles

> very confident you'll be able to use them to justify another raise soon

That is indeed how the VC funding game is played. If you don't raise another round, you are dead anyway, so you spend down your seed round to try and justify that following round...

Did Claude make the commits and branches?

(Honestly I don’t think so here, but I predict that will happen eventually)

Right? I’ve been through due diligence and it’s neither a quick nor simple process, even for seed.

Thanks, that's exactly what happened.

The title is misleading unfortunately but that's how social media goes...

The real number is probably closer to ~13 engineers, because it costs a company the worker's salary _again_ for benefits, payroll taxes, etc., etc.

Our team was much smaller. We didn't spend all the capital.

It's not even clear they spent all the money. Maybe it just wasn't a viable product.

Yeah exactly. We didn't spend the majority of it.

A great way to launder money then?

"Failure" is the expected median though. You can't due-diligence your way out of "startup ran out of runway"!

The due diligence report just come back:

You in?

Thanks, that's exactly what happened.

The title is misleading unfortunately but that's how social media goes...

Our team was much smaller. We didn't spend all the capital.

Yeah exactly. We didn't spend the majority of it.

Which step of “VC firm with millions to invest” and “fresh grads blow millions on AWS bills, sushi delivery and ketamine” is dirty money being washed?

It's not on anyone to set up your favourite "governance" system. If anyone honestly wants to keep maintaining or using it the code is still there.

While most startups fail eventually, failure in less than a year with over 7 million dollars is not the expected median. It’s the exact sort of thing that due diligence is supposed to prevent.

Also the whole project is open source. If you want, you could take it over.

At least the repo is still available. Anyone can fork and carry on, create a community, etc.

Which toys exactly were taken? The repo seems open source, is any component missing?

Only if the CEO is the first man to step foot on mars. He gets to be immortalized, we get to watch him die living his best life.

Please follow the HN guidelines.

Which step of “VC firm with millions to invest” and “fresh grads blow millions on AWS bills, sushi delivery and ketamine” is dirty money being washed?

At least the repo is still available. Anyone can fork and carry on, create a community, etc.

TensorZero Logo

TensorZero

TensorZero is an open-source LLMOps platform that unifies:

Gateway: access every LLM provider through a unified API, built for performance (<1ms p99 latency)
Observability: store inferences and feedback in your database, available programmatically or in the UI
Evaluation: benchmark individual inferences or end-to-end workflows using heuristics, LLM judges, etc.
Optimization: collect metrics and human feedback to optimize prompts, models, and inference strategies
Experimentation: ship with confidence with built-in A/B testing, routing, fallbacks, retries, etc.

You can take what you need, adopt incrementally, and complement with other tools. It plays nicely with the OpenAI SDK, OpenTelemetry, and every major LLM provider.

TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and fuels ~1% of global LLM API spend today.

Website · Docs · Twitter · Slack · Discord

Quick Start (5min) · Deployment Guide · API Reference · Configuration Reference

Demo

Features

[!NOTE]

🆕 TensorZero Autopilot

TensorZero Autopilot is an automated AI engineer powered by TensorZero that analyzes LLM observability data, sets up evals, optimizes prompts and models, and runs A/B tests.

It dramatically improves the performance of LLM agents across diverse tasks:

Learn more →

🌐 LLM Gateway

Integrate with TensorZero once and access every major LLM provider.

Call any LLM (API or self-hosted) through a single unified API
Infer with tool use, structured outputs (JSON), batch, embeddings, multimodal (images, files), caching, etc.
Create prompt templates and schemas to enforce a structured interface between your application and the LLMs
Satisfy extreme throughput and latency needs, thanks to 🦀 Rust: <1ms p99 latency overhead at 10k+ QPS
Ensure high availability with routing, retries, fallbacks, load balancing, granular timeouts, etc.
Track usage and cost and enforce custom rate limits with granular scopes (e.g. tags)
Set up auth for TensorZero to allow clients to access models without sharing provider API keys

Supported Model Providers

Anthropic, AWS Bedrock, AWS SageMaker, Azure, DeepSeek, Fireworks, GCP Vertex AI Anthropic, GCP Vertex AI Gemini, Google AI Studio (Gemini API), Groq, Hyperbolic, Mistral, OpenAI, OpenRouter, SGLang, TGI, Together AI, vLLM, and xAI (Grok).

Need something else? TensorZero also supports any OpenAI-compatible API (e.g. Ollama).

Usage Example

You can use TensorZero with any OpenAI SDK (Python, Node, Go, etc.) or OpenAI-compatible client.

Deploy the TensorZero Gateway (one Docker container).
Update the base_url and model in your OpenAI-compatible client.
Run inference:

from openai import OpenAI

# Point the client to the TensorZero Gateway
client = OpenAI(base_url="http://localhost:3000/openai/v1", api_key="not-used")

response = client.chat.completions.create(
    # Call any model provider (or TensorZero function)
    model="tensorzero::model_name::anthropic::claude-sonnet-4-6",
    messages=[
        {
            "role": "user",
            "content": "Share a fun fact about TensorZero.",
        }
    ],
)

See Quick Start for more information.

🔍 LLM Observability

Zoom in to debug individual API calls, or zoom out to monitor metrics across models and prompts over time — all using the open-source TensorZero UI.

Store inferences and feedback (metrics, human edits, etc.) in your own database
Dive into individual inferences or high-level aggregate patterns using the TensorZero UI or programmatically
Build datasets for optimization, evaluation, and other workflows
Replay historical inferences with new prompts, models, inference strategies, etc.
Export OpenTelemetry traces (OTLP) and export Prometheus metrics to your favorite application observability tools
Soon: AI-assisted debugging and root cause analysis; AI-assisted data labeling

📈 LLM Optimization

Send production metrics and human feedback to easily optimize your prompts, models, and inference strategies — using the UI or programmatically.

Optimize your models with supervised fine-tuning, RLHF, and other techniques
Optimize your prompts with automated prompt engineering algorithms like GEPA
Optimize your inference strategy with dynamic in-context learning, best/mixture-of-N sampling, etc.
Enable a feedback loop for your LLMs: a data & learning flywheel turning production data into smarter, faster, and cheaper models
Soon: synthetic data generation

📊 LLM Evaluation

Compare prompts, models, and inference strategies using evaluations powered by heuristics and LLM judges.

Evaluate individual inferences with inference evaluations powered by heuristics or LLM judges (≈ unit tests for LLMs)
Evaluate end-to-end workflows with workflow evaluations with complete flexibility (≈ integration tests for LLMs)
Optimize LLM judges just like any other TensorZero function to align them to human preferences
Soon: more built-in evaluators; headless evaluations

Evaluation » UI Evaluation » CLI

docker compose run --rm evaluations \
  --evaluation-name extract_data \
  --dataset-name hard_test_cases \
  --variant-name gpt_4o \
  --concurrency 5

Run ID: 01961de9-c8a4-7c60-ab8d-15491a9708e4
Number of datapoints: 100
██████████████████████████████████████ 100/100
exact_match: 0.83 ± 0.03 (n=100)
semantic_match: 0.98 ± 0.01 (n=100)
item_count: 7.15 ± 0.39 (n=100)

🧪 LLM Experimentation

Ship with confidence with built-in A/B testing, routing, fallbacks, retries, etc.

Run adaptive A/B tests to ship with confidence and identify the best prompts and models for your use cases.
Enforce principled experiments in complex workflows, including support for multi-turn LLM systems, sequential testing, and more.

& more!

Build with an open-source stack well-suited for prototypes but designed from the ground up to support the most complex LLM applications and deployments.

Build simple applications or massive deployments with GitOps-friendly orchestration
Extend TensorZero with built-in escape hatches, programmatic-first usage, direct database access, and more
Integrate with third-party tools: specialized observability and evaluations, model providers, agent orchestration frameworks, etc.
Iterate quickly by experimenting with prompts interactively using the Playground UI

Frequently Asked Questions

How is TensorZero different from other LLM frameworks?

TensorZero enables you to optimize complex LLM applications based on production metrics and human feedback.
TensorZero supports the needs of industrial-grade LLM applications: low latency, high throughput, type safety, self-hosted, GitOps, customizability, etc.
TensorZero unifies the entire LLMOps stack, creating compounding benefits. For example, LLM evaluations can be used for fine-tuning models alongside AI judges.

Can I use TensorZero with ___?

Yes. Every major programming language is supported. It plays nicely with the OpenAI SDK, OpenTelemetry, and every major LLM provider.

Is TensorZero production-ready?

Yes. TensorZero is used by companies ranging from frontier AI startups to the Fortune 10 and powers ~1% of the global LLM API spend today.

Here's a case study: Automating Code Changelogs at a Large Bank with LLMs

How much does TensorZero cost?

TensorZero (LLMOps platform) is 100% self-hosted and open-source.

TensorZero Autopilot (automated AI engineer) is a complementary paid product powered by TensorZero.

Who is building TensorZero?

Our technical team includes a former Rust compiler maintainer, machine learning researchers (Stanford, CMU, Oxford, Columbia) with thousands of citations, and the chief product officer of a decacorn startup. We're backed by the same investors as leading open-source projects (e.g. ClickHouse, CockroachDB) and AI labs (e.g. OpenAI, Anthropic). See our $7.3M seed round announcement and coverage from VentureBeat. We're hiring in NYC.

How do I get started?

You can adopt TensorZero incrementally. Our Quick Start goes from a vanilla OpenAI wrapper to a production-ready LLM application with observability and fine-tuning in just 5 minutes.

Get Started

Start building today. The Quick Start shows it's easy to set up an LLM application with TensorZero.

Questions? Ask us on Slack or Discord.

Using TensorZero at work? Email us at hello@tensorzero.com to set up a Slack or Teams channel with your team (free).

Examples

We are working on a series of complete runnable examples illustrating TensorZero's data & learning flywheel.

Optimizing Data Extraction (NER) with TensorZero

This example shows how to use TensorZero to optimize a data extraction pipeline. We demonstrate techniques like fine-tuning and dynamic in-context learning (DICL). In the end, an optimized GPT-4o Mini model outperforms GPT-4o on this task — at a fraction of the cost and latency — using a small amount of training data.

Agentic RAG — Multi-Hop Question Answering with LLMs

This example shows how to build a multi-hop retrieval agent using TensorZero. The agent iteratively searches Wikipedia to gather information, and decides when it has enough context to answer a complex question.

Writing Haikus to Satisfy a Judge with Hidden Preferences

This example fine-tunes GPT-4o Mini to generate haikus tailored to a specific taste. You'll see TensorZero's "data flywheel in a box" in action: better variants leads to better data, and better data leads to better variants. You'll see progress by fine-tuning the LLM multiple times.

Image Data Extraction — Multimodal (Vision) Fine-tuning

This example shows how to fine-tune multimodal models (VLMs) like GPT-4o to improve their performance on vision-language tasks. Specifically, we'll build a system that categorizes document images (screenshots of computer science research papers).

Improving LLM Chess Ability with Best-of-N Sampling

This example showcases how best-of-N sampling can significantly enhance an LLM's chess-playing abilities by selecting the most promising moves from multiple generated options.

Blog Posts

We write about LLM engineering on the TensorZero Blog. Here are some of our favorite posts:

It's not on anyone to set up your favourite "governance" system. If anyone honestly wants to keep maintaining or using it the code is still there.

Part of the social contract of putting a free software project up for public use and convincing Microsoft to host it for free (!) is indeed that you're going to maintain it in good faith for the people who consume it, and that if you can't you'll make a good faith effort to help the people who do.

There are good and bad ways to extract yourself from maintainership obligations. This is the bad way.

While most startups fail eventually, failure in less than a year with over 7 million dollars is not the expected median. It’s the exact sort of thing that due diligence is supposed to prevent.

Also the whole project is open source. If you want, you could take it over.

That's why either VCs confused moat with bot farms and farmed stars over solving genuine problems or they just blindly invested based on founders track record no matter what. To me both are really by product vibe coding hype and chatgpt killing wrappers.

Are there cases when VC investors actually went after founders for fraud or embezzlement or misrepresenting the business or something like that?

Which toys exactly were taken? The repo seems open source, is any component missing?

In response to sibling: > It's still open source because you can fork it if you really want

Yes, that's exactly what it means!

The project name, its community center and hosting environment, the active participation and consent of the copyright holders of the software was withdrawn. This is a dead project, we can all see it. If you want to use it and contribute and get help, you have no where to go.

"It's still open source because you can fork it if you really want" is a specious and unhelpful attitude, and it tells me that you, like the owners of this thing, are not to be trusted to manage such a thing.

Only if the CEO is the first man to step foot on mars. He gets to be immortalized, we get to watch him die living his best life.

Please follow the HN guidelines.

Are there cases when VC investors actually went after founders for fraud or embezzlement or misrepresenting the business or something like that?

In response to sibling: > It's still open source because you can fork it if you really want

Yes, that's exactly what it means!

Please follow the HN guidelines.

There are good and bad ways to extract yourself from maintainership obligations. This is the bad way.

No, there is no social contract here. Microsoft gives free hosting because it's cheap and also provides a path to their paid offerings. People share stuff they work on for fun, to help flesh out their resume, to get help, etc. There's no reason for a maintainer not to drop a project in a heartbeat if it becomes the slightest bit of a burden.

No it's not.

Also read the link. This is apache 2 licensed. Even in whatever imaginary world where there is such a social contract, there is thankfully a legal contract that includes disclaimer of warranty.

There are no maintainership obligations unless someone pays you for them.

Sorry but this is an outrageous perspective, at no point does git init / git push am I committing myself to a social contract, in fact there’s probably a license that states no warranty and no support is to be expected… maintainership obligations gtfo if you’re not here paying for support

Hacker Times

Hacker Times

AI OSS tool repo goes archived over night after raising $7.3M Seed

Discussion

Discussion

TensorZero

Demo

Features

🆕 TensorZero Autopilot

🌐 LLM Gateway

Supported Model Providers

Usage Example

🔍 LLM Observability

📈 LLM Optimization

📊 LLM Evaluation

🧪 LLM Experimentation

& more!

Frequently Asked Questions

Get Started

Examples

Blog Posts