Open source AI must win

This, and distributed LLM inference. We are at a point where no single person can setup a rig to run a SOTA model, it is just too expensive.

So we must build and adopt frameworks that allow individuals to share resources to run SOTA models in a distributed manner. That way they will also be non-censorable by governments.

Also The only way to prevent that one entity weaponizes it, is by giving EVERYONE access to it.

I've been contemplating a decentralized model training system for some time using volunteer machines that we all contribute. But, it is astronomically difficult. The communication speeds are untenable.

And, there is the issue of data poisoning from untrusted nodes. I've almost cracked that last issue with a self-healing checkpointed rollback system that doesn't have to throw out anything that follows the corrupt datum.

But, I'm just one person with an idea and I don't have infinite funds to make this happen. This isn't a small project.

Maybe there would be interest in something like this, now that entire frontier labs are being banned from making further progress.

The total power of all GPUs on the planet dwarf their capabilities, if we had a way to harness them in a distributed way efficiently. We wouldn't be able to train a Fable as fast as them, but eventually having access is better than never having access.

When "open source" means freeware, it's like saying "we want free copies".

What we should be saying is: We want a public, community-ran project that does pretraining and training collectively. This means working on a training corpus in public and somehow coordinating the training work.

This is a complete change of what the term means, It's like how people conflate piracy with theft. Two different things, use different words. Free weights, inference code and chat template is very different from a community-ran LLM project.

It won in my house/my business right from the start. (Well, open weights, at least — which is an uncomfortable nuance.)

I have never understood the willingness to make the functioning of or development of a product so completely dependent on the secret sauce of one of two big unprofitable, inscrutable startups.

It really defies sensible engineering principles to do that. So I was never going to do it. I'm exploring AI now but because I have decided that open weights make it a good use of my time.

It's bad enough that any given business often ends up beholden to a single payment platform and the policies of two US credit card providers.

I guess it is the freelancer in me but I always feel nervous when I am asked to put so much energy into studying or learning someone's product, rather than the underlying technology. I still remember the days when Microsoft was pretty much lobbying academic departments with promises of access to the NT source code. I remember a senior figure in our own saying that Linux was a sideshow and access to NT would make us relevant.

More control over destiny is always necessary, and I remind myself and others that the "state of the art" is behind the "cutting edge". Progress is made at the cutting edge, but there is risk of damage. Engineering should focus on building on the state of the art, not on hitching a ride on someone else's progress.

I would be totally willing to pay $50 per month to support an open source AI lab, rather to get open source models as byproducts of corporations.

Who is going to fund it? Training is unfathomably expensive.

You have either VC funded models looking for a return on investment, or CCP funded models looking to solidify authoritarian "model Chinese society".

Maybe there are some university 4B models, but I doubt those will carry far.

I don't know how open source AI wins. The description is too vague for serious discussions. What I do know is that, once closed source AI groups become anti-you, you should punish them, or help open source groups, or both.

If you really want specific open source {LLM, LMM, research, harness, whatever} groups to win over closed source counterparts, you may show your care by trying open source solutions first when solving problems. And if they're really capable, award them with contributions or something.

With open-weight AI, there might not be an incentive to put large sums of capital towards training / research. There might be a donation fund of some sorts, but it certainly won't reach the level of fundraising that the frontier labs are receiving.

Because of this, I think it might not be possible to have AI *only* open-weight; major players like OpenAI, Anthropic, Google will likely stay for good, with better models than open-source versions.

I think it might look something like Photoshop & GIMP, with Photoshop being a frontier lab, and GIMP being the open-weight model. GIMP is decent for many different image editing workflows, but Photoshop is just better.

I would definitely prefer to have an open-weight model better than frontier labs'. Though I don't think it's possible.

I agree with sentiment and mission, but the goal is inseparable from politics at this point.

Being Open Source (tm) will not protect you from the government/others imposing controls on your silicon or what it is allowed to do, which is already happening around the world.

Even having the models be open source won't fix the regulation or economic incentives. Which is not something you can compress into a couple of paragraphs.

AI is civilizational infrastructure and it needs civilizational solutions. Not just source.

Where does Anthropic or OpenAI winning leave us?

Dependents of an AI-megacorp for our "facts"? Our software? Our work?

It's possible these companies will become everyone's boss, and will dictate to everyone what everyone is allowed to work on, think, say, do, believe, etc.

Before Big Tech springs that trap, we must support and divert resources to open models.

Open-source AI can, by definition, never "win". AI is just hillclimbing today, and closed labs can always absorb everything the open world does and build upon it.

It doesn't really matter for most use cases, because the way AI is working is capability saturation. https://www.delanceyukschoolschesschallenge.com/the-rising-t...

The only exception to this is fields that are inherently adversarial (to nature or others) and an edge relative to competition matters.

Since it's not mentioned in the article, the distinction between open source and open weights is important. Open weights models are almost like a 'first shot is free' entry drug. Without at least the original training data your ability to meaningfully upgrade it is so limited that its utility will quickly fall behind the latest versions of continuously developed models. So much that it'll leave you craving for another release, or have you going back to the provider's API. Even simple things like moving the knowledge cutoff forward will noticeably improve the UX, and that's not to speak of more fundamental improvements like reasoning, quantization-aware training and all the goodness that's yet to come.

Sure, we can do research to bring improvements to open weights models, but it's the same thing: it's either open source or it won't benefit the general public nearly as much.

Not today, may after the next 3-4 breakthroughs. One thing that people don't realize is that the AI they use today is highly highly subsidized bc of the capex that has gone into it. Even if people collaborated together - will not be able to raise billions of dollars that are needed.

These are still very very (and very) early days of the modern AI and there are so many changes that are gonna happen. It's possible that all the frontier labs of today won't exist in a few years.

A question I've got which I've been wondering about, not sure if anyone else has been thinking about it, what actually made Fable so effective?

From what I could tell from the very little time that I had to interact with it, it's instruction following seemed more consistent

The other thing that comes to mind is a lot of people commented on how driven it was, so I'm wondering whether figuring out how to keep existing models looping on task might actually be quite a big shift in capability

I agree but we are dependent mostly on Chinese models at this point to pull it off.

Replace America with „The World“

There is not much open source AI .. there is open weight .. but anyways. Deepseek v4 is pretty much at the same level as the agents we had last year around November and it is an open weight model so I am hopeful.

I think it's also important and heavily overlooked to develop and maintain open source "pro" level models. Those that are able to think for 80 minutes and yield heavy solutions.

I'm not an expert in LLMs so it's hard to understand how much are we lacking, is it just the compute and thinking strategies / parallel chains, or something specific architecturally. But I feel there's value there and I haven't seen anything like it available so far.

That was sama's and elon's original goals before they became trillionaires. Just to keep google/deepmind to take over.

Turned out both assumptions were wrong. You couldn't trust sama to turn this into open source, the Chinese did. Elon never.

And we couldn't see demis take over as expected, probably blocked by Google buerocracy.

I think we need it to prevent slavery happening again.

what is Open Source AI even?

to me Open Source, like Free Software, is something i can run on my own computer. any AI system that runs on a computer that i do not control is by my definition not Open Source.

so how then can Open Source AI win? it can't even compete. even if we collect enough money and create a dedicated Open Source organization to build and run a community owned AI datacenter, how does that help?

so what exactly is the demand here?

It will win - in the sense that AI too will become a freely available resource. You can't stop progress.

My bet is that once cost-efficiency becomes a priority, we will figure out ways to get away from the expensive GPU infrastructure on figure out how to architect models for CPUs. I still remember that Microsoft paper about ternary weights.

This should be the top post. Not Anthropic or OpenAI marketing plots. This is existential.

This is obviously in direct opposition to the very idea of America. So if it happens it'll happen in other countries.

The win I'd really like to see would be for remuneration of training data, and for a provenance of all the data used by a given LLM.

Open source AI will win. It's the same reason why out of all the languages on the web we could have used, Javascript won.

I am really curious how long will it take for the open source models to hit current fable/mythos capabilities, KIMI 2.7 was launched recently and its quiet good for open source models its as good as Opus 4.6 maybe in practical applications not benchmarks so like 6 months to an year behind, after which the next step will be to wait for the day when we will be able to run mythos level intelligence on local hardware, Remember when 5MB storage was the size of a table?

A loooooot of work to be done for the above to happen

There is nothing more surreal in AI chat than entering your own name and being told you are a banned topic. Open source models must win. There is no alternative.

Don't worry, open source AI will win. There's a reason everybody is desperate to IPO fast and get an exit, their competitive advantage is not lasting long.

I'm assuming this is popular because of Fable restrictions. AFAIK, open source is not excluded from ITAR / EAR restrictions (or other export restriction in other countries).

So the real solution you're looking for is technology that can't be arbitrarily gatekept by a sovereign nation.

I think it's enough to use Open Router to encourage competition in the market place.

I have been working on this exact problem, and I suppose now is as good a time as any to talk about it.

To make any agent "good", there are two components: the model and the harness. Very few companies can train models, but anyone can build a harness. How much does the harness matter? Can I build a harness that's good enough that I can use open source models with opus level performance? That's the question I've been trying to answer by building better harnesses. None of the existing frameworks have the functionality I need to build a good harness. The features I need are language-level... and so I started building a language called Agency[0].

It's been six months and its going well. Some of the things Agency can do are wild:

- It can pause and serialize execution at any point, making HITL easy

- It has some neat safety capabilities such as handlers[1] and PFA[2]

- You can bundle up any agent as an HTTP or MCP server[3]

- I'm now working on a built-in optimizer to optimize agents (think DSPy).

Obviously, it's a huge undertaking, but having worked with the Agency for six months, I can't imagine going back to another framework. It makes things so easy. I'm working on its built-in agent now [4]. My goal it to get it to be as good as Claude Code, but using open source models. It's still early days, lots of rough edges, but if this sort of thing interests you, I'd love to have a few more people test it out.

[0] https://agency-lang.com

[1] https://agency-lang.com/guide/handlers.html

[2] https://agency-lang.com/guide/partial-application.html

[3] https://agency-lang.com/cli/serve.html

[4] https://github.com/egonSchiele/agency-lang/blob/main/package...

While it is not at all practical to train an LLM with tens or hundreds of billions of parameters on hobbyists hardware, what if there are other architectures that perform just as well but are easier to train by 1000 volunteers?

I always wondered if 1000 1M parameter models fine-tuned to specific tasks with a small router could perform as well as 100B models.

And I know this is roughly how MoE works, but current MoE models still require training the model as a whole, and big players don’t have an incentive to change that.

But OpenSource community does…

I feel with current government decision to block Fable, this is not a mere opensource issue, considering how US government restrict frontier models, what we need is sovereignty for every country. If not they will release every model with a kill switch in future like F35.

I would also want all conversation with AI to be public, searchable and indexable.

It is only fair, give that LLMs are enabled by human generated content from the Internet, that they give it back!

Truly, big corps have no incentive to invest in open source local AI. I maintain a small effort towards this goal here at https://pocketweb.tools/

For US citizens: counting on Open Source AI is another libertarian fantasy.

Open source AI should and will get better for sure (including better defined first), but the state will have the power over AI never the less.

If you don't like govt's AI policy or the people making those policies, go fix that, don't act like you can avoid them.

For Chinese: saying "Open source AI must win" sounds like singing "L'Internationale, sera le genre humain". The reality is Open Source AI will be over the moment US competitive pressure gone.

For rest of world: there's no real AI for you so far, go work on it or be a citizen of US&A or China.

In the US -- once our nation finishes attacking our own education system -- this is definitely something a group of academic institutions could get together and accomplish. I assume the same is true in other countries. Companies like Nvidia and AMD might even support that effort, as they make money on the hardware and would probably be more than happy for there to be more reasons to use it. There may have not been a compelling enough motivation to achieve this before, but "models" didn't have this level of strategic relevance until relatively recently. Nvidia has been fairly good about releasing open weight models in the last few months.

The article doesn't say what it means by win. I presume we will have the present situation where the cutting edge stuff is closed source developed by profit oriented companies and open source is available two but a year or two behind.

Isn't training material the biggest problem for truly open source LLMs (such that could compete with top tier models)? The computation part can be solved with money, but compiling a comprehensive training set that could be freely shared and free of copyright issues is pretty much impossible.

I think articles this light on content should not be upvoted to front page.

At d5s.tech we are recreating the layers built on top of models, working on dogfooding our own product to run a large chunk of the company.

I feel extremely strongly that a future in which most companies depend on one or two large AI-megacorps is going to lead to excessive rent seeking sooner or later.

I remain positive that the long term steady state will consist of proprietary models, -but- with open source AI models statistically close.

If compute keeps growing the relative cost of training current frontier models will decrease. An open source Fable/Mythos model simply seems inevitable.

The latest US gov meddling in the Fable rollout really put the nail in the coffin. We can't integrate a strategic product that is subject to the capricious behavior of the US

My grim view is that it's just one incident away from some evil freaks to use ablated offline model for some nasty acts to have lawmakers lose their mind and try to regulate open source models and even consumer GPU. Think the latest 3d printers restriction.

we could've been fine with the sole existence of AI if the organizations providing them weren't greedy and rug-pullers. anthropic could've been loved by all if it acted towards the benefit of humanity. as intelligent system continue to become smarter, close or beyond mythos level, what now? with the 'community-driven' mindset we have, is the future really going to be safe? probably not we just need a company that develops, serves, maintains, these models the right way, priced fairly that benefits the user and the company.

As an person whos getting into tech and already developing a game, the fact that laptop prices since 2020 have increased by 20-40% is insane. It's delaying the time to create my game. I researched the reason for the cost spike, and most of it is from the excessive money put in ai Technically, the owners of AI could slow down the amount of GPUs and RAM they buy because AI has almost reached its most usable peak. Everything they add just introduces more bugs, so instead of building more AI centers, they should focus on improving the main AI model with bug fixes. There's no need to give it more unnecessary power. Most people don't care; the entire business is run by a few old men who think AI is everything and invest huge sums of money to show other AI companies they need to improve to get more funding from old people. We just need to find something new and innovative for older investors to focus on, so not everything is about investing in AI like Roblox, OpenAI, Google, etc. The extreme amount of reasoning power given to AI is causing bugs, and the moments when AI had outbursts towards people are related to this.

I think models will be a commodity sooner rather than later. This whole race doesnt matter. First mover advantage is real, but over enough time it wont matter.

Well, the crazy thing I'm working on (100% self-funded thus far): https://trivyn.io. The main idea is moving most of the reasoning to the symbolic layer so the "neuro" piece can be a small model able to be self-hosted on reasonable hardware.

But, I'm just one person with an idea and I don't have infinite funds to make this happen. This isn't a small project.

Maybe there would be interest in something like this, now that entire frontier labs are being banned from making further progress.

As I replied to a child comment - this is a nice idea that just isn't tenable in reality. AI hardware isn't just hilariously faster than consumer GPUs, it's also hilariously more power-efficient and has hilariously better connectivity. Every one of these dimensions kills the idea.

The far, FAR superior power efficiency means that even if you did harness every public GPU or GPU-like device on earth, you'd end up consuming so much excess electricity it would be cheaper on net to simply take the money that would have gone to the power bill and spend it on your own datacenter.

And even if electricity was free, having those GPUs spread over the world with internet-level latency will slow everything down by factors of thousands to millions - if it's feasible at all. Regardless, you're not getting fable-oss this decade, maybe even not this century.

It would be better for governments to buy and own their own datacenters, maybe as a coalition, and dedicate their operation to the public good. I believe that is what we actually have to do.

>But when people think of decentralized training, they don’t first think of gigantic datacenters, owned by the same company, training models across large distances. Instead, they imagine thousands of small datacenters, or individual consumers, pooling their spare compute over the internet to orchestrate a training run larger than any single actor could manage alone. Many companies are pursuing this vision: Pluralis Research, Prime Intellect and Nous Research have already successfully decentrally trained models at scale. But in practice, training decentrally over the internet has lagged far behind more centralized training. Even their largest models (Pluralis’ 8B Protocol Model, Prime Intellect’s INTELLECT-1, and Nous’ Consilience 40B) have been trained with 1,000x less compute than today’s frontier models (such as xAI’s Grok 4). https://epoch.ai/gradient-updates/how-far-can-decentralized-...

>The communication speeds are untenable.

Can it be parallelized or not?

If you take a model, make two copies, and fine-tune each one on different data, what happens when you merge them? Does it work if you freeze different layers?

I think this works if the steps are small enough. And the transfer should become tenable if the steps are big enough. Where's the cutoff?

> The total power of all GPUs on the planet dwarf their capabilities

That just isn't true. It misunderstands exactly how much silicon has gone directly to those companies, and exactly how much more powerful said silicon is compared to consumer grade gear.

This could be of interest to you: https://thealliance.ai/projects/tapestry

there was a project trying to achieve some of those goals a few years ago using p2p: petals https://github.com/bigscience-workshop/petals

their bloom model was also a collaborative effort https://huggingface.co/docs/transformers/en/model_doc/bloom

Could it be done by making a sparse MoE of thousands, or tens of thousands, of smaller experts in very niche domains? Maybe a tree-like structure of experts which can delegate from relatively general but inaccurate to extremely niche but accurate? Also these experts might be plug-and-play, easily swap out an inferior expert with a stronger one in the future without having to redo the whole pile?

Ya that'd be an awesome project, the only issue is how do you verify it's not being poisoned? To actually validate it would require more analysis than the training took to run. It would require a trusted network, not an open one, unless that can get solved somehow.

Well, I suppose it is understandable why you want to attack the most obvious problem with such a scheme: obtaining sufficient compute.

That does mean you are actually neglecting the more difficult issues.

there are some strong open source groups like NOUS research taking the fight https://nousresearch.com/

This, and distributed LLM inference. We are at a point where no single person can setup a rig to run a SOTA model, it is just too expensive.

So we must build and adopt frameworks that allow individuals to share resources to run SOTA models in a distributed manner. That way they will also be non-censorable by governments.

Also The only way to prevent that one entity weaponizes it, is by giving EVERYONE access to it.

I built Teale.com and opensourced it. My domain contribution to society. It powers fully distributed inference on Mac, windows, Linux, android, iOS, hell even harmonyOS.

Opensource/weight models will get better and better and eventually we will have mythos level running on smartphone/eyeglass hardware.

It is stupidly tedious currently to match supply with demand though because physical hardware like a 16gb ram MacBook doesn't mean there's truly 16gb available let alone matching models and all of their settings (kvcache, context limit, temperature, etc) to demand.

Would appreciate any help cus we need ai inference by the people for the people.

> distributed LLM inference

This seems extremely inefficient considering data transfer between model layers if the model is distributed. I found this project called Petals that claim up to 4 tok/s for a 180B model although its repository hasn't been updated in two years.

https://petals.dev/

I propose the TokenTorrent protocol. Hoist yer pirate, erhm, clanker flags!

I wonder if there is way local small LLMs can complement each other in away that the sum-total yields a much more performant LLM

> The only way to prevent that one entity weaponizes it, is by giving EVERYONE access to it

There is a middle way; the policy space also includes government regulating both access and monopoly.

I’m opposed to monopolies of this tech, but I hope the risks of giving everyone jailbroken AGI/ASI are clear.

As a toy example you could imagine a Universal Basic AI where government subcontracts to (n_quorum) labs, everyone gets a token budget, but operating the APIs comes with the safety controls.

If everyone does get to run their own jailbroken AGI, then the only stable societal norm I see is A LOT of surveillance to make sure nobody is building CBRNE threats. This doesn’t seem like a clear win from a civil liberty perspective, though I could see the argument.

yes, it also complements the geohot idea behind the tinybox