Dav2d

'AV2 decoding is roughly five times more complex than AV1 decoding. In practice, that means software running on today’s hardware will struggle to decode AV2 in real time without careful, architecture-specific optimization'

AV1 software decoding is already very intensive so AV2 decoding benchmarks are the next thing that would be really interesting (or mortifying) to see.

A codec spec isn't done until there is at least one decoder developed in the field. So reference + 1. The field implementations often become the de facto spec.

Reading the MPEG1 specs back in the 90s as a child opened my eyes to how to define complex systems. For a media coding standard, they spent most of their time saying how to interpret encoded bytes, which I realized is genius. Be descriptive about decoding and you don't have to be prescriptive about encoding. Encoding is where you can apply all the creativity, but you need to provide a way to have a shared understanding of the encoded bytes.

I'm not quite convinced a 25% reduction in size is worth effectively obsoleting all devices that have hardware decoders for AV1 but will struggle to decode AV2

I thought this was about Dave2D

Seems like the blog succumbed to the HN hug of death (`Actioning this file would cause "jbkempf.com//blog/2026/dav2d/" to exceed the per-day file actions limit of 160000 actions, try again later`), is there a copy available somewhere?

Ouch, looks like the HN hug of death struck again. Gives me error 429.

> The page you have tried to access is not available because the owner of the file you are trying to access has exceeded our short term bandwidth limits. Please try again shortly.

HN hug of death

  ... improvements around 25% compared to AV1

  AV2 decoding is roughly five times more complex than AV1 decoding

I'm not sure what these two lines mean or if we can compare them, any help?

Is there actually a reasonably performing encoder that can compete with the x26* family in real world conditions this time?

The AV2 Video Standard Has Released (Final v1.0 Specification)

https://news.ycombinator.com/item?id=48340910

> Let dav2d be.

This is an odd signoff. Are people having a go at dav2d?

> Make it fast on older desktop, by writing asm for SSSE3+ chips

I guess 5 years ago (around the time when Intel stopped making SSE-only chips) is technically "older", but I wouldn't prioritize avx2 when devices intended for consuming media definitely experience much less pressure to upgrade than workstations…

If decode is becoming so complicated and expensive the hardware can't handle it, why not just go full neural, send latents, and run decode on tensor cores?

The answer is probably the same as for why not AV2 everything; a lot of hardware couldn't support it today. But in 10 years?

It seems we're running up against fundamental limits of human-engineered video codecs at this point. There might be a lesson in there.

I would love to see comparisons with AV1 on very low bitrates.

Is codex working on novel decoders 24/7? I hope

Wait until you try d4vid, it's killer.

When AV1 was first announced, I got the impression the name was chosen partly as a pun/reference/homage to AVI, the classic but outdated format with used to be popular. Then when I saw Dav1d, OK, good way to continue the pun.

But now with AV2 and Dav2d, that completely breaks. Are we eventually going to get AV3/Dav3d and AV4/Dav4d, which will read like Ave/Daved and Ava/Davad? Seems a bit awkward. Was the idea from the start to have the 1 be the version number, and have it specifically be part of the name?

Not to be confused with Da4vid (world-class hacker and owner of the Black sun) or D4vd (rap artist and alleged murderer)

Sorry if this sounds naive, but does it make sense to write a codec library in C/ASM considering how well Rust is progressing, especially when, as the author puts it, AV2 decoding is roughly five times more complex than AV1 decoding?

Ok whose idea was ‘Wiener filtering’

This seems like an interesting case to test AI agents on.

Like we had weird examples like C compilers and Bun. This is a much more interesting example because its highly nontrivial.

AV1 exists, Dav1d exists. Lets see AI take the AV2 spec and Dav1d code and try to make a working high performance AV2 decoder.

How is AV2 expected to avoid the patent-pool issues AV1 ran into?

AV1 was designed as royalty-free, but Sisvel’s pool and the recent Dolby/Snap proved the contrary.

https://accessadvance.com/2026/03/24/access-advance-licensor...

A codec spec isn't done until there is at least one decoder developed in the field. So reference + 1. The field implementations often become the de facto spec.

Ouch, looks like the HN hug of death struck again. Gives me error 429.

> The page you have tried to access is not available because the owner of the file you are trying to access has exceeded our short term bandwidth limits. Please try again shortly.

HN hug of death

The AV2 Video Standard Has Released (Final v1.0 Specification)

https://news.ycombinator.com/item?id=48340910

Should have used AV2

Too much traffic from HN?

``` Too Many Requests The page you have tried to access is not available because the owner of the file you are trying to access has exceeded our short term bandwidth limits. Please try again shortly.

Details: Actioning this file would cause "jbkempf.com//blog/2026/dav2d/" to exceed the per-day file actions limit of 160000 actions, try again later ```

AV1 software decoding is already very intensive so AV2 decoding benchmarks are the next thing that would be really interesting (or mortifying) to see.

> AV1 software decoding is already very intensive

I think you might be misunderestimating how incredible the dav1d AV1 decoder is. Not only does it require less total time than the reference decoder to decode the same video, but it can spread that out over far more threads. I was unable to watch 4k 60fps av1 video on my media center PC (it's from 2019, so predates hardware av1 decoding, and, well, the CPU was a little long in the tooth) until I switched to dav1d. With dav1d I am now able to watch 4k 60fps av1 using software decoding, and my machine uses 10% CPU while doing so. Really amazing piece of software.

With any luck, the dav2d 5x claim will hold true, and 10% CPU usage will scale to 50% CPU usage, meaning I'm still able to watch 4k 60fps video on my media center without a hardware upgrade. (that machine doesn't have hyperthreading, so 50% cpu is actually 50%, not 100% in a fancy suit)

Intel's Arc dGPUs were really compelling for dedicated AV1 encode and decode, especially the small form factor of some cards. You could even fit it as a secondary card in a PC dedicated to recording and encode workflows for OBS.

Hope we get a similar option with future lineups that support AV2, especially given how popular video creation and streaming are now.

I came to post this as well. Until widespread, inexpensive hardware catches up to a 2018 codec, AV# will remain a niche ideal.

> AV1 software decoding is already very intensive so AV2 decoding benchmarks are the next thing that would be really interesting (or mortifying) to see.

Yes, this is going to be fun to watch.

I'm not quite convinced a 25% reduction in size is worth effectively obsoleting all devices that have hardware decoders for AV1 but will struggle to decode AV2

Modern video services perform multiple encodings with different codecs, bitrates and screen dimensions, and serve up the most appropriate format that the client device can decode. Youtube has hundreds of format variants:

https://gist.github.com/MartinEesmaa/2f4b261cb90a47e9c41ba11...

Devices with AV1 hardware decoding - rare as they are - won't be obsoleted for a long time.

When you host videos with near 17 billion views you're going to want to stream those videos in as few bits as possible:

https://www.youtube.com/watch?v=XqZsoesa55w

That extra 25% becomes worth it.

Nothing will become obsolete. AV1 will stick around for a long time. And YouTube still does H.264 encodes to support old devices.

A new codec doesn't obsolete old devices. At least, not right away.

Studios still release new dvds with mpeg2 video. Online videos tend to be available in many codecs. Video conferencing tends to negotiate to best available or has settled on ancient codecs and won't change quickly.

I thought this was about Dave2D

Soon to be decodable with dav2d!

Yeah I suppose it’s named after dav1d but still seems like a pretty unfortunate name collision.

  ... improvements around 25% compared to AV1

  AV2 decoding is roughly five times more complex than AV1 decoding

I'm not sure what these two lines mean or if we can compare them, any help?

I understood it as compression is 25% better : a quality of 10mbps in av1 can be achieved with 8mbps in Av2. But, it needs 5 times more compute power for this 25% gain.

> I'm not sure what these two lines mean or if we can compare them, any help?

AV2 saves 25% bandwidth at the cost of 5x more decoding complexity.

Smaller files but harder to decode

> Make it fast on older desktop, by writing asm for SSSE3+ chips

Almost every Intel CPU released since 2013 has AVX2 support. Some Atom SKUs were longer holdouts, but the fraction of x86 CPUs shipped in the last decade that have AVX2 support is very high.

I would love to see comparisons with AV1 on very low bitrates.

Return of the 8MB Shrek encodes?

Is codex working on novel decoders 24/7? I hope

One would imagine given the name that it would specialize in codecs

I'm pretty sure it is a homage. As for dav1d it's not a reference decoder (although partially funded by AOM iirc) so they might not know that the next iteration will simply be AV2, we have h264, h265, h266 naming though

Tangent but I cannot wait for h269 (or h267 for the younger gen)

I think it's a reasonable decision. The only people who will interact with dav2d by name are codec nerds, and a simple increment makes the lineage more obvious to that audience.

As with all naming schemes in the tech world, I am sure no future scenarios, including successor names, were ever considered

1dav2codecs?

2av2furious?

Da5id could potentially work as a Snow Crash reference.

> experience Dav... Now in 3D!

I’m fascinated by the flurry of downvotes to a simple commentary and question, especially when the replies are normal. If you’re one of the downvotes, please do share what you found offensive about my comment, I am genuinely interested in what you perceived as problematic.

Not to be confused with Da4vid (world-class hacker and owner of the Black sun) or D4vd (rap artist and alleged murderer)

Or Dave2D, popular tech youtuber

> Not to be confused with Da4vid (world-class hacker and owner of the Black sun)

*Da5id

The algorithms deployed in these kind of codecs take into account not only human vision and mathematical laws of information, but also nitty-gritty details of how computers work, which are optimally exploited by directly having humans write detailed assembly rather than a compiler make a best guess and effort.

Because it's 5 times more complex, you need to get the maximum performance available. Therefore more ASM than ever.

Rust does not bring more performance. Just more safety.

Ok whose idea was ‘Wiener filtering’

This seems like an interesting case to test AI agents on.

Like we had weird examples like C compilers and Bun. This is a much more interesting example because its highly nontrivial.

AV1 exists, Dav1d exists. Lets see AI take the AV2 spec and Dav1d code and try to make a working high performance AV2 decoder.

How is AV2 expected to avoid the patent-pool issues AV1 ran into?

AV1 was designed as royalty-free, but Sisvel’s pool and the recent Dolby/Snap proved the contrary.

https://accessadvance.com/2026/03/24/access-advance-licensor...

> AV1 software decoding is already very intensive so AV2 decoding benchmarks are the next thing that would be really interesting (or mortifying) to see.

Yes, this is going to be fun to watch.

A new codec doesn't obsolete old devices. At least, not right away.

When you host videos with near 17 billion views you're going to want to stream those videos in as few bits as possible:

https://www.youtube.com/watch?v=XqZsoesa55w

That extra 25% becomes worth it.

Nothing will become obsolete. AV1 will stick around for a long time. And YouTube still does H.264 encodes to support old devices.

Almost every Intel CPU released since 2013 has AVX2 support. Some Atom SKUs were longer holdouts, but the fraction of x86 CPUs shipped in the last decade that have AVX2 support is very high.

Return of the 8MB Shrek encodes?

https://web.archive.org/web/20210416200451/https://cdn.disco...

Shrek 1 at 8.34MB including audio.. insane

6MB should be enough for everyone!

One would imagine given the name that it would specialize in codecs

Tangent but I cannot wait for h269 (or h267 for the younger gen)

I think it's a reasonable decision. The only people who will interact with dav2d by name are codec nerds, and a simple increment makes the lineage more obvious to that audience.

As with all naming schemes in the tech world, I am sure no future scenarios, including successor names, were ever considered

1dav2codecs?

2av2furious?

And then AV3: Tokyo Drift, and after that AV Episode 1.

Da5id could potentially work as a Snow Crash reference.

> experience Dav... Now in 3D!

Or Dave2D, popular tech youtuber

Or dave, the command to start Dangerous Dave.

I don’t think the chances of confusion between a niche celebrity and a video decoder are very big.

> Not to be confused with Da4vid (world-class hacker and owner of the Black sun)

*Da5id

Because it's 5 times more complex, you need to get the maximum performance available. Therefore more ASM than ever.

Rust does not bring more performance. Just more safety.

The safety can be worth it in certain cases. Like when handling untrusted input. And it's not just Rust: look at WUFFS for example. WUFFS can actually rival handwritten implementations in certain cases.

Encoder and decoder writers frequently need extremely fine grain control over SIMD instructions in order to get good performance.

The way they weave these instructions can be very hard to express with a high level language.

Further, there's a ton of work with arrays and importantly parts of arrays. They can, for example, need to extract every other element up to 1/2 the array. Unfortunately, rust has runtime array bounds checks which make writing that sort of code slower. The compiler can elade those checks, but usually only in simple cases.

The authors would be writing a bunch of unsafe rust to get the performance they want and rust makes that more painful on purpose.

I like rust, but C/ASM really is the right choice here. This is one of the few cases where rust's safety is a major detriment.

The ffmpeg devs have said many times in public that they routinely get speedups of 10x or more over C code. I'm not a reputable source on this myself but I highly recommend looking into their channels, mails, or posts.

https://youtu.be/nepKKz-MzFM&t=7195

If you can stand Lex Friedman for a bit, the VLC authors talk about why you use ASM for a video decoder instead of pure C or rust.

yes it makes sense to use C/ASM here, but if you're curious, there is a rust port of dav1d named rav1d: https://github.com/memorysafety/rav1d

it's not much slower than the original C/ASM implementation (last i checked ~5%?) but that matters here

fyi the Rust port already exists: https://github.com/stukenov/rav2d you keep the hand-written asm via FFI, rest is safe Rust. same trick rav1d uses.

Go ask FFmpeg what they're writing their encoders and decoders in.

I don't know why you've been down-voted. It definitely isn't an optimal decision. A video codec isn't all assembly. There's plenty of plain unsafe C code. E.g. this is the first random file I clicked. It has a ton of raw C pointer stuff just begging to be exploited.

https://code.videolan.org/videolan/dav2d/-/blob/main/src/dat...

There is a project to write an AV1 decoder in Rust: Rav1d (really stretching the name here).

https://github.com/memorysafety/rav1d

They got within 5% of the performance of dav1d and held a contest to close the gap but I think I read somewhere that this wasn't achieved.

https://www.memorysafety.org/blog/rav1d-perf-bounty/

They claimed

> This is enough of a difference to be a problem for potential adopters, and, frankly, it just bothers us.

But in my opinion nobody actually cares about 5% in absolute terms. It's likely just Rust naysayers using that as an excuse.

I think the likely reason for dav2d using C is that they can reuse lots of code and infrastructure from dav1d. But I agree it would be much better if they worked on Rav2d instead (these names!). You can hardly complain about a 5% overhead if you're opting in to 5x more decoding complexity.

Yes? There is 5x more code to optimize the ASM for.

Norbert Wiener in the 1940s. He invented the technique.

It's a semi-common last name.

https://en.wikipedia.org/wiki/Wiener_filter?wprov=sfla1

standard in DSP. Super cool and can be used on stock trades and other predictive markets.

> Lets see AI take the AV2 spec and Dav1d code and try to make a working high performance AV2 decoder.

That sounds like one of these high-risk, high-reward things that are great for people / projects / companies who have nothing to lose, but is not a great baseline strategy for an established market player. AV2 is here with support from aomedia and its members. AV2 will be used, and we need a production-grade decoder regardless of where AI is at, so it makes much more conservative business sense to use established approaches (language: c/asm, devteam: ffmpeg/dav1d) as a starting point. While that's happening, we can dabble in AI and other risky stuff and see if it helps. If so, great, and if not, nothing lost.

They filed a suit, henceforth making a claim of an issue...... They haven't "proved" anything other then they have lawyers on staff that can file some paperwork until the suit is settled in court...

How does that prove anything?

They're claiming that there are patents, but that doesn't mean there are.

Sorry, I have a patent on questioning whether open source codecs are parent encumbered. Venmo me $1000 or you will be speaking to my lawyers

Every single AV2 news here in the last week has seen exactly the same question.

Either go back read the answers there first, or I will assume you are part of a FUD campaign (yes, I know HN guidelines, but again every single AV2 news in the last week has seen the same rhetorical "questions" as top "comments").

No codec can ever avoid patent-pool claims.

Too much traffic from HN?

``` Too Many Requests The page you have tried to access is not available because the owner of the file you are trying to access has exceeded our short term bandwidth limits. Please try again shortly.

Details: Actioning this file would cause "jbkempf.com//blog/2026/dav2d/" to exceed the per-day file actions limit of 160000 actions, try again later ```

I don't know if I'm underestimating HN's reach but I doubt we did that, probably traffic from a much bigger aggregator/forum

i had that too once i used dyndns address my linux apache crashed when some one posted it here

most likely the millions of llm bots that scrape hn and stuff hugged it when it got to frontpage.

I came to post this as well. Until widespread, inexpensive hardware catches up to a 2018 codec, AV# will remain a niche ideal.

Hardly niche. My laptop isn't new and it has hardware AV1 decoding and encoding. My 10 year old iPhone 7 can play 1080p AV1 video in software for over 200 minutes with VLC. The iPhone 7 was released in 2016, a year and a half before AV1. The dav1d decoder is mighty.

Netflix uses AV1: https://netflixtechblog.com/av1-now-powering-30-of-netflix-s...

YouTube uses AV1. It's tough to be more mainstream than that.

Right click on a YouTube video and select Stats for Nerds. If your system is capable of it, chances are it will be playing back in AV1.

Most of the YouTube videos I watch these days are AV1 encodes. Sometimes it's in VP9 and occasionally it's H.264.

Hope we get a similar option with future lineups that support AV2, especially given how popular video creation and streaming are now.

Is there a compelling reason encoding needs to be done locally?

The point of encoding is to reduce downstream bandwidth for the viewer, and upstream bandwidth for the distribution network.

The content creator only needs to upload it once.

https://gist.github.com/MartinEesmaa/2f4b261cb90a47e9c41ba11...

Devices with AV1 hardware decoding - rare as they are - won't be obsoleted for a long time.

Even devices that don't aren't becoming obsolete. My M1 macbook does not have a hardware decoder and youtube seems to choose AV1 whenever it's available. The software decoder does not noticeably warm the laptop.

Is there actually a reasonably performing encoder that can compete with the x26* family in real world conditions this time?

Last time Intel's SVT-AV1 managed to beat both x265 and x265 at various quality/speed tradeoffs.

I assume there will be an SVT-AV2 too which will semi-automatically gain from the SVT foundation for working with lots of cores but will still need specific work to support and then tune AV2 encoding.

Yes? There is 5x more code to optimize the ASM for.

i had that too once i used dyndns address my linux apache crashed when some one posted it here

Yeah I suppose it’s named after dav1d but still seems like a pretty unfortunate name collision.

Smaller files but harder to decode

https://web.archive.org/web/20210416200451/https://cdn.disco...

Shrek 1 at 8.34MB including audio.. insane

Video resolution: 128x72, hahah. Late 90s RealPlayer postage stamp video is back! To its credit, that whole movie is probably smaller than RealPlayer itself was.

There's a 64MB game boy advance cartridge with shrek on it [1]. Looks pretty horrible [2]. But the GBA only has 16KB fast / 256KB slow RAM, and a 16MHz CPU.

[1] https://archive.org/details/Shrek-Video-GBA [2] https://www.youtube.com/watch?v=CyOfPZQl4MI

I love this, hope to see a AV2 version at 8MB

6MB should be enough for everyone!

Codex is actually short for `Code Execution`

And then AV3: Tokyo Drift, and after that AV Episode 1.

This thread made me think of this song/sketch from Kids in the Hall.

https://www.youtube.com/watch?v=dEGCdVADLWY

Or dave, the command to start Dangerous Dave.

It brings tooling that is a LOT easier. Just things like dependency management, test running and so on is so much better in Rust than in C, even if you happen to write the exact same code because you basically write unsafe code and hand rolled assembly for many things. I think this is people using the tool they know rather than the best tool (And if you know a tool well, it might become the best tool for the job because of that). It could be because a huge chunk of existing code can be re-used. But all else being equal (existing code, existing developers don't exist) I refuse to believe a codec should ever be written in C ever again.

> Rust does not bring more performance. Just more safety.

Though more safety can in some cases bring a bit more performance. For instance, with Rust you can often avoid "defensive copies" of objects.

Surely 100% of these low level features are availale in rust too? I understand it is a massive undertaking and builds off the previous codec(s) but writing these things by hand such as inline assembly seems to be as easy if not easier in Rust?

And as soon as you walk into concurrency territory for a complex codec like this then it seems almost impossible for humans to do correctly while retaining safety.

I don't know if I'm underestimating HN's reach but I doubt we did that, probably traffic from a much bigger aggregator/forum

You are underestimating HN's reach, this happens all the time. As someone who has been on the front page of HN it's a pretty big rush in traffic!

Hacker news doesn't generate much traffic, despite what people are saying.

The host here has a limit of 160000 files served each day. That is extremely low. If the site has an icon, css, a js file and a few images it's 10 files each visit. That's will limit it to 16k visits/day. If there are more files loaded it might just handle a few thousand visits, and they have received more than that from HN now.

The HN hug of death is real. If you aren’t prepared, it can set fire to your server room.

Yeah, it's hard to tell thb, just a guess. But potentially the site also misconfigured their server, causing too much cache misses and hitting the server direclty.

Netflix uses AV1: https://netflixtechblog.com/av1-now-powering-30-of-netflix-s...

YouTube uses AV1. It's tough to be more mainstream than that.

Right click on a YouTube video and select Stats for Nerds. If your system is capable of it, chances are it will be playing back in AV1.

Most of the YouTube videos I watch these days are AV1 encodes. Sometimes it's in VP9 and occasionally it's H.264.

Supported is different from doing it well though. You do notice the performance hit even on TVs that playback YouTube videos on AV1.

Even on 1080p videos running on AV1 on 1x, the TV system bogs down and any kind of interaction has a variable 1-3s lag. On some TVs if you do 1.25x the TV automatically "downgrades" the resolution to 480p to avoid dropping frames.

I wish there was an option to still use VP9 / H.264 on those systems (even limited to 1080p).

Yeah I could imagine the AV1 codec sticking around for a very long while, even as a fallback for AV2. There's still hundreds of millions of people out there using old/cheap devices (especially in developing countries) where that battery drain from software decoding is a big problem, so AV2 would be nonviable.

Same. Mostly AV1, sometimes VP9, and rarely h264.

What's missing mostly: live streams which are h264.

Currently, and I say currently, dav1d is so fast, no worries on that side.

Is there a compelling reason encoding needs to be done locally?

The point of encoding is to reduce downstream bandwidth for the viewer, and upstream bandwidth for the distribution network.

The content creator only needs to upload it once.

Yes.

An uncompressed 1080p, 60fps video with 24-bit color depth would need around 3Gbps to be streamed. And even if you don't need to stream it, that would still consume a sizeable portion of the write throughput of the fastest SSDs currently available; if you go up to 4K, you'd actually exceed that by a lot (not to mention, 1tb of storage would last for about 10 minutes of video).

If you don't encode locally as the video is created, you either need to store RAW frames which takes enormous amount of storage, or you use a different format and suffer quality loss by transcending.

Well yes? The platforms only accept certain resolution/bitrates and also most of America isnt running 1gig up. They're running 5-30 mbps up. So yeah they need to encode it.

Video calls & streaming.

> Let dav2d be.

This is an odd signoff. Are people having a go at dav2d?

If decode is becoming so complicated and expensive the hardware can't handle it, why not just go full neural, send latents, and run decode on tensor cores?

The answer is probably the same as for why not AV2 everything; a lot of hardware couldn't support it today. But in 10 years?

It seems we're running up against fundamental limits of human-engineered video codecs at this point. There might be a lesson in there.

Wait until you try d4vid, it's killer.

> I'm not sure what these two lines mean or if we can compare them, any help?

AV2 saves 25% bandwidth at the cost of 5x more decoding complexity.

What does "complexity" mean here? Computation required?

I understood it as compression is 25% better : a quality of 10mbps in av1 can be achieved with 8mbps in Av2. But, it needs 5 times more compute power for this 25% gain.

https://youtu.be/nepKKz-MzFM&t=7195

If you can stand Lex Friedman for a bit, the VLC authors talk about why you use ASM for a video decoder instead of pure C or rust.

fyi the Rust port already exists: https://github.com/stukenov/rav2d you keep the hand-written asm via FFI, rest is safe Rust. same trick rav1d uses.

Norbert Wiener in the 1940s. He invented the technique.

It's a semi-common last name.

https://en.wikipedia.org/wiki/Wiener_filter?wprov=sfla1

They filed a suit, henceforth making a claim of an issue...... They haven't "proved" anything other then they have lawyers on staff that can file some paperwork until the suit is settled in court...

Soon to be decodable with dav2d!

No codec can ever avoid patent-pool claims.

Every single AV2 news here in the last week has seen exactly the same question.

Should have used AV2

standard in DSP. Super cool and can be used on stock trades and other predictive markets.

Sorry, I have a patent on questioning whether open source codecs are parent encumbered. Venmo me $1000 or you will be speaking to my lawyers

I'm French, and so is JB. To me it feels more like a play on "Let there be light", as in French we'd say "Let the light be". I might be wrong but I did not feel this insinuated people were having a go at AV2/dav2d.

What makes you think that would use less resources?

And it's not really hardware hitting limits, it's specifically software decoding on somewhat weaker machines.

This is a software decoder designed to run on general purpose hardware.

Adding custom hardware like tensor cores to the stack would serves a different use case.

I wouldn't want it myself, but, video games already do something like this - AI upscaling. Render the game at lower resolution / quality settings, then use AI to upscale and fill in the blanks.

This could work / works for video too, give it lower resolution / quality images and AI upscale. Its predecessor would calculate intermediate frames for example.

What does "complexity" mean here? Computation required?

Yes, much higher computation required to encode it, and decode it, both.

dav1d is the av1 decoder and it’s an insane feat of engineering. Written in assembly, it even eschews the normal c calling convention to get even better performance.

Almost half of the file is audio, so you're not saving as much.

Video resolution: 128x72, hahah. Late 90s RealPlayer postage stamp video is back! To its credit, that whole movie is probably smaller than RealPlayer itself was.

I once watched an entire movie (around 90 mins) on a Nokia 6303. This reminds me of that

There's a 64MB game boy advance cartridge with shrek on it [1]. Looks pretty horrible [2]. But the GBA only has 16KB fast / 256KB slow RAM, and a 16MHz CPU.

[1] https://archive.org/details/Shrek-Video-GBA [2] https://www.youtube.com/watch?v=CyOfPZQl4MI

I love this, hope to see a AV2 version at 8MB

There is going to be a never ending stream of them isn't there.

Or go the Apple Watch naming scheme route.

Just “AV”

Next, AV Series 1 and 2 (released simultaneously)

Later, AV Edition but it costs $10,000

Already predicting which versions to avoid, huh.

Are video codecs in the present day able to be sandboxed? In my fantasies at least I’d like the worst a malicious video file can do is cause garbage output or cause the codec to crash.

Forgive the ignorance, I have worked entirely in the abstracted layers of the stack, and mostly web.

Encoder and decoder writers frequently need extremely fine grain control over SIMD instructions in order to get good performance.

The way they weave these instructions can be very hard to express with a high level language.

The authors would be writing a bunch of unsafe rust to get the performance they want and rust makes that more painful on purpose.

I like rust, but C/ASM really is the right choice here. This is one of the few cases where rust's safety is a major detriment.

Performance should not be priority #1. Security should be. Why do we slow down all CPUs to prevent SPECTRE attacks yet continue to write in C? As rav1d shows, the perf loss is far less to migrate from C to Rust than it is to apply SPECTRE mitigations, and adding a sandbox around a memory-unsafe codec is going to be way more expensive again than using Rust code to start.

yes it makes sense to use C/ASM here, but if you're curious, there is a rust port of dav1d named rav1d: https://github.com/memorysafety/rav1d

it's not much slower than the original C/ASM implementation (last i checked ~5%?) but that matters here

It's a Rust/ASM port, look there: https://github.com/memorysafety/rav1d/blob/main/src/ext/x86/...

I am not sure if it is that much safer than the C version when raw assembly is still required.

It is much slower than 5%, there were other independent tests that put it around 20%.

there's a rav2d now too fwiw — https://github.com/stukenov/rav2d same playbook: safe Rust + asm kernels via FFI. just shipped 0.1.0.

Go ask FFmpeg what they're writing their encoders and decoders in.

That isn’t particularly helpful to someone asking a question in good faith. What others are using doesn’t clarify why they are using it. Plus, FFmpeg is itself a decade older than Rust. The OP is asking about starting a new project today.

Just don't try reporting a security issue to them.

https://code.videolan.org/videolan/dav2d/-/blob/main/src/dat...

There is a project to write an AV1 decoder in Rust: Rav1d (really stretching the name here).

https://github.com/memorysafety/rav1d

They got within 5% of the performance of dav1d and held a contest to close the gap but I think I read somewhere that this wasn't achieved.

https://www.memorysafety.org/blog/rav1d-perf-bounty/

They claimed

> This is enough of a difference to be a problem for potential adopters, and, frankly, it just bothers us.

But in my opinion nobody actually cares about 5% in absolute terms. It's likely just Rust naysayers using that as an excuse.

funny you mention it — rav2d exists now: https://github.com/stukenov/rav2d full C-to-Rust port, asm kernels still via FFI like rav1d does. early (0.1.0) but passes conformance against dav2d.

Hacker Times

Hacker Times

Dav2d

Discussion

Discussion

dav2d

AV2

From dav1d to dav2d

Current status

Performance work

checkasm

Open source

What comes next