Redis 8.8: New array data structure, rate limiter, performance improvements

> Rate limiting is one of the most common Redis use cases. Traditionally, users implemented rate limiters using server-side Lua scripts combined with client logic. In Redis 8.8, we introduce a window counter rate limiter (by @raffertyyu, together with the Redis team).

I had a look for this and it turns out it's slightly mis-described there - it's not a window counter, it's a "GCRA (Generic Cell Rate Algorithm)" - a leaky bucket algorithm. Code here: https://github.com/redis/redis/blob/unstable/src/gcra.c

The code comments say it was heavily influenced by https://github.com/brandur/redis-cell by Brandur Leach.

It's a neat algorithm (I just learned about it today) - it only needs to store a single integer for each rate-limited key, which is the "Theoretical Arrival Time" when the bucket would next be empty.

While I love Redis as a versatile tool for external data structures, it's still lacking in two areas IMHO:

One, it would be cool to be able to embed it, similar to sqlite, directly into applications.

Two, the HA story is so much more complicated than it should be. I totally acknowledge that concurrency and distributed computing is hard, but it should not require reading heaps of documentation and understanding two entirely separate multi-node approaches only to figure out there are lots of subtle strings attached that make it impractical for many applications.

Where did everyone end up on the Redis/Valkey split? Is there still a reason to use Redis after the license kerfuffle?

There's also a separate blog post that goes into the details of why existing data structures Redis already supported, which could provide array-like behavior, weren't good enough:

https://redis.io/blog/diving-deep-into-rediss-new-array-data...

given his ds4 project, likely collaborated with DeepSeek for this release:

https://github.com/antirez/ds4

window counter rate limiter!

This is awesome!

And arrays look great too. Lots to play with.

And here we see the reason for the sudden AI enthusiasm of Redis authors: array data structures are used in AI. This was clear weeks ago.

The website looks like openclaw's website.

The code comments say it was heavily influenced by https://github.com/brandur/redis-cell by Brandur Leach.

It's a neat algorithm (I just learned about it today) - it only needs to store a single integer for each rate-limited key, which is the "Theoretical Arrival Time" when the bucket would next be empty.

Also, the “cell” in Generic Cell Rate Algorithm is an ATM cell. GCRA is 1990s telecom, the scheduling algorithm ATM switches used to check that 53-byte cells were arriving on the wire at the agreed rate.

window counter rate limiter!

This is awesome!

And arrays look great too. Lots to play with.

And here we see the reason for the sudden AI enthusiasm of Redis authors: array data structures are used in AI. This was clear weeks ago.

The website looks like openclaw's website.

While I love Redis as a versatile tool for external data structures, it's still lacking in two areas IMHO:

One, it would be cool to be able to embed it, similar to sqlite, directly into applications.

What would be the point of embedding Redis into an application? What's the advantage of using Redis over using the builtin (or third party) data structures of the language the application is developed in?

I'm asking as a non-webdev who never quite got what Redis actually does, but would love to learn.

> One, it would be cool to be able to embed it, similar to sqlite, directly into applications.

I've found myself wanting this on several occasions too. I.e. wanting all my rust backend processes (k8s pods) to have some minimal shared state, without having to spin up a Redis cluster. I've talked to Claude about it a couple of times, and it descends into something like, "you gotta use Raft or CRDTs, and pick 2 out of 3 from CAP". Which honestly seems pretty fair, and indicates to me that I'm dreaming for something magical.

Nonetheless, it is nice to hear someone else asking for this. If this is indeed feasible (even if simple/limited), then I'd be interested to try it.

Genuinely interested why we need HA in redis, just not read round robin from multiple non-HA instances? Redis (and memcache) are memory caches and should be treated like that, not like highly consistent distributed session store.

> it's still lacking in two areas

This is entirely different than what Redis is and tries to solve.

Sqlite is embedded. It's not a distributed SQL. Redis is a distributed data structure store and concurrency primitive. These are worlds apart.

> HA story is so much more complicated than it should be

It is precisely as complicated as it needs to be. You don't want data loss.

If you're in the business of high available fault tolerance, you read the manual and learn how to Redis.

Where did everyone end up on the Redis/Valkey split? Is there still a reason to use Redis after the license kerfuffle?

There's also a separate blog post that goes into the details of why existing data structures Redis already supported, which could provide array-like behavior, weren't good enough:

https://redis.io/blog/diving-deep-into-rediss-new-array-data...

@antirez wrote about the development of that data structure last month, which includes how he used LLMs to do it (which was before ds4 for the co-comment mentioning it ;). The PR he linked goes into the motivation.

https://antirez.com/news/164

https://github.com/redis/redis/pull/15162

given his ds4 project, likely collaborated with DeepSeek for this release:

https://github.com/antirez/ds4

Possibly, but the array type code was implemented using GPT/Claude models before DS4 was a thing. I really recommend this write up on how he used LLMs which I think is a more sane/safe way to code with them vs the YOLOing even I'm subject to unfortunately...

https://antirez.com/news/164

The experimental SSD streaming feature (author's demo @ https://x.com/antirez/status/2062536214675067322 - recently merged into the main branch) is great news for that project, allowing for SOTA inference (DeepSeek V4 Flash and Pro!) on RAM-limited machines. Now we need work on large-ish scale batching in order to recover tok/s under the SSD streaming scenario. It's not helpful when running normally (at least not on Apple Silicon) since thermal/power throttling is the constraint in that case, but SSD streaming is a whole other consideration.

For those who may not know, you can cut your costs in AWS by going with Valkey over Redis for about 33% savings.

https://aws.amazon.com/blogs/database/reduce-your-amazon-ela...

We switched to Valkey two years ago. I haven't really looked back. I think both projects have done a lot of nice stuff since the split but it's not really impacting anything I use. The feature set was fine five years ago and I don't think we're using anything in Valkey that wouldn't work in Redis. There are probably a lot of projects that never switched over because they had no real need.

But most of the cloud providers now offer Valkey because of the license changes. Of course, cloud providers not offering Redis was the intention of the license change from the Redis point of view. So mission accomplished for Redis.

But the flip side of course is that if you want to deploy on standard infrastructure rather than self hosting Redis, Valkey is now the easy, low risk path that probably should be the default for most companies that target AWS, Azure, GCP, etc. Same with Elasticsearch vs. Opensearch and a few other products where the community forked because of license changes.

Mentioning Elasticsearch because I know people in both communities and I'm deeply familiar with the stack. A few years on, Opensearch has taken a lot of the momentum from Elasticsearch.

I've switched to Valkey and I'm not really looking back. I'm much more comfortable with those people maintaining the software.

Valkey, because our cloud provider is hosting it and that's obviously what they prefer.

I feel like we're using about 1% of its features at this point - really just as a fast K/V store - so it would be easy to switch if needed, but I can't see a case where we would.

We're a self hosted shop, we went with Valkey. Valkey also has support for RDMA, which we already is running in our infrastructure.

We use almost exclusively Valkey now, mostly because we host on AWS and Render, which both use Valkey. It's faster, cheaper and compatible. I'd consider Garnet too but I believe it doesn't support LUA(or didn't at the time we needed it).

We switched to Valkey after the Redis license kerfuffle happened, discovered we were saving money on our AWS bill, and have no motivation to go back to Redis.

So we’ve stayed with Valkey.

Most people seems to have switched to Valkey, and it's backed by the Linux foundation.

Went with 100% ValKey, if you are solely on AWS it is a no-brainer

We went with DragonFlyDB

> One, it would be cool to be able to embed it, similar to sqlite, directly into applications.

Nonetheless, it is nice to hear someone else asking for this. If this is indeed feasible (even if simple/limited), then I'd be interested to try it.

Mentioning Elasticsearch because I know people in both communities and I'm deeply familiar with the stack. A few years on, Opensearch has taken a lot of the momentum from Elasticsearch.

I've switched to Valkey and I'm not really looking back. I'm much more comfortable with those people maintaining the software.

I'm asking as a non-webdev who never quite got what Redis actually does, but would love to learn.

To me the thing I like about Redis is that it gives you a storage engine very suitable for caches; it handles TTLs and memory pressure, as well as built-in serialization with the ability to get better performance by allowing for some data loss. At the same time, many users will be deploying small programs to individual machines. If you could just have Redis be embedded this would make it very operationally simple: no additional daemons and a single file to backup if you want to.

It would also be useful because of the ability to switch modalities. When running a multi node service, you can use Redis to share data between nodes and use Redis pubsub as a communication bus. If you wanted to support a simple single node configuration too, then it wouldn't need to be a special case, it could just go through the same mechanism but with an embedded Redis instance.

It's pretty similar to SQLite: being able to embed more or less a complete storage engine into your app can be very convenient and powerful.

Probably because Redis gives you a very well-defined/understood set of rich data structures with built-in behavior like TTL, atomic operations, eviction, and persistence. These things are otherwise usually scattered across native types, helper classes, or entirely separate libraries.

A few nice things about doing this in no particular order:

Embedding would make local dev/CI integration testing convenient.

Embedding replicated Redis with each application instance would give you HA benefits while infra-management complexity.

Embedded redis (even via local RPC) is still going to be faster than a lot of languages or frameworks’ built-in data structures. Large array operations in, say, Python are gonna slower than RPCing to Redis (assuming that the data structures are built gradually and not built all at once); to beat Redis you’d have to use numpy or something—-which is definitely preferable, but is extra work if your app already uses Redis for other things.

Just like choosing SQLite over e.g. LMDB or RocksDB, embedded Redis would be a nice future proofing option for small apps during the prototype phase; less would have to be changed to move Redis out of the app than if a different cache or persistence service were chosen.

In practice, mostly scaling sessions and ephemeral data (caching) across multiple intances of a microservice on multiple machines. Seperating the kv store and the application allows upgrading each application while retaining availability and avoiding loss of session data.

For simple cases, it is probably a total overkill to even consider it, but for something heavier, embedding the database gives you a chance to trivially migrate later to a separate database server.

Why would you embed SQLite?

It’s the same use case with a different api.

A typical (meaningful) example might be communication between threads or actors in a single process, or idempotent tests.

As with SQLite, an external xxx that does this for you is certainly better, etc. but it’s convenient sometimes, to have an application that doesn’t go “now before you run this install Postgres…”.

It’s seldom useful for a web app where you control everything.

> Redis (and memcache) are memory caches and should be treated like that

If you haven't come across Kvrocks yet, it may be worth a look: https://github.com/apache/kvrocks https://kvrocks.apache.org/ . It's a database with a Redis-compatible wire protocol, but the database is stored on disk. This means your working set is not limited by RAM and can be a few orders of magnitude larger! On modern SSDs this is still very fast. I think it improves the durability story as well. But the big win is the orders of magnitude larger database space.

As I've been improving my side project https://totalrealreturns.com/ recently I've ended up using both Redis and Kvrocks together. Redis is great for small global state that needs to be super fast. Kvrocks is great for larger bulk data storage (large precomputed datasets), but also supports a lot of the Redis data structures as well as Lua scripts.

Redis is used for plenty of things, not just memory caches.

For example if you use it for session storage, you can't have your application read from a random instance that may or may not contain the session.

For the project I've been working on for more than 15 years, we make extensive use of the pub/sub functionality for distributing live data. Pub/sub scales well across the cluster. Publish to one, and it goes out to subscribers on any of the nodes that they've connected to.

Will millions of users, high availability is critical for this functionality.

Redis doesn't necessarily have to be used as a cache. Streams, for example, make it a great message queue; but a single-node message queue is a single point of failure and thus not viable for many setups.

Years ago I enabled durability on redis & used it as database for an online card game

> it's still lacking in two areas

This is entirely different than what Redis is and tries to solve.

Sqlite is embedded. It's not a distributed SQL. Redis is a distributed data structure store and concurrency primitive. These are worlds apart.

> HA story is so much more complicated than it should be

It is precisely as complicated as it needs to be. You don't want data loss.

If you're in the business of high available fault tolerance, you read the manual and learn how to Redis.

What kind of an answer is that? This software is perfect the way it is, you’re just to inept to hold it right?

A high availability protocol should not leak into the client. It should be able to discover other nodes. It should not land in broken states so easily. It should not limit the number of writers. It should not error during failover.

Are these hard problems? Yes. Should we just accept that things are hard because that’s how the gods have given them to us? No.

For those who may not know, you can cut your costs in AWS by going with Valkey over Redis for about 33% savings.

https://aws.amazon.com/blogs/database/reduce-your-amazon-ela...

But what about Geico?

Valkey, because our cloud provider is hosting it and that's obviously what they prefer.

I feel like we're using about 1% of its features at this point - really just as a fast K/V store - so it would be easy to switch if needed, but I can't see a case where we would.

They prefer it because they don't have to pay to use it.

We went with DragonFlyDB

It's pretty similar to SQLite: being able to embed more or less a complete storage engine into your app can be very convenient and powerful.

A few nice things about doing this in no particular order:

Embedding would make local dev/CI integration testing convenient.

Embedding replicated Redis with each application instance would give you HA benefits while infra-management complexity.

Why would you embed SQLite?

It’s the same use case with a different api.

A typical (meaningful) example might be communication between threads or actors in a single process, or idempotent tests.

It’s seldom useful for a web app where you control everything.

> Redis (and memcache) are memory caches and should be treated like that

We're a self hosted shop, we went with Valkey. Valkey also has support for RDMA, which we already is running in our infrastructure.

We switched to Valkey after the Redis license kerfuffle happened, discovered we were saving money on our AWS bill, and have no motivation to go back to Redis.

So we’ve stayed with Valkey.

Most people seems to have switched to Valkey, and it's backed by the Linux foundation.

Went with 100% ValKey, if you are solely on AWS it is a no-brainer

https://antirez.com/news/164

https://github.com/redis/redis/pull/15162

https://antirez.com/news/164

It doesn’t seem like the right tool for the job, though. Aren’t your own programming language’s constructs much more well-defined / understood ?

For simple cases, it is probably a total overkill to even consider it, but for something heavier, embedding the database gives you a chance to trivially migrate later to a separate database server.

Redis is not a database. It’s a key / value store.

Redis is used for plenty of things, not just memory caches.

For example if you use it for session storage, you can't have your application read from a random instance that may or may not contain the session.

This case is exactly what he talks about. To get HA just setup more than one redis cache - or rebuild the session if it was lost in the redis cache.

Will millions of users, high availability is critical for this functionality.

Years ago I enabled durability on redis & used it as database for an online card game

What kind of an answer is that? This software is perfect the way it is, you’re just to inept to hold it right?

Are these hard problems? Yes. Should we just accept that things are hard because that’s how the gods have given them to us? No.

They prefer it because they don't have to pay to use it.

It doesn’t seem like the right tool for the job, though. Aren’t your own programming language’s constructs much more well-defined / understood ?

Language's own native data-structures are generally much more capable and vast. 99%+ developers use only a very limited set of those capabilities. This approach packages those most used ones into a nice, consistent DSL.

It's similar in effect to what busybox does to shell utilities, though the motives are different.

Redis has some pretty useful primitive that many languages don't:

- HyperLogLog, bloom filter, other probabilistic data structures

- Geospatial operations on stored points and polygons

- Expiring keys, for creating caches

These aren't in most standard libraries, and the Redis implementations tend to be fast, robust and well understood.

I use PHP. None of the language tools or constructs available to me are adequate.

https://blog.codinghorror.com/the-php-singularity/

This case is exactly what he talks about. To get HA just setup more than one redis cache - or rebuild the session if it was lost in the redis cache.

But what about Geico?

Redis is not a database. It’s a key / value store.

It's similar in effect to what busybox does to shell utilities, though the motives are different.

It’s not. Imagine a web app that stores your user information in a session store, mapped by your cookie-provided session ID. Your web app searches redis 1 for the session id, but since that key is on redis 2, the lookup fails and the application thinks there is no such session, and rejects the request.

Now you could solve this specific case by sharding by prefix, or by querying all instances, but then you still do not have high availability: if the instance a specific session is on is down, these users cannot authenticate. At that point you’re better off with a single instance.

That's why you run Redis Sentinel in production

It's so easy a grug brain can do it.

It kind of is a database:

A key-value database, or key-value store, is a data storage paradigm designed for storing, retrieving, and managing associative arrays, a data structure more commonly known today as a dictionary.

https://en.wikipedia.org/wiki/Key–value_database

that's still a database.

it's not a relational database.

Redis has some pretty useful primitive that many languages don't:

- HyperLogLog, bloom filter, other probabilistic data structures

- Geospatial operations on stored points and polygons

- Expiring keys, for creating caches

These aren't in most standard libraries, and the Redis implementations tend to be fast, robust and well understood.

It's so easy a grug brain can do it.

It kind of is a database:

A key-value database, or key-value store, is a data storage paradigm designed for storing, retrieving, and managing associative arrays, a data structure more commonly known today as a dictionary.

https://en.wikipedia.org/wiki/Key–value_database

that's still a database.

it's not a relational database.

I use PHP. None of the language tools or constructs available to me are adequate.

https://blog.codinghorror.com/the-php-singularity/

And you want to embed Redis inside PHP as a solution?? That’s nuts.

I don't think you understand what HA means.

The app would look up in both databases. If it exists in any, there would be a session.

Thisnis strictly different from partitioning which I think you are mixing it up with.

Paritioning is for performance not HA

But that is his point. If you cannot find the session id in redis, you login again. If your Redis server crash, you start a new one and everyone just login again. No data is lost.

That's why you run Redis Sentinel in production

That you do. Until you realise that there is only a single writer in that scenario, it doesn’t address any sharding concerns, you need to use compatible clients that opt into the sentinel protocol, during failover you’ll see client errors… there’s lots of room for improvement on redis HA.

With the amount of problems I had using Redis Sentinel, I really wish there was another way. On multiple occasions, with completely different deployments, it got itself into a non-repairable state where the only option was to drop it and setup the replicas manually. I was hoping someone would do a Patroni-like project for Redis, but I've not found it yet. I've moved all persistent data to PostgreSQL and use a number of Valkeys behind Envoy proxy as a cache.

And you want to embed Redis inside PHP as a solution?? That’s nuts.

Where else could they store their serialized PHP data structures? (just kidding)

I don't think you understand what HA means.

The app would look up in both databases. If it exists in any, there would be a session.

Thisnis strictly different from partitioning which I think you are mixing it up with.

Paritioning is for performance not HA

That’s the precise point I’m making

But that is his point. If you cannot find the session id in redis, you login again. If your Redis server crash, you start a new one and everyone just login again. No data is lost.

Sure the data is lost. A session commonly holds arbitrary state, and even if it’s just the login information. This is ridiculous.

Where else could they store their serialized PHP data structures? (just kidding)

That’s the precise point I’m making

Sure the data is lost. A session commonly holds arbitrary state, and even if it’s just the login information. This is ridiculous.

Obviously these are application decisions.

You, obviously, don't commit important data only to a session that you can loose, if the application does not allow it.

We use redis as infrastructure. To route events and as a cache.

For us redis could go down and we would merely see a degradation of our service with no data loss.

I recommend using redis like that. And then use a database that supports transactions for real data problems.

But we are different. And that's OK.

If you consider it important, you have to store it in a real database. No buts. If you don't consider it important, sharded redis works fine.

Obviously these are application decisions.

You, obviously, don't commit important data only to a session that you can loose, if the application does not allow it.

We use redis as infrastructure. To route events and as a cache.

For us redis could go down and we would merely see a degradation of our service with no data loss.

I recommend using redis like that. And then use a database that supports transactions for real data problems.

But we are different. And that's OK.

If you consider it important, you have to store it in a real database. No buts. If you don't consider it important, sharded redis works fine.

Redis is a real database. If I wasn’t convinced it could retain data I hand it, I wouldn’t use it in the first place.

Just because it works for your use case right now doesn’t mean there isn’t room for improvements to support others too.

Redis is a real database. If I wasn’t convinced it could retain data I hand it, I wouldn’t use it in the first place.

Just because it works for your use case right now doesn’t mean there isn’t room for improvements to support others too.

> Redis is a real database.

Oh good, then you don't need to do any of the stuff that you suggested to do

> Redis is a real database.

Oh good, then you don't need to do any of the stuff that you suggested to do

Redis 8.8 in Redis Open Source is now available, bringing performance improvements alongside a set of powerful new features. Highlights include array - a new general-purpose data structure, a window counter rate limiter, streams message NACKing, subkey notifications for hash fields, explicit control over JSON numeric array storage, multiple aggregators in a single time series query, and a new COUNT aggregator for sorted sets union and intersection.

Summary of performance improvements in 8.8

Redis 8.8 introduces significant end-to-end throughput improvements:

Data type	Operations	End-to-end throughput improvements
Strings	MGET (pipelined, with I/O-threads)	Up to 68%
	MGET (pipelined, single thread)	Up to 50%
	MSET	Up to 8%
Hash	HGETALL	Up to 25% (1K+ fields)
Streams	XREADGROUP	Up to 83% (COUNT 100)
Sorted set	ZADD, ZINCRBY, ZRANGEBYSCORE	Up to 74%
Bitmap	Bitmap operations	Up to 28% (x86)
HyperLogLog	PFCOUNT	Up to 18% (x86)
(several)	SCAN, HSCAN, SSCAN, ZSCAN	Up to 40%

In addition, persistence and replication (full synchronization) is now up to 60% faster.

Summary of new features in 8.8

Redis has always been about choosing the right data structure for the job. In Redis 8.8, we introduce a new general-purpose data structure: array. An array is an index-addressable collection of string values. Each array element is stored at a numeric index, and can be accessed extremely fast. Arrays are dynamic, sparse-friendly, and compute-aware containers, enabling new use cases and better flexibility and efficiency for existing use cases (by @antirez).

Rate limiting is one of the most common Redis use cases. Traditionally, users implemented rate limiters using server-side Lua scripts combined with client logic. In Redis 8.8, we introduce a window counter rate limiter (by @raffertyyu, together with the Redis team).

Our investment in improving Redis Streams continues.

Redis 8.2 simplified message acknowledgment and deletion across multiple consumer groups
Redis 8.4 made it easier for consumers to read both new and idle pending messages
Redis 8.6 introduced idempotent production

Building on this momentum, Redis 8.8 adds support for message NACKing, allowing consumers to explicitly release pending messages so they become immediately available and prioritized for consumption by other consumers.

In Redis 7.4 we introduced hash field expiration – a capability that saw strong adoption. A frequent follow-up request was for field-level notifications, similar to existing key-level notifications. Redis 8.8 delivers this with subkey notifications for hash fields, allowing clients to subscribe to events such as field expiration and deletion. These notifications include the key, the subkey (field name), and the event type.

Retrieving multiple time series aggregators is a common operation. For example, candlestick charts rely on MIN, MAX, FIRST, and LAST aggregations. Prior to Redis 8.8, this required multiple commands. Redis 8.8 now supports multiple aggregators in a single time series command, reducing round trips and simplifying client logic.

Redis 8.4 introduced support for homogeneous numeric arrays in JSON, delivering up to 92% memory reduction – especially valuable for AI workloads. In Redis 8.8, users can now explicitly control how numeric arrays are stored (BF16, FP16, FP32, or FP64), enabling better alignment with source data, vector indexing needs, and memory/precision tradeoffs.

Finally, Redis 8 extends sorted set union and intersection operations with a new COUNT aggregator. This allows the score of each element to reflect either the number of input sets it appears in or the weighted sum across those sets, unlocking new use cases in ranking, scoring, and analytics.

The new features explained

Array: A new general-purpose data structure

Redis has always been about choosing the right data structure for the job. Redis traditionally provides several core data structures, including lists, hashes, sets, and sorted sets. In Redis 8.8, we introduce a new general-purpose data structure: array.

What is an array?

An array is an index-addressable collection of string values. Each element is stored at a numeric index, and can be accessed extremely fast.

Arrays go far beyond basic indexed storage. They are flexible, memory-efficient, and compute-aware. Arrays have some capabilities that enable new use cases and better flexibility and efficiency for many existing use cases:

**An array doesn’t need to have a fixed size
**Arrays grow and shrink dynamically. Elements can be set at any index (0 to 2⁶⁴−1), and the array grows efficiently as needed. Elements can be deleted and the array shrinks accordingly.
**An array can be dense or sparse
**The used indices don’t have to be consecutive, and yet the memory footprint is proportional to the number of elements, and access by index remains extremely fast.
For example: an array index can represent a product ID and the values hold product names or details. Similarly, the index can represent timestamps and the values hold log events.
**An array can be used as a ring buffer (Sliding Window)
**Arrays can act as a bounded rolling buffer: retain the last n elements, maintain insertion order, and automatically overwrite older entries.
Think of a log file or a stream of events, where you want to efficiently keep the last n log entries, events, or measurements, and frequently fetch the last [up to] n values. This is especially useful if you need to feed a rule engine, process a fraud detection window, continuously update a chart, or execute security validations.
**Arrays can aggregate data
**When the values are numeric (for example, real-time sensor reports or stock quotes), arrays support server-side computation over index ranges, including SUM, MIN, and MAX. When the values are binary flags, Boolean aggregators (AND, OR, XOR) are supported as well.
Server-side aggregators are Ideal for sensor data, financial ticks, and real-time metrics. When combined with ring buffer semantics, Arrays enable sliding window analytics such as real-time anomaly detection.
**An array can be searched
**An array can represent a textual file (e.g., .txt, .csv, .log), where each element is indexed by the line-number and holds a single text line. Users can iterate over these lines for analysis, and can search for specific lines using an exact or partial string, a glob-style pattern, or a regular expression.
With ring buffer semantics, arrays can constantly hold the last n log-lines allowing users and agents to contextualize or enrich incoming events based on recent ones.

In summary, an array is a dynamic, flexible, high-performance, index-addressable, compute-aware container that combines aspects of:

List (ordered data)
Time-series (sliding windows)
Sparse map (non-contiguous indices)
Analytical engine (aggregation and search)

Random element access: Array vs list vs hash

Benchmarking arrays against the closest list and hash equivalents under random access at large element counts, the advantages of array show up clearly:

Operation (100K elements; 1 KB values)	Array	List	Hash
Read random element	675K ops/sec	133K ops/sec	626K ops/sec
Write random element	757K ops/sec	137K ops/sec	689K ops/sec
Delete random element	841K ops/sec	—	730K ops/sec

* Redis 8.8, single instance on an Intel Sapphire Rapids m7i.metal-24xl machine

For random-element operations, array provides 8-15% better throughput than Hashes and are at least 5 times faster than Lists.

Memory wise, lists are the most compact. Arrays require ~18% more memory per element, while hashes require 30-46% more memory than lists, depending on the size of the elements:

Element size (100K elements)	Array	List	Hash
100 bytes	122 bytes/element	104 bytes/element	151 bytes/element
1 Kbyte	1290 bytes/element	1035 bytes/element	1337 bytes/element

Ring buffer: Array vs list

A common pattern in Redis is using a list as a bounded ring buffer: clients push new entries with RPUSH and trim list back with LTRIM to keep a constant number of elements. Arrays expose ARRING, which performs the same operation in a single atomic command.

Ring size; element size	Array (ARRING)	List (RPUSH+LTRIM)	Array’s advantage
1K elements; 100 bytes	1.11M inserts/sec	512K inserts/sec	× 2.2
100K elements; 100 bytes	1.12M inserts/sec	528K inserts/sec	× 2.1
1K elements; 1 Kbyte	840K inserts/sec	424K inserts/sec	× 2.0
100K elements; 1 Kbyte	837K inserts/sec	413K inserts/sec	× 2.0

* Redis 8.8, single instance on an Intel Sapphire Rapids m7i.metal-24xl machine

ARRING delivers twice the throughput (inserts/sec) compared to the equivalent RPUSH+LTRIM idiom, independent of ring size. Memory footprint is the same as above: Arrays require ~18% more memory than lists.

When should arrays be used?

Arrays are extremely useful when:

You need extremely fast access by index or by index-range
You need a sliding window over recent data
You need server-side aggregation
You want to search for matching elements

What arrays are not suitable for?

Arrays are not a replacement for other data structures.Use lists if you need push/pop operations, or inserting elements between others.

Use hashes if you need field name-based access instead of numeric indices.

Where can I learn more?

Array documentation: https://redis.io/docs/staging/DOC-6334/develop/data-types/arrays/

Array commands: https://redis.io/docs/latest/commands/?group=array

Diving deep into Redis’s new array data type: https://redis.io/blog/diving-deep-into-rediss-new-array-data-type/

Window counter rate limiter

Window counter rate limiters, including fixed window, fixed window with lazy reset, and sliding window counter variants, use one or more fixed-duration time windows. Each window maintains a counter initialized to 0 when the window is created, along with a maximum capacity representing the number of tokens allowed during that window’s lifetime.

Before Redis 8.8, implementing a Window counter rate limiter required Lua scripting. In 8.8, we introduce a new command for working with window counters:

The idea is simple: each window has a duration (specified via EX or PX) and a token capacity (specified with UBOUND). The number of tokens requested can be specified with BYINT increment (default is 1). INCREX attempts to increment the counter by the requested number of tokens. The key is created if it does not already exist.

To make this command suitable for rate limiter use cases, beyond basic increment semantics, INCREX introduces three new capabilities compared to the existing INCR family of commands:

INCREX returns both the new counter value and the actual increment applied, allowing the caller to immediately determine whether the request should be allowed or rejected.
When ENX is specified, expiration is set only if the key does not already have one. This ensures that the window’s TTL is set only when a window is created and not modified on subsequent requests during its lifetime.
Boundary enforcement: the request is rejected if it would exceed the defined bounds. With SATURATE, the request may be “partially accepted” with the counter clamped to the specified bounds (“saturated”) .

Beyond rate limiting, INCREX can be seen as a generalized form of INCR, INCRBY, INCRBYFLOAT, as well as DECR and DECRBY (via negative increments), with added support for bounds and expiration control.

Streams: NACKing messages

In real-world applications, stream consumers don’t always successfully process the messages they consume. Failures can happen for many reasons:

A consumer may encounter internal issues unrelated to the message itself. For example, failing to reach an external service it needs for processing the message.
A consumer may need to shut down and release unprocessed messages.
A resource-constrained consumer (CPU, memory) may be unable to process certain messages (at least, in a timely manner).
A message may be malformed, poisoned, or even malicious.

Before Redis 8.8, consumers had no way to explicitly reject (NACK) a message. They could either acknowledge it or leave it pending. In practice, this meant other consumers in the consumer group had to recover these messages using XREADGROUP … CLAIM, XPENDING+XCLAIM or XAUTOCLAIM.

This approach introduces delays, since messages remain idle in the Pending Entries List (PEL) until another consumer claims them – an issue for time-sensitive systems.

Redis 8.8 introduces a new command to address this directly:

XNACK key group [SILENT|FAIL|FATAL] IDS numids id [id ...]

This command allows consumers to explicitly release messages back to the stream, making them immediately available for re-delivery.

XNACK supports three modes, each designed for a different real-world scenario:

SILENT - Used when the failure is unrelated to the message (e.g., shutdown or transient internal errors). The delivery counter is decremented by 1, effectively undoing the increment that occurred when the message was added to the PEL.
FAIL - Used when the message cannot be processed by this consumer but may succeed elsewhere (e.g., requires more resources). The delivery counter remains unchanged (it was already incremented by 1 when added to the group's PEL).
FATAL - used for malformed, poison, or potentially malicious messages. The delivery counter is set to LLONG_MAX, making it easy to detect and route to a dead-letter queue.

These modes map naturally to production scenarios: graceful shutdowns or transient failures, resource-based failures, and poison message handling.

When a message is NACKed, it is:

Marked as unowned (its last consumer is set to an empty string)
Assigned a last delivery time of 0
Placed at the end of the NACKed portion of the PEL

The head of the PEL is reserved for all NACKed messages, ordered FIFO among themselves, followed by pending messages that were neither ACKed nor NACKed in their existing order. This guarantees that NACKed messages are always prioritized over idle pending messages.

The delivery order on XREADGROUP is updated accordingly:

When CLAIM min-idle-time is specified:
- NACKed messages (new behavior)
- Messages pending for at least min-idle-time
- Never-delivered messages
If CLAIM is not specified:
- Only never-delivered messages are returned (unchanged behavior)

Hash: Subkey notifications

Redis key-level notifications let clients subscribe to key-related events in real time via pub/sub channels. There are two types of channels:

Keyspace notifications: clients subscribe to a specific key; each message contains an event type.
Keyevent notifications: clients subscribe to specific events; each message contains a key name.

In Redis 7.4, we introduced hash field expiration. This feature saw strong adoption, and a common request followed: support for hash field-level notifications, since key-level notifications do not include field names.

Redis 8.8 introduces subkey-level notifications. Starting with Hashes, clients can now subscribe to events at the field level, such as field updates, deletions, and expirations.

Subkey notifications include the key, subkeys (for hashes, these are field names), and the event type.

Redis 8.8 adds four new channel types:

Subkeyspace notifications: Clients subscribe to a specific key; each message contains an event type and field names.
Subkeyevent notifications: Clients subscribe to a specific event type; each message contains a key name and field names.
Subkeyspaceitem notifications: Clients subscribe to a specific key+field combination; each message contains an event type.
Subkeyspaceevent notifications: Clients subscribe to a specific event+key combination; each message contains field names.

These mirror the flexibility of keyspace notifications while extending visibility down to the field level.

The following events are emitted for hash fields: hset, hdel, hexpire, hexpired, hpersist, hincrby, and hincrbyfloat.

Time series: Multiple aggregators in a single command

The TS.RANGE, TS.REVRANGE, TS.MRANGE, and TS.MREVRANGE commands support an optional AGGREGATION parameter which allows grouping samples into time buckets and applying an aggregation function.

Users can choose from 15 supported aggregators (such as AVG, SUM, MIN, MAX, FIRST, and LAST), and the results are computed accordingly.

In many real-world scenarios, however, multiple aggregations are needed simultaneously. A common example is candlestick charts, which require MIN (low), MAX (high), FIRST (open), and LAST (close).

Before Redis 8.8, this required issuing multiple commands - one per aggregator - resulting in additional latency and client-side complexity.

Redis 8.8 introduces support for multiple aggregators in a single command, allowing all required aggregations to be computed in one request.

The command syntax remains unchanged. Users can now specify multiple aggregators as a comma-separated list:

TS.RANGE key from to AGGREGATION MIN,MAX,FIRST,LAST bucketDuration

Note that aggregators are comma-separated, with no spaces between them.

JSON: Explicitly declaring floating-point array types

The JSON specification defines a generic “number” type, without enforcing a specific representation such as IEEE-754 FP16, FP32, or FP64 for non-integers. As a result, each implementation must choose how to represent numeric values internally.

Starting with Redis 8.4, JSON numeric arrays (such as vector embeddings) are stored using efficient binary representations, significantly reducing memory usage. Redis automatically selects the most appropriate numeric type, but for non-integers, and without additional hints, this usually defaults to FP64 to preserve precision.

For example, the decimal value 0.3 cannot be represented exactly in binary (similar to how 1/3 cannot be represented exactly in decimal). To avoid loss of precision, Redis typically uses FP64. In practice, this means that many floating-point arrays end up being stored as FP64, even when such high precision is not required.

In many real-world scenarios, the original data was already generated using lower-precision formats. Redis 8.8 addresses this by allowing users to explicitly control how floating-point arrays are stored. Users can now choose between BF16, FP16, FP32, and FP64, enabling better alignment with source data, vector indexing requirements, and memory/precision tradeoffs.

The JSON.SET command includes a new optional parameter:

JSON.SET key path value [NX | XX] **[FPHA BF16|FP16|FP32|FP64]**

FPHA stands for Floating-Point Homogeneous Array
It forces Redis to store any floating-point array in value using the specified format
For large arrays, such as embeddings with hundreds or thousands of elements, the difference becomes substantial, often reducing memory usage by multiple times.

Sorted sets: Union and intersection - COUNT aggregator

Sorted sets support set operations via ZUNION, ZUNIONSTORE, ZINTER, and ZINTERSTORE. For all four commands, users can control how element scores are computed in the result using the SUM, MIN, or MAX aggregators, optionally applying weights to each input set.

In some use cases, however, the original scores are not relevant. Instead, users may want the resulting score to reflect how many input sets contain each element, or, when weights are provided, the sum of the weights of the sets that contain it.

In Redis 8.8, we introduce a new COUNT aggregator to support this directly.

With COUNT:

If no weights are specified, the score becomes the number of input sets containing the element (i.e., 1 + 1 + ...)
If weights are specified, the score becomes the sum of the weights of the sets containing the element (i.e., weight₁ + weight₂ + ...)

This effectively ignores the original element scores and focuses only on set membership.

The COUNT aggregator enables patterns such as:

Ranking items by popularity across multiple sources
Finding elements that appear in many datasets
Implementing voting or scoring systems based on presence rather than value

All without requiring additional client-side logic.

Getting started

All these enhancements are generally available on Redis 8.8 today. You can start using the new commands by downloading Redis 8.8 and experimenting with them in your existing workflows.

Have feedback or questions? Join the discussion on our Discord server or reach out to your account manager.

Hacker Times

Hacker Times

Redis 8.8: New array data structure, rate limiter, performance improvements

Discussion

Discussion

Summary of performance improvements in 8.8

Summary of new features in 8.8

The new features explained

Array: A new general-purpose data structure

Window counter rate limiter

Streams: NACKing messages

Hash: Subkey notifications

Time series: Multiple aggregators in a single command

JSON: Explicitly declaring floating-point array types

Sorted sets: Union and intersection - COUNT aggregator

Getting started