Redis Cluster V2 project #8948

madolson · 2021-05-15T00:34:13Z

This issue covers a couple of high level of areas for improving Redis Cluster. Ranked roughly by priority in each pillar.

Improved use case support
This pillar focuses on providing improved functionality outside of the core cluster code but helps improve the usability of cluster mode.

Pubsub scaling:
Messages are published into a global channel space that doesn’t follow slot conventions. The proposal is to introduce a new “pubsub local” functionality where clients direct messages to the correct nodes. Goal is to reduce write amplification.
#8029
#8621
#3346

Clusterbus as HA for single shard:
Allows the clusterbus to replace sentinel as the HA mechanism for Redis. This will require voting replicas which is dicussed later.
#10875

#11271
The idea here is that Redis server nodes could proxy incoming requests to the desired node instead of relying on heavy client side logic to know the cluster topology. Simplifies some of the work for workloads that don’t want to maintain a heavy client. This would be an optional configuration.

Custom hashing support:
Some applications want to have their own mechanism for determining slots, so we should extend the hashtag semantics to include information about what slot the request is intended for.

Hashtag scanning/atomic deletion:
A common ask has been for being able to use scan like commands to find elements in a hashtag without having to scan the entire keyspace. A proposal is to be able to create a group of keys that can be atomically deleted. A secondary index could also solve this issue.
(I'm sure there is an issue for this, I'll find it)

Cluster management improvements
This pillar focuses around improving the ease of use for managing Redis clusters.

Hostname support:
Certain applications want hostname support for SNI (this is hostname validation for TLS) and it’s apparently an ask for kubernetes.
#2186
#9530

Consensus based + Atomic slot migration:
Implement a server based slot migration command that migrates the data from one slot to another slot. (We have a solution we hopefully will someday post for this)
#2807

Improved metrics for slot performance:
Add metrics for individual slot performance to make decisions about hot shards/keys. ** This makes it easier to identify slots that should be moved. Easy metrics to grab our key accesses, ideally memory would be better but that's hard.

Dynamic slot ownership
For all master clusters in caching based used cases, its data durability is not needed and nodes in a cluster can simply take over slots from other nodes when a node dies. Adding nodes can also mean that it will automatically takeover slot ownership from other nodes.
#4160

Auto scaling
Support automatic rebalancing of clusters when adding nodes/removing nodes as well as during steady state when there is traffic load mismatch.
#3009

Moving cluster bus to a separate thread, improved reliability in case of busy server
Today if the main thread is busy it main not respond to a health check ping even though it is still up and healthy. Refactoring the clusterbus onto its own thread will make it more responsive.

Refactor abstractions in cluster.c:
Several abstractions in cluster.c are hard to follow and should be broken up including: Cluster bus and node handling, slot awareness, health monitoring.

Human readable names for nodes:
Today individual Redis nodes report their hexadecimal names, which are not human readable. Instead we should additionally assign them some more readable name that is either logical or corresponds to their primary.
#9564

Gossiped node deletion
Typically you need to send a cluster forget to each node in a cluster to delete a node. If you don't do this fast enough, the node will be re-added through gossip. Ideally you just need to forget a node and it will eventually be forgotten throughout the cluster.
#10875

*Module support for different consensus algorithms *
Today Redis only supports the clusterbus as a consensus algorithm, but we could also support module hooks for other forms of consensus.

Cluster HA improvements
This pillar focuses on improving the high availability aspects of Redis cluster and focuses around improving failover and health checks.

Reduce messages sent for node health decisions:
The Redis clusterbus has an NxN full mesh of gossip health messages. This can cause performance degradation and instability in large clusters as health and voting authorization is slow. There are several ways to solve this such as having failovers be shard local or being smarter about propagation of information.
#3929

Voting replicas: (group this with other conensus ones)
Today replicas don’t take part in leader election, this would be useful for smaller cluster sizes especially single shards.
#10875

Avoiding cascading failovers leading to data loss:
It's possible that a replica without data can be promoted to be the master role and lost all data in the shard. This is typically the result of a cascading failover. Ideally we should add a stopgap here to prevent this last node from being demoted.

Placement awareness:
Today the individual nodes have no concept of how they are placed compared to each other, and will happily allow all the primaries to exist in the same zone. This also may include the notion of multi-region awareness.

RESPV3 topology updates
Today clusters come to learn about topology changes when they send a request to the wrong node. This can be limited by having nodes proactively notify clients when a topology change has occurred. This can be inefficient since today clients need to call CLUSTER SLOTS to re-learn the entire topology. A client can opt into topology changes, and from that point on it will receive information about just what topology has changed.
#10150

iakkus · 2021-05-19T19:54:02Z

For 'human-readable names', I guess the basic idea is similar to the way docker assigns names to started containers (unless one is given by the user).

I think it would be useful:

to let users assign 'aliases' to nodes,
to have replicas be named after their primaries (preferably, in a transparent and automated fashion), and
to let users create pools for the random names picked for the nodes.

dmitrypol · 2021-06-10T18:37:20Z

here are a few more ideas:

Better integration story for Redis / Cluster / Sentinels.
Integrate Sentinel support into Redis. Make it easier to do failovers w/o needing Sentinels (not really cluster related but similar).
Support multiple databases in Cluster
When Redis Cluster mode does a failover send a PubSub message just like Sentinel does. This way in case of hardware failure someone will be notified. Right now you have to ping Redis Cluster asking for it's health.
Better user experience for setting up cluster via redis-cli.

zuiderkwast · 2021-06-10T19:02:30Z

+1 for builtin failover/sentinel
Use one db for cluster (db 0 like now), other db numbers for non-cluster (e.g. local cache for colocated app server)

hwware · 2021-08-16T19:34:39Z

i have one more idea for cluster adding slot and deleting slot:
current, we can only add slot or delete slot for individual slot, such as cluster addslots 1 2 3 .... 5000
If we want to add multiply slots in range, we need use bash shell,
I think we could add command like: cluster addslots -r 1 5000 which means we add slots from 1 to 5000
we could implement similar command for delete slots

zuiderkwast · 2021-11-21T16:56:35Z

Does "Gossiped node deletion" involve a timed blacklist as described in #1410?

chenyang8094 · 2022-02-23T01:24:16Z

Request proxying:
The idea here is that Redis server nodes could proxy incoming requests to the desired node instead of relying on heavy client side logic to know the cluster topology. Simplifies some of the work for workloads that don’t want to maintain a heavy client. This would be an optional configuration.

It seems to be somewhat related to my issue #10307

judeng · 2022-10-09T08:47:37Z

Use one db for cluster (db 0 like now), other db numbers for non-cluster (e.g. local cache for colocated app server)

I don't know the history of the cluster very well. Why does the cluster not support the multi-dbs mode, are there any difficulties in technical implementation? Has it been discussed in our community?

judeng · 2022-10-17T14:05:28Z

@madolson Thanks for answering!
in my scenarios, a cluster would be shared in multiple callers, and using multiple db could reduce the dict overhead, I'd like to try it.

This PR adds a human readable name to a node in clusters that are visible as part of error logs. This is useful so that admins and operators of Redis cluster have better visibility into failures without having to cross-reference the generated ID with some logical identifier (such as pod-ID or EC2 instance ID). This is mentioned in #8948. Specific nodenames can be set by using the variable cluster-announce-human-nodename. The nodename is gossiped using the clusterbus extension in #9530. Co-authored-by: Madelyn Olson <madelyneolson@gmail.com>

judeng · 2024-04-17T13:23:10Z

hi everyone, Any update with the ClusterV2? Could we replace the gossip in 8.0?

madolson mentioned this issue May 15, 2021

General Cluster improvements #4053

Closed

6 tasks

madolson mentioned this issue Jun 22, 2021

CLUSTER FAILOVER fails if master dosn't contain any slots #5121

Open

madolson mentioned this issue Jun 30, 2021

[QUESTION]how to reduce the network consume of gossip ? #9104

Closed

madolson mentioned this issue Sep 14, 2021

[QUESTION] Why redis slaves don't vote? #9495

Closed

hwware mentioned this issue Sep 20, 2021

Add hostname variables #9529

Closed

This was referenced Sep 29, 2021

Cluster human readable nodename feature #9564

Merged

Custom hashing support #9604

Closed

ShooterIT mentioned this issue Dec 2, 2021

[NEW] Support slot-based data migration apache/kvrocks#412

Closed

madolson mentioned this issue Dec 16, 2021

redis cluster crash recover #9948

Open

zuiderkwast mentioned this issue Jan 19, 2022

CLUSTER SUBSCRIBE SLOTS (topology changes) #10150

Open

supercaracal mentioned this issue Feb 8, 2022

Prioritize replica node redis/redis-rb#1067

Closed

oranagra added this to backlog in 7.2 <obsolete> via automation Feb 17, 2022

supercaracal mentioned this issue Mar 22, 2022

Plan for redis-rb 5.0 redis/redis-rb#1070

Closed

madolson mentioned this issue Mar 29, 2022

Introduce shard ID to Redis cluster #10474

Closed

zuiderkwast mentioned this issue Jun 13, 2022

Gossiped node deletion #10861

Closed

kyle-yh-kim mentioned this issue Oct 26, 2022

Introduce CLUSTER SLOT-STATS command (#11422) #11432

Open

hpatro mentioned this issue Feb 28, 2023

[NEW] Custom Slot Hashing Mechanism #11853

Open

oranagra mentioned this issue Apr 30, 2023

[NEW] Redis master-replica seamless switching #12097

Open

hpatro mentioned this issue Aug 29, 2023

How to configure a writable replica in a Redis Cluster #12511

Open

madolson closed this as completed May 1, 2024

K-Jo reopened this May 7, 2024

redis deleted a comment from madolson May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redis Cluster V2 project #8948

Redis Cluster V2 project #8948

madolson commented May 15, 2021 •

edited

iakkus commented May 19, 2021

dmitrypol commented Jun 10, 2021 •

edited

zuiderkwast commented Jun 10, 2021

hwware commented Aug 16, 2021

zuiderkwast commented Nov 21, 2021

chenyang8094 commented Feb 23, 2022 •

edited

judeng commented Oct 9, 2022

judeng commented Oct 17, 2022

judeng commented Apr 17, 2024

Redis Cluster V2 project #8948

Redis Cluster V2 project #8948

Comments

madolson commented May 15, 2021 • edited

iakkus commented May 19, 2021

dmitrypol commented Jun 10, 2021 • edited

zuiderkwast commented Jun 10, 2021

hwware commented Aug 16, 2021

zuiderkwast commented Nov 21, 2021

chenyang8094 commented Feb 23, 2022 • edited

judeng commented Oct 9, 2022

judeng commented Oct 17, 2022

judeng commented Apr 17, 2024

madolson commented May 15, 2021 •

edited

dmitrypol commented Jun 10, 2021 •

edited

chenyang8094 commented Feb 23, 2022 •

edited