Read Repair vs healing to satisfy ReplicaCount #55

cobexer · 2020-09-26T13:54:15Z

During an upgrade of my application, which uses olric embedded, all replicas get restarted in quick succession.
Most keys will never be read, or will be read very infrequently ... which with read repair means that after an upgrade of my application the cache will be immediately "empty".

That behavior means that olric doesn't actually provide useful functionality for my use case - unless I add code that reads the entire keyspace after startup to effectively repair the cache redundancy before letting Kubernetes know that the Pod is started successfully.

I believe that olric could do this internally more efficiently and I also believe that such a functionality would be generally useful:

From the documentation it seems that olric could "easily" know that a part of the keyspace doesn't satisfy the requested ReplicaCount and actively transfer the data to the newly joined member to repair the cache in case a node restarts.

So this is a request for:

when joining a cluster, ask it to transfer some data to the new node to satisfy ReplicaCount
provide an API to detect when this initial sync is finished so that the embedding application can communicate to the Kubernetes API when it is safe to continue with the rollout
detect node joins/departures fast enough to make such a rollout fast enough
useful node identity in the context of a Kubernetes cluster where IP-addresses are basically useless

wliuroku · 2020-12-18T19:56:53Z

Having same issue here. I have to to do a full read repair of all keys to trigger data transfer when new node joins. Do we know what version will we address this issue?

hacktmz · 2021-08-30T11:24:37Z

Having same issue

buraksezer · 2021-08-30T12:33:31Z

Hi all,

I'm aware of this is one of the most wanted features among the users. I started working on a solution based on a technique called vector clock. It may be ready for initial tests in a couple of months. I plan to make it production-ready by the end of this year.

For anyone who is curious about version vectors, here is some info:

buraksezer self-assigned this Sep 26, 2020

buraksezer added the enhancement New feature or request label Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read Repair vs healing to satisfy ReplicaCount #55

Read Repair vs healing to satisfy ReplicaCount #55

cobexer commented Sep 26, 2020

wliuroku commented Dec 18, 2020 •

edited

hacktmz commented Aug 30, 2021

buraksezer commented Aug 30, 2021

Read Repair vs healing to satisfy ReplicaCount #55

Read Repair vs healing to satisfy ReplicaCount #55

Comments

cobexer commented Sep 26, 2020

wliuroku commented Dec 18, 2020 • edited

hacktmz commented Aug 30, 2021

buraksezer commented Aug 30, 2021

wliuroku commented Dec 18, 2020 •

edited