Avoid copying between processes #3

zuiderkwast · 2022-06-09T08:32:46Z

Minimize the copying of data between processes. Do as much as possible in the calling process.

drmull · 2022-06-10T21:42:27Z

One idea is to make a separate benchmark repo where we test ered vs eredis_cluster and maybe include other erlang redis cluster clients.

As we have discussed before, it would be interesting to try out some more optimized way of of doing things. One thing we could do is to remove the ered and ered_client process and instead use atomics for the slot map lookup, persistent term for the connection lookup and the counters module for keeping track of the queue size.

slot -> connection index (atomics)
connection index -> queue size (counters)
connection index -> connection pid (persistent term, local pid fits in a word so no global GC to update)

The connection module would have to handle reconnect and status reporting to the cluster module. The queue would be the connection send process message queue.
Avoid gen_server:call since setting up the link is expensive, rely on a timeout instead.

Not sure if it will work, there might be a catch, but if it works I think it would be quite efficient.

zuiderkwast · 2022-06-11T08:21:35Z

Benchmarking is a good idea. We should include ecredis in the comparison.

Atomics and counters are probably good, but I'm not sure about persistent term. It's true that replacing a pid doesn't trigger a global GC, but it still rewrites the whole persistent term table, which may contain stuff out of control of this lib. Perhaps an ETS table is an acceptable choice for connection index -> pid lookup?

Avoid gen_server:call since setting up the link is expensive, rely on a timeout instead.

You mean gen_server:call's monitor is expensive? With timeout you mean we use cast + receive after?

drmull · 2022-06-11T20:53:18Z

but it still rewrites the whole persistent term table, which may contain stuff out of control of this lib.

Yes you are right, I did not realize the persistent term table was global. Better not go that way.

You mean gen_server:call's monitor is expensive?

Yes, I meant monitor. I remember it showed up when I did some profiling and bang/cast + receive performed better. It is hackish but might be worth if we are going all in for speed. At least we could profile it and see if it makes any difference.

ghost · 2022-07-26T09:51:30Z

Perhaps an ETS table is an acceptable choice for connection index -> pid lookup?

A process dict might also be an option, it has the same lifetime as ETS tables (dies with the owner process). If doing only simple key lookups it should be faster than ETS I guess.

zuiderkwast · 2022-07-26T22:35:15Z

Ideally the lookup should happen in the caller (user's) process before the first gen-server call. We don't want to pollute the process dictionary of the user's process.

zuiderkwast added the enhancement New feature or request label Nov 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid copying between processes #3

Avoid copying between processes #3

zuiderkwast commented Jun 9, 2022

drmull commented Jun 10, 2022

zuiderkwast commented Jun 11, 2022

drmull commented Jun 11, 2022

ghost commented Jul 26, 2022

zuiderkwast commented Jul 26, 2022

Avoid copying between processes #3

Avoid copying between processes #3

Comments

zuiderkwast commented Jun 9, 2022

drmull commented Jun 10, 2022

zuiderkwast commented Jun 11, 2022

drmull commented Jun 11, 2022

ghost commented Jul 26, 2022

zuiderkwast commented Jul 26, 2022