Introduce shared query buffer for client reads #258

uriyage · 2024-04-08T09:53:11Z

This PR optimizes client query buffer handling in Valkey by introducing a shared query buffer that is used by default for client reads. This reduces memory usage by ~20KB per client by avoiding allocations for most clients using short (<16KB) complete commands. For larger or partial commands, the client still gets its own private buffer.

The primary changes are:

Adding a shared query buffer shared_qb that clients use by default
Modifying client querybuf initialization and reset logic
Copying any partial query from shared to private buffer before command execution
Freeing idle client query buffers when empty to allow reuse of shared buffer
Master client query buffers are kept private as their contents need to be preserved for replication stream

In addition to the memory savings, this change shows a 3% improvement in latency and throughput when running with 1000 active clients.

The memory reduction may also help reduce the need to evict clients when reaching max memory limit, as the query buffer is the main memory consumer per client.

madolson

Overall it looks good to me. Can you document the procedure you were using when you saw the 3% performance boost? Did we also test it for pipelined clients?

I would also ideally like some protection in freeClient() to make sure if a client gets freed it somehow doesn't also free the shard query buffer.

src/networking.c

src/server.c

tests/unit/querybuf.tcl

src/replication.c

uriyage · 2024-05-05T11:14:00Z

Performance Benchmark:

Server Setup:

3 million keys, each with 512 bytes

Benchmark Configuration:

1,000 clients running SET commands
Server and benchmark running on separate ARM instances with 64 cores

Server Command:

./valkey-server --save

Benchmark Command:

./valkey-benchmark -t set -d 512 -r 3000000 -c 1000 --threads 50 -h <server_host_address> -n 50000000

Results (Average of 3 runs):

Without shared query buffer:
- Throughput: 208,361 operations per second
- Average latency: 4.806 milliseconds
With shared query buffer:
- Throughput: 214,062 operations per second
- Average latency: 4.692 milliseconds

PingXie

This change LTGM overall. Thanks @uriyage

src/server.c

src/networking.c

src/server.c

src/networking.c

zuiderkwast

Generally LGTM

src/networking.c

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

Co-authored-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Uri Yagelnik <uriy@amazon.com>

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

madolson

I'm good with it. @uriyage Can you do the merge to get it up to date?

src/networking.c

codecov · 2024-05-13T09:27:51Z

Codecov Report

Attention: Patch coverage is 96.22642% with 2 lines in your changes are missing coverage. Please review.

Project coverage is 69.82%. Comparing base (4e18e32) to head (7f7f618).

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable     #258      +/-   ##
============================================
+ Coverage     69.80%   69.82%   +0.01%     
============================================
  Files           109      109              
  Lines         61801    61839      +38     
============================================
+ Hits          43141    43178      +37     
- Misses        18660    18661       +1

Files	Coverage Δ
src/replication.c	`86.72% <100.00%> (-0.05%)`	⬇️
src/server.c	`88.62% <100.00%> (+0.01%)`	⬆️
src/networking.c	`85.20% <95.23%> (+0.30%)`	⬆️

... and 13 files with indirect coverage changes

soloestoy · 2024-05-13T12:06:20Z

@uriyage can you give more tests about the cases that the whole command or argv exceed PROTO_IOBUF_LEN?

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

uriyage · 2024-05-13T17:56:00Z

@uriyage can you give move test results about the cases that the whole command or argv exceed PROTO_IOBUF_LEN?

I checked with 24K values (since with jemalloc we actually allocate more than 16K).

Commands:

./valkey-server --protected-mode no --save

./valkey-benchmark -t set -d 24000 -r 1000 -c 500 --threads 4 -h 192.31.233.204 -n 10000000

I get 83,544 without a shared query buffer and 83,238 with a shared query buffer.

So, essentially about the same, which is expected since with larger values, we revert to the current logic where we use a private buffer per client.

uriyage · 2024-05-14T06:24:47Z

I'm good with it. @uriyage Can you do the merge to get it up to date?

@madolson Done

madolson · 2024-05-14T16:57:10Z

@soloestoy Do you have any further followup?

soloestoy · 2024-05-15T07:16:21Z

tests/unit/querybuf.tcl

@@ -24,8 +24,24 @@ start_server {tags {"querybuf slow"}} {
    # The test will run at least 2s to check if client query
    # buffer will be resized when client idle 2s.
    test "query buffer resized correctly" {
-        set rd [valkey_client]
+


it's better to add two more cases that commands with large arguments exceed 16KB (PROTO_IOBUF_LEN) and 32KB (PROTO_MBULK_BIG_ARG), that we can cover all special cases.

soloestoy · 2024-05-15T08:55:02Z

I'm thinking if we can eliminate the limit of shared query buffer (and resize in cron if needed), maybe we can get more benefits.

uriyage force-pushed the shared_qb branch 2 times, most recently from 6ce2ea4 to fc9ee81 Compare April 8, 2024 11:48

madolson requested review from madolson and zuiderkwast April 8, 2024 15:57

madolson reviewed Apr 22, 2024

View reviewed changes

PingXie reviewed May 6, 2024

View reviewed changes

src/server.c Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

src/server.c Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

zuiderkwast approved these changes May 8, 2024

View reviewed changes

src/networking.c Outdated Show resolved Hide resolved

uriyage and others added 5 commits May 9, 2024 05:28

use shared query buffer for clients read

5617098

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

renaming and add test validation

44db6e2

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

Update src/replication.c

5592840

Co-authored-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Uri Yagelnik <uriy@amazon.com>

Chack for shared qb in freeClient

dd182c5

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

PR changes

0b34557

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

uriyage force-pushed the shared_qb branch from 2995d28 to 0b34557 Compare May 9, 2024 05:28

madolson approved these changes May 9, 2024

View reviewed changes

madolson added release-notes This issue should get a line item in the release notes to-be-merged Almost ready to merge labels May 9, 2024

Merge branch 'unstable' into shared_qb

41b7c79

enjoy-binbin reviewed May 13, 2024

View reviewed changes

src/networking.c Outdated Show resolved Hide resolved

uriyage added 2 commits May 13, 2024 16:25

Handle the case where thread_shared_qb is not initialize

2041fc3

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

assert client is using shared buffer in resetSharedQueryBuf

7f7f618

Signed-off-by: Uri Yagelnik <uriy@amazon.com>

soloestoy reviewed May 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce shared query buffer for client reads #258

Introduce shared query buffer for client reads #258

uriyage commented Apr 8, 2024

madolson left a comment

uriyage commented May 5, 2024

PingXie left a comment

zuiderkwast left a comment

madolson left a comment

codecov bot commented May 13, 2024 •

edited

soloestoy commented May 13, 2024 •

edited

uriyage commented May 13, 2024

uriyage commented May 14, 2024

madolson commented May 14, 2024

soloestoy May 15, 2024

soloestoy commented May 15, 2024

Introduce shared query buffer for client reads #258

Are you sure you want to change the base?

Introduce shared query buffer for client reads #258

Conversation

uriyage commented Apr 8, 2024

madolson left a comment

Choose a reason for hiding this comment

uriyage commented May 5, 2024

PingXie left a comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

madolson left a comment

Choose a reason for hiding this comment

codecov bot commented May 13, 2024 • edited

Codecov Report

soloestoy commented May 13, 2024 • edited

uriyage commented May 13, 2024

uriyage commented May 14, 2024

madolson commented May 14, 2024

soloestoy May 15, 2024

Choose a reason for hiding this comment

soloestoy commented May 15, 2024

codecov bot commented May 13, 2024 •

edited

soloestoy commented May 13, 2024 •

edited