Buffered writes(Storage request flushes)

TimurMannapov · August 20, 2025, 10:20pm

Hello,
We are doing optimizations in our project and try to reduce storage request flushes by batching sql updates/inserts/deletes. While doing this, we noticed that update statements are not write buffered, while inserts are.
Test case:
“PostgreSQL 11.2-YB-2024.2.3.2-b0 on x86_64-pc-linux-gnu, compiled by clang version 17.0.6 ( GitHub - yugabyte/llvm-project: The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org. 9b881774e40024e901fc6f3d313607b071c08631), 64-bit”
Table: test_flush(tenant_id int, customer_id bigint, version int, some_value bigint). PK(tenant_id, customer_id)
Transaction isolation mode: repeatable read
Inserts that are batched and are blazing fast

DO
$do$
DECLARE
i integer;
numbers INTEGER[];
BEGIN
SELECT ARRAY(SELECT generate_series(1, 1)) INTO numbers;
FOR i IN 1..500 LOOP
INSERT INTO public.test_flush2(tenant_id, customer_id, version, some_value)
VALUES (i, i, i, i);
END LOOP;
END
$do$

Similar updates are not write buffered(first run inserts so there will be data for updates) and are slow. That is simpler case for single table
DO
$do$
DECLARE
i integer;
numbers INTEGER;
BEGIN
SELECT ARRAY(SELECT generate_series(1, 1)) INTO numbers;
FOR i IN 1..500 LOOP
UPDATE public.test_flush
SET version = version + 1, some_value = 2
WHERE tenant_id = i AND customer_id = i;
END LOOP;
END
$do$

I know I can update mutiple entries from single update statement using unnest and writes will be bufferred and overall exuction will be fast, but I’m more interested in write buffering for multiple updates for different tables. Is there way to do this?
Best regards, Timur

Jim_Knicely · August 21, 2025, 2:04am

UPDATE version = version + 1 does a read-modify-write, causing a flush on every row update.

Why not just rewrite into a single set-based UPDATE?

Example:
EXPLAIN (ANALYZE, DIST)
WITH delta(key_tenant, key_customer, new_val) AS (
SELECT gs, gs, 2
FROM generate_series(1, 500) AS gs)
UPDATE public.test_flush AS t
SET version = t.version + 1,
some_value = d.new_val
FROM delta AS d
WHERE t.tenant_id = d.key_tenant
AND t.customer_id = d.key_customer;

That produces 1 FLUSH:

Planning Time: 0.154 ms
Execution Time: 19.677 ms
Storage Read Requests: 1
Storage Read Execution Time: 1.909 ms
Storage Rows Scanned: 500
Storage Write Requests: 500
Catalog Read Requests: 0
Catalog Write Requests: 0
Storage Flush Requests: 1
Storage Flush Execution Time: 12.553 ms
Storage Execution Time: 14.462 ms
Peak Memory Usage: 238 kB

TimurMannapov · August 21, 2025, 11:32am

Will multiple set-based updates for different tables be batched in this case?

dorian_yugabyte · August 25, 2025, 7:48am

Can you provide an exact example?

Topic		Replies	Views
Conditional statements in transactions General	1	899	April 13, 2018
Batch delete operation when in-built cassandra ttl cannot be used in the case of transactional tables Design Discussions	3	1476	June 28, 2019
Comparision between your raft implementation and etcd General	2	1452	November 16, 2017
What really happen when update and insert data to table with secondary index on regular column General	3	739	May 11, 2020
How's TTL performance General	9	3433	October 27, 2019

Buffered writes(Storage request flushes)

Related topics