Improvements to partition export by arthurpassos · Pull Request #1402 · Altinity/ClickHouse

arthurpassos · 2026-02-13T17:49:32Z

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Forward port of unmerged PR #1177

Documentation entry for user-facing changes

As of now, this PR does the following things:

Limit the amount of export part operations that can be scheduled based on the BackgroundMovesExecutor. This applies both to partition and part exports. It is no longer memory bound, it solves the problem of one replica locking all parts in a task just because it ran faster even if it did not have bandwidth to execute all of them.
Introduce ZooKeeper export partition requests specific metrics (I suppose we'll remove this later, for now it is good to have to be able to benchmark different approaches).
Introduce lock a data part inside the task strategy as opposed to locking and only then scheduling a task. This is controlled by export_merge_tree_partition_lock_inside_the_task. I don't think we want users to do it, it is to experiment and benchmark. See ExportPartFromPartitionExportTask
Only run the scheduler if we have available slots. And only schedule as many as we can (based on slots). This is subject to TOCTOU. The background executor has an internal pending queue, so even if we have available slots a task might end up on pending. To tackle this, we would need to write a new background executor, but not a must for now.
Add local system.replicated_partition_exports option. Refactor querying system.replicated_partition_exports to use multi_read requests instead of several different read requests. Throws iff multi_read is not supported (this was vibe coded).
Shuffle parts to export in a given partition task before choosing a part to work on to avoid locking collisions.
Save the entire Settings object instead of only FormatSettings - now more settings should be preserved (part export only)
Clear part references in partition export manifest once it is no longer pending

lock_inside_the_task is not production ready as of now (the entire feature is not production ready tbh) - there is a possible crash in case the user schedules an export, changes the schema of the destination table, and the export executes. This is because validation is not being done on the fly for this setting. But I think it is ok to ignore this corner case for now.

This partially tackles the following:

Implement some sort of limit on how many part exports can be schedule in a given replica #1176 (fully, but can still be improved)
Add metrics / investigate if existing zk metrics is enough to measure chattiness wiith zk on partition export #1174 (fully)
Clear export partition part references once the task has been completed or killed #1153 (fully)

CI/CD Options

Exclude tests:

Regression jobs to run:

…xport request in case pool is full

…ly preserve lock inside task setting

arthurpassos · 2026-03-04T16:08:15Z

@codex please review it deeply

github-actions · 2026-03-04T16:18:56Z

Workflow [PR], commit [1e85917]

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6fad4e2402

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-04T16:21:32Z

src/Storages/ExportReplicatedMergeTreePartitionManifest.h

            /// what to do if it's not a valid value?
        }

+        manifest.lock_inside_the_task = json->getValue<bool>("lock_inside_the_task");


Keep manifest parsing backward-compatible

Make lock_inside_the_task optional when deserializing metadata. Existing metadata.json znodes created before this change do not contain that key, so json->getValue<bool>("lock_inside_the_task") throws during polling/system-table reads and can prevent processing of in-flight exports after upgrade until those znodes are manually cleaned up.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-04T16:21:32Z

src/Storages/MergeTree/ExportPartFromPartitionExportTask.cpp

+    LOG_INFO(storage.log, "ExportPartFromPartitionExportTask: Failed to lock part {}, skipping", part_name);
+    return false;


Remove export manifest when lock acquisition fails

Clean up the pre-inserted export_manifests entry on the lock-failure path. In lock_inside_the_task mode, the scheduler inserts the manifest before execution, but if tryCreate(.../locks/<part>) fails here, the task exits without erasing that manifest, leaving the part permanently marked as already-exporting on this replica and blocking retries if the current lock owner dies before finishing.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-04T16:21:32Z

src/Storages/MergeTree/ExportPartitionManifestUpdatingTask.cpp

+        info.transaction_id = metadata.transaction_id;
+        info.create_time = metadata.create_time;


Populate query_id in partition export system rows

Assign info.query_id when building ReplicatedPartitionExportInfo. This method now fills many fields from manifest metadata but never sets query_id, so system.replicated_partition_exports.query_id is always empty even for active exports, which is a regression from the previous implementation and breaks query-to-export correlation.

Useful? React with 👍 / 👎.

arthurpassos added 15 commits February 13, 2026 11:39

keep track of export partition zk requests

5c09119

vibe coded getexportpartitioninfo

68ae542

move vibe coded getpartitionexports inside the updating task

4d98ec9

rmv unexistent increment

0fdc716

lock part inside task

f2ae2cc

tmp

a759957

okish

122fdc3

add setting to control locking behavior (inside/outside task), deny e…

16ae536

…xport request in case pool is full

clear part references from partition task when status changes, proper…

6d74239

…ly preserve lock inside task setting

settings history

51fc82f

improvements

645c359

implement local query to system replicated partition exports

0d7768b

address comments

060ce68

glitch

a49476d

fix build

4601ae1

arthurpassos changed the base branch from fp_antalya_26_1_export_part_partition to antalya-26.1 March 4, 2026 16:00

Merge branch 'antalya-26.1' into 26_1_export_improvements_test

6fad4e2

arthurpassos changed the title ~~26 1 export improvements test~~ Improvements to partition export Mar 4, 2026

arthurpassos marked this pull request as ready for review March 4, 2026 16:07

test fixes

f1a8d15

chatgpt-codex-connector bot reviewed Mar 4, 2026

View reviewed changes

fix some issues AI found

ee5a230

CarlosFelipeOR mentioned this pull request Mar 5, 2026

Export Part/Partition integration tests (PR #1388) failing consistently under ASAN builds Altinity/clickhouse-regression#112

Open

Merge branch 'antalya-26.1' into 26_1_export_improvements_test

63d78cf

svb-alt added antalya antalya-26.1 labels Mar 6, 2026

ianton-ru and others added 2 commits March 6, 2026 13:12

Fix test execution in parallel

d2288e6

fix exceptions count bug

4fbcb23

arthurpassos added 3 commits March 6, 2026 14:53

make export partition tests run sequentially

3cbb91d

small fix

22e8bd4

drop table after test execution

1e85917

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to partition export#1402

Improvements to partition export#1402
arthurpassos wants to merge 24 commits intoantalya-26.1from
26_1_export_improvements_test

arthurpassos commented Feb 13, 2026 •

edited by zvonand

Loading

Uh oh!

arthurpassos commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		LOG_INFO(storage.log, "ExportPartFromPartitionExportTask: Failed to lock part {}, skipping", part_name);
		return false;

		info.transaction_id = metadata.transaction_id;
		info.create_time = metadata.create_time;

Conversation

arthurpassos commented Feb 13, 2026 • edited by zvonand Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Documentation entry for user-facing changes

CI/CD Options

Exclude tests:

Regression jobs to run:

Uh oh!

arthurpassos commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arthurpassos commented Feb 13, 2026 •

edited by zvonand

Loading

github-actions bot commented Mar 4, 2026 •

edited

Loading