[data] Support multiple datasets in a cluster (2/2): partition cluster resources by subcluster label by TimothySeah · Pull Request #63375 · ray-project/ray

TimothySeah · 2026-05-15T19:24:26Z

Summary

The end goal is to support 2 ray data datasets in 1 cluster with subcluster label scheduling. In such a setup, we have 2 datasets and 2 trainers sharing the same AutoscalingCoordinator. The previous PR in this stack (#63331) made sure that each dataset's tasks ended up in the correct subcluster. This PR ensures that all requesters, whether they are trainers or datasets, only request and receive resources in their subcluster.

To this end, the main change was to AutoscalingCoordinator._tick, which is called at regular intervals. AutoscalingCoordinator._tick calls 3 helper methods, which this PR changes as follows:

merge_and_send_requests: each autoscaling request now includes the subcluster label of the requester
update_cluster_node_resources: we now group cluster nodes by subcluster
_reallocate_resources: we now update OngoingRequests with their subcluster-scoped resources.

I also changed the try_trigger_scaling method, which creates datasets' autoscaling requests. Before this change, this method tried to scale up every node in the cluster. Now, it only scales up the relevant subcluster. Note that this only applies to dataset requesters; trainer requesters attempt scaleup by requesting resource bundles with their corresponding label selectors (which includes subcluster labels), so I didn't need to touch that path.

API summary

To use subcluster scheduling, the user must set the __subcluster__ label in their compute config

- name: training_node
  instance_type: p4d.24xlarge
  max_workers: 2
  min_workers: 2
  use_spot: false
  labels:
    __subcluster__: training

- name: validation_node
  instance_type: p4d.24xlarge
  max_workers: 1
  min_workers: 1
  use_spot: false
  labels:
    __subcluster__: validation

and the label_selector on their dataset

# Option 1: set using ray.train.DataConfig
trainer = TorchTrainer(
	train_fn,
	datasets={"train": train_ds, "val": val_ds},
	dataset_config=ray.train.DataConfig(
		datasets_to_split=["train"],
		data_execution_options=DataExecutionOptions(
			per_dataset_execution_options={
				"train": ExecutionOptions(
					label_selector={"__subcluster__": "train"}
				),
				"val": ExecutionOptoins(
					label_selector={"__subcluster__": "validation"}
				)
			}
		)
	)

# Option 2: set directly using DataContext.ExecutionOptions
train_ds.context.label_selector = {"__subcluster__": "training"}

Testing

Ran multitenancy stress test based on this PR (PR: #63737, test: https://buildkite.com/ray-project/release/builds/95982).

gemini-code-assist

Code Review

This pull request implements label_selector and subcluster_label_key support across Ray Data and Ray Train, allowing users to constrain task and actor placement to specific labeled subsets of a cluster. The changes include updates to ExecutionOptions, the AutoscalingCoordinator for resource bucketing, and broad propagation of these selectors through physical operators, planners, and data source implementations. Feedback was provided regarding the merge_label_selector utility, suggesting that it should always return a new dictionary to resolve a contradiction in its docstring and prevent potential mutation bugs.

justinvyu

Great work!

Signed-off-by: Timothy Seah <tseah@anyscale.com>

…resources + request_remaining=True Signed-off-by: Timothy Seah <tseah@anyscale.com>

justinvyu

Thanks!

Signed-off-by: Timothy Seah <tseah@anyscale.com>

cursor

Cursor Bugbot has reviewed your changes using default effort and found 1 potential issue.

^{Reviewed by Cursor Bugbot for commit ef0e3e5. Configure here.}

justinvyu

when is request_resources(label_selectors) used? what happens if you do request_resources(label_selectors, subcluster_selector)? Does one overwrite the other?

Is that meant to be "non-subcluster related labels"?

Can we also raise an error to explicitly disallow one requester trying to request bundles from multiple subclusters?

around here:

  if subcluster_selector and label_selectors:
      req_subcluster = subcluster_selector.get(SUBCLUSTER_LABEL_KEY)
      for i, sel in enumerate(label_selectors):
          bundle_subcluster = sel.get(SUBCLUSTER_LABEL_KEY)
          if bundle_subcluster is not None and bundle_subcluster != req_subcluster:
              raise ValueError(
                  f"Bundle {i} label_selector targets subcluster "
                  f"{bundle_subcluster!r}, but requester is registered to "
                  f"{req_subcluster!r}. Per-bundle cross-subcluster "
                  f"allocation is not supported."
              )

Signed-off-by: Timothy Seah <tseah@anyscale.com>

…ataset subcluster changes Signed-off-by: Timothy Seah <tseah@anyscale.com>

TimothySeah · 2026-06-08T23:10:02Z

when is request_resources(label_selectors) used?
Is that meant to be "non-subcluster related labels"?

Check out #58845 and #63287. In the former PR, my goal was to support placing Ray Train workers on nodes with particular attributes. These would usually be subcluster labels, but could also be nodes within a subcluster. For example, we may want to place Ray Train workers on gpu nodes within the training subcluster, as opposed to Ray Data workers for the training dataset on the cpu nodes within the training subcluster. However, I forgot to update the AutoscalingCoordinator to scale up these nodes if they don't currently exist, which @liulehui added in the latter PR.

Right now, there are two types of requesters - datasets and ray train. Datasets will always request the subcluster using subcluster_selector, while Ray Train will always request all desired node attributes - including the subcluster - together using label_selectors. I agree it's a bit clunky/confusing though so I am open to suggestions on how to clean up this separation.

what happens if you do request_resources(label_selectors, subcluster_selector)? Does one overwrite the other?
Can we also raise an error to explicitly disallow one requester trying to request bundles from multiple subclusters?

subcluster_selector takes precedence: https://github.com/ray-project/ray/pull/63375/changes#diff-23e42254510d06fc2e4595cb52c69872e0b16f6c52932f06b502d63548e72067R361. I also implemented your ValueError suggestion, so now we should raise an error before we even get to this point.

justinvyu

Thanks! Can you update the PR description?

…utoscaler_v2.py Co-authored-by: Justin Yu <justin.v.yu@gmail.com> Signed-off-by: Timothy Seah <tseah@anyscale.com>

Signed-off-by: Timothy Seah <tseah@anyscale.com>

…r resources by subcluster label (ray-project#63375) The end goal is to support 2 ray data datasets in 1 cluster with subcluster label scheduling. In such a setup, we have 2 datasets sharing the same AutoscalingCoordinator. The previous PR in this stack (ray-project#63331) made sure that each dataset's tasks ended up in the correct subcluster. This PR ensures that all requesters, whether they are trainers or datasets, only request and receive resources in their subcluster. --------- Signed-off-by: Timothy Seah <tseah@anyscale.com> Co-authored-by: Justin Yu <justin.v.yu@gmail.com>

…ter (#63982) #63375 doesn't work because `__subcluster__` is not a valid label name. I am testing whether `subcluster` works on this PR (#63737) and cherrypicked that change here. --------- Signed-off-by: Timothy Seah <tseah@anyscale.com>

…r resources by subcluster label (ray-project#63375) The end goal is to support 2 ray data datasets in 1 cluster with subcluster label scheduling. In such a setup, we have 2 datasets sharing the same AutoscalingCoordinator. The previous PR in this stack (ray-project#63331) made sure that each dataset's tasks ended up in the correct subcluster. This PR ensures that all requesters, whether they are trainers or datasets, only request and receive resources in their subcluster. --------- Signed-off-by: Timothy Seah <tseah@anyscale.com> Co-authored-by: Justin Yu <justin.v.yu@gmail.com>

…ter (#64003) #63375 doesn't work because subcluster is not a valid label name. I am testing whether subcluster works on this PR (#63737) and cherrypicked that change here. Merged to 2.56.0 release branch already #63982 --------- Signed-off-by: Timothy Seah <tseah@anyscale.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com> Co-authored-by: Timothy Seah <tseah@anyscale.com>

…r resources by subcluster label (ray-project#63375) The end goal is to support 2 ray data datasets in 1 cluster with subcluster label scheduling. In such a setup, we have 2 datasets sharing the same AutoscalingCoordinator. The previous PR in this stack (ray-project#63331) made sure that each dataset's tasks ended up in the correct subcluster. This PR ensures that all requesters, whether they are trainers or datasets, only request and receive resources in their subcluster. --------- Signed-off-by: Timothy Seah <tseah@anyscale.com> Co-authored-by: Justin Yu <justin.v.yu@gmail.com>

…ter (ray-project#64003) ray-project#63375 doesn't work because subcluster is not a valid label name. I am testing whether subcluster works on this PR (ray-project#63737) and cherrypicked that change here. Merged to 2.56.0 release branch already ray-project#63982 --------- Signed-off-by: Timothy Seah <tseah@anyscale.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com> Co-authored-by: Timothy Seah <tseah@anyscale.com>

gemini-code-assist Bot reviewed May 15, 2026

View reviewed changes

Comment thread python/ray/data/_internal/execution/util.py

TimothySeah mentioned this pull request May 28, 2026

[data] Support multiple datasets in a cluster (1/2): Pipe DataContext.ExecutionOptions.label_selector to task submissions #63331

Merged

justinvyu approved these changes May 28, 2026

View reviewed changes

TimothySeah marked this pull request as ready for review May 29, 2026 17:40

TimothySeah requested review from a team as code owners May 29, 2026 17:40

TimothySeah changed the title ~~[data] AutoscalingCoordinator _tick loop respects subcluster boundaries~~ May 29, 2026

ray-gardener Bot added the data Ray Data-related issues label May 29, 2026

TimothySeah added 4 commits May 29, 2026 17:39

[data] AutoscalingCoordinator _tick loop respects subcluster boundaries

075aba2

Signed-off-by: Timothy Seah <tseah@anyscale.com>

move tests and remove unnecessary env var

4639a7a

Signed-off-by: Timothy Seah <tseah@anyscale.com>

address pr feedback

cba5443

Signed-off-by: Timothy Seah <tseah@anyscale.com>

safeguard against None.get

7061322

Signed-off-by: Timothy Seah <tseah@anyscale.com>

TimothySeah force-pushed the tseah/2-datasets-prototype-2 branch from 60ccec0 to 7061322 Compare May 30, 2026 00:40

cursor Bot reviewed May 30, 2026

View reviewed changes

Comment thread python/ray/data/_internal/cluster_autoscaler/default_cluster_autoscaler_v2.py Outdated

autoscalingcoordinator tick loop respects subcluster even with empty …

683a3fd

…resources + request_remaining=True Signed-off-by: Timothy Seah <tseah@anyscale.com>

justinvyu reviewed Jun 4, 2026

View reviewed changes

justinvyu requested changes Jun 4, 2026

View reviewed changes

TimothySeah added 2 commits June 5, 2026 14:26

Use per-requester registration instead of subcluster_label_selector

3faaf1f

Signed-off-by: Timothy Seah <tseah@anyscale.com>

address other comments

b049c80

Signed-off-by: Timothy Seah <tseah@anyscale.com>

cursor Bot reviewed Jun 5, 2026

View reviewed changes

Comment thread python/ray/data/_internal/cluster_autoscaler/default_autoscaling_coordinator.py Outdated

Comment thread python/ray/data/_internal/cluster_autoscaler/default_autoscaling_coordinator.py Outdated

fix bugs

a8d5863

Signed-off-by: Timothy Seah <tseah@anyscale.com>

cursor Bot reviewed Jun 5, 2026

View reviewed changes

Comment thread python/ray/data/_internal/cluster_autoscaler/default_cluster_autoscaler_v2.py

try_trigger_scaling should also filter by subcluster

ef0e3e5

Signed-off-by: Timothy Seah <tseah@anyscale.com>

cursor Bot reviewed Jun 6, 2026

View reviewed changes

Comment thread python/ray/data/_internal/cluster_autoscaler/default_autoscaling_coordinator.py

justinvyu reviewed Jun 8, 2026

View reviewed changes

TimothySeah added 2 commits June 8, 2026 15:22

address comments

cdeed95

Signed-off-by: Timothy Seah <tseah@anyscale.com>

Raise when label_selectors and subcluster_selector disagree or when d…

aaea07b

…ataset subcluster changes Signed-off-by: Timothy Seah <tseah@anyscale.com>

TimothySeah requested a review from justinvyu June 8, 2026 23:17

justinvyu approved these changes Jun 9, 2026

View reviewed changes

Comment thread python/ray/data/_internal/cluster_autoscaler/default_cluster_autoscaler_v2.py Outdated

TimothySeah changed the title ~~[data] Support multiple datasets in a cluster (2/2): AutoscalingCoordinator _tick loop respects subcluster boundaries~~ Jun 9, 2026

TimothySeah and others added 2 commits June 8, 2026 18:01

Update python/ray/data/_internal/cluster_autoscaler/default_cluster_a…

655aa43

…utoscaler_v2.py Co-authored-by: Justin Yu <justin.v.yu@gmail.com> Signed-off-by: Timothy Seah <tseah@anyscale.com>

clarify comment

6cc713b

Signed-off-by: Timothy Seah <tseah@anyscale.com>

justinvyu enabled auto-merge (squash) June 9, 2026 02:07

github-actions Bot added the go add ONLY when ready to merge, run all tests label Jun 9, 2026

justinvyu merged commit 5d2c4e7 into ray-project:master Jun 9, 2026
8 checks passed

rayhhome mentioned this pull request Jun 9, 2026

[Data] Pin cluster autoscaler version in label_selector forwarding test #63970

Closed

This was referenced Jun 9, 2026

Revert "[data] Support multiple datasets in a cluster (2/2): partition cluster resources by subcluster label" #63973

Closed

[data] Rename subcluster label key from __subcluster__ to ray-subcluster #63982

Merged

This was referenced Jun 10, 2026

Tseah/cp subcluster rename #63997

Closed

[data] Rename subcluster label key from __subcluster__ to ray-subcluster #64003

Merged

Uh oh!

Conversation

TimothySeah commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

API summary

Testing

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

justinvyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinvyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

justinvyu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TimothySeah commented Jun 8, 2026

justinvyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Labels

2 participants

TimothySeah commented May 15, 2026 •

edited

Loading

justinvyu left a comment •

edited

Loading