-
Notifications
You must be signed in to change notification settings - Fork 250
Initial checkin of G4 Blueprint #4585
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
@@ -0,0 +1,81 @@ | |||
# Copyright 2024 Google LLC |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Copyright 2024 Google LLC | |
# Copyright 2025 Google LLC |
new_image: | ||
family: slurm-gcp-6-11-ubuntu-2204-lts-nvidia-570 | ||
project: schedmd-slurm-public | ||
disk_size_gb: 200 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
disk_size_gb: 200 | |
disk_size_gb: 100 |
enable_placement: false | ||
node_count_static: 1 | ||
bandwidth_tier: gvnic_enabled | ||
machine_type: g4-standard-48 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How should we expose the fact that g4-standard-384 has 2 NICs? I'd consider having comments here then a commented out section below for the second NIC.
enable_controller_public_ips: true | ||
instance_image: $(vars.new_image) | ||
instance_image_custom: true | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- g4_partition | ||
- slurm_login | ||
settings: | ||
machine_type: e2-standard-2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
machine_type: e2-standard-2 |
settings: | ||
machine_type: e2-standard-2 | ||
endpoint_versions: | ||
enable_controller_public_ips: true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
enable_controller_public_ips: true |
deployment_name: ## Set Deployment Name Here ## | ||
region: ## Set GCP Region Here ## | ||
zone: ## Set GCP Zone ID Here ## | ||
new_image: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
new_image: | |
instance_image: |
machine_type: e2-standard-2 | ||
endpoint_versions: | ||
enable_controller_public_ips: true | ||
instance_image: $(vars.new_image) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
instance_image: $(vars.new_image) |
source: community/modules/scheduler/schedmd-slurm-gcp-v6-login | ||
use: [network] | ||
settings: | ||
machine_type: e2-standard-2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove and default to c2-standard-4
machine_type: e2-standard-2 |
This is the Initial checkin of the G4 Blueprint. It has been regressed for up to 2 nodes. The G4 VM-type was introduced in June 2025 and is a family of VMs with NVIDIA RTX PRO 6000 Blackwell.
The G4 VMs can power a variety of workloads, from cost-efficient inference, to advanced physical AI, robotics simulations, generative AI-enabled content creation, and next-generation game rendering.