Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
231 commits
Select commit Hold shift + click to select a range
f2372d0
debug bazel cpu rbe presubmit for windows
nitins17 Nov 13, 2024
fc82f13
enable workflows
nitins17 Nov 13, 2024
08ef29a
update linux arm64 container to ml-build
nitins17 Nov 14, 2024
b77573d
upload and download artifacts from gcs bucket
nitins17 Nov 14, 2024
26744b6
update container images to one that has gcloud tools
nitins17 Nov 14, 2024
afcee12
update image and runner type
nitins17 Nov 14, 2024
a2c78a1
change upload path
nitins17 Nov 14, 2024
1c27adb
adjust upload and download command
nitins17 Nov 14, 2024
84f072e
create dist folder before downloading artifacts
nitins17 Nov 14, 2024
b4f7c8e
experiment with reusable workflows
nitins17 Nov 15, 2024
e123c8e
Refine if conditions and update the upload destination
nitins17 Nov 15, 2024
0a54f97
refine if condition
nitins17 Nov 15, 2024
2f122fd
refine if condition
nitins17 Nov 15, 2024
866d149
fix file name
nitins17 Nov 15, 2024
9ecfc00
create artifact list dynamically
nitins17 Nov 15, 2024
0f5783f
try a fix for the dynamic artifact list
nitins17 Nov 15, 2024
e210fd2
try a fix for the dynamic artifact list (2)
nitins17 Nov 15, 2024
f9a90b9
try a fix for the dynamic artifact list (3)
nitins17 Nov 15, 2024
22b9076
disable workflows
nitins17 Nov 15, 2024
09c576b
try a fix for the dynamic artifact list (4)
nitins17 Nov 15, 2024
117186b
switch to 2 job strategy
nitins17 Nov 15, 2024
6aa801f
fix syntax error
nitins17 Nov 15, 2024
98742b2
add missing job dep
nitins17 Nov 15, 2024
e4b45d1
Change input type to be able to parse when generating matrix
nitins17 Nov 15, 2024
6eca13c
change how artifact matrix is constructed
nitins17 Nov 15, 2024
c356513
fix syntax issue
nitins17 Nov 15, 2024
196c3de
fix syntax issue
nitins17 Nov 15, 2024
42253cc
update build.py
nitins17 Nov 18, 2024
9793661
set default values if inputs context is unavailable
nitins17 Nov 18, 2024
cce45ca
set default shell to bash when determining the artifact matrix
nitins17 Nov 18, 2024
6471a96
add wait for connect step to debug issue
nitins17 Nov 18, 2024
03f8d24
fix syntax error
nitins17 Nov 18, 2024
ab1ae97
try fix
nitins17 Nov 18, 2024
8b43482
try fix
nitins17 Nov 18, 2024
354bc83
Add variable to know if a workflow call was made, enable all package …
nitins17 Nov 18, 2024
d1ca997
try fix
nitins17 Nov 18, 2024
aaa2a70
make is_workflow_call metadata input as not required
nitins17 Nov 18, 2024
55a5d03
try fix
nitins17 Nov 18, 2024
bf48bb3
define inputs as bash variables
nitins17 Nov 18, 2024
c8714df
Change determine_matrix job name
nitins17 Nov 18, 2024
ddd84f7
update build.py
nitins17 Nov 18, 2024
1767756
fix exclude filters
nitins17 Nov 18, 2024
6c0e0fb
introduce rbe env var to decide when to use rbe flags
nitins17 Nov 18, 2024
ea7b99f
update scripts to match upstream
nitins17 Nov 19, 2024
dd464ff
change upload destination and only enable linux x86 builds
nitins17 Nov 19, 2024
6f1995d
make workflow calls simpler
nitins17 Nov 19, 2024
28a2652
simplify workflow calls
nitins17 Nov 20, 2024
e72f19a
fix print statemetns and fix comments
nitins17 Nov 20, 2024
4498912
enable resuable workflow for pytest gpu and disable build artifact wo…
nitins17 Nov 20, 2024
960796a
Set number of processes to be a multiple of the gpu count
nitins17 Nov 20, 2024
d14a236
Set test environment variables in a single section
nitins17 Nov 20, 2024
aa01d86
rework tpu job and scripts to match upstream
nitins17 Nov 20, 2024
6b7720f
Make the resuable pytest cpu workflow as default
nitins17 Nov 20, 2024
e43a85e
Reuse the Build artifact workflow
nitins17 Nov 20, 2024
b61691c
stop building jax and remove python exclude filter for pjrt
nitins17 Nov 20, 2024
2af5f0c
enable windows runners and run for all python versions
nitins17 Nov 20, 2024
75695a0
Add all Python versions to run_pytest job
nitins17 Nov 20, 2024
bd34770
update docker image to one with gsutil
nitins17 Nov 20, 2024
68ea9eb
Use the correct Python binary to install deps
nitins17 Nov 20, 2024
bb1130c
update site-packages to dist-packages
nitins17 Nov 20, 2024
3ad80f1
change input name to upload_destination_prefix
nitins17 Nov 20, 2024
58ca848
remove unused env var
nitins17 Nov 20, 2024
6fef417
adjust os name when running on windows runners
nitins17 Nov 20, 2024
67d7d05
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Nov 20, 2024
191389b
add concurrency settings and remove condition that disables running o…
nitins17 Nov 20, 2024
f9165b1
replace continue-on-error with fail-fast
nitins17 Nov 20, 2024
dc9dec4
move log_stream outside of class
nitins17 Nov 22, 2024
8857b65
add the use_clang flag
nitins17 Nov 25, 2024
a2632fd
enable the build artifacts workflow
nitins17 Nov 25, 2024
482635c
debug asan workflow
nitins17 Nov 26, 2024
a7fa98f
add a check for return code
nitins17 Nov 26, 2024
c41f1c2
add a ci workflow to debug
nitins17 Nov 26, 2024
0603e25
add a python3 binary
nitins17 Nov 26, 2024
c9f5b86
add python binary to path
nitins17 Nov 26, 2024
eae1124
change to v4 checkout
nitins17 Nov 26, 2024
9625f1c
change the fetch depth
nitins17 Nov 26, 2024
93b724a
debug
nitins17 Nov 26, 2024
69d963e
disable workflows
nitins17 Nov 26, 2024
3a88546
use ml build image
nitins17 Nov 26, 2024
dc3070a
add deps
nitins17 Nov 26, 2024
643996a
install git before actions/checkout
nitins17 Nov 26, 2024
1b81bc3
revert other changes
nitins17 Nov 26, 2024
d0a2664
try a fix for path
nitins17 Nov 26, 2024
204c7cf
try a fix for path
nitins17 Nov 26, 2024
630a1df
try a fix for path
nitins17 Nov 26, 2024
68bf70f
try a fix for path
nitins17 Nov 27, 2024
53b7d02
try a fix for path
nitins17 Nov 27, 2024
de1fcd1
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Nov 27, 2024
bddc349
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Nov 27, 2024
2afda3d
update workflows after sync to upstream
nitins17 Nov 27, 2024
6a52899
fix workflow names
nitins17 Nov 27, 2024
1397c94
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Nov 27, 2024
1611e30
sync to upstream
nitins17 Nov 27, 2024
5788853
fix rbe setting and change msys conversion script
nitins17 Nov 27, 2024
2135bb7
change msys conversion script call to match its new usage
nitins17 Nov 27, 2024
6328485
fix comment
nitins17 Nov 27, 2024
140173d
fix conversion script
nitins17 Nov 27, 2024
e49d60d
fix conversion script
nitins17 Nov 27, 2024
2a50a63
fix comment
nitins17 Nov 27, 2024
645d3cc
fix conversion script
nitins17 Nov 27, 2024
387cc60
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Nov 27, 2024
fdb6ef9
update msys conversion script
nitins17 Nov 27, 2024
c9b5bc3
Change to ml build container
nitins17 Nov 28, 2024
d47d72d
update docker scripts
nitins17 Nov 28, 2024
0ea43ca
Enable workflows
nitins17 Nov 28, 2024
1a33626
fix gsutil download link
nitins17 Nov 28, 2024
2dc0af2
add a workflow to test building mulitple artifacts with build CLI
nitins17 Nov 28, 2024
b308da6
mark github repo as safe
nitins17 Nov 28, 2024
3c41e56
switch to 16 core windows runner
nitins17 Dec 3, 2024
9531ed2
add flag to enable detailed timestamped logging
nitins17 Dec 4, 2024
58e6611
test non ci build
nitins17 Dec 4, 2024
844afaa
revert change
nitins17 Dec 4, 2024
e1e7828
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 4, 2024
cf02fd3
fix exclude filters
nitins17 Dec 4, 2024
97a2b1e
replace t2a with c4a runners
nitins17 Dec 5, 2024
cbf4548
test GPU workflow with nosla image
nitins17 Dec 5, 2024
5981613
remove exclude filters on cuda pjrt
nitins17 Dec 5, 2024
d7704a0
change image to one with gsutil
nitins17 Dec 5, 2024
ce247f9
disable tpu workflow
nitins17 Dec 6, 2024
e23e7fb
try adding gpus=all option to debug gpu failures
nitins17 Dec 6, 2024
4d44155
add a workflow to debug matrix in resuable workflows
nitins17 Dec 6, 2024
c871fd1
add workflows to debug matrix strategy for reusable workflows
nitins17 Dec 10, 2024
5d8e9a4
Merge branch 'main' into srnitin/test-scripts-workflows
nitins17 Dec 10, 2024
0d47fe1
try 2
nitins17 Dec 10, 2024
063255a
Merge branch 'srnitin/test-scripts-workflows' of https://github.com/g…
nitins17 Dec 10, 2024
0405669
use inputs to define matrix
nitins17 Dec 10, 2024
dd8b3d4
use inputs to define matrix
nitins17 Dec 10, 2024
2bd68cb
pass artifact as input
nitins17 Dec 10, 2024
f8e1ce6
run only jaxlib 3.10 build
nitins17 Dec 10, 2024
bad1a51
update reusable workflows to new matrix strategy
nitins17 Dec 11, 2024
43f698b
remove concurrency settings
nitins17 Dec 11, 2024
24f0d1e
adjust upload/download logic for windows
nitins17 Dec 11, 2024
0ad69cc
fix syntax issue
nitins17 Dec 11, 2024
a20be40
add pipe to enable multi-line if condition
nitins17 Dec 11, 2024
cfaf3d0
Fix syntax issue
nitins17 Dec 11, 2024
fb87575
try fix for if condition evaluation
nitins17 Dec 11, 2024
6278a73
combine command with and operator
nitins17 Dec 11, 2024
97c4c70
adjust paths when on windows
nitins17 Dec 11, 2024
1bfcc26
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 11, 2024
774e5dc
merge changes from upstream
nitins17 Dec 11, 2024
c905115
enable the bazel cpu rbe workflow
nitins17 Dec 12, 2024
c25c78f
fix syntax issues
nitins17 Dec 12, 2024
cd587a5
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 12, 2024
a7b4933
Utilize cache better by keeping the set of build options same when bu…
nitins17 Dec 16, 2024
1615c23
Fix syntax issue
nitins17 Dec 16, 2024
16f74bb
remove rbe
nitins17 Dec 16, 2024
8fa2800
build jaxlib first
nitins17 Dec 16, 2024
8d77de1
update .bazelrc to upstream
nitins17 Dec 16, 2024
17928f5
update build.py to upstream
nitins17 Dec 16, 2024
0878b0e
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 16, 2024
ac02e1a
remove ci config flag
nitins17 Dec 17, 2024
b162517
apply --config=clang on all platforms
nitins17 Dec 17, 2024
b9ccb7f
add a matrix strtegy for building with the ml-build rbe images
nitins17 Dec 17, 2024
639f636
Merge branch 'main' into srnitin/test-scripts-workflows
nitins17 Dec 17, 2024
87cfd2d
update workflow call
nitins17 Dec 17, 2024
501ec7e
update python verison to match workflow call
nitins17 Dec 17, 2024
6123ecd
update python verison to match workflow call
nitins17 Dec 17, 2024
c31f0c3
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 18, 2024
2eb10cc
add new workflows for running wheel tests
nitins17 Dec 18, 2024
a8da0ae
fix indent
nitins17 Dec 18, 2024
bb2a90c
fix indent
nitins17 Dec 18, 2024
fc484e6
add missing type to iputs
nitins17 Dec 18, 2024
148a6a0
move matrix stategy to calling workflow
nitins17 Dec 19, 2024
18947b3
remove concurrency
nitins17 Dec 19, 2024
3c1eda7
add bazel gpu to wheel tests wokrflow
nitins17 Dec 19, 2024
1c83465
switch to t2a to debug crashes
nitins17 Dec 19, 2024
db08947
disable windows
nitins17 Dec 19, 2024
a6fdb20
update string match
nitins17 Dec 19, 2024
c0c7ec8
update string match
nitins17 Dec 19, 2024
9232ee3
update build.py to upstream
nitins17 Dec 19, 2024
08e708d
update utils.py to upstream
nitins17 Dec 19, 2024
71699a2
update to upstream
nitins17 Dec 19, 2024
c7df7a8
switch to t2a machines, delete old workflows
nitins17 Dec 20, 2024
2619e7c
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Dec 20, 2024
6d23c1a
use the t2a machine for running tests
nitins17 Dec 20, 2024
ae0d99b
move output_path to be inside for loop
nitins17 Dec 21, 2024
7da8aca
change to building an editable wheel
nitins17 Dec 21, 2024
740a92c
sync to upstream
nitins17 Dec 21, 2024
47c7afa
create a pytest cpu workflow to debug arm64 crashes
nitins17 Dec 21, 2024
be0e54c
Change name and build config
nitins17 Dec 21, 2024
903910a
upload artifacts to a single location in gcs to match internal ci
nitins17 Dec 23, 2024
2ac88b7
debug error
nitins17 Dec 23, 2024
e76aab2
copy paste command from other workflow that works
nitins17 Dec 23, 2024
4878b6e
add a workflow to debug env var issue
nitins17 Dec 23, 2024
2ed7af3
switch out image
nitins17 Dec 23, 2024
df59b2d
set env var to 3.11
nitins17 Dec 23, 2024
729cbb5
use tr to delete periods
nitins17 Dec 23, 2024
dfb12e3
use tr to get python major minor versions
nitins17 Dec 23, 2024
266746d
modify download wheel commands
nitins17 Dec 23, 2024
c89b47b
add an entrypoint workflow for nightly/release testing
nitins17 Jan 2, 2025
ada4e63
change the name of pytest and bazel non-rbe workflows
nitins17 Jan 2, 2025
2f52a39
Add env var to decide if to install jax at current commit
nitins17 Jan 3, 2025
fa61d91
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Jan 3, 2025
c2a4cd0
Pass the install_jax_current_commit env var as an input to workflow c…
nitins17 Jan 3, 2025
018e764
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Jan 3, 2025
806a161
Fix syntax error
nitins17 Jan 3, 2025
0b92e10
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Jan 10, 2025
50ef46e
Remove windows specific download steps
nitins17 Jan 10, 2025
34774fc
Enable remote cache support for platforms where rbe is not supported
nitins17 Jan 13, 2025
937f951
fix syntax error
nitins17 Jan 13, 2025
a5c8f2c
fix syntax error
nitins17 Jan 13, 2025
4c37d90
dummy change
nitins17 Jan 13, 2025
a07fed4
update checkout version
nitins17 Jan 14, 2025
3fddb21
remove spaces in upload/download uri
nitins17 Jan 14, 2025
2b362cc
update alias
nitins17 Jan 14, 2025
82e8d9d
update shell
nitins17 Jan 14, 2025
1eb8871
update alias
nitins17 Jan 14, 2025
157a293
remove gsutil aliases now that dockerfile is updated
nitins17 Jan 14, 2025
c943c25
update to new ml build gpu containers
nitins17 Jan 31, 2025
2060105
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Jan 31, 2025
9fa26dd
make continuous wokrkflow run on presubmits
nitins17 Jan 31, 2025
2014eb0
fix syntax issue
nitins17 Jan 31, 2025
3fd003d
fix workflow files
nitins17 Jan 31, 2025
60b0a11
fix workflow file
nitins17 Jan 31, 2025
be3fca8
fix merge
nitins17 Jan 31, 2025
492b104
fix merge
nitins17 Jan 31, 2025
45479bd
fix merge
nitins17 Jan 31, 2025
7819589
sync to main
nitins17 Feb 3, 2025
88582f1
use new mlbuild cuda images
nitins17 Feb 13, 2025
d25cc25
Merge branch 'main' into srnitin/test-scripts-workflows
nitins17 Feb 13, 2025
4ccc3c5
Merge branch 'main' into srnitin/test-scripts-workflows
nitins17 Feb 20, 2025
1b20f8c
Get XLA via actions/checkout
nitins17 Feb 28, 2025
b28ff8a
Merge branch 'srnitin/test-scripts-workflows' of https://github.com/g…
nitins17 Feb 28, 2025
9300174
Merge branch 'main' into srnitin/test-scripts-workflows
nitins17 Feb 28, 2025
47ad0a6
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Apr 10, 2025
6733533
Merge branch 'srnitin/test-scripts-workflows' of https://github.com/g…
nitins17 Apr 10, 2025
80628d6
remove obsolete files
nitins17 Apr 10, 2025
f03298e
Switch the CUDA workflow to use a non-cuda docker image to test jax i…
nitins17 Apr 10, 2025
8d4ec3c
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Apr 10, 2025
4d978df
Merge branch 'main' of https://github.com/google-ml-infra/jax-fork in…
nitins17 Apr 11, 2025
919155f
Revert cuda changes done to test wheel breakage
nitins17 Apr 11, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .bazelrc
Original file line number Diff line number Diff line change
Expand Up @@ -210,6 +210,9 @@ build:macos_cache --remote_cache="https://storage.googleapis.com/tensorflow-maco
# Cache pushes are limited to JAX's CI system.
build:macos_cache_push --config=macos_cache --remote_upload_local_results=true --google_default_credentials

build:public_cache --remote_cache="https://storage.googleapis.com/jax-bazel-cache/" --remote_upload_local_results=false
build:public_cache_push --config=public_cache --remote_upload_local_results=true --google_default_credentials

# #############################################################################
# CI Build config options below.
# JAX uses these configs in CI builds for building artifacts and when running
Expand Down
10 changes: 8 additions & 2 deletions .github/workflows/build_artifacts.yml
Original file line number Diff line number Diff line change
Expand Up @@ -109,7 +109,6 @@ jobs:

env:
JAXCI_HERMETIC_PYTHON_VERSION: "${{ inputs.python }}"
JAXCI_CLONE_MAIN_XLA: "${{ inputs.clone_main_xla }}"

name: Build ${{ inputs.artifact }} (${{ inputs.runner }}, Python ${{ inputs.python }}, clone main XLA=${{ inputs.clone_main_xla }})

Expand All @@ -118,7 +117,14 @@ jobs:
gcs_upload_uri: ${{ steps.store-gcs-upload-uri.outputs.gcs_upload_uri }}

steps:
- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- name: Checkout JAX repository
uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- name: Checkout XLA repository
if: contains("${{ inputs.clone_main_xla }}", '1')
uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
with:
repository: openxla/xla
path: jax/xla
- name: Enable RBE if building on Linux x86 or Windows x86
if: contains(inputs.runner, 'linux-x86') || contains(inputs.runner, 'windows-x86')
run: echo "JAXCI_BUILD_ARTIFACT_WITH_RBE=1" >> $GITHUB_ENV
Expand Down
37 changes: 37 additions & 0 deletions .github/workflows/build_artifacts_single_invocation.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
name: Build multiple artifacts with build CLI

on:
pull_request:
branches:
- main
workflow_dispatch:
inputs:
halt-for-connection:
description: 'Should this workflow run wait for a remote connection?'
type: choice
required: true
default: 'no'
options:
- 'yes'
- 'no'

jobs:
build_artifacts:

runs-on: "linux-x86-n2-64"
container: "us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build:latest"

env:
JAXCI_HERMETIC_PYTHON_VERSION: 3.12

steps:
- uses: actions/checkout@v3
# Halt for testing
- name: Wait For Connection
uses: google-ml-infra/actions/ci_connection@main
with:
halt-dispatch-input: ${{ inputs.halt-for-connection }}
- name: Mark GitHub repo as safe
run: git config --global --add safe.directory "$GITHUB_WORKSPACE"
- name: Build all artifacts with a single invocation of build CLI
run: python build/build.py build --editable --use_clang --output_path=$(pwd)/dist --wheels=jaxlib,jax-cuda-plugin,jax-cuda-pjrt --python_version=$JAXCI_HERMETIC_PYTHON_VERSION --verbose
41 changes: 41 additions & 0 deletions .github/workflows/ci_duplicate.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
name: CI (duplicate)

on:
pull_request:
branches:
- main

permissions:
contents: read # to fetch code
actions: write # to cancel previous workflows

jobs:
lint_and_typecheck:
runs-on: linux-x86-n2-16
container:
image: "us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build:latest" # ratchet:ubuntu:20.04
timeout-minutes: 5
steps:
- name: Install dependencies for setting up Python
env:
DEBIAN_FRONTEND: noninteractive
run: |
apt update && apt install libssl-dev libsqlite3-dev git -y
- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- name: Set up Python 3.11
uses: actions/setup-python@v5
with:
python-version: "3.11"
- run: python -m pip install pre-commit
# - name: Add to PATH
# run: export PATH=$PATH:/__w/_tool/Python/3.11.10/x64/bin
# - name: Set python3 alias
# run: echo "alias python3=/__w/_tool/Python/3.11.10/x64/bin/python3" >> ~/.bashrc
# Halt for testing
- name: Wait For Connection
uses: google-ml-infra/actions/ci_connection@main
- uses: actions/cache@6849a6489940f00c2f30c0fb92c6274307ccb58a # v4.1.2
with:
path: ~/.cache/pre-commit
key: pre-commit-${{ env.pythonLocation }}-${{ hashFiles('.pre-commit-config.yaml', 'setup.py') }}
- run: pre-commit run --show-diff-on-failure --color=always --all-files
46 changes: 46 additions & 0 deletions .github/workflows/env_var_debug.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
name: CI - Debug

on:
pull_request:
branches:
- main

jobs:
run-tests:

strategy:
fail-fast: false # don't cancel all jobs on failure
matrix:
# Python values need to match the matrix stategy in the build artifacts job above
runner: ["linux-x86-g2-48-l4-4gpu",]
python: ["3.11",]

runs-on: ${{ matrix.runner }}
container: "us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build:latest"

name: "Debug env var setting"

env:
JAXCI_HERMETIC_PYTHON_VERSION: "${{ matrix.python }}"
JAXCI_PYTHON: "python${{ matrix.python }}"

steps:
- uses: actions/checkout@v4
- name: Set env vars for use in artifact download URL
run: |
os=$(uname -s | awk '{print tolower($0)}')
arch=$(uname -m)

# Get the major and minor version of Python.
# E.g if JAXCI_HERMETIC_PYTHON_VERSION=3.10, then python_major_minor=310
echo $os $arch $JAXCI_HERMETIC_PYTHON_VERSION

python_major_minor_1=$(echo "$JAXCI_HERMETIC_PYTHON_VERSION" | tr -d '.')
echo $os $arch $JAXCI_HERMETIC_PYTHON_VERSION $python_major_minor_1

python_major_minor_2="${JAXCI_HERMETIC_PYTHON_VERSION//./}"
echo $os $arch $JAXCI_HERMETIC_PYTHON_VERSION $python_major_minor_1 $python_major_minor_2

echo "OS=${os}" >> $GITHUB_ENV
echo "ARCH=${arch}" >> $GITHUB_ENV
echo "PYTHON_MAJOR_MINOR=${python_major_minor}" >> $GITHUB_ENV
57 changes: 57 additions & 0 deletions .github/workflows/pytest_cpu_crash_debug.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
name: CI - Pytest CPU (crash debug)

on:
pull_request:
branches:
- main

jobs:
run-tests:
defaults:
run:
# Explicitly set the shell to bash to override the default Windows environment, i.e, cmd.
shell: bash

strategy:
fail-fast: false # don't cancel all jobs on failure
matrix:
runner: ["linux-x86-t2a-48-dev",]
python: ["3.10",]
enable-x64: [1, 0]

runs-on: ${{ matrix.runner }}
container: "us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build-arm64:latest"

name: "Pytest CPU (${{ matrix.runner }}, Python ${{ matrix.python }}, x64=${{ matrix.enable-x64 }})"

env:
JAXCI_HERMETIC_PYTHON_VERSION: "${{ matrix.python }}"
JAXCI_PYTHON: "python${{ matrix.python }}"
JAXCI_ENABLE_X64: ${{ matrix.enable-x64 }}


steps:
- uses: actions/checkout@v3
# Halt for testing
- name: Wait For Connection
uses: google-ml-infra/actions/ci_connection@main
# Checkout XLA at head, if we're building jaxlib at head.
- name: Checkout XLA at head
uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
with:
repository: openxla/xla
path: xla
# We need to mark the GitHub workspace as safe as otherwise git commands will fail.
- name: Mark GitHub workspace as safe
run: |
git config --global --add safe.directory "$GITHUB_WORKSPACE"
- name: Build jaxlib
run: |
$JAXCI_PYTHON build/build.py build --wheels=jaxlib \
--bazel_options=--config=ci_linux_aarch64 \
--local_xla_path="$(pwd)/xla" \
--verbose
- name: Install dependencies
run: $JAXCI_PYTHON -m pip install -r build/requirements.in
- name: Run Pytest CPU tests
run: ./ci/run_pytest_cpu.sh
2 changes: 1 addition & 1 deletion .github/workflows/pytest_cuda.yml
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ jobs:
runs-on: ${{ inputs.runner }}
# Test the oldest and newest supported CUDA versions.
container: ${{ (contains(inputs.cuda, '12.1') && 'us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build-cuda12.1-cudnn9.8:latest') ||
(contains(inputs.cuda, '12.8') && 'us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build-cuda12.8-cudnn9.8:latest') }}
(contains(inputs.cuda, '12.8') && 'us-central1-docker.pkg.dev/tensorflow-sigs/tensorflow/ml-build-cuda12.8-cudnn9.8:latest')}}
name: "Pytest CUDA (${{ inputs.runner }}, CUDA ${{ inputs.cuda }}, Python ${{ inputs.python }}, x64=${{ inputs.enable-x64 }})"
env:
JAXCI_HERMETIC_PYTHON_VERSION: "${{ inputs.python }}"
Expand Down
7 changes: 5 additions & 2 deletions .github/workflows/wheel_tests_continuous.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,9 @@ name: CI - Wheel Tests (Continuous)
on:
schedule:
- cron: "0 0 * * *" # Run once per day
pull_request:
branches:
- main
workflow_dispatch: # allows triggering the workflow run manually

concurrency:
Expand Down Expand Up @@ -55,7 +58,7 @@ jobs:
runner: ${{ matrix.runner }}
artifact: ${{ matrix.artifact }}
python: ${{ matrix.python }}
clone_main_xla: 1
clone_main_xla: 0
upload_artifacts_to_gcs: true
gcs_upload_uri: 'gs://general-ml-ci-transient/jax-github-actions/jax-fork/${{ github.workflow }}/${{ github.run_number }}/${{ github.run_attempt }}'

Expand All @@ -73,7 +76,7 @@ jobs:
runner: ${{ matrix.runner }}
artifact: ${{ matrix.artifact }}
python: ${{ matrix.python }}
clone_main_xla: 1
clone_main_xla: 0
upload_artifacts_to_gcs: true
gcs_upload_uri: 'gs://general-ml-ci-transient/jax-github-actions/jax-fork/${{ github.workflow }}/${{ github.run_number }}/${{ github.run_attempt }}'

Expand Down
Loading