547 Commits

Author SHA1 Message Date
Mathew Odden
ec4b8ee1ed Fixes for 0.5.0 build ported to rocm-main
(cherry picked from commit c23a81461192a2b6da3d364076a261714d2dc64f)
2025-03-25 17:51:30 -05:00
charleshofer
dd7f96b27c
Fix ROCm build README (#284) 2025-03-18 14:35:36 -05:00
Charles Hofer
c7b407c9f0 Merge branch 'rocm-main' into ci-upstream-sync-151_1 2025-03-18 15:27:35 +00:00
Mathew Odden
d864b4fbf4
Fix auditwheel version issue (#288)
Auditwheel 6.3.0 changed/removed the lddtree function
so cap constraint to 6.2.x
2025-03-17 12:30:56 -05:00
Nitin Srinivasan
031614c22b Pin numpy~=2.1.0 in workflow file instead of test-requirements.txt
PiperOrigin-RevId: 737632771
2025-03-17 08:59:06 -07:00
GitHub Actions
e275d5cf6c Merge remote-tracking branch 'origin/rocm-main' into ci-upstream-sync-147_1 2025-03-14 22:42:07 +00:00
Nitin Srinivasan
5944c9ed65 Install test dependencies from test-requirements.txt instead of requirements.in
PiperOrigin-RevId: 736878834
2025-03-14 08:57:20 -07:00
GitHub Actions
a0edd3fbb2 Merge remote-tracking branch 'origin/rocm-main' into ci-upstream-sync-144_1 2025-03-12 16:57:18 +00:00
charleshofer
f14a1d0b71
Add JSON output to multi-GPU tests (#274) 2025-03-12 11:30:55 -05:00
jax authors
13eb8d3ae7 Upgrade ml-dtypes version in py3.10-py3.13 hermetic python lock files.
This change is needed to add testing of int2/uint2 dtypes via bazel in presubmit (see https://github.com/jax-ml/jax/pull/21395).

PiperOrigin-RevId: 735895293
2025-03-11 14:41:34 -07:00
jax authors
0db14aa342 Add NVIDIA wheel requirements only for Linux builds.
PiperOrigin-RevId: 735850240
2025-03-11 12:33:54 -07:00
github-actions[bot]
6ee76a8a6d
Merge pull request #271 from ROCm/ci-upstream-sync-142_1
CI: 03/11/25 upstream sync
2025-03-11 14:10:03 -05:00
JD
ce53e374fc
Deprecate obsolete gfx versions (#273) 2025-03-11 12:19:51 -05:00
Charles Hofer
fb89a4b427 Merge branch 'rocm-main' into ci-upstream-sync-142_1 2025-03-11 16:33:59 +00:00
jax authors
1aca76fc13 Update :build_jaxlib flag to control whether we should add py_import dependencies to the test targets.
This change enables testing the wheels produced by the build rules in the presubmit using one `bazel test` command only.

There are three options for running the tests:

1) `build_jaxlib=true`: the tests depend on JAX targets.
2) `build_jaxlib=false`: the tests depend on the wheel files located in the `dist` folder.
3) `build_jaxlib=wheel`: the tests depend on the py_import targets.

PiperOrigin-RevId: 735765819
2025-03-11 08:31:43 -07:00
jax authors
007fc7a6f1 Remove version limit for setuptools dependency.
PiperOrigin-RevId: 735453796
2025-03-10 11:36:17 -07:00
Nitin Srinivasan
721d1a3211 Add functionality to allow promoting RC wheels during release
List of changes:
1. Allow us to build a RC wheel when building release artifacts. This is done by modifying the build CLI to use the new JAX build rule and passing in the build options that control the wheel tag. A new build argument `use_new_wheel_build_rule` is introduced to the build CLI to avoid breaking anyone that uses the CLI and the old build rule. Note that this option will go way in the future when the build CLI migrates fully to the new build rule.
2. Change the upload script to upload both rc and release tagged wheels (changes internal)

PiperOrigin-RevId: 733464219
2025-03-04 14:21:12 -08:00
Charles Hofer
07cd809ba8 Merge branch 'rocm-main' into ci-upstream-sync-135_1 2025-03-04 16:22:17 +00:00
JD
a701022ec4
add gfx1101 target (#249) 2025-03-03 09:29:47 -06:00
Zahid Iqbal
15255dd69e
removing csv result compilation after Unit test... (#248) 2025-03-02 08:30:34 -06:00
jax authors
615219b1f6 Remove tensorstore dependency from //jax/experimental/array_serialization:serialization in OSS (see https://github.com/google/tensorstore/issues/218)
Disable serialization_test in OSS.

PiperOrigin-RevId: 731463136
2025-02-26 14:47:16 -08:00
jax authors
c9c7250dd4 Upgrade to Bazel 7.4.1
PiperOrigin-RevId: 731278247
2025-02-26 05:33:24 -08:00
Charles Hofer
1217ba9054 Fix numactl-devel name 2025-02-25 20:48:15 +00:00
Charles Hofer
72ecacd870 Fix numa package 2025-02-25 20:41:23 +00:00
Charles Hofer
e82b4e22dc Install numa library 2025-02-25 20:24:44 +00:00
Charles Hofer
45e2060b90 Merge branch 'rocm-main' into ci-upstream-sync-127_1 2025-02-25 19:30:08 +00:00
jax authors
eb912ad0d9 Create jax wheel build target.
This change introduces a uniform way of building the artifacts and controlling the filename version suffixes (see the changes for `jaxlib`, `jax-cuda-plugin` and `jax-cuda-pjrt` in https://github.com/jax-ml/jax/pull/25126)

Previously `jax` wheel was built via `python3 -m build` command. The resulting wheel contained the python packages files in `jax` folder (e.g. the files in the subdirs that have `__init__.py` file).

You can still build the `jax` wheel with `python3 -m build` command.

Bazel `jax` wheel target: `//:jax_wheel`

Environment variables combinations for creating wheels with different versions:
  * self-built wheel (default build rule behavior): `--repo_env=ML_WHEEL_TYPE=snapshot`
  * release: `--repo_env=ML_WHEEL_TYPE=release`
  * release candidate: `--repo_env=ML_WHEEL_TYPE=release --repo_env=ML_WHEEL_VERSION_SUFFIX=-rc1`
  * nightly build: `--repo_env=ML_WHEEL_TYPE=custom --repo_env=ML_WHEEL_BUILD_DATE=<YYYYmmdd> --repo_env=ML_WHEEL_GIT_HASH=$(git rev-parse HEAD)`

PiperOrigin-RevId: 730916743
2025-02-25 09:30:08 -08:00
charleshofer
dd3f34ca2b
Use bazel for PR tests (#216)
* Use bazel for running pre-merge CI tests

* Don't use HEREDOC

* Fix block text

* Use bash array

* Add bazel install

* Put Bazel in the build image

* Use Bazelisk

* Remove bazel install in Docker

* Go back to upstream XLA

* Remove bazel test command from workflow

* Move test command to build container

* Fix string format typos
2025-02-24 11:38:06 -06:00
H. Vetinari
dd4aa79d6a fix getting gcc major version 2025-02-24 08:03:57 +11:00
Charles Hofer
3745591d68 Merge branch 'rocm-main' into ci-upstream-sync-112_1 2025-02-11 20:36:19 +00:00
Charles Hofer
9133253c20 Change to make CI run 2025-02-11 16:54:51 +00:00
jax authors
e64650e2ba Add --config=cuda_libraries_from_stubs in the end of all additional bazel options for CUDA wheels.
Build.py shouldn't be used for building the wheels with real CUDA libraries in the dependencies. This change prevents overriding the default configuration.

PiperOrigin-RevId: 725326252
2025-02-10 13:24:47 -08:00
Charles Hofer
4e403d29d2 Remove conflicting param for ci_build 2025-02-10 18:16:21 +00:00
Charles Hofer
31c1f25425 Merge branch 'rocm-main' into ci-upstream-sync-110_1 2025-02-10 18:02:25 +00:00
jax authors
7ffb613b8f Merge pull request #26409 from hawkinsp:fstring
PiperOrigin-RevId: 724390055
2025-02-07 10:23:50 -08:00
Peter Hawkins
d01520c63f Fix a missing "f" on an f-string.
While I'm here, reword the text a bit.
2025-02-07 12:53:24 -05:00
jax authors
1ca8807dca Merge pull request #25810 from ROCm:gh-9948-add-gpu-ci-upstream
PiperOrigin-RevId: 724378710
2025-02-07 09:51:59 -08:00
charleshofer
ebf4a54f4f Add AMD ROCm GPU CI post-build check (#137) 2025-02-06 21:41:53 +00:00
Charles Hofer
c3e27f86bc Merge branch 'rocm-main' into ci-upstream-sync-106_1 2025-02-04 17:28:34 +00:00
Kanglan Tang
59a3552ae6 Remove portpicker for free threaded python 3.13t in test-requirements.txt
PiperOrigin-RevId: 722776783
2025-02-03 13:30:01 -08:00
Charles Hofer
47580efda5 Merge branch 'rocm-main' into ci-upstream-sync-98_1 2025-01-28 21:18:47 +00:00
jax authors
727d0367a4 Update --config=cuda to add direct dependencies on CUDA libraries both for bazel build and bazel test phases.
With this configuration the same cache is used both for `bazel build` and `bazel test` commands (provided the same target is specified).

Add `--config=no_cuda_libs` for building targets with CUDA libraries from stubs.

PiperOrigin-RevId: 720334587
2025-01-27 15:46:17 -08:00
Charles Hofer
63e6442bdf Merge branch 'rocm-main' into ci-upstream-sync-97_1 2025-01-27 17:19:08 +00:00
jax authors
9a60e6fce4 Merge pull request #25917 from ROCm:ci_fix_multi_gpu_test_logic-upstream
PiperOrigin-RevId: 716153760
2025-01-16 02:45:54 -08:00
Ruturaj4
8e88adcd3f Fix run_multi_gpu script multi-gpu issue and refactor code 2025-01-15 22:33:03 +00:00
Ruturaj4
435edf1f8c Add gfx12xx archs 2025-01-15 16:14:40 +00:00
charleshofer
ea6903b6a8
Merge pull request #199 from ROCm/run-less-tests 2025-01-14 10:09:54 -06:00
charleshofer
c22bed5592
Merge pull request #197 from ROCm/ci-upstream-sync-83_1
CI: 01/09/25 upstream sync
2025-01-09 10:59:13 -06:00
Charles Hofer
1fa6e91af6 Add option to ci_build to run different tests 2025-01-09 16:53:30 +00:00
Charles Hofer
9d34a49d94 Commit to trigger CI 2025-01-09 15:36:46 +00:00