rocm_jax

mirror of https://github.com/ROCm/jax.git synced 2025-04-22 16:06:12 +00:00

Author	SHA1	Message	Date
Vladimir Belitskiy	5370ac2ec5	Remove the try/except for Shardy imports. Shardy has been been included in JAX for a while now. PiperOrigin-RevId: 742778405	2025-04-01 11:33:44 -07:00
Sergei Lebedev	8cb3596136	Partially rolling forward #22998 Reverts 322d0c2f31e92e68a531f95a53c3f040d6a76bdf PiperOrigin-RevId: 670173462	2024-09-02 04:44:47 -07:00
Feng Wang	322d0c2f31	Rollback the change "Import from ``mlir.dialects`` lazily" Reverts a755f1db837c464f6aa3d3111a1bc40b5ebdd37d PiperOrigin-RevId: 663324497	2024-08-15 09:00:47 -07:00
Sergei Lebedev	a755f1db83	Import from ``mlir.dialects`` lazily These imports jointly account for ~0.3s of import time internally. PiperOrigin-RevId: 662588167	2024-08-13 11:22:41 -07:00
Bart Chrzaszcz	864178d3a3	#sdy Initial set of changes to allow for lowering to the Shardy dialect. The OpenXLA project is working on an open source, MLIR, named-axis based propagation (and in the future SP<D partitioning) system that will be dialect agnostic (would work for any dialect - MHLO, StableHLO, YourDialect). We plan on having frontends like JAX and PyTorch target this when using XLA and wanting SPMD propagation/partitioning. See www.github.com/openxla/shardy for more info. Currently Shardy is implemented inside the XLA compiler, requiring us to round-trip between StableHLO and HLO with `mhlo.sharding`s. But we will eventually make Shardy the first pass in the XLA pipeline while it's still working on StableHLO. Partitioning (the system that adds the collectives like all-gathers/all-reduces) will still be the GSPMD Partitioner, but next year the Shardy partitioner will be developed, allowing for propagation and partitioning to be completely in MLIR and the first pass in the pipeline. So then we'd have: 1. Traced jaxpr 2. Jaxpr -> StableHLO 3. StableHLO with Shardy propagation 4. StableHLO with Shardy partitioning 5. StableHLO -> HLO 6. XLA optimizations The following test: ```py def test_sdy_lowering(self): mesh = jtu.create_global_mesh((4, 2), ('x', 'y')) np_inp = np.arange(16).reshape(8, 2) s = jax.sharding.NamedSharding(mesh, P('x', 'y')) arr = jax.device_put(np_inp, s) @partial(jax.jit, out_shardings=s) def f(x): return x * 2 print(f.lower(arr).as_text()) ``` outputs: ``` module @jit_f attributes {mhlo.num_partitions = 8 : i32, mhlo.num_replicas = 1 : i32} { sdy.mesh @mesh = <"x"=4, "y"=2> func.func public @main(%arg0: tensor<8x2xi64> {mhlo.layout_mode = "{1,0}", sdy.sharding = #sdy.sharding<@mesh, [{"x"}, {"y"}]>}) -> (tensor<8x2xi64> {jax.result_info = "", mhlo.layout_mode = "default", sdy.sharding = #sdy.sharding<@mesh, [{"x"}, {"y"}]>}) { %c = stablehlo.constant dense<2> : tensor<i64> %0 = stablehlo.broadcast_in_dim %c, dims = [] : (tensor<i64>) -> tensor<8x2xi64> %1 = stablehlo.multiply %arg0, %0 : tensor<8x2xi64> return %1 : tensor<8x2xi64> } } ``` Shardy will be hidden behind the `jax_use_shardy_partitioner` flag initially before becoming enabled by default in the future. PiperOrigin-RevId: 655127611	2024-07-23 05:32:06 -07:00
jax authors	b5583742b5	Merge pull request #21273 from superbobry:mypy-ruff PiperOrigin-RevId: 636146344	2024-05-22 06:35:38 -07:00
Sergei Lebedev	f5617d7323	Removed noop # type: ignore comments mypy should now flag these by default.	2024-05-19 21:01:29 +01:00
Adam Paszke	5a2d7a2df4	Switch Mosaic GPU to a custom pass pipeline and improve the lowering of GPU launch The stock MLIR pipeline was a good way to get the prototype off the ground, but its default passes can be problematic. In particular, the gpu.launch is compiled into a sequence of instructions that load the kernel onto the GPU, run the kernel and immediately unload it again. This has the correct semantics, but loading the kernel is both expensive and forces a synchronization point, which leads to performance issues. To resolve this, I implemented a new MLIR pass that finds the gpu.launch ops and splits each function that has it into two functions: one that preloads the kernel onto the GPU, and another one that consumes the handle produced by the previous one. We call the first function at compile-time, while only the second one is used at run-time. There are other overheads in MLIR's implementation of kernel launch, but I will fix those later. PiperOrigin-RevId: 627670773	2024-04-24 03:27:45 -07:00
Adam Paszke	8e3f5b1018	Initial commit for Mosaic GPU Moving this to JAX to make it easier to explore Pallas integration. PiperOrigin-RevId: 625982382	2024-04-18 04:04:10 -07:00
Peter Hawkins	d95084dbc8	Use an explicit MLIR dialect registration, rather than _site_initialize_0. Remove some special case handling of the SCF dialect, use upstream utilities instead. PiperOrigin-RevId: 588433245	2023-12-06 08:19:55 -08:00
Peter Hawkins	32fb1b4034	Remove the ml_program MLIR dialect from jaxlib. Jax isn't using this, and in fact our code to build this wasn't including the C++ parts, so it was broken anyway. Remove it until someone actually needs it for something. PiperOrigin-RevId: 587323808	2023-12-02 09:29:39 -08:00
Peter Hawkins	30a0136813	Increase minimum jaxlib version to 0.4.19. 0.4.19 has xla_extension version 207 and mlir_api_version 54. PiperOrigin-RevId: 583412447	2023-11-17 09:38:31 -08:00
Neil Girdhar	3c920c0120	Switch from flake8 to Ruff	2023-11-15 22:35:52 -05:00
Sharad Vikram	d872812a35	[Pallas] Upstream pallas to JAX PiperOrigin-RevId: 552963029	2023-08-01 16:43:13 -07:00
Peter Hawkins	f7eef2eda8	Use the upstream MLIR strip-debuginfo pass instead of hand-rolling our own. (I had missed that the upstream pass exists!) Fixes https://github.com/google/jax/issues/16649 PiperOrigin-RevId: 548192839	2023-07-14 12:24:59 -07:00
Peter Hawkins	a861b31e3e	Remove redundant stablehlo import. The duplicate import confuses pytype. PiperOrigin-RevId: 540707118	2023-06-15 15:30:50 -07:00
Peter Hawkins	3bb7386149	[JAX] Improve handling of metadata in compilation cache. Metadata, in particular code location information is present in the HLO generated by JAX. The compilation cache uses the serialized HLO as a cache key, which begs the question: should code location information be part of that key? Simply changing the line number on which a function appears shouldn't necessarily cause a cache miss. There are pros and cons: the main advantage of excluding metadata is that we will get more cache hits, and the main disadvantage is that debug information and profiling data in the HLO might become confusing, since it may refer to a different program entirely, or to a version of a program that does not correspond to the current state of the source tree. We argue that saving compilation time is the more important concern. This change adds a tiny MLIR pass that strips Locations from a StableHLO module, and applies it in the compilation cache if metadata stripping is enabled. PiperOrigin-RevId: 525534901	2023-04-19 13:27:04 -07:00
Peter Hawkins	0e05a7987f	Split some submodules out of //jax under Bazel. Add separate BUILD targets * :version - for version.py * _src/lib - wrapping the jaxlib shims. * :util - for util.py * :config - for config.py PiperOrigin-RevId: 515307923	2023-03-09 05:27:34 -08:00
Yash Katariya	d84ac2240c	Remove use_stablehlo as minimum mlir_api_version >= 43 PiperOrigin-RevId: 512176274	2023-02-24 15:20:09 -08:00
Jake VanderPlas	94af71a24c	CI: fix mypy jaxlib version	2023-01-24 06:57:23 -08:00
Eugene Burmako	a1480c454e	Migrate JAX from producing MHLO to producing StableHLO As discussed over the last few months, it is desirable to migrate JAX from producing MHLO to producing StableHLO, and this CL makes this happen. More specifically: 1) MLIR lowerings now produce StableHLO ops instead of MHLO ops. 2) Fallback lowerings now produce StableHLO ops as well. 3) Occurrences of "MHLO" in prose have been changed to "StableHLO", unless the documents are immutable (changelog, JEPs). From time to time, it might be useful to produce MHLO directly, so MHLO is not going away and is still within arm's reach (although compatibility guarantees will only be provided for StableHLO and not for MHLO): a) `from jax._src.lib.mlir.dialects import mhlo` still does the same thing. b) `XlaLowering.mhlo()` is available as well, but its implementation has changed - it calls `stablehlo-legalize-to-hlo` underneath. c) `Lowering.as_text()/compiler_ir()` still support `dialect="mhlo"`, but the default has changed to "stablehlo". d) We're still using `mhlo.is_same_data_across_replicas` and `mhlo.sharding` because StableHLO currently lacks comparable functionality. https://github.com/openxla/stablehlo/issues/744 tracks the corresponding work, but it is not a blocker - we can use these attributes with StableHLO without any issues. PiperOrigin-RevId: 497978733	2022-12-27 08:53:20 -08:00
Peter Hawkins	2c6c30d458	Bump the minimum jaxlib version to 0.4.1. Jaxlib 0.4.1 has XLA client version 109 and MLIR API version 39.	2022-12-19 17:49:24 +00:00
Eugene Burmako	b8ae8e3fa1	(NFC) Prepare for migration from producing MHLO to producing StableHLO This CL renames occurrences of "mhlo" in: 1) names, 2) tests, 3) prose in order to prepare for the upcoming migration. Unchanged occurrences: 1) Public API that contains "mhlo", e.g. XlaLowering.mhlo and the "mhlo" argument value in Lowering.as_text and Lowering.compiler_ir. 2) Documentation (changelog, JEPs, IR examples, etc). 3) One rare situation where prose says "StableHLO" and "MHLO" in one sentence, so both are necessary to disambiguate. PiperOrigin-RevId: 495771153	2022-12-15 21:00:07 -08:00
Eugene Burmako	55996328f2	Introduce XlaLowering::stablehlo() and use it in associated APIs See tests/api_test.py for usage examples. At the moment, stablehlo() works by using the hlo-legalize-to-stablehlo pass, which takes MHLO natively produced by JAX and converts it into StableHLO. This is an intermediate step towards switching JAX to natively produce StableHLO. This CL adds both mhlo_to_stablehlo and stablehlo_to_mhlo to jaxlib, even though only the former is used at the moment. This is done in anticipation of switching JAX to natively produce StableHLO, where stablehlo_to_mhlo will be needed to provide backward compatibility for XlaLowering::mhlo(). We're adding stablehlo_to_mhlo now, so that in the future we don't have to update jaxlib again which will make deployment easier. PiperOrigin-RevId: 487144342	2022-11-08 22:50:06 -08:00
Peter Hawkins	ba557d5e1b	Change JAX's copyright attribution from "Google LLC" to "The JAX Authors.". See https://opensource.google/documentation/reference/releasing/contributions#copyright for more details. PiperOrigin-RevId: 476167538	2022-09-22 12:27:19 -07:00
Peter Hawkins	6c59d72c75	Bump the minimum jaxlib version to 0.3.15.	2022-09-08 16:43:46 -04:00
Robert Suderman	45046857f6	Fix ModuleNotFoundError for phawkins only with version	2022-06-28 22:42:45 +00:00
Robert Suderman	64aaeb2da9	Make ml_program import conditional	2022-06-28 20:43:50 +00:00
Robert Suderman	499a4e733c	Expose ml_program dialect for MLIR builder We now have an ml_program dialect that describes global variables including load and store operations. Expose this dialect to allow exporting variables and constants.	2022-06-28 20:29:41 +00:00
Aart Bik	c1261ccd27	Adds a wrapper to sparse tensor dialect, as part of an an initial prototype of an alternate JAX compilation path that emits the MLIR MHLO/CHLO dialects instead of classic XLA HLO together with sparse tensor types. PiperOrigin-RevId: 443438043	2022-04-21 11:48:44 -07:00
Peter Hawkins	0150d15cb2	Increase minimum jaxlib version to 0.3.7. Drop backwards compatibility with older jaxlib versions.	2022-04-18 08:09:50 -04:00
jax authors	6c45969fe4	Integrate LLVM at llvm/llvm-project@eb27da7dec Updates LLVM usage to match [eb27da7dec67](https://github.com/llvm/llvm-project/commit/eb27da7dec67) PiperOrigin-RevId: 432199388	2022-03-03 08:24:39 -08:00
Peter Hawkins	8f6e077d9a	Adds an initial prototype of an alternate JAX compilation path that emits the MLIR MHLO/CHLO dialects instead of classic XLA HLO. This lowering is missing a number of features, but it is complete enough that many tests pass, and that I would like to start checking it in. PiperOrigin-RevId: 409134016	2021-11-11 06:37:12 -08:00

33 Commits