rocm_jax

mirror of https://github.com/ROCm/jax.git synced 2025-04-16 03:46:06 +00:00

Author	SHA1	Message	Date
Matthew Johnson	66a6eb299e	add autodiff rules for jax.lax.ragged_all_to_all collective also update the ragged_all_to_all docstring. pseudocode in the style of the shard_map tutorial would be better and cleaner, but it needs the context of the tutorial to explain; i'll add ra2a to the shmap tutorial in the future. PiperOrigin-RevId: 735957604	2025-03-11 18:22:02 -07:00
Yash Katariya	a3edfb43ef	Now that sharding_in_types config flag is True, remove the config and all the conditionals PiperOrigin-RevId: 728653433	2025-02-19 06:53:35 -08:00
Jake VanderPlas	e389b707ba	Add public APIs for jax.lax monoidal reductions	2025-02-11 16:00:03 -08:00
Yash Katariya	bc1a706688	[sharding_in_types] Add a canonicalize_value step before dispatching `bind` so that we can insert `mesh_cast`s under the following conditions: * When current_mesh is Manual and aval mesh is Auto * When current mesh is set and aval mesh is unset * Final style primitives skip this canonicalization and they are free to add it in their own `bind` method. * `mesh_cast` is skipped from this canonicalization to avoid recursion errors. This is required to make sure that after we hit abstract_eval rule and check_jaxpr, everything is properly typed in JAX's type system. `Auto` right now is a bit more permissive because we need to keep the current code at HEAD working but `Explicit` and `Manual` are very strict. PiperOrigin-RevId: 722868091	2025-02-03 18:00:19 -08:00
Yash Katariya	9107ee4a22	Do automatic casting from auto -> manual when the context mesh is manual and avals are in auto mode. This happens when values are being closed over in a shard_map. The casting is happening at lax level but we can move this to a different place later on. PiperOrigin-RevId: 721495804	2025-01-30 13:14:04 -08:00
Gunhyun Park	a8df383ccf	Fix `lax.ragged_all_to_all` degenerate case In a singleton group case, unlike regular all_to_all, the ragged op becomes a generic equivalent of DynamicUpdateSlice, except update size is not statically known. This operation can't be expressed with standard HLO instructions -- the backend will handle this case separately. Added small improvement to error messages. PiperOrigin-RevId: 721473063	2025-01-30 12:05:02 -08:00
Gunhyun Park	809e1133c8	Add support for `axis_name` and `axis_index_groups` to `lax.ragged_all_to_all` PiperOrigin-RevId: 720738861	2025-01-28 16:02:03 -08:00
Parker Schuh	f3e27b6c28	Support axis_index using a nested shard_map instead of iota with full to shard. PiperOrigin-RevId: 718661661	2025-01-22 19:14:37 -08:00
Yunlong Liu	3ff000ee3e	fix the degenerated case	2025-01-06 16:08:07 +00:00
Oleg Shyshkov	db464b3f0a	Clarify documentation for output_offsets operand of ragged_all_to_all. PiperOrigin-RevId: 708321802	2024-12-20 07:52:11 -08:00
Yash Katariya	473e2bf527	Put abstract_mesh on every eqn so that we can preserve it during `eval_jaxpr` and `check_jaxpr` roundtrip. Also allow users to enter into `Auto`/`User` mode inside jit along all or some axes. Add checks to make sure that avals inside a context match the surrounding context. This check happens inside `abstract_eval` rules but maybe we need a more central place for it which we can create later on. PiperOrigin-RevId: 707128096	2024-12-17 09:17:21 -08:00
Oleg Shyshkov	6d82a6fc90	Allow `lax.ragged_all_to_all` input and output operands to have different ragged dimension sizes. We need to guaranty that the outermost dimension of the output is big enough to fit all received elements, but it's not necessary for input and output outermost dimensions to be exactly equal. PiperOrigin-RevId: 707011916	2024-12-17 02:20:10 -08:00
Parker Schuh	0e7f218eb0	Support axis_index inside shard_map(auto=...) by using iota and then calling full_to_shard. PiperOrigin-RevId: 705704369	2024-12-12 18:39:05 -08:00
Gunhyun Park	12c30578b2	Introduce `lax.ragged_all_to_all` primitive This version emits a StableHLO custom call. The test outputs the following MLIR module: ``` module @jit_ragged_all_to_all { func.func public @main(%arg0: tensor<6xf32>, %arg1: tensor<6xf32>, %arg2: tensor<3xi32>, %arg3: tensor<3xi32>, %arg4: tensor<3xi32>, %arg5: tensor<3xi32>) -> (tensor<6xf32>) { %0 = stablehlo.custom_call @ragged_all_to_all(%arg0, %arg1, %arg2, %arg3, %arg4, %arg5) {api_version = 4 : i32, backend_config = {replica_groups = dense<[[0, 1, 2]]> : tensor<1x3xi64>}} : (tensor<6xf32>, tensor<6xf32>, tensor<3xi32>, tensor<3xi32>, tensor<3xi32>, tensor<3xi32>) -> tensor<6xf32> return %0 : tensor<6xf32> } } ``` For now, the API assumes `split_axis` and `concat_axis` of `all_to_all` to be the outermost (ragged) dim, and `axis_index_groups` is default to all replicas (e.g. there is only one group and covers all axis indices aka iota like the example above). The current API is inspired from https://www.mpich.org/static/docs/v3.1/www3/MPI_Alltoallv.html which essentially also does a ragged all to all. PiperOrigin-RevId: 704550890	2024-12-09 22:19:40 -08:00
Yash Katariya	9a0e9e55d8	[sharding_in_types] Handle collective axes in lowering rules more generally. If any axis is collective, set all dims of aval to unspecified dims in `wrap_with_sharding_op`. Also lower shardings with `Collective` axes correctly to HloSharding. PiperOrigin-RevId: 696703030	2024-11-14 17:32:01 -08:00
Dougal Maclaurin	478b750c29	Reverts f281c6f46475270a57a02416469226315377592c PiperOrigin-RevId: 693339094	2024-11-05 07:17:14 -08:00
Dougal Maclaurin	f281c6f464	Reverts ec39b592f7c096b0b8183723feaab2ed0d001041 PiperOrigin-RevId: 692949053	2024-11-04 06:54:06 -08:00
Dougal Maclaurin	ec39b592f7	Remove lattice system from JAX, especially raise_to_shaped (except as a no-op for backwards compat) PiperOrigin-RevId: 692557993	2024-11-02 17:03:50 -07:00
Sergei Lebedev	bdf2ca10fc	Removed more dead code from various submodules PiperOrigin-RevId: 691342832	2024-10-30 02:41:53 -07:00
Dougal Maclaurin	c36e1f7c1a	Make trace dispatch purely a function of context rather than a function of both context and data. This lets us delete a lot of machinery for managing data-dependent tracing: levels, sublevels, post_process_call, new_base_main, custom_bind and so on. PiperOrigin-RevId: 691086496	2024-10-29 11:04:31 -07:00
Tom Natan	ed5ba633d4	Reverts 6cf09f8c24c67ff650b95d174501fff3cb59db0d PiperOrigin-RevId: 682440543	2024-10-04 13:56:27 -07:00
Tom Natan	6cf09f8c24	Reverts eff00cc4499cfe3f3f24bafda6c1ecf908232ff3 PiperOrigin-RevId: 678756266	2024-09-25 10:33:53 -07:00
Tom Natan	eff00cc449	[JAX] add support for gather/scatter batching dims following the new attributes in stablehlo. This change also uses the new batching dims for gather/scatter batching rules, to avoid concatenating the indices with iota. See https://github.com/openxla/stablehlo/pull/2259 PiperOrigin-RevId: 678649138	2024-09-25 04:53:11 -07:00
Jake VanderPlas	7b41583414	refactor jax.lax to not depend on jax.numpy	2024-09-01 07:49:49 -07:00
jax authors	efba5f61b5	Merge pull request #22812 from superbobry:maint PiperOrigin-RevId: 658751187	2024-08-02 04:43:33 -07:00
Abhinav Gunjal	dfe8d94170	Integrate StableHLO at openxla/stablehlo@fb18ee25 PiperOrigin-RevId: 658515936	2024-08-01 13:23:01 -07:00
Sergei Lebedev	92b1f71314	Removed various ununsed functions To rerun the analysis do python -m vulture jax/_src --ignore-names "[A-Za-z]" --ignore-decorators ""	2024-08-01 11:18:19 +01:00
Matthew Johnson	88d1cd731d	remove pdot and xeinsum (since xmap is gone)	2024-07-25 21:19:17 +00:00
Matthew Johnson	3f9eb404e4	remove named_shapes (since xmap is now gone)	2024-07-25 00:54:50 +00:00
Peter Hawkins	8ab0c07edc	Don't wrap singleton ir.Values with tuples during HLO lowering. In general a JAX value might correspond to multiple HLO values, which is why the HLO lowering represents each value as a tuple of zero or more ir.Values. However, the common case is that there is exactly one value, and almost all such lists are singletons. To reduce the number of singleton list and tuple objects allocated during MLIR lowering, instead represent singleton values as unwrapped ir.Values, and only use a tuple if there is not exactly one ir.Value backing a JAX value.	2024-07-01 16:11:00 -04:00
rajasekharporeddy	b93da3873b	Fix Typos	2024-06-17 13:55:46 +05:30
jax authors	ede94c3c81	Rollback of https://github.com/google/jax/pull/20705 Causing pmap_test.py failures. Reverts a7bce471440dda2a8bbeed1fe01dd9f733ef5bbc PiperOrigin-RevId: 638437174	2024-05-29 15:46:55 -07:00
jax authors	a7bce47144	Merge pull request #20705 from chaserileyroberts:chase/pbroadcast_channel_fix PiperOrigin-RevId: 637986186	2024-05-28 12:29:40 -07:00
Michael Levesque-Dion	43f51d73ce	Clean up version switches from dense array migration PiperOrigin-RevId: 637955865	2024-05-28 10:58:51 -07:00
Chase Roberts	af6970e432	Pipe channel handle	2024-05-28 10:20:50 -07:00
Sergei Lebedev	f5617d7323	Removed noop # type: ignore comments mypy should now flag these by default.	2024-05-19 21:01:29 +01:00
Sergei Lebedev	c3bc88d5e4	Bumped mypy to 1.10.0 and ruff to 0.4.4	2024-05-16 23:16:32 +01:00
Samuel	959ecca182	Minor typo fix in docstring `jax.lax.psum` Fix code formatting inconsistency in `psum` docstring Currently, "device2" and "device3" are rendered incorrectly in the JAX documentation (see second example [here](https://jax.readthedocs.io/en/latest/_autosummary/jax.lax.psum.html))	2024-04-12 13:12:00 +01:00
Chase Roberts	01412f7645	pbroadcast	2024-03-18 15:12:33 -07:00
Philip Pham	3fe65e2005	Pipe `tiled` through `all_to_all` primitive The `_all_to_all_transpose_rule` calls `all_to_all` which can accept a `tiled` argument. Thus, for the transpose to know the right value of `tiled` to pass, we need to plumb the `tiled` argument through the primitive and various interpreters, even though it's a no-op because the `tiled` argument is handled outside the primitive. It would be cleaner to handle `tiled` inside the primitive, but I will leave that for followup work. Fixes #15982. PiperOrigin-RevId: 612628600	2024-03-04 16:33:56 -08:00
Sergei Lebedev	5283d4b4a5	Axis names are now tracked via an effect This allows propagating the names bottom up -- from equations to the jaxpr, instead of "discovering" them top-down by traversing (and rebuilding) the jaxpr via core.subst_axis_names. PiperOrigin-RevId: 612416803	2024-03-04 05:42:03 -08:00
Michael Levesque-Dion	ebfce197ea	Emit dense arrays for StableHLO ops migrating to dense arrays We are migrating some attrs on some StableHLO ops to use DenseI64ArrayAttr instead of DenseIntElementsAttr. Using DenseI64ArrayAttr enforces that the attr values are 1-dimensional and provides nicer APIs. (see https://github.com/openxla/stablehlo/issues/1578 for additional context) Unfortunately, we have to duplicate the `dense_int_array` function because we migrated the ops in batches. We can't use the existing `dense_int_array` function because it would produce arrays for ops that hadn't yet been migrated. This PR makes the final batch of changes, so no additional methods should be added going forward. We also have to introduce a new `dense_bool_array` function, with a similar version check. When the minimum supported jaxlib version uses a recent enough version of StableHLO (v6 or above), it will be possible to remove the version checks and remove the duplicated `dense_int_array_v6` function. PiperOrigin-RevId: 601271749	2024-01-24 16:41:37 -08:00
Jan Hrček	4da56dcdd7	Fix duplicate word occurrences	2023-12-19 06:15:30 +01:00
Sergei Lebedev	f936613b06	Upgrade remaining sources to Python 3.9 This PR is a follow up to #18881. The changes were generated by adding from __future__ import annotations to the files which did not already have them and running pyupgrade --py39-plus --keep-percent-format {jax,tests,jaxlib,examples,benchmarks}/*/.py	2023-12-13 10:29:45 +00:00
Peter Hawkins	458a8962be	Always lower reduce_scatter_p as an HLO ReduceScatter. We don't need the fallback path for CPU: XLA:CPU already does its own lowering of ReduceScatter as AllReduce + DynamicSlice, and I plan to teach it a direct lowering in an upcoming change. PiperOrigin-RevId: 586311031	2023-11-29 05:37:58 -08:00
Peter Hawkins	1e961b80da	Remove fallback path that lowers all_gather via psum. As far as I can tell this is no longer necessary on GPU, which handles arbitrary allgather dimensions (by making the dimension the major-most dimension in layout assignment), and on CPU, where at present XLA would do the same lowering JAX would. I'm planning to improve the XLA:CPU lowering in a subsequent change. PiperOrigin-RevId: 586291911	2023-11-29 04:14:11 -08:00
Lukas Geiger	7f5784a903	Add missing f-strings identifiers in xeinsum error message	2023-11-18 03:11:24 +00:00
Peter Hawkins	8e8dc263bc	Use MLIR generated convenience functions athing(...) instead of writing AThingOp(...).result. In most cases these are more succinct. This change does not update Pallas/Mosaic. PiperOrigin-RevId: 583448254	2023-11-17 11:47:14 -08:00
Neil Girdhar	3c920c0120	Switch from flake8 to Ruff	2023-11-15 22:35:52 -05:00
Matthew Johnson	f33ef3ff9c	improve psum_scatter docstring (formatting and content)	2023-11-14 17:46:35 -08:00

1 2 3 4

193 Commits