rocm_jax

mirror of https://github.com/ROCm/jax.git synced 2025-04-27 04:26:07 +00:00

Author	SHA1	Message	Date
Sergei Lebedev	194884d311	Migrated to mypy 1.14.1 with --allow_redefinition I initially wanted to upgrade to 1.15, but it seems to have a bug in how ternary expressions are type checked. For example, def f(x: int) -> str: ... def g(x: int) -> str: ... callback = f if ... else g # has type object!	2025-02-13 15:38:28 +00:00
Ruturaj4	fe68eb8b25	[ROCm] Implement RNN support	2025-01-14 19:04:49 -06:00
Dan Foreman-Mackey	28bbbf894f	Simplify and consolidate dot algorithm control in lax. In https://github.com/jax-ml/jax/pull/23574, we added a new `algorithm` parameter to `lax.dot_general` with the goal of giving users explicit control over the specific algorithm used to control dot product accumulation. When using this feature in real use cases, we have found that the API is both too conservative (it required the user to pass the appropriate input types) and too restrictive for common use cases. In this change, I simplify the API to bring it more in line with user expectations, and generalize it to support a broader range of use cases. The core change is to update the dot_general lowering rule to add explicit type casts to the inputs, making sure that they always have the appropriate storage types going into the `DotGeneral` StableHLO op. Before this change, some backends would implicitly cast for some algorithms (e.g. f32 -> bf16), but error for others. It seems more user friendly to include automatic casts in all cases where a specific algorithm is requested. Another change in behavior is to (if needed) cast the result of the `DotGeneral` op (which is defined by the algorithm's `accumulation_type`) to match the input types. This means that, regardless of the algorithm choice, the output type will match the value that a user would expect from past use of `lax.dot_general`. The `preferred_element_type` parameter can now be used to control the output type, even when an algorithm is selected. To summarize, the updated version of `dot_general` accepts _any_ input dtypes, and the output will always match the inputs (under the existing promotion rules if the LHS and RHS don't match) unless `preferred_element_type` is used to select a specific output type. The specified "algorithm" is now more of an implementation detail, rather than the defining feature of the API, and JAX will do whatever it can to satisfy the user's request. (If an algorithm is not supported on the current device, we will still get a compile time error.) With the above changes in mind, it's no longer really necessary to have a `transpose_algorithm` parameter, because we can now use the same algorithm for the backwards pass. For users who need to customize the algorithm on the backwards pass, that is still possible using `custom_vjp`. Given the above changes, @sbodenstein made the excellent point that we don't really need the `algorithm` parameter anymore: just accept `DotAlgorithm` inputs to `precision`. I think this is a really nice suggestion, so I have updated the interface to implement this. One minor negative of this approach is that `preferred_element_type` isn't a great name for what that parameter does when it is used in conjunction with an algorithm. In the long run, I'd like to rename this parameter, but keeping it as is for now seems like the best short term approach. PiperOrigin-RevId: 683302687	2024-10-07 13:21:34 -07:00
Sergei Lebedev	f5617d7323	Removed noop # type: ignore comments mypy should now flag these by default.	2024-05-19 21:01:29 +01:00
Sergei Lebedev	c3bc88d5e4	Bumped mypy to 1.10.0 and ruff to 0.4.4	2024-05-16 23:16:32 +01:00
Peter Hawkins	30a0136813	Increase minimum jaxlib version to 0.4.19. 0.4.19 has xla_extension version 207 and mlir_api_version 54. PiperOrigin-RevId: 583412447	2023-11-17 09:38:31 -08:00
Jake VanderPlas	4a5bd9e046	Fix typos across the package	2023-09-22 14:54:31 -07:00
Peter Hawkins	f52926e832	Fix test breakage in RNN test with old jaxlibs. Remove some outdated version guards.	2023-09-20 11:50:04 -04:00
Andrey Portnoy	fc1c31d958	Run LSTM test using FP32 math (as opposed to TF32) 1. Add (limited) precision specifier handling to LSTM Enables differentiating between TF32 and FP32 math. TF32 math had insufficient precision to reliably pass LSTM correctness tests on A100 and H100. 2. Run the test using FP32 TF32 precision is not sufficient for the test to pass reliably on Ampere+ GPUs such as A100 and H100.	2023-09-19 14:45:14 -04:00
Peter Hawkins	816ba91263	Use lower-case PEP 585 names for types. Issue https://github.com/google/jax/issues/16537 PiperOrigin-RevId: 542969282	2023-06-23 15:12:14 -07:00
Yash Katariya	47fc23d7ba	Make rnn_bwd_abstract_eval backwards compatible by guarding it agains the jaxlib version PiperOrigin-RevId: 529260653	2023-05-03 19:28:42 -07:00
Yash Katariya	260a4305ac	Guard the rnn changes on the jaxlib version to be backwards compatible PiperOrigin-RevId: 529184172	2023-05-03 13:42:49 -07:00
Matthew Johnson	56feaca7f9	update cuDNN RNN code not to save 'workspace' scratch between fwd and bwd PiperOrigin-RevId: 528928263	2023-05-02 17:05:42 -07:00
Cristian Garcia	aa12e3597b	handle seq_lengths in lstm_ref	2023-04-03 22:22:54 +00:00
Sharad Vikram	3c3fa042e3	Copy seq_lengths before creating descriptor PiperOrigin-RevId: 519771897	2023-03-27 10:59:44 -07:00
Peter Hawkins	8fb1fd318d	Replace jax._src.util.prod with math.prod. math.prod() was added in Python 3.8, so we can assume it is always present. PiperOrigin-RevId: 513011144	2023-02-28 12:41:00 -08:00
Roy Frostig	cb8dcce2fe	migrate more internal dependencies from `jax.core` to `jax._src.core` PiperOrigin-RevId: 509736368	2023-02-14 23:01:11 -08:00
Jake VanderPlas	94af71a24c	CI: fix mypy jaxlib version	2023-01-24 06:57:23 -08:00
Qiao Zhang	4d1c4bc761	Add CUDNN custom call for LSTM. Exposed as jax.experimental.rnn module. PiperOrigin-RevId: 491445515	2022-11-28 14:31:48 -08:00
jax authors	d1fbdbc1cf	Rollback of "Add CUDNN custom call for LSTM. Exposed as jax.experimental.rnn module." PiperOrigin-RevId: 490499003	2022-11-23 07:48:05 -08:00
Qiao Zhang	78963b6020	Add CUDNN custom call for LSTM. Exposed as jax.experimental.rnn module. PiperOrigin-RevId: 490387796	2022-11-22 18:53:29 -08:00

21 Commits