rocm_jax

mirror of https://github.com/ROCm/jax.git synced 2025-04-17 04:16:07 +00:00

Author	SHA1	Message	Date
Jake VanderPlas	051abafd6d	jnp.linalg.solve: finalize deprecation of batched 1D solves	2025-01-10 10:42:32 -08:00
George Necula	dd0447a7c6	[aot] Add support for as_text(debug_info=True). This exposes an easier way to get StableHLO and HLO with more debugging information (source locations for StableHLO and metadata for HLO).	2025-01-10 07:59:56 +02:00
Peter Hawkins	392a851769	Increase the minimum SciPy version to 1.11.1. (1.11.0 was yanked from PyPi because of licensing problems, so 1.11.1 is the oldest 1.11 release.) PiperOrigin-RevId: 713073731	2025-01-07 16:10:45 -08:00
Dan Foreman-Mackey	a7f384cc6e	Add a register_custom_type_id function to the GPU plugins. This enables dynamic registration of custom FFI types on the appropriate platform via PJRT. PiperOrigin-RevId: 712904085	2025-01-07 07:29:38 -08:00
jax authors	56f0f9534d	Merge pull request #25633 from dfm:move-ffi PiperOrigin-RevId: 712863350	2025-01-07 04:40:21 -08:00
Jake VanderPlas	c7b0d681bd	Remove deprecated jax.experimental.array_api	2025-01-06 15:19:02 -08:00
Jake VanderPlas	2f7204fff6	jnp.einsum: default to optimize='auto'	2025-01-06 11:02:31 -08:00
Jake VanderPlas	245a13a329	Deprecate scipy.special.lpmn & lpmn_values	2025-01-06 09:31:15 -08:00
Dan Foreman-Mackey	cb4d97aa1f	Move jex.ffi to jax.ffi.	2024-12-29 13:06:19 +00:00
Jake VanderPlas	40fe4b8797	Finalize deprecation of some symbols from jax.lib.xla_client	2024-12-23 10:14:16 -08:00
Jake VanderPlas	c206ae7fe8	changelog: link to api compatibility & python version docs	2024-12-23 09:39:45 -08:00
Dan Foreman-Mackey	c6131ee527	Add support for N-D FFTs with D>3.	2024-12-19 15:23:30 +00:00
Jake VanderPlas	89a54a9e85	Re-land changes from https://github.com/jax-ml/jax/pull/25555 Reverts 25524abc67d82281e8a4093480637785c03a0150 PiperOrigin-RevId: 707679094	2024-12-18 15:02:54 -08:00
Yash Katariya	8b734808e8	Remove jax_enable_memories config flag. It defaulted to True for a very long time and it's time to remove the flag. PiperOrigin-RevId: 707590263	2024-12-18 10:15:45 -08:00
Peter Hawkins	ee45718457	Increase the minimum NumPy version to v1.25. Per SPEC 0, we drop NumPy v1.24 support on Dec 18, 2024.	2024-12-18 08:18:57 -05:00
jax authors	25524abc67	Reverts b56dc63160eaccd7df05d03b1c38f804ff85f564 PiperOrigin-RevId: 707501925	2024-12-18 04:43:57 -08:00
Jake VanderPlas	3cecbf34f2	Remove core.concrete_aval and replace with abstractify	2024-12-17 18:18:25 -08:00
Peter Hawkins	ff52aedf67	Update version numbers after release.	2024-12-17 18:16:25 -05:00
Peter Hawkins	7de9eb20df	Reverts 525b646c0ebd5205f4fa0639c94adb2de47e1cf0 PiperOrigin-RevId: 707146329	2024-12-17 10:12:34 -08:00
George Necula	afcb62ea20	[export] Expand exporting to work with AbstractMesh. This is a follow up from #25640 that enabled lowering with AbstractMesh. This required adding `num_devices` to `lowering.compiler_args` because in presence of an AbstractMesh the device_assignment is not accurate.	2024-12-16 10:30:46 +02:00
Jake VanderPlas	c73f306099	Finalize deprecation of jnp.round_ PiperOrigin-RevId: 705998500	2024-12-13 14:13:44 -08:00
jax authors	87b66f3c35	Merge pull request #25451 from jakevdp:undep-core PiperOrigin-RevId: 705910242	2024-12-13 09:36:32 -08:00
Ivy Zheng	26c40fadfd	Add jax.tree shortcuts for .*_with_path calls, for convenience of users. PiperOrigin-RevId: 705645570	2024-12-12 15:13:32 -08:00
Jake VanderPlas	d3406768f0	temporarily un-deprecate several jax.core APIs. These were causing excessive log-spam for some users; I'll work to migrate them to jax.extend before re-deprecating these.	2024-12-12 13:15:58 -08:00
Jake VanderPlas	f858a71461	Finalize some deprecations in jax.core, jax.lib.xla_bridge, and jax.lib.xla_client.	2024-12-11 09:50:33 -08:00
Jake VanderPlas	6541a62099	jax.core: deprecate a number of APIs	2024-12-10 11:11:32 -08:00
Peter Hawkins	820f51dc53	Merge branch 'release/0.4.37' into main.	2024-12-09 20:21:43 -05:00
Peter Hawkins	ffb07cdadb	Update versions for v0.4.37 release.	2024-12-09 15:39:59 -05:00
IvyZX	65b6088411	Avoid index out of range error in carry structure check	2024-12-09 15:36:32 -05:00
IvyZX	bd77a703fd	Avoid index out of range error in carry structure check	2024-12-09 10:44:28 -08:00
Peter Hawkins	ba626fa650	Bump JAX version after release. PiperOrigin-RevId: 703472753	2024-12-06 05:58:09 -08:00
George Necula	3f5f3e1c47	[export] Removed __gpu$xla.gpu.triton (Pallas GPU) from the list of custom calls with guaranteed compatibility. This is because the underlying Triton IR does not guarantee compatibility. PiperOrigin-RevId: 703127711	2024-12-05 08:42:41 -08:00
George Necula	5fe5206b6a	[shape_poly] Remove some deprecated kwargs PiperOrigin-RevId: 703116755	2024-12-05 08:02:38 -08:00
labs-code-app[bot]	762301fc5d	Add exec_time_optimization_effort and memory_fitting_effort flags. These flags control the amount of effort the compiler spends minimizing execution time and memory usage, respectively. They can be set via the command line, e.g. . Valid values are between -1.0 and 1.0, default is 0.0.	2024-11-26 13:57:47 +00:00
Nitin Srinivasan	6761512658	Re-factor build CLI to a subcommand based approach This commit reworks the JAX build CLI to a subcommand based approach where CLI use cases are now defined as subcommands. Two subcommands are defined: build and requirements_update. "build" is to be used when wanting to build a JAX wheel package. "requirements_update" is to be used when wanting to update the requirements_lock.txt files. The new structure offers a clear and organized CLI that enables users to execute specific build tasks without having to navigate through a monolithic script. Each subcommand has specific arguments that apply to its respective build process. In addition, arguments are separated into groups to achieve a cleaner separation and improves the readability when the CLI subcommands are run with `--help`. It also makes it clear as to which parts of the build they affect. E.g: CUDA arguments only apply to CUDA builds, ROCM arguments only apply to ROCM builds, etc. This reduces the complexity and the potential for errors during the build process. Segregating functionalities into distinct subcommands also simplifies the code which should help with the maintenance and future extensions. There is also a transition from using `subprocess.check_output` to `asyncio.create_subprocess_shell` for executing the build commands which allows for streaming logs and helps in showing the build progress in real time. Usage: * Building `jaxlib`: ``` python build/build.py build --wheels=jaxlib --python_version=3.10 ``` * Building `jax-cuda-plugin`: ``` python build/build.py build --wheels=jax-cuda-plugin --cuda_version=12.3.2 --cudnn_version=9.1.1 --python_version=3.10 ``` * Building multiple packages: ``` python build/build.py build --wheels=jaxlib,jax-cuda-plugin,jax-cuda-pjrt --cuda_version=12.3.2 --cudnn_version=9.1.1 --python_version=3.10 ``` * Building `jax-rocm-pjrt`: ``` python build/build.py build --wheels=jax-rocm-pjrt --rocm_version=60 --rocm_path=/path/to/rocm ``` * Using a local XLA path: ``` python build/build.py build --wheels=jaxlib --local_xla_path=/path/to/xla ``` * Updating requirements_lock.txt files: ``` python build/build.py requirements_update --python_version=3.10 ``` For more details on each argument and to see available options, run: ``` python build/build.py build --help ``` or ``` python build/build.py requirements_update --help ``` PiperOrigin-RevId: 700075411	2024-11-25 13:03:04 -08:00
Peter Hawkins	dfe27a1682	Mention stackless in the release notes.	2024-11-20 14:53:52 -05:00
Jake VanderPlas	85e2969aea	Deprecate several private APIs in jax.lib	2024-11-20 08:48:26 -08:00
Peter Hawkins	525b646c0e	Reverts 2075b091c4e83f0bdbd0d47812a72114fb8b937a PiperOrigin-RevId: 698152759	2024-11-19 14:47:24 -08:00
Peter Hawkins	2c80d1af50	Add a new API jax.lax.split. This API does not add expressive power, since it is already possible to split arrays by repeated slicing. Its purpose is to be a primitive that is the transpose of `lax.concatenate`, so that primitives like `jnp.unstack` can be differentiatied more efficiently. Before: ``` In [1]: import jax.numpy as jnp, jax In [2]: x = jnp.ones((3,)) In [3]: jax.jit(jax.linear_transpose(lambda xs: jnp.unstack(xs), jnp.ones((5, 3)))).trace((x,)5).jaxpr Out[3]: { lambda ; a:f32[3] b:f32[3] c:f32[3] d:f32[3] e:f32[3]. let f:f32[5,3] = pjit[ name=unstack jaxpr={ lambda ; g:f32[3] h:f32[3] i:f32[3] j:f32[3] k:f32[3]. let l:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] k m:f32[5,3] = pad[padding_config=((4, 0, 0), (0, 0, 0))] l 0.0 n:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] j o:f32[5,3] = pad[padding_config=((3, 1, 0), (0, 0, 0))] n 0.0 p:f32[5,3] = add_any m o q:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] i r:f32[5,3] = pad[padding_config=((2, 2, 0), (0, 0, 0))] q 0.0 s:f32[5,3] = add_any p r t:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] h u:f32[5,3] = pad[padding_config=((1, 3, 0), (0, 0, 0))] t 0.0 v:f32[5,3] = add_any s u w:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] g x:f32[5,3] = pad[padding_config=((0, 4, 0), (0, 0, 0))] w 0.0 y:f32[5,3] = add_any v x in (y,) } ] a b c d e in (f,) } ``` Note in particular the `pad` calls, which are the transpose of `slice`. Transposing the split has the effect of forming many dense intermediate cotangents. After: ``` In [1]: import jax.numpy as jnp, jax In [2]: x = jnp.ones((3,)) In [3]: jax.jit(jax.linear_transpose(lambda xs: jnp.unstack(xs), jnp.ones((5, 3)))).trace((x,)5).jaxpr Out[3]: { lambda ; a:f32[3] b:f32[3] c:f32[3] d:f32[3] e:f32[3]. let f:f32[5,3] = pjit[ name=unstack jaxpr={ lambda ; g:f32[3] h:f32[3] i:f32[3] j:f32[3] k:f32[3]. let l:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] k m:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] j n:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] i o:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] h p:f32[1,3] = broadcast_in_dim[ broadcast_dimensions=(1,) shape=(1, 3) sharding=None ] g q:f32[5,3] = concatenate[dimension=0] p o n m l in (q,) } ] a b c d e in (f,) } ```	2024-11-19 15:25:47 -05:00
Peter Hawkins	c5e8ae80f9	Update jax.scipy.special.gamma and gammasgn to return NaN for negative integer inputs. Change to match upstream scipy: https://github.com/scipy/scipy/pull/21827. Fixes #24875	2024-11-18 20:33:27 -05:00
Dan Foreman-Mackey	ccb331707e	Add a GPU implementation of `lax.linalg.eig`. This feature has been in the queue for a long time (see https://github.com/jax-ml/jax/issues/1259), and some folks have found that they can use `pure_callback` to call the CPU version as a workaround. It has recently come up that there can be issues when using `pure_callback` with JAX calls in the body (https://github.com/jax-ml/jax/issues/24255; this should be investigated separately). This change adds a native solution for computing `lax.linalg.eig` on GPU. By default, this is implemented by calling LAPACK on host directly because this has good performance for small to moderately sized problems (less than about 2048^2). For larger matrices, a GPU-backed implementation based on [MAGMA](https://icl.utk.edu/magma/) can have significantly better performance. (I should note that I haven't done a huge amount of benchmarking yet, but this was the breakeven point used by PyTorch, and I find roughly similar behavior so far.) We don't want to add MAGMA as a required dependency, but if a user has installed it, JAX can use it when the `jax_gpu_use_magma` configuration variable is set to `"on"`. By default, we try to dlopen `libmagma.so`, but the path to a non-standard installation location can be specified using the `JAX_GPU_MAGMA_PATH` environment variable. PiperOrigin-RevId: 697631402	2024-11-18 08:11:57 -08:00
jax authors	4fe9164548	Merge pull request #24871 from carlosgmartin:numpy_put_along_axis PiperOrigin-RevId: 696679735	2024-11-14 16:00:51 -08:00
carlosgmartin	1f114b1cf7	Add numpy.put_along_axis.	2024-11-14 15:23:26 -05:00
Jake VanderPlas	f401c97967	finalize deprecation of jax.clear_backends	2024-11-14 09:22:09 -08:00
jax authors	ed9fdbbf0a	Merge pull request #24842 from jakevdp:batched-toeplitz PiperOrigin-RevId: 695917476	2024-11-12 16:52:25 -08:00
Dan Foreman-Mackey	5808170a10	Add GPU overflow bugfix (#24846 ) to changelog.	2024-11-12 08:57:52 -08:00
Jake VanderPlas	3f98c57f7b	jax.scipy.linalg.toeplitz: support implicit batching	2024-11-11 15:32:43 -08:00
jax authors	c8f5b2bb13	Merge pull request #24481 from jakevdp:key-array-error PiperOrigin-RevId: 694626415	2024-11-08 13:47:05 -08:00
Jake VanderPlas	83383fc717	Error on numpy array conversion of PRNG key array	2024-11-07 10:08:49 -08:00
Jake VanderPlas	1af3b01c1c	register_dataclass: allow marking static fields via field(static=True)	2024-11-06 11:18:11 -08:00

1 2 3 4 5 ...

749 Commits