rocm_jax

mirror of https://github.com/ROCm/jax.git synced 2025-04-16 20:06:05 +00:00

Author	SHA1	Message	Date
Yue Sheng	c2d4373535	Make `core.Token` a non-trivial class which wraps a `jax.Array`. Currently, we use a singleton and empty `core.token` object everywhere. After the change, tokens could be created and threaded in and out of computations to build up dependency. Also update ordered side-effects to use the new `core.Token` class (NFC for this part, just to unify token usage). PiperOrigin-RevId: 626091210	2024-04-18 11:09:55 -07:00
Jake VanderPlas	dc2d8c13d0	[key reuse] call key reuse logic directly in dispatch	2024-04-11 17:08:32 -07:00
Jake VanderPlas	1b3aea8205	Finalize the deprecation of the arr.device() method The method has been emitting an DeprecationWarning since JAX v0.4.21, released December 2023. Existing uses can be replaced with `arr.devices()` or `arr.sharding`, depending on the context. PiperOrigin-RevId: 623015500	2024-04-08 19:04:15 -07:00
Jake VanderPlas	5115b89538	Fix typos in comments	2024-04-08 15:16:39 -07:00
Matthew Johnson	46a516275f	[mutable-arrays] enable refs without cps, and not just at top level Co-authored-by: Dougal Maclaurin <dougalm@google.com>	2024-04-03 16:23:19 -07:00
Matthew Johnson	fa9f02ba2f	Reverts 0dde8f7f9607d09841ece7125dfc0773c3613fab PiperOrigin-RevId: 619416732	2024-03-26 22:26:41 -07:00
Matthew Johnson	9474b46012	[scan] don't traverse body jaxpr in lowering This is an attempt to re-land #19819 aka cl/607570860 after a small number of performance regressions. As before, the main changes are: 1. simplify the scan impl that we trace through to get the lowering, and 2. ensure that when tracing it to a jaxpr, we don't rebuild the scan body jaxpr we already have in hand. The main motivation was (2), but (1) seems like a useful win too. The way we achieve (2) is with a new trick: in our scan_impl function, which is only ever traced to a jaxpr, instead of calling `core.jaxpr_as_fun(jaxpr)(args)` we call a new primitive `eval_jaxpr_p.bind(args, jaxpr=jaxpr)`. This new primitive only has a staging rule defined for it (i.e. all we can do with it is stage it into a jaxpr), and that rule just generates a call into the jaxpr of interest. Therefore we will not traverse into the jaxpr just to rebuild it inline (as before). The code in #19819 was simpler in that it avoided reshapes, concats, and un-concats. But it caused at least one apparent performance regression (an XLA bug?) and it was unrelated to the original goal of reducing tracing time. So here we just land the trace time improvement.	2024-03-26 17:17:58 -07:00
Yash Katariya	0b4634170e	Don't report origin_msg if any execption is raised in `self._origin_msg` PiperOrigin-RevId: 618237231	2024-03-22 11:23:46 -07:00
Jake VanderPlas	8949a63ce1	[key reuse] rename flag to jax_debug_key_reuse	2024-03-22 05:37:30 -07:00
jax authors	ce0d0c17c3	Merge pull request #20218 from mattjj:mutable-arrays-closure PiperOrigin-RevId: 615463712	2024-03-13 10:23:23 -07:00
Matthew Johnson	649cd50681	[mutable-arrays] support closed-over mutable arrays in jit	2024-03-13 09:59:03 -07:00
Roy Frostig	98f790f5d5	update package/API reference docs to new-style typed PRNG keys	2024-03-07 12:40:09 -08:00
Jake VanderPlas	b349328d5d	Remove some dead code	2024-03-06 11:30:48 -08:00
Sergei Lebedev	5283d4b4a5	Axis names are now tracked via an effect This allows propagating the names bottom up -- from equations to the jaxpr, instead of "discovering" them top-down by traversing (and rebuilding) the jaxpr via core.subst_axis_names. PiperOrigin-RevId: 612416803	2024-03-04 05:42:03 -08:00
Matthew Johnson	3a403f2a0e	[mutable-arrays] move MutableArray, add eager, improve tests, fix bug 1. move MutableArray to core.py, and some handlers to their respective files 2. fix a bug in aliasing setup (it was just broken before, now better test coverage) 3. add eager support by enabling get_p, swap_p, and addupdate_p impls 4. improve tests slightly	2024-03-01 15:03:23 -08:00
Matthew Johnson	ab0f7061ad	[mutable-arrays] allow state effects in jit by building in run_state with help from @sharadmv, @yashkatariya, @dougalm, and others The basic strategy is to apply discharge_state when lowering a jaxpr with state effects to HLO, and update the dispatch path accordingly. Specifically: 1. in tests only for now, introduce a MutableArray data type; 2. teach jit to abstract it to a Ref(ShapedArray) type, register an input handler, etc; 3. call discharge_state in `lower_sharding_computation` to lower a jaxpr with refs to a jaxpr (and then to an HLO) with extra outputs, and set up aliasing; 4. teach the output side of the dispatch path to drop those outputs. As an alternative to (3), we could potentially lower away the effects at a higher level, like in _pjit_lower_cached. They are similar because _pjit_lower_cached is the only (non-xmap) caller of lower_sharding_computation. I decided to do it in lower_sharding_computation mainly because that's closer to where we set up aliases, and I wanted to make mutable arrays correspond to aliased inputs/outputs on the XLA computation.	2024-02-29 21:50:19 -08:00
Peter Hawkins	9dee375dc6	Avoid use of operator.attrgetter in core.find_top_Trace, since it allocates and is less efficient than a lambda. PiperOrigin-RevId: 609739110	2024-02-23 08:47:32 -08:00
George Necula	6705a96b56	[shape_poly] Cleaning up naming of terms and factors. In the past symbolic expressions were polynomials, consisting of sums of monomials, which were products of atoms. Over time the language of symbolic expressions has become richer. Now expressions are sums of terms, which are products of factors. Here we rename references to monomials to terms, and `_DimMon` to `_DimTerm`. We also rename reference of atoms to factors, and `_DimAtom` to `_DimFactor`. At the same time we rename most of the methods of `_DimExpr` to have a leading underscore, to indicate that they are private methods.	2024-02-21 09:18:22 +01:00
Anselm Levskaya	772743e6a4	Internal change Reverts 330afdc8bebe900d999202c4d59613e99cadb0ad PiperOrigin-RevId: 607783139	2024-02-19 14:03:09 +00:00
Jake VanderPlas	1fe46aa8be	Error for deprecated scalar conversions of non-scalar arrays	2024-02-16 11:26:30 -08:00
Peter Hawkins	67df647988	Reland https://github.com/google/jax/pull/10573 . The original PR was reverted because of downstream breakage. Originally we used the `Var.count` attribute to ensure `Var` instances were printed consistently regardless of context, even though only their object id was load-bearing. That is, `Var.count` was only used for pretty printing. (#1949 added a total_ordering on `Var` for reasons out of scope of JAX's core code. I'm going to figure out if that's still needed... Haiku tests all seem to pass without it.) But #8019 revised our pretty-printing so as not to use `Var.count`. Instead it chose how to pretty-print Var instances based on their order of appearance in a jaxpr. That meant `Var.count` really wasn't useful anymore. So this PR removes `Var.count`. Since we no longer have `Var.count`, we also don't need core.gensym to take an optional sequence of jaxprs, since that was just used to set the starting count index for new `Var`s. In fact, `Var.__repr__` and `JaxprEqn.__repr__` were made confusing after #8019, since they could print variable names totally different from the names that would appear when the same `JaxprEqn` or `Var` objects were printed as part of a jaxpr. That is, before this PR we might have a jaxpr which printed like: ``` import jax def f(x): for _ in range(3): x = jax.numpy.sin(x) return x jaxpr = jax.make_jaxpr(f)(3.) print(jaxpr) # { lambda ; a:f32[]. let # b:f32[] = sin a # c:f32[] = sin b # d:f32[] = sin c # in (d,) } _, eqn, _ = jaxpr.jaxpr.eqns print(eqn) # a:f32[] = sin b ``` Notice the variable names in the equation pretty-print don't correspond to any in the jaxpr pretty-print! So this PR changes `JaxprEqn.__repr__` and `Var.__repr__` to show `Var` object ids, and in general just do less formatting (which seems consistent with the spirit of `__repr__`): ``` JaxprEqn(invars=[Var(id=140202705341552):float32[]], outvars=[Var(id=140202705339584):float32[]], primitive=sin, params={}, effects=set(), source_info=SourceInfo(traceback=<jaxlib.xla_extension.Traceback object at 0x7f837c73d770>, name_stack=NameStack(stack=()))) ``` PiperOrigin-RevId: 607664497	2024-02-16 05:57:12 -08:00
jax authors	330afdc8be	Merge pull request #19819 from mattjj:scan-dont-traverse-body-jaxpr-in-lowering PiperOrigin-RevId: 607570860	2024-02-15 22:36:35 -08:00
Matthew Johnson	5ead7a6ef2	[scan] don't traverse body jaxpr in lowering The main changes are: 1. simplify the scan impl that we trace through to get the lowering, and 2. ensure that when tracing it to a jaxpr, we don't rebuild the scan body jaxpr we already have in hand. The main motivation was (2), but (1) seems like a useful win too. The way we achieve (2) is with a new trick: in our scan_impl function, which is only ever traced to a jaxpr, instead of calling `core.jaxpr_as_fun(jaxpr)(args)` we call a new primitive `eval_jaxpr_p.bind(args, jaxpr=jaxpr)`. This new primitive only has a staging rule defined for it (i.e. all we can do with it is stage it into a jaxpr), and that rule just generates a call into the jaxpr of interest. Therefore we will not traverse into the jaxpr just to rebuild it inline (as before.	2024-02-15 22:13:22 -08:00
Peter Hawkins	885e8a2311	Don't recompute abstract eval rules when inlining a jit jaxpr. The current implementation of jit inlining uses core.eval_jaxpr() and retraces the subjaxpr. This ends up performing abstract evaluation a second time. Instead, write a direct implementation of inlining that doesn't use the tracing machinery. PiperOrigin-RevId: 607418006	2024-02-15 12:28:48 -08:00
George Necula	18698a1f19	[shape_poly] Add support for jnp.split	2024-02-15 14:43:41 +01:00
Peter Hawkins	5833b0767b	Use an isinstance check rather than dtypes.issubdtype to check whether the dtype in an aval is an extended dtype. We don't need the full generality of issubdtype, and this is slightly faster. This operation is very common (e.g., for every aval construction, even with a non-extended dtype). On my laptop: ``` In [18]: d = jnp.dtype(jnp.int32) In [20]: %timeit jax.dtypes.issubdtype(d, jax.dtypes.extended) 490 ns ± 2.78 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each) In [22]: %timeit isinstance(d, jax._src.dtypes.ExtendedDType) 78.3 ns ± 0.111 ns per loop (mean ± std. dev. of 7 runs, 10,000,000 loops each) ``` PiperOrigin-RevId: 606616884	2024-02-13 07:38:02 -08:00
Peter Hawkins	70edc40d4b	Don't look for dimension_as_value on Tracers in core.full_raise. This code triggers the relatively slow `Tracer.__getattr__` path on tracers, but as far as I can see a tracer can never have this attribute. PiperOrigin-RevId: 606612790	2024-02-13 07:19:11 -08:00
George Necula	983bb32ae6	[shape_poly] Add limited support for equality explicit constraints. Previously, we have added support for inequality constraints. Now we also add equality constraints. These are useful when encountering errors due to inability to check equalities of symbolic expressions, e.g., in the broadcasting rules. For example, the current decision procedure cannot decide that `mod(mod(a, 2), 2) == mod(a, 2)`. To work around such limitations, it is now possible to add the above as an equality constraint. Like other explicit constraints, this will be used to decide equalities during staging, and will be checked at shape refinement time. See more details in the README.md changes.	2024-02-08 10:09:47 +01:00
Sergei Lebedev	078bb00fdb	Replaced most usages of abc.ABC with util.StrictABC StrictABC does not allow registering virtual subclasses and can thus avoid using relatively expensive __instancecheck__/__sublclasscheck__ defined in abc.ABCMeta. The only abc.ABC subclass left is jax.Array which does use virtual subclasses for natively-defined array types.	2024-01-29 12:40:43 +00:00
George Necula	a1286d0021	[shape_poly] Improve core.max_dim and core.min_dim Previously, we optimized `core.max_dim(a, b)` to `a` if `a >= b` and to `b` if `a < b`. Now we also optimize it to `b` if `a <= b`. Similarly for `core.min_dim`. At the same time we move more of the logic from `core.py` to `shape_poly.py`.	2024-01-15 15:10:28 +02:00
George Necula	6b7b3a3902	[shape_poly] Replace non_negative_dim with max_dim and min_dim. Previously, we had `core.non_negative_dim` and we used it to express `max(d, 0)`. This is needed in several places internally to express index computations involving clamping (for numpy indexing), or striding and dilation (which have a conditional semantics). It seemed that this special case was sufficient, and we expressed `max(a, b)` as `a + non_negative(b - a)` and `min(a, b)` as `a - non_negative(a - b)`. One drawback was that `non_negative` can be a surprising construct when it appears in error messages. Also, users need `max` and `min` computations with dimensions. It is clearer if we use `max` and `min` directly instead of rewriting these to use `non_negative`. The drawback is that we now have to duplicate some internal logic to for `max` and `min`, but overall I feel this is worth it for the better error messages we get.	2024-01-08 20:54:18 +02:00
Matthew Johnson	05da18ab54	tweaks to enable adding custom tangent dtypes tweaks to enable adding custom tangent dtypes: * fix a bug in zeros_like_shaped_array and KeyTyRules.zero to ensure `scalar_zero` is actually a scalar * upgrade the adder handler for ShapedArray to delegate to an extended dtype rule for addition * convert_element_type shouldnt blanket-disallow extended dtypes; actually that can be a key operation for working with them! instead, add new `convert_from` and `convert_to` rules. instead of letting these rules perform arbitrary logic, for now they can just return a bool indicating whether the conversion is legit; if false, an error is raised, and if true, the existing convert_element_type lowering rule just generates a ConvertElementType HLO from one physical type to the other this pr also adds a test for a custom tangent dtype of interest for plumbing quantization scales out of a backward pass	2023-12-22 11:33:14 -08:00
Matthew Johnson	ec7d28c0b2	revise logic for tangent types of extended dtypes * remove the dead code KeyTangentTy * replace TyRules.make_tangent with TyRules.zero * removed ad.instantiate_zeros_aval, which was redundant with ad.instantiate_zeros ever since (1) we removed units and (2) we made Zero carry an aval on it * fix a bug in backward_pass where we instantiated a Zero at the primal type rather than the corresponding tangent type * fix _f_bwd in test_keyarray_custom_vjp, which had the wrong type (need to return cotangents for all inputs, we were returning a (float_tangent, key_tangent) pair instead of a (float_tangent, (float_tangent, key_tangent)) nested tuple, see #19009 for a check which catches this and hence includes the same test change We probably also need a TyRules.add for any extended dtypes that can occur as tangent dtypes, but we currently don't have any tests that exercise that (because all extended dtype tangent types are currently float0). I have some follow-up work to add such a case though!	2023-12-20 14:24:52 -08:00
Sergei Lebedev	f936613b06	Upgrade remaining sources to Python 3.9 This PR is a follow up to #18881. The changes were generated by adding from __future__ import annotations to the files which did not already have them and running pyupgrade --py39-plus --keep-percent-format {jax,tests,jaxlib,examples,benchmarks}/*/.py	2023-12-13 10:29:45 +00:00
Jake VanderPlas	a1ee8c1743	Improve shape validation when jax_dynamic_shapes=True	2023-12-12 13:58:46 -08:00
jax authors	616f4d29bb	Merge pull request #18888 from superbobry:pp-improvement PiperOrigin-RevId: 590269555	2023-12-12 11:12:42 -08:00
Sergei Lebedev	840abfb7ab	The pretty printer now de-duplicates identical jaxprs This compresses the output e.g. when a jitted function is called repeatedly in a Python loop.	2023-12-12 17:14:43 +00:00
Jake VanderPlas	a52d18781e	Add experimental static key reuse checking	2023-12-11 12:03:48 -08:00
Sergei Lebedev	352e10ed68	`Effects` is now an immutable set This allows safely using `no_effects` as a default value. PiperOrigin-RevId: 589836905	2023-12-11 08:45:52 -08:00
Sergei Lebedev	ea158d3109	Print pjit name= before other params The jaxpr sometimes gets pretty big, making it hard to see the name.	2023-12-07 16:54:07 +00:00
George Necula	0a02d83015	[shape_poly] Add simpler APIs max_dim and min_dim, improve >= 0 Add core.max_dim and core.min_dim as nicer wrappers around the core.non_negative_dim. Also improve the completeness of the heuristics for deciding >= 0, and add more tests.	2023-12-07 09:41:47 +01:00
Jake VanderPlas	c2a0530274	jaxpr: improve printed repr when eqn has no return values	2023-12-06 10:45:24 -08:00
Yash Katariya	e624610e72	Replace apply_primitive internals with `jax.jit`. This allows deletion of a lot of code and leads to ~40% eager performance speedup. Benchmarks: ``` name old time/op new time/op delta eager_unary_dispatch 31.3µs ± 1% 19.4µs ± 6% -37.91% (p=0.016 n=4+5) eager_unary 32.1µs ± 0% 19.8µs ± 4% -38.26% (p=0.016 n=4+5) eager_binary_dispatch 35.9µs ± 1% 20.5µs ± 4% -42.93% (p=0.016 n=4+5) eager_binary 36.6µs ± 1% 21.1µs ± 4% -42.29% (p=0.016 n=4+5) jit_trivial_dispatch 3.87µs ± 2% 4.12µs ±25% ~ (p=1.000 n=5+5) jit_trivial 4.75µs ± 2% 4.82µs ±11% ~ (p=0.690 n=5+5) jit_simple_dispatch 2.95µs ± 2% 2.97µs ± 7% ~ (p=1.000 n=5+5) jit_simple 3.52µs ± 6% 3.51µs ± 5% ~ (p=0.841 n=5+5) jit_simple_dispatch_array 2.95µs ± 2% 2.96µs ± 6% ~ (p=1.000 n=5+5) jit_simple_array 3.46µs ± 2% 3.51µs ± 5% ~ (p=0.690 n=5+5) jit_small_matmul 3.01µs ± 1% 3.00µs ± 4% ~ (p=0.548 n=5+5) jit_big_matmul 34.0µs ±18% 35.5µs ±17% ~ (p=0.310 n=5+5) jit_simple_many_args_dispatch/num_args:10 6.93µs ± 6% 6.80µs ± 6% ~ (p=0.481 n=10+10) jit_simple_many_args_dispatch/num_args:100 47.7µs ± 7% 45.4µs ± 2% ~ (p=0.237 n=10+8) jit_simple_many_args_dispatch/num_args:1000 545µs ± 8% 516µs ± 2% ~ (p=0.101 n=10+8) jit_simple_many_args_dispatch/num_args:2000 1.12ms ± 7% 1.07ms ± 2% ~ (p=0.237 n=10+8) jit_simple_many_args/num_args:10 7.42µs ± 5% 7.23µs ± 2% ~ (p=0.173 n=10+8) jit_simple_many_args/num_args:100 48.4µs ± 7% 45.6µs ± 2% ~ (p=0.237 n=10+8) jit_simple_many_args/num_args:1000 542µs ± 6% 524µs ± 8% ~ (p=0.089 n=10+10) jit_simple_many_args/num_args:2000 1.12ms ± 7% 1.08ms ± 1% ~ (p=0.068 n=10+8) jit_simple_pruned_args_dispatch_10 4.79µs ± 8% 4.98µs ±10% ~ (p=0.421 n=5+5) jit_simple_pruned_args_10 5.32µs ± 6% 5.30µs ± 4% ~ (p=1.000 n=5+5) jit_simple_pruned_args_dispatch_100 24.7µs ± 6% 23.8µs ± 8% ~ (p=0.548 n=5+5) jit_simple_pruned_args_100 25.2µs ± 6% 24.4µs ± 8% ~ (p=0.690 n=5+5) jit_simple_pruned_args_dispatch_1000 238µs ± 7% 232µs ± 8% ~ (p=0.841 n=5+5) jit_simple_pruned_args_1000 240µs ± 7% 234µs ± 8% ~ (p=1.000 n=5+5) jit_simple_pruned_args_dispatch_2000 516µs ± 6% 497µs ± 1% ~ (p=0.413 n=5+4) jit_simple_pruned_args_2000 517µs ± 6% 505µs ± 7% ~ (p=0.690 n=5+5) jit_dispatch_without_transfer 719µs ± 9% 751µs ± 8% ~ (p=0.222 n=5+5) jit_dispatch_with_transfer 799µs ±14% 793µs ± 9% ~ (p=1.000 n=5+5) pmap_trivial_2_devices 49.9µs ±40% 48.2µs ±42% ~ (p=0.841 n=5+5) pmap_trivial_dispatch_8_devices 74.5µs ±24% 78.9µs ±29% ~ (p=0.421 n=5+5) pmap_trivial_8_devices 79.3µs ± 6% 82.7µs ±20% ~ (p=0.841 n=5+5) pmap_simple_2_devices 47.1µs ±17% 49.1µs ±20% ~ (p=0.548 n=5+5) pmap_simple_dispatch_8_devices 73.4µs ±16% 76.8µs ±21% ~ (p=0.690 n=5+5) pmap_simple_8_devices 76.0µs ±10% 80.6µs ±29% ~ (p=1.000 n=5+5) pmap_simple_dispatch_8_devices_100_args 1.12ms ±22% 1.08ms ±42% ~ (p=0.841 n=5+5) pmap_simple_8_devices_100_args 12.5ms ± 8% 12.8ms ±10% ~ (p=1.000 n=5+5) sda_index_1 413µs ± 1% 686µs ± 4% +66.08% (p=0.008 n=5+5) sda_index_2 850µs ± 1% 1378µs ± 4% +62.02% (p=0.008 n=5+5) sda_index_8 3.60ms ± 1% 5.69ms ± 4% +58.00% (p=0.008 n=5+5) bench_shaped_abstractify 300µs ± 1% 305µs ± 3% ~ (p=0.056 n=5+5) bench_xla_abstractify_scalar_int 6.45µs ± 1% 6.50µs ± 3% ~ (p=0.548 n=5+5) bench_xla_abstractify_scalar_float 3.73µs ± 1% 3.73µs ± 3% ~ (p=0.690 n=5+5) bench_xla_abstractify_scalar_numpy_int32 4.97µs ± 1% 4.83µs ± 3% ~ (p=0.095 n=5+5) bench_xla_abstractify_scalar_numpy_uint32 4.91µs ± 1% 4.75µs ± 0% -3.30% (p=0.016 n=5+4) bench_xla_abstractify_numpy_random 4.34µs ± 2% 4.31µs ± 3% ~ (p=0.310 n=5+5) bench_xla_abstractify_numpy_arange_100_float32 3.94µs ± 1% 3.93µs ± 3% ~ (p=0.548 n=5+5) bench_xla_abstractify_enum 6.85µs ± 1% 7.06µs ± 7% +3.07% (p=0.032 n=5+5) bench_are_op_shardings_equal 26.9µs ± 2% 27.0µs ± 3% ~ (p=0.841 n=5+5) bench_pjit_check_aval_sharding 691µs ± 2% 711µs ±13% ~ (p=0.841 n=5+5) bench_addressable_shards_index 656ns ± 4% 688ns ± 9% ~ (p=0.095 n=5+5) bench_remat_eager_retracing_overheads 12.7ms ± 4% 10.7ms ± 1% -15.48% (p=0.016 n=5+4) bench_remat_eager_retracing_overheads_static_argnums 13.0ms ± 2% 11.3ms ± 6% -13.71% (p=0.008 n=5+5) bench_slicing_compilation 12.1ms ± 1% 12.3ms ± 4% ~ (p=0.690 n=5+5) bench_slicing_compilation2 11.3ms ± 0% 11.5ms ± 6% ~ (p=0.690 n=5+5) bench_repeated_static_indexing 62.5ms ± 2% 40.8ms ± 8% -34.77% (p=0.008 n=5+5) bench_repeated_static_slicing 46.7ms ± 1% 31.4ms ± 2% -32.76% (p=0.008 n=5+5) pjit_simple_1_device/num_args:1 2.72µs ± 2% 2.68µs ± 5% ~ (p=0.151 n=5+5) pjit_simple_1_device/num_args:10 12.6µs ± 7% 12.3µs ± 3% ~ (p=0.310 n=5+5) pjit_simple_1_device/num_args:100 109µs ± 3% 108µs ± 4% ~ (p=0.548 n=5+5) pjit_simple_4_device/num_args:1 38.0µs ±26% 36.8µs ±19% ~ (p=0.690 n=5+5) pjit_simple_4_device/num_args:10 93.3µs ±19% 96.6µs ±23% ~ (p=0.841 n=5+5) pjit_simple_4_device/num_args:100 730µs ±16% 698µs ±48% ~ (p=0.841 n=5+5) pjit_aot_1_device/num_args:1 3.29µs ± 2% 3.12µs ± 4% -5.24% (p=0.016 n=4+5) pjit_aot_1_device/num_args:10 13.0µs ± 1% 12.7µs ± 2% ~ (p=0.063 n=4+5) pjit_aot_1_device/num_args:100 111µs ± 5% 110µs ±11% ~ (p=0.421 n=5+5) pjit_aot_4_device/num_args:1 38.4µs ±19% 38.9µs ±24% ~ (p=1.000 n=5+5) pjit_aot_4_device/num_args:10 91.3µs ±15% 96.9µs ±29% ~ (p=0.548 n=5+5) pjit_aot_4_device/num_args:100 676µs ±20% 689µs ±41% ~ (p=0.841 n=5+5) host_local_array_to_global_array 196µs ± 6% 194µs ± 4% ~ (p=0.548 n=5+5) device_put 50.8µs ± 1% 50.7µs ± 4% ~ (p=0.413 n=4+5) device_put_sharded 176µs ± 0% 177µs ± 4% ~ (p=0.190 n=4+5) device_get_8_devices 3.96ms ± 4% 4.03ms ± 7% ~ (p=0.413 n=4+5) np_asarray_8_devices 3.34ms ±18% 3.30ms ±10% ~ (p=0.548 n=5+5) jax_array_arrays_8_devices 5.01ms ±10% 5.09ms ±21% ~ (p=0.421 n=5+5) batch_inplace_while_scatter 440µs ± 1% 439µs ± 1% ~ (p=0.421 n=5+5) batch_inplace_while_dynamic_update_slice 454µs ± 0% 457µs ± 1% ~ (p=0.905 n=4+5) serial_dot_products 4.51µs ± 3% 4.41µs ± 2% ~ (p=0.151 n=5+5) bench_make_array_from_callback_fully_replicated_sharding 26.6µs ± 1% 27.0µs ± 2% ~ (p=0.056 n=5+5) ``` PiperOrigin-RevId: 586505950	2023-11-29 18:07:13 -08:00
Neil Girdhar	3dcf0fc520	Annotate Jaxpr properties	2023-11-10 13:48:56 -05:00
Jake VanderPlas	cd3ea05665	Ensure sharding-related array properties are documented	2023-11-03 09:56:33 -07:00
Sergei Lebedev	f2ce5dbd01	MAINT Do not use `str()` and `repr()` in f-string replacement fields `str()` is called by default by the formatting machinery, and `repr()` only needs `!r`.	2023-10-23 15:12:04 +01:00
Jake VanderPlas	a794bebb33	CI: update mypy to v1.6.0	2023-10-11 12:54:51 -07:00
Sergei Lebedev	65d3058944	Migrate a subset of internal modules to use state objects The motivation here is to gradually replace all dynamic lookups on `jax.config` with statically-typed state objects, which are more type checker/IDE friendly. PiperOrigin-RevId: 571932143	2023-10-09 07:29:53 -07:00
Jake VanderPlas	bfed3d862e	Improve behavior of core.valid_jaxtype	2023-09-22 13:46:09 -07:00
jax authors	256612bb80	Merge pull request #17720 from superbobry:tuple-list-comp PiperOrigin-RevId: 567433086	2023-09-21 15:16:12 -07:00

1 2 3

127 Commits