Fix issues with split and split_dims #1828

ricardoV94 · 2026-01-06T12:08:10Z

Fix bug when passing simple Tensor shape to split_dims
Change grad_undefined -> grad_disconnected for split_sizes in SplitOp (see #1827 for more context)

pytensor/tensor/basic.py

jessegrabowski · 2026-01-10T01:11:07Z

I reverted the changes to as_tensor_variable. At minimum it's out of scope for this PR. Implementing more careful checks of the shape argument (based on the analysis in the comment above) was sufficient to clear the test failures. We can revisit the ndims argument later.

Something else I noticed was that we're passing dtype to as_tensor_variable. This doesn't do anything in the Variable case, so I changed it to an explicit cast (inside the Op make_node, I left it in the wrapper to handle the Sequence case)

ricardoV94 · 2026-01-10T07:51:55Z

No, better not to cast variables in node but raise like before. That's what shape ops always do. If a user passes a float as a shape argument it's likely a bug and this would mask it

jessegrabowski · 2026-01-10T17:49:42Z

Someday I will merge a PR

ricardoV94 · 2026-01-10T18:49:30Z

pytensor/tensor/reshape.py

+        )

-    if not shape:
+    if empty_shape:


What about just shape.type.shape == (0,), for the variable case? Also if you standardize as_tensor_variable you don't need the variable vs non-variable case

But also do we need the special squeeze branch or would the Op do the right thing anyway?

Tests pass without it (as long as I adjust the existing test_split_size_zero_shape test to pass dtype int to the shape argument), so I guess not.

ricardoV94 · 2026-01-10T21:16:16Z

I'm happy with the PR. I'll fix the git history and merge

ricardoV94 · 2026-01-11T12:22:10Z

I made some further simplifications, and also cleaned type hints. The ShapeValueType from shape.py is not the right thing because it allows ellpsis and none as well, which split_dims and unpack do not.

Remove cases where type-hints are better than bad type-hints

ricardoV94 · 2026-01-11T12:28:58Z

pytensor/tensor/reshape.py

-        outputs: Sequence[Variable],
-        output_grads: Sequence[Variable],
-    ) -> list[Variable]:
+    def L_op(self, inputs, outputs, output_grads):


I strongly disagree with appeasing mypy here and pretend we don't know that we can only ever get and return TensorVariable

ricardoV94 · 2026-01-11T12:29:13Z

pytensor/tensor/reshape.py

        self.axis = axis

-    def make_node(self, x: Variable, shape: Variable) -> Apply:  # type: ignore[override]
+    def make_node(self, x, shape):


This was wrong, as x, shape may be TensorLike

ricardoV94 · 2026-01-11T12:29:34Z

pytensor/tensor/reshape.py

-        # example when splitting a packed tensor that had its dims expanded before packing (e.g. when packing shapes
-        # (3, ) and (3, 3) to (3, 4)
-        return squeeze(x, axis=axis)  # type: ignore[no-any-return]
+        axis = normalize_axis_index(axis, x.ndim)


it can only be an index, not a tuple so be more pedantic

ricardoV94 · 2026-01-11T12:29:54Z

pytensor/tensor/reshape.py

 def pack(
    *tensors: TensorLike, axes: Sequence[int] | int | None = None
-) -> tuple[TensorVariable, list[ShapeValueType]]:
+) -> tuple[TensorVariable, list[TensorVariable]]:


We only return TensorVariable shapes, not the flexible input types

ricardoV94 added bug Something isn't working gradients Op implementation labels Jan 6, 2026

This was referenced Jan 6, 2026

Use pack and unpack in minimize and root #1806

Open

Confusion between grad_undefined / grad_disconnected #1827

Closed

jessegrabowski mentioned this pull request Jan 8, 2026

Do not coerce gradients to TensorVariable #1685

Merged

ricardoV94 commented Jan 9, 2026

View reviewed changes

pytensor/tensor/basic.py Outdated Show resolved Hide resolved

pytensor/tensor/basic.py Outdated Show resolved Hide resolved

jessegrabowski force-pushed the split_dims_tweak branch from 4f38402 to 579566d Compare January 10, 2026 01:08

ricardoV94 commented Jan 10, 2026

View reviewed changes

ricardoV94 force-pushed the split_dims_tweak branch 2 times, most recently from 39f8dc4 to deaf670 Compare January 11, 2026 12:21

Fix type hints in reshape.py

0d5db55

Remove cases where type-hints are better than bad type-hints

ricardoV94 force-pushed the split_dims_tweak branch from deaf670 to b4b7d8f Compare January 11, 2026 12:24

ricardoV94 added 2 commits January 11, 2026 13:26

SplitDims: Fix scalar tensor shape

6e3d33b

Split: Return disconnected gradient for split sizes

f0fbe9a

ricardoV94 force-pushed the split_dims_tweak branch from b4b7d8f to f0fbe9a Compare January 11, 2026 12:27

ricardoV94 commented Jan 11, 2026

View reviewed changes

ricardoV94 requested a review from jessegrabowski January 11, 2026 12:30

ricardoV94 mentioned this pull request Jan 11, 2026

Do not redefine Disconnected type every time #1837

Open

jessegrabowski approved these changes Jan 11, 2026

View reviewed changes

ricardoV94 merged commit d8b51df into pymc-devs:main Jan 11, 2026
66 checks passed

ricardoV94 deleted the split_dims_tweak branch January 11, 2026 19:47

Fix issues with split and split_dims #1828

Fix issues with split and split_dims #1828

Uh oh!

Conversation

ricardoV94 commented Jan 6, 2026

Uh oh!

Uh oh!

Uh oh!

jessegrabowski commented Jan 10, 2026

Uh oh!

ricardoV94 commented Jan 10, 2026

Uh oh!

jessegrabowski commented Jan 10, 2026

Uh oh!

ricardoV94 Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Jan 10, 2026

Uh oh!

ricardoV94 commented Jan 11, 2026

Uh oh!

ricardoV94 Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ricardoV94 Jan 10, 2026 •

edited

Loading