Enable dynamic shape for `XLATensorImpl::sym_sizes_custom()` #3829

miladm · 2022-08-04T10:22:10Z

Enable dynamic shape for XLATensorImpl::sym_sizes_custom()

miladm · 2022-08-04T21:04:26Z

Here is the reference to is_dynamic_dimension.

miladm · 2022-08-07T23:34:07Z

Local tests pass for python ../test/test_view_ops.py -v TestViewOpsXLA.test_view_copy_xla as shown below, though they fail on CI.

2022-08-07 23:30:32.889533: W 1463191 tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcuda.so.1'; dlerror: libcuda.so.1: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda/lib64:/usr/local/nvidia/lib:/usr/local/nvidia/lib64
2022-08-07 23:30:32.889591: W 1463191 tensorflow/stream_executor/cuda/cuda_driver.cc:269] failed call to cuInit: UNKNOWN ERROR (303)
test_view_copy_xla (__main__.TestViewOpsXLA) ... ok

----------------------------------------------------------------------
Ran 1 test in 0.233s

OK

JackCaoG · 2022-08-26T18:37:04Z

torch_xla/csrc/tensor_impl.cpp

@@ -110,7 +114,11 @@ void XLATensorImpl::shallow_copy_from(
 }

 at::IntArrayRef XLATensorImpl::sizes_custom() const {
-  const_cast<XLATensorImpl*>(this)->SetupSizeProperties();
+  if (true) { /* TODO(@miladm): replace this with a flag */
+    const_cast<XLATensorImpl*>(this)->SetupSymSizeProperties();


@Krovatkin Is the plan to always use symsize even for the static ints?

No, in his later PR, @miladm is using is_dynamic to decide whether we need to create symint nodes or static ints :

https://github.com/pytorch/xla/pull/3909/files#diff-c4e1dd39b63d78af7c207b2d48ac29553d74214b1c185ae34e084dd2f583879eR197

In this PR, @miladm is forcing both sym_sizes_ and sizes_and_strides_ to be filled with upper bounds. There are no real symint nodes yet.

JackCaoG · 2022-08-26T18:40:00Z

torch_xla/csrc/tensor_impl.cpp

+      numel_ *= tensor_->shape().get().dimensions(i);
+      // }
+    }
+    sizes_and_strides_.set_sizes(sym_sizes);


@Krovatkin From what I can tell this pr switch to set size to set sym_sizes for tensorImpl's sizes_and_strides_. I am guessing upstream already support only setting sym_size?

From what I can tell this pr switch to set size to set sym_sizes for tensorImpl's sizes_and_strides_.

Kind of. This PR is the first step in enabling real dynamic shapes. Note, we have two separate storages here:

sizes_and_strides_

sym_sizes_

Unfortunately, we can't just make C++ sizes() to start throwing a "NotImplemented" exception, otherwise we would break a lot of code. We want sizes() to return upper bounds.
This is why @miladm is populating set_sizes_and_strides with concrete integers wrapped in SymInts. You could think of this logic as just saving upper bounds; we just need to wrap them in SymInts since upstream is now using the unified type: SymInt.

Now the doubly unfortunate part. We know that upstream can store real symintnodes in sizes_and_strides_ wrapped in SymInts. However, we can't take advantage of that, because if we store real SymIntNodes in sizes_and_strides_ and someone calls sizes() in C++ this would trigger conversions to ints on SymIntNodes which would trigger materialization.

So we decided to have separate storage for sym_sizes() namely sym_sizes_.

This way when someone calls size() in python, we would call XLATensor::sym_sizes_custom which would return sym_sizes_ which may contain both concrete ints and symintnodes.

Now if a user uses one of those SymIntNodes in python it will trigger materialization of the SymIntNode (since we do want the exact result at least according to our discussions).

Phew....

@JackCaoG the part I'm not 100% sure is why we need to populate sizes_and_strides_ in addition to sym_sizes_ when sym_sizes as called. I'd think when someone calls sizes() that would update sizes_and_strides_ when someone calls sym_sizes that would set sym_sizes_.
Presumably, @miladm ran into some issues so we do need to be setting both when `sym_sizes is called.

miladm requested review from JackCaoG, Krovatkin and wonjoo-wj August 4, 2022 10:22

miladm force-pushed the sym_sizes_impl branch from e6c9eaa to 6092381 Compare August 4, 2022 10:25

miladm self-assigned this Aug 4, 2022

miladm added the dynamism Dynamic Shape Features label Aug 4, 2022

miladm added this to the Dynamic Shape milestone Aug 4, 2022

miladm force-pushed the sym_sizes_impl branch from 6092381 to d5946d4 Compare August 4, 2022 10:42

miladm mentioned this pull request Aug 4, 2022

sym_size_custom implementation for dynamic shape fails at tensor.sum().backward() call #3827

Open

miladm linked an issue Aug 7, 2022 that may be closed by this pull request

Shape mismatch due to shape replication while initializing sym_sizes_ for dynamic shape #3839

Closed

miladm force-pushed the sym_sizes_impl branch from 953c902 to 2a47572 Compare August 7, 2022 22:20

miladm force-pushed the sym_sizes_impl branch 2 times, most recently from 535f354 to 7da8d3b Compare August 11, 2022 20:14

miladm linked an issue Aug 11, 2022 that may be closed by this pull request

test_variable_sequence_xla fails upon updating sym_sizes for dynamic shape #3870

Open

miladm force-pushed the sym_sizes_impl branch 3 times, most recently from 0a6c9b6 to e61ac28 Compare August 13, 2022 21:47

miladm added 4 commits August 13, 2022 22:02

adding dynamism to sym_sizes

5c74f05

lcoal python tests pass on this version of the code - needs code cleanup

842a9f8

added SetupSymSizeProperties and removed debug code

ae04631

linter, removed incorrectly merged ode

455e845

miladm added 24 commits August 13, 2022 22:05

lcoal python tests pass on this version of the code - needs code cleanup

d01ec69

added SetupSymSizeProperties and removed debug code

b4ca291

linter, removed incorrectly merged ode

6e1f802

fixed the custom call site

22cc04c

adding dynamism to sym_sizes

a2824d7

lcoal python tests pass on this version of the code - needs code cleanup

3185ad7

added SetupSymSizeProperties and removed debug code

0cf3c6f

linter, removed incorrectly merged ode

165a47a

fixed the custom call site

76515e5

linter

dfaba65

adding dynamism to sym_sizes

8c0957e

lcoal python tests pass on this version of the code - needs code cleanup

de19ad5

linter, removed incorrectly merged ode

0976230

fixed the custom call site

2546b08

adding dynamism to sym_sizes

d27fb73

lcoal python tests pass on this version of the code - needs code cleanup

8382326

added SetupSymSizeProperties and removed debug code

f499465

linter, removed incorrectly merged ode

8dbd82a

fixed the custom call site

1054d43

adding dynamism to sym_sizes

601c192

lcoal python tests pass on this version of the code - needs code cleanup

38d746a

added SetupSymSizeProperties and removed debug code

101bc84

linter

122ba52

corrections after rebase

cad3ba9

miladm force-pushed the sym_sizes_impl branch from e61ac28 to cad3ba9 Compare August 13, 2022 22:20

miladm added 2 commits August 13, 2022 22:21

linter

7be76b8

cleanup test

5037965

miladm linked an issue Aug 15, 2022 that may be closed by this pull request

lazySymIntNode->node_ segfaults on simple dynamic tensor conversation between SymInt and toSymIntNodeImpl #3887

Closed

JackCaoG reviewed Aug 26, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable dynamic shape for `XLATensorImpl::sym_sizes_custom()` #3829

Enable dynamic shape for `XLATensorImpl::sym_sizes_custom()` #3829

Uh oh!

miladm commented Aug 4, 2022

Uh oh!

miladm commented Aug 4, 2022

Uh oh!

miladm commented Aug 7, 2022

Uh oh!

JackCaoG Aug 26, 2022

Uh oh!

Krovatkin Aug 27, 2022

Uh oh!

Krovatkin Aug 27, 2022

Uh oh!

JackCaoG Aug 26, 2022

Uh oh!

Krovatkin Aug 27, 2022

Uh oh!

Krovatkin Aug 27, 2022

Uh oh!

Uh oh!

Enable dynamic shape for XLATensorImpl::sym_sizes_custom() #3829

Are you sure you want to change the base?

Enable dynamic shape for XLATensorImpl::sym_sizes_custom() #3829

Uh oh!

Conversation

miladm commented Aug 4, 2022

Uh oh!

miladm commented Aug 4, 2022

Uh oh!

miladm commented Aug 7, 2022

Uh oh!

JackCaoG Aug 26, 2022

Choose a reason for hiding this comment

Uh oh!

Krovatkin Aug 27, 2022

Choose a reason for hiding this comment

Uh oh!

Krovatkin Aug 27, 2022

Choose a reason for hiding this comment

Uh oh!

JackCaoG Aug 26, 2022

Choose a reason for hiding this comment

Uh oh!

Krovatkin Aug 27, 2022

Choose a reason for hiding this comment

Uh oh!

Krovatkin Aug 27, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Enable dynamic shape for `XLATensorImpl::sym_sizes_custom()` #3829

Enable dynamic shape for `XLATensorImpl::sym_sizes_custom()` #3829