Device agnostic code #282

ra9hur · 2023-01-29T03:35:45Z

ra9hur
Jan 29, 2023

At the outset, thanks for a very well-structured course. I am coming from tensorflow background and this course is helping me a lot in picking up PyTorch related concepts quickly !!

Working with devices in tensorflow is very intuitive and we just have to select device for the session OR specify distributed strategy and things work out of the box.

Have two questions related to this.

Any thoughts on why we have to choose device explicitly in PyTorch ?
Choosing device for the model makes sense since model has to run on a given device.

2a. Why should we choose device for data ?

2b. If we choose GPU:1 for data and and gpu:0 for model (separate devices for model and data), it should introduce latency and will be noticeable for bigger datasets. Why is this option given ?

The design should have been that data / model to reside on the same device to speed up things.
Are these options given to choose depending on RAM size of devices, size of datasets and estimated RAM size that could be required to train the model ?

Sure, I am missing something very trivial. Need your inputs.

Answered by mrdbourke

Feb 1, 2023

Hi @ra9hur,

Good questions!

Any thoughts on why we have to choose device explicitly in PyTorch ?

This is for maximum customizability of the code.

And the ability to choose which device the computation happens on.

E.g. some computation may be better on GPU, some better on CPU.

Choosing device for the model makes sense since model has to run on a given device.

2a. Why should we choose device for data ?

Model + data have to be on the same device for computation to happen.

If a model is on GPU and data on CPU, an error will occur (the data isn't stored in the same place).

2b. If we choose GPU:1 for data and and gpu:0 for model (separate devices for model and data), it should introd…

View full answer

mrdbourke · 2023-02-01T06:26:40Z

mrdbourke
Feb 1, 2023
Maintainer

Hi @ra9hur,

Good questions!

Any thoughts on why we have to choose device explicitly in PyTorch ?

This is for maximum customizability of the code.

And the ability to choose which device the computation happens on.

E.g. some computation may be better on GPU, some better on CPU.

Choosing device for the model makes sense since model has to run on a given device.

2a. Why should we choose device for data ?

Model + data have to be on the same device for computation to happen.

If a model is on GPU and data on CPU, an error will occur (the data isn't stored in the same place).

2b. If we choose GPU:1 for data and and gpu:0 for model (separate devices for model and data), it should introduce latency and will be noticeable for bigger datasets. Why is this option given ?

This is correct.

For this, you'll likely want to use libraries that allow multi-GPU usage, such as:

Hugging Face Accelerate - https://huggingface.co/docs/accelerate/index
Torch FullyShardedDataParallel - https://pytorch.org/docs/stable/fsdp.html

1 reply

ra9hur Feb 1, 2023
Author

Hi @mrdbourke, thanks for clarifying, helped a lot !!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Device agnostic code #282

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Device agnostic code #282

Uh oh!

ra9hur Jan 29, 2023

Replies: 1 comment · 1 reply

Uh oh!

mrdbourke Feb 1, 2023 Maintainer

Uh oh!

ra9hur Feb 1, 2023 Author

ra9hur
Jan 29, 2023

Replies: 1 comment 1 reply

mrdbourke
Feb 1, 2023
Maintainer

ra9hur Feb 1, 2023
Author