[rllib] Parallel-data loading and multi-gpu support for IMPALA#2766
Conversation
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
ericl
left a comment
There was a problem hiding this comment.
Personally, I think the multi-GPU stuff is too messy and we should move away from it. However it's not clear what the best way to do that is, perhaps with TF eager, or moving a lot of the logic into numpy, it would be simpler.
| return Resources( | ||
| cpu=1, | ||
| gpu=cf["gpu"] and cf["gpu_fraction"] or 0, | ||
| gpu=cf["num_gpus"] and cf["num_gpus"] * cf["gpu_fraction"] or 0, |
There was a problem hiding this comment.
I don't know what will happen if gpu_fraction > 1, but for < 1 I believe it should work.
It would be good to figure this out. Sphinx doesn't seem to have a good way to do this, unless you duplicate the comments in the dict, or change it to a class or something. For now I think it's reasonable to expect users to read the code since it's at the top of the file. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
I just did a quick multi-GPU run on PPO / IMPALA atari and perf looks good still. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test FAILed. |
|
Test PASSed. |
What do these changes do?
Related issue number