Refine automatic mixed precision support via hyper param by vinhngx · Pull Request #1681 · tensorflow/tensor2tensor

vinhngx · 2019-08-28T01:54:19Z

In continuation of #1637 and in response to @afrozenator 's comments in #1680

In this PR, we re-organize automatic mixed precision training support to provide a cleaner implementation and an easier interface via using hyper parameters.

In particular, GPU automatic mixed precision training can now be enabled via setting a flag (and correspondingly a so-named hyper-parameter) gpu_automatic_mixed_precision for all tensor2tensor models, for example:

Transformer

PROBLEM=translate_ende_wmt32k
MODEL=transformer
HPARAMS=transformer_big
DATA_DIR=/data/translate_ende_wmt32k
TRAIN_DIR=/tmp/$MODEL-$HPARAMS

t2t-trainer \
  --data_dir=$DATA_DIR \
  --problem=$PROBLEM \
  --model=$MODEL \
  --hparams_set=$HPARAMS \
  --output_dir=$TRAIN_DIR \
  --train_steps=100000 \
  --eval_steps=1000 \
  --gpu_automatic_mixed_precision=True

Resnet:

PROBLEM=image_imagenet224
MODEL=resnet
HPARAMS=resnet_50
DATA_DIR=/data/ImageNet
TRAIN_DIR=/tmp/$HPARAMS

t2t-trainer \
  --data_dir=$DATA_DIR \
  --problem=$PROBLEM \
  --model=$MODEL \
  --hparams_set=$HPARAMS \
  --output_dir=$TRAIN_DIR \
  --hparams='batch_size=256' \
  --worker_gpu=8 \
  --gpu_automatic_mixed_precision=True

This is opposed to the previous approaches of setting the OS flag TF_ENABLE_AUTO_MIXED_PRECISION which is a non-programatic approach, or passing the flag gpu_auto_mixed_precision directly to the optimizer (which will require modification of individual models to make call to optimizer with mixed precision training option).

afrozenator · 2019-08-28T05:26:28Z

    opt = tf.contrib.tpu.CrossShardOptimizer(opt)
-  if gpu_auto_mixed_precision or os.environ.get(
-      "TF_ENABLE_AUTO_MIXED_PRECISION", "0") == "1":
+  if hparams.gpu_automatic_mixed_precision:


get(hparams, "gpu_automatic_mixed_precision", False) is preferable -- since people may pass an hparam that doesn't have this param -- for example in tests etc.

good one. I fixed this

afrozenator · 2019-08-28T05:27:18Z

-      memory_height=1
+      memory_height=1,
+      # Whether to use GPU automatic mixed precision (via graph rewrite)
+      gpu_automatic_mixed_precision=False


This is good, but as in your earlier PR, based on a flag can you set this to true?

i.e. after we make the hparams in t2t_trainer, based on your flag, flip this on

Just add the flag again to trainer and turn hparams on accordingly.

afrozenator

Left a few comments -- thanks for the changes

vinhngx · 2019-08-30T02:57:21Z

Thanks for the feedbacks @afrozenator . Let me know if the latest revision works.

afrozenator · 2019-08-30T04:30:27Z

Thanks a lot @vinhngx for contributing this in the first place and now making it better!

Will merge it in shortly.

vinhngx · 2019-08-30T13:28:10Z

great thanks. I'm closing #1680 then.

PiperOrigin-RevId: 266390503

vinhngx added 6 commits August 27, 2019 23:13

moving gpu_auto_mixed_precision parameter to hparams

a627219

fix param naming

3bef6da

move gpu automixed precision training to trainer flag

2f58e79

remove unused os lib

3a133d9

adding automatic mixed precision support as a hparam

7bffd08

revert t2t_trainer changes

152504f

googlebot added the cla: yes PR author has signed CLA label Aug 28, 2019

vinhngx mentioned this pull request Aug 28, 2019

Refine automatic mixed precision support #1680

Closed

afrozenator reviewed Aug 28, 2019

View reviewed changes

adding gpu_automatic_mixed_precision flag to trainer

ecb1ce6

afrozenator merged commit d973bc8 into tensorflow:master Aug 30, 2019

tensorflow-copybara pushed a commit that referenced this pull request Aug 30, 2019

Merge of PR #1681

1a2542e

PiperOrigin-RevId: 266390503

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refine automatic mixed precision support via hyper param#1681

Refine automatic mixed precision support via hyper param#1681
afrozenator merged 7 commits into
tensorflow:masterfrom
vinhngx:v1.14.0-AMP-hparams

vinhngx commented Aug 28, 2019 •

edited

Loading

afrozenator Aug 28, 2019

vinhngx Aug 30, 2019

afrozenator Aug 28, 2019

vinhngx Aug 30, 2019

vinhngx Aug 30, 2019

afrozenator left a comment

vinhngx commented Aug 30, 2019

afrozenator commented Aug 30, 2019

vinhngx commented Aug 30, 2019

Labels

3 participants

Uh oh!

Conversation

vinhngx commented Aug 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

afrozenator Aug 28, 2019

Choose a reason for hiding this comment

vinhngx Aug 30, 2019

Choose a reason for hiding this comment

afrozenator Aug 28, 2019

Choose a reason for hiding this comment

vinhngx Aug 30, 2019

Choose a reason for hiding this comment

vinhngx Aug 30, 2019

Choose a reason for hiding this comment

afrozenator left a comment

Choose a reason for hiding this comment

vinhngx commented Aug 30, 2019

afrozenator commented Aug 30, 2019

vinhngx commented Aug 30, 2019

Labels

3 participants

vinhngx commented Aug 28, 2019 •

edited

Loading