add ADE20K dataset #3853

walkerlala · 2018-04-03T12:06:49Z

This PR provide scripts/documents for downloading/converting the ADE20K dataset and training deeplabv3 on it. One particular thing to note about is the exclude_list variable in utils/train_utils.py. I think it is OK to do this, but it might hurt performance of other models.

YknZhu

Thank you @walkerlala ! Mainly style nits.

YknZhu · 2018-04-03T16:45:48Z

research/deeplab/datasets/build_ade20k_data.py

+FLAGS = tf.app.flags.FLAGS
+flags = tf.app.flags
+
+tf.app.flags.DEFINE_string(


Ah.. just some ancient code. I will remove it ;-)

YknZhu · 2018-04-03T16:55:54Z

research/deeplab/datasets/build_ade20k_data.py

+import random
+import string
+import sys
+from PIL import Image


Import PIL

https://google.github.io/styleguide/pyguide.html?showone=Imports#Imports

This is legacy code. I will remove it.

YknZhu · 2018-04-03T17:49:24Z

research/deeplab/datasets/build_ade20k_data.py

+    RuntimeError: If loaded image and label have different shape.
+  """
+
+  img_names = glob.glob(os.path.join(dataset_dir, '*.jpg'))


maybe tf.gfile.Glob(...)?

YknZhu · 2018-04-03T17:55:34Z

research/deeplab/datasets/build_ade20k_data.py

+    seg_names.append(seg)
+
+  num_images = len(img_names)
+  num_per_shard = int(math.ceil(num_images) / float(_NUM_SHARDS))


math.ceil(num_images / float(_NUM_SHARDS)) ?

YknZhu · 2018-04-03T17:59:32Z

research/deeplab/datasets/download_and_convert_ade20k.sh

+
+CURRENT_DIR=$(pwd)
+WORK_DIR="./ADE20K"
+mkdir -p ${WORK_DIR}


"${WORK_DIR}", please also fix variables below.

https://google.github.io/styleguide/shell.xml?showone=Variable_expansion#Variable_expansion

I am just trying to "stay consistent with existing code ;-)". Anyway, I will fix those.

aquariusjay

Thanks, walkerlala!
Overall looks great. A few suggestions. Please take a look.

aquariusjay · 2018-04-03T18:06:23Z

research/deeplab/datasets/build_ade20k_data.py

+_NUM_SHARDS = 4
+
+def _convert_dataset(dataset_split, dataset_dir, dataset_label_dir):
+  """ Convert the ADE20k dataset into into tfrecord format (SSTable).


"" Convert -> """Converts

aquariusjay · 2018-04-03T18:06:56Z

research/deeplab/datasets/build_ade20k_data.py

+  """ Convert the ADE20k dataset into into tfrecord format (SSTable).
+
+  Args:
+    dataset_split: dataset split (e.g., train, val)


Make first word capital.

And put a period (i.e., '.') at the end of each sentence.

aquariusjay · 2018-04-03T18:09:25Z

research/deeplab/datasets/segmentation_dataset.py

+    splits_to_sizes = {
+        'train': 20210, # num of samples in images/training
+        'val': 2000, # num of samples in images/validation
+        'eval': 2,


What is this split? Add comment for it? It seems that this split is not generated?

Oh yes, this split is used for my own testing. I just forgot to remove it. I will remove it.

aquariusjay · 2018-04-03T18:20:29Z

research/deeplab/utils/train_utils.py

@@ -99,7 +99,7 @@ def get_model_init_fn(train_logdir,
  tf.logging.info('Initializing model from path: %s', tf_initial_checkpoint)

  # Variables that will not be restored.
-  exclude_list = ['global_step']
+  exclude_list = ['global_step', 'logits']


There are cases where you do want to restore `logits' weights (e.g., further fine-tuning them on validation set).
My proposal will be: Could you please

Modify get_extra_layer_scopes in model.py as follows.

def get_extra_layer_scopes(last_layers_contain_logits_only=False):
"""Gets the scopes for extra layers.

Args:
last_layers_contain_logits_only: Boolean, True if only consider logits
as the last layers (i.e., exclude ASPP module, decoder module and so on).

Returns:
A list of scopes for extra layers.
"""
if last_layers_contain_logits_only:
return [_LOGITS_SCOPE_NAME]
else:
return [
_LOGITS_SCOPE_NAME,
_IMAGE_POOLING_SCOPE,
_ASPP_SCOPE,
_CONCAT_PROJECTION_SCOPE,
_DECODER_SCOPE,
]

Add a flag in train.py.

flags.DEFINE_boolean('last_layers_contain_logits_only', False,
'Only consider logits as last layers or not.')

Modify line 295 in train.py as follows.

last_layers = model.get_extra_layer_scopes(
FLAGS.last_layers_contain_logits_only)

aquariusjay · 2018-04-03T18:23:44Z

research/deeplab/g3doc/ade20k.md

+    fine_tune_batch_norm = False.
+
+2. User should fine tune the `min_resize_value` and `max_resize_value` to get
+   better result. Note that `resize_factor` has to equals to `output_stride`.


has to equals -> has to be equal

walkerlala · 2018-04-06T13:52:42Z

@YknZhu @aquariusjay All fixed. Please review.

YknZhu

Looks great! Please wait for @aquariusjay's review.

aquariusjay

Looks great! Thank you, walkerlala!

RomRoc · 2018-04-29T15:01:15Z

Hello @walkerlala , could you share a script similar to local_test.sh with all steps to download, convert and retrain a deeplab model with ADE20K dataset, instead of Pascal dataset like in local_test.sh ?
Thanks

add ADE20K dataset

17ba1ca

walkerlala requested review from aquariusjay and YknZhu as code owners April 3, 2018 12:06

googlebot added the cla: yes label Apr 3, 2018

walkerlala mentioned this pull request Apr 3, 2018

[deeplab] Training deeplab model with ADE20K dataset #3730

Open

YknZhu reviewed Apr 3, 2018

View reviewed changes

aquariusjay reviewed Apr 3, 2018

View reviewed changes

fix code style problem and add option "last_layers_contain_logits_only"

13c9de3

YknZhu reviewed Apr 6, 2018

View reviewed changes

aquariusjay approved these changes Apr 6, 2018

View reviewed changes

aquariusjay merged commit 6741cfc into tensorflow:master Apr 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ADE20K dataset #3853

add ADE20K dataset #3853

walkerlala commented Apr 3, 2018

YknZhu left a comment

YknZhu Apr 3, 2018

walkerlala Apr 6, 2018

YknZhu Apr 3, 2018

walkerlala Apr 6, 2018

YknZhu Apr 3, 2018

YknZhu Apr 3, 2018

walkerlala Apr 6, 2018

YknZhu Apr 3, 2018

walkerlala Apr 6, 2018

aquariusjay left a comment

aquariusjay Apr 3, 2018

aquariusjay Apr 3, 2018

aquariusjay Apr 3, 2018

aquariusjay Apr 3, 2018

walkerlala Apr 6, 2018 •

edited

Loading

aquariusjay Apr 3, 2018

aquariusjay Apr 3, 2018

walkerlala commented Apr 6, 2018

YknZhu left a comment

aquariusjay left a comment

RomRoc commented Apr 29, 2018

add ADE20K dataset #3853

add ADE20K dataset #3853

Conversation

walkerlala commented Apr 3, 2018

YknZhu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aquariusjay left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

walkerlala Apr 6, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

walkerlala commented Apr 6, 2018

YknZhu left a comment

Choose a reason for hiding this comment

aquariusjay left a comment

Choose a reason for hiding this comment

RomRoc commented Apr 29, 2018

walkerlala Apr 6, 2018 •

edited

Loading