change reference_data.py to use tf.gfile #3921

robieta · 2018-04-10T00:45:05Z

Reference tests are failing on TAP because they use open(). Due to some quirks with gfile (in py2 it defaults to bytes and in py3 it defaults to string, but it is illegal to pass "rt" or "rb") it is necessary to only interact with files as bytes. This leads to some awkward encode/decode calls with json, but those calls keep the code identical between py2 and py3.

qlzh727

I am curious about why read/write to JSON file as binary format. What's the content of JSON file? TF version should just be text.

robieta · 2018-04-10T05:06:03Z

My contention would be that we should adopt one absolutely safe way to dump JSON, and use it everywhere. Which unfortunately is serializing and then writing to the file. I understand the position that sometimes it's overkill, but that way the code is robust and you never have to think about whether the code will ever see ASCII.

karmel

Please add lots of comments explaining why this is necessary to avoid the next helpful engineer from thinking, gee, this could be simpler if we...

qlzh727 · 2018-04-10T18:50:11Z

As chatted offline, I think for now its better to skip the encode/decode for utf8 since most of the data you dump to json are just text data. Using "wb" and "rb" confuses people.

robieta · 2018-04-10T20:08:34Z

Alright, everything should be good now. The tests were also throwing warnings due to a small change in batch_norm, so I updated them.

karmel · 2018-04-10T23:25:09Z

"Superficial change in batch_norm" in theory shouldn't require an update here, right? I thought we were going to be robust to that? I worry that we're about to introduce flaky tests.

robieta · 2018-04-10T23:30:03Z

We are robust to that. If you don't change the reference files it still passes, you just get warnings that the op graph is different than what is expected. So once we confirm that the change is innocuous we can make the change at our convenience.

* change reference_data.py to use tf.gfile * simplify json treatment * Update reference files to account for a superficial change in batch_norm

robieta requested review from karmel and qlzh727 April 10, 2018 00:45

robieta requested a review from a team as a code owner April 10, 2018 00:45

googlebot added the cla: yes label Apr 10, 2018

qlzh727 reviewed Apr 10, 2018

View reviewed changes

karmel suggested changes Apr 10, 2018

View reviewed changes

robieta force-pushed the reference_testing_gfile branch from f3c519b to 3f29187 Compare April 10, 2018 19:44

Taylor Robie added 3 commits April 10, 2018 15:50

change reference_data.py to use tf.gfile

d65aac1

simplify json treatment

c514af5

Update reference files to account for a superficial change in batch_norm

7f6373d

robieta force-pushed the reference_testing_gfile branch from b3a2443 to 7f6373d Compare April 10, 2018 22:51

karmel approved these changes Apr 10, 2018

View reviewed changes

robieta merged commit 2661eb9 into master Apr 10, 2018

robieta deleted the reference_testing_gfile branch April 11, 2018 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change reference_data.py to use tf.gfile #3921

change reference_data.py to use tf.gfile #3921

robieta commented Apr 10, 2018

qlzh727 left a comment

robieta commented Apr 10, 2018

karmel left a comment

qlzh727 commented Apr 10, 2018

robieta commented Apr 10, 2018

karmel commented Apr 10, 2018

robieta commented Apr 10, 2018

change reference_data.py to use tf.gfile #3921

change reference_data.py to use tf.gfile #3921

Conversation

robieta commented Apr 10, 2018

qlzh727 left a comment

Choose a reason for hiding this comment

robieta commented Apr 10, 2018

karmel left a comment

Choose a reason for hiding this comment

qlzh727 commented Apr 10, 2018

robieta commented Apr 10, 2018

karmel commented Apr 10, 2018

robieta commented Apr 10, 2018