Skip to content

Latest commit

 

History

History
30 lines (21 loc) · 1021 Bytes

agenda.md

File metadata and controls

30 lines (21 loc) · 1021 Bytes

Agenda

This page hosts the agenda for each TA meeting on mondays.

Week 2 (26-04-2021)

  • Introduction with TA
  • Propose CURL paper

Week 3 (03-05-2021)

  • CURL is an improvement on Pixel SAC. Would it be good to also implement the baseline? It is mostly swapping the first couple of CNN layers.
  • Currently we have copied the utility classes and training structure from the original repo. Is this okay to do?
  • Discuss further research options if implementation turns out to be too trivial.
  • Discuss what repo to use as a basis for the reproduction.

Week 4 (10-05-2021)

  • See week 3

Week 5 (17-05-2021)

  • We are unsure how to exactly update the encoder from the SAC side, should we use actor, critic or both?

Week 6 (24-05-2021)

  • Training for 100k timesteps will take +- 15 hours on cheetah run, others might be slower/faster. What is your opinion on this duration?

Week 7 (31-05-2021)

Week 8 (07-06-2021)

Week 9 (14-06-2021)

Week 10 (21-06-2021)

Week 11 (28-06-2021)