PEARL: Meta-RL
22 Mar 2019, Prathyush SPIt is 20-100x faster than prior methods, with better final performance, using soft actor-critic and order-invariant context embedding:
For more details, visit the source.
It is 20-100x faster than prior methods, with better final performance, using soft actor-critic and order-invariant context embedding:
For more details, visit the source.