
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

TAAC: Temporally Abstract Actor-Critic for Continuous Control

An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards

Generative Particle Variational Inference via Estimation of Functional Gradients

Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers

Mutual Information State Intrinsic Control

Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning
