MindStorm
  • Home
  • Archives
  • Categories
  • Tags
  • About

Soft Actor-Critic

RL > Preliminary
#SAC

Soft Q-Learning

RL > Preliminary
#SQL

Maximum Entropy RL

RL > Preliminary
#MaxEntRL

Gaussian Policy

RL > Preliminary
#Gaussian Policy

DPG

RL > Preliminary
#DDPG #TD3

TRPO and PPO

RL > Preliminary
#TRPO #PPO

Actor-Critic

RL > Preliminary
#AC #A2C

REINFORCE

RL > Preliminary
#REINFORCE #Baseline

Policy Gradient

RL > Preliminary
#Baseline #PG

Value Learning Technique

RL > Preliminary
#ER #Target Q #Double Q
1…4567

Search

Hexo Fluid
Views: Visitors: