Reinforcement Learning Q Learning [code] Sarsa [code] Deep Q Network (DQN)[code] Policy gradient [code] Actor-Critic [code] Proximal Policy Optimization (PPO) [code] Deep Deterministic Policy Gradient (DDPG) [code] Soft Actor-Critic [code] Control as inference [code]