Deep Reinforcement Learning

Advantage Actor-Critic (A2C) algorithm on Breakout (left) and Space Invaders (right)

Deep Deterministic Policy Gradient (DDPG) on MuJoCo virtual creatures
The objective is to make these creatures walk.

Neural Style Transfer

Variational AutoEncoders & Art