Machine Learning with Phil dives into Deep Q Learning with Tensorflow 2 and Keras.

Dueling Deep Q Learning is easier than ever with Tensorflow 2 and Keras. In this tutorial for deep reinforcement learning beginners we’ll code up the dueling deep q network and agent from scratch, with no prior experience needed. We’ll train an agent to land a spacecraft on the surface of the moon, using the lunar lander environment from the OpenAI Gym.

The dueling network can be applied to both regular and double q learning, as it’s just a new network architecture. It doesn’t require any change to the q learning or double q learning algorithms. We simply have to change up our feed forward to accommodate the new value and advantage streams, and combine them in a way that makes sense.

Samuel Arzt shows off a project where an AI learns to park a car in a parking lot in a 3D physics simulation.

The simulation was implemented using Unity’s ML-Agents framework (

From the video description:

The AI consists of a deep Neural Network with 3 hidden layers of 128 neurons each. It is trained with the Proximal Policy Optimization (PPO) algorithm, which is a Reinforcement Learning approach.