DeepMind’s New AI MuZero Mastered More Than 50 Games

DeepMind’s New AI MuZero Mastered More Than 50 Games

Two Minute Papers explores the paper "Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model." Details
OpenAI Safety Gym: A Safe Place For AIs To Learn

OpenAI Safety Gym: A Safe Place For AIs To Learn

Two Minute Papers takes a look at the paper "Benchmarking Safe Exploration in Deep Reinforcement Learning.” Details
DeepMind’s AlphaStar: A Grandmaster Level StarCraft 2 AI

DeepMind’s AlphaStar: A Grandmaster Level StarCraft 2 AI

Two Minute Papers explores the paper "AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning" in this video.  Details
Why Deep Q Learning Needs A Target Network and Replay Memory

Why Deep Q Learning Needs A Target Network and Replay Memory

Machine Learning with Phil has got another interesting look at Deep Q Learning as part of a preview of his course. The two biggest innovations... Details
Naive Actor Critic With Experience Replay

Naive Actor Critic With Experience Replay

Machine Learning with Phil posted this tutorial to apply experience replay to the actor critic algorithm. It seems smart, but it turns out that it... Details
Deep Q Learning From Paper to Code

Deep Q Learning From Paper to Code

After a particularly fascinating talk I attended last week at MLADS, I want to spend more time focused on Deep Q Learning. Fortunately, YouTuber Phil... Details
Reinforcement Learning Tutorial For Beginners

Reinforcement Learning Tutorial For Beginners

edureka! covers the basics of reinforcement learning in this tutorial for beginners. Details
OpenAI Plays Hide and Seek and Outsmarts the Rules

OpenAI Plays Hide and Seek and Outsmarts the Rules

Two Minute Papers examines the paper "Emergent Tool Use from Multi-Agent Interaction" and why we may have a hard time controlling AI. Details
AI Learns to Park with Deep Reinforcement Learning

AI Learns to Park with Deep Reinforcement Learning

Samuel Arzt shows off a project where an AI learns to park a car in a parking lot in a 3D physics simulation. The simulation... Details
Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare

Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare

Microsoft Research jsut posted this talk from Reinforcement Learning Day 2019: “Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare.”  See more at... Details