Naive Actor Critic With Experience Replay

Naive Actor Critic With Experience Replay

Machine Learning with Phil posted this tutorial to apply experience replay to the actor critic algorithm. It seems smart, but it turns out that it... Details
Deep Q Learning From Paper to Code

Deep Q Learning From Paper to Code

After a particularly fascinating talk I attended last week at MLADS, I want to spend more time focused on Deep Q Learning. Fortunately, YouTuber Phil... Details
Reinforcement Learning Tutorial For Beginners

Reinforcement Learning Tutorial For Beginners

edureka! covers the basics of reinforcement learning in this tutorial for beginners. Details
OpenAI Plays Hide and Seek and Outsmarts the Rules

OpenAI Plays Hide and Seek and Outsmarts the Rules

Two Minute Papers examines the paper "Emergent Tool Use from Multi-Agent Interaction" and why we may have a hard time controlling AI. Details
AI Learns to Park with Deep Reinforcement Learning

AI Learns to Park with Deep Reinforcement Learning

Samuel Arzt shows off a project where an AI learns to park a car in a parking lot in a 3D physics simulation. The simulation... Details
Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare

Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare

Microsoft Research jsut posted this talk from Reinforcement Learning Day 2019: “Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare.”  See more at... Details
The 7 Capabilities Every AI Should Have

The 7 Capabilities Every AI Should Have

Two Minute Papers examines the paper "Behaviour Suite for Reinforcement Learning" is available here: https://arxiv.org/abs/1908.03568 and the source code is here  https://github.com/deepmind/bsuite Details
Multi-Agent Hide and Seek

Multi-Agent Hide and Seek

OpenAI has an interesting video explaining some of their latest research behind training reinforcement learning agents how to play hide and seek. Details
This Adorable Baby T-Rex AI Learned To Dribble

This Adorable Baby T-Rex AI Learned To Dribble

Two Minute Papers explores the paper "MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies" in this video. Details
Superhuman Poker AI That Was Trained in 20 Hours

Superhuman Poker AI That Was Trained in 20 Hours

Two Minute Papers explores the paper “Superhuman AI for Multiplayer Poker” and why poker presents a formidable challenge to AI system in this video. Details