tensorflow와 openAI의 gym을 활용하여 강화학습 알고리즘들을 설명하고, 구현해놓은 블로그

Deep Learning 2016. 12. 30. 16:32

안녕하세요.

tensorflow와 openAI의 gym을 활용하여 강화학습 알고리즘들을 설명하고, 구현해놓은 블로그가 있어서 소개드립니다.

https://medium.com/@awjuliani/latest

강화학습 외에도 AI전반에 대해 다루고 계신 분이신데,
내용 및 코드에 대한 설명이 친절하게 잘 되어 있네요.

강화학습 쪽 목차는 다음과 같습니다.

Part 0 — Q-Learning Agents
Part 1 — Two-Armed Bandit
Part 1.5 — Contextual Bandits
Part 2 — Policy-Based Agents
Part 3 — Model-Based RL
Part 4 — Deep Q-Networks and Beyond
Part 5 — Visualizing an Agent’s Thoughts and Actions
Part 6 — Partial Observability and Deep Recurrent Q-Networks
Part 7 — Action-Selection Strategies for Exploration
Part 8 — Asynchronous Actor-Critic Agents (A3C)

Part 4만 잠시 봤는데, double DQN까지도 다루셨더군요.

'Deep Learning' 카테고리의 다른 글

Modern optimization methods from gradient descent family are the key to the majority of modern data-driven solutions. Sebastian Ruder does the great job explaining the modern optimization methods. (0)	2017.01.03
Popular HN Deep Learning Stroies (0)	2016.12.30
인공지능: 변화와 능력개발 Taegyun Jeon (0)	2016.12.27
인공지능, 기계학습 그리고 딥러닝 6 views Jinwon Lee (0)	2016.12.27
Electronic health records could be used for the prediction of the future of patient. (0)	2016.12.26