Reinforcement Learning 5: Методы на основе политики агента
В этом видео разберемся с новой группой методов, которые основаны непосредственно на политике агента. Познакомимся с методом REINFORCE, рассмотрим комбинацию алгоритмов Actor Critic, основанных на значениях, похожих на Policy Gradient и Q-Learning.
In this video, we will understand a new group of methods that are based directly on the agent’s policy. Let’s get acquainted with the REINFORCE method, consider a combination of Actor Critic algorithms based on values similar to Policy Gradient and Q-Learning.
00:00:00 Начало видео
00:01:05 Deep Q-Network (DQN) method
00:03:26 Policy function
00:05:34 Policy Gradients method
00:17:14 Метод REINFORCE
00:23:51 Actor-Critic
00:25:05 A2C (Advantage Actor-Critic)
00:34:00 A3C (Asynchronous Advantage Actor-Critic)
00:45:40 Actor-Critic for continuous action spaces
00:53:25 Actor-Critic: Model
00:56:58 Actor-Critic: Policy and Training
01:10:07 Mountain Car Continuous
01:14:42 Actor-Critic: Гиперпараметры
Ukrainian IT-company. Machine Learning | Data Science | Artificial Intelligence
#artificialintelligence
#MachineLearning #ReinforcementLearning
#ИскусственныйИнтеллект #Машинноеобучение
8 views
5
2
7 months ago 00:02:30 1
Learning the bathroom - Vocabulary for kids
7 months ago 00:03:22 1
Renewable Energy Sources - Types of Energy for Kids
7 months ago 00:01:25 1
UK | United Kingdom | United Kingdom Song | A Geography Song About the UK and its Capitals
7 months ago 00:17:54 1
Petite Pretty Crystal Pendant or Ring - DIY Jewelry Making Tutorial by PotomacBeads
7 months ago 00:04:08 1
Wild animals for kids - Vocabulary for kids
7 months ago 00:00:59 1
A New Approach to Disney’s Robotic Character Pipeline
7 months ago 00:03:06 1
Deluxe Kick & Play Piano Gym from Fisher-Price
7 months ago 00:16:04 1
START TO UNDERSTAND French with a Simple Story (A1-A2)
7 months ago 00:03:41 1
How to Take Care of the Environment - 10 Ways to Take Care of the Environment
7 months ago 00:02:17 1
Prepositions of place for children - The concept of space, for kids - Where things are
7 months ago 00:02:50 1
LimX Dynamics’ Biped Robot P1 Conquers the Wild Based on Reinforcement Learning
7 months ago 00:13:07 1
Learn How to Talk about Age in English! Also, Happy Birthday to Me! 🍰
7 months ago 00:08:06 1
Weather Talk! Learn English Words and Phrases to Talk about the Weather | Video with Subtitles
7 months ago 02:31:11 1
If Christians Did This They’d Never be Spiritually Bound
7 months ago 00:00:49 15
Body Design and Gait Generation of Chair-Type Asymmetrical Tripedal Low-rigidity Robot
7 months ago 00:00:48 1
Cat Taught To Play Piano Using Classical Conditioning || ViralHog
7 months ago 00:03:38 1
Adventure into the Digital Age with Steve Jobs - History for Kids
7 months ago 00:18:12 1
Learning styles. Типы обучающихся, или как найти свой стиль обучения
7 months ago 00:01:09 1
Phonics ee Sound Song | ee Sound | Digraph ee | ee | Phonics Resource | Vowel Digraph
7 months ago 00:09:06 1
It’s Spring! Let’s Learn English Outside! An English Lesson about the Season of Spring
7 months ago 00:08:29 1
Learn English While Baking Cookies with Me in the Kitchen!
7 months ago 00:03:03 1
张碧晨 - 光的方向 | 张碧晨燃情歌唱长歌一生 |《长歌行》片头主题曲MV | The Long Ballad - OST&Opening Song
7 months ago 00:02:34 2
Learning the kitchen - Vocabulary for kids
7 months ago 00:50:44 1
Using EEG & Machine Learning to Perform Lie Detection • Jennifer Marsman • YOW! 2017