Reinforcement Learning 4: Непрерывные состояния. Метод DQN. Продолжение.
Продолжаем обсуждать метод глубокого Q-обучения (DQN), рассмотрим программную реализацию DQN, ознакомимся с библиотекой Ray.
We continue to discuss the method of deep Q-learning (DQN), consider the software implementation of DQN, get acquainted with the Ray library.
00:00:00 Начало видео
00:00:12 Нейронная сеть для Q-функции
00:00:38 Deep Q-Network (DQN) method
00:01:41 DQN, DeepMind (2015)
00:06:56 DQN: Memory buffer (replay memory)
00:09:00 DQN: Один эпизод
00:16:38 DQN: Обучение
00:23:21 DQN: Гиперпараметры
00:33:42 Проблема нестабильности
00:38:08 DQN: Непрерывные действия
00:44:34 Double DQN
00:46:18 Dueling DQN
00:50:49 Библиотека Ray (rrlib)
00:55:00 Ray: Инициализация
00:57:32 Ray: Обучение
00:59:00 Ray: Mountain Car
01:02:17 Ray: Включаешь - не работает
01:09:56 Lunar Lander
01:12:55 Ray: А иногда работает
01:26:10 Lunar Lander
Ukrainian IT-company. Machine Learning | Data Science | Artificial Intelligence
#artificialintelligence
#MachineLearning #ReinforcementLearning
#ИскусственныйИнтеллект #Машинноеобучение
5 views
9
0
2 months ago 00:02:02 12
DayZ Update Teaser
2 months ago 00:38:19 1
L’horreur existentielle de l’usine à trombones.
2 months ago 00:01:29 1
Introducing the World’s Coolest Humanoid Robot — EngineAI SE01!
2 months ago 00:25:48 1
What do tech pioneers think about the AI revolution? - BBC World Service
2 months ago 00:19:32 1
How to Make a Carbon Fiber Car Bonnet/Hood - Part 2/3 : Resin Infusion
2 months ago 00:01:44 1
Unitree Introducing | Unitree G1 Humanoid Agent | AI Avatar | Price from $16K
2 months ago 00:02:50 1
LimX Dynamics’ Biped Robot P1 Conquers the Wild Based on Reinforcement Learning
2 months ago 00:32:28 1
Free English Class! Topic: Our Daily Routines! 🐕⏰🥙 (Lesson Only)
2 months ago 00:11:18 1
Learn How To Talk About Your Daily Routine in English Part 2
2 months ago 00:07:45 1
Learn How To Talk About Your Daily Routine in English by Watching Me Act Out Mine
2 months ago 00:14:45 1
Как установить Stable Diffusion 3.5 Large и Turbo на компьютер? Пошаговая инструкция для Windows.
2 months ago 00:02:15 1
Number and Counting song | Learn Counting to 1000 | Math for 2nd Grade | Kids Academy
2 months ago 00:01:58 1
Morse Code Alphabet Receiving Practice (1)
2 months ago 00:09:12 3
Learn English Through Story Level 1, Graded Reader Level 1, Stories Short Beginners, Basic English
2 months ago 00:29:52 7
[LeatherCraft] Baguette Bag 4K / FREE PDF PATTERN
3 months ago 00:59:40 1
’Little Learning Machines’ Postmortem: A Game About Training Neural Networks
3 months ago 01:08:43 1
#dobetter Podcast Episode 4: Learned Behavior during Extinction
3 months ago 00:01:11 3
MEVIUS: A Quadruped Robot Easily Constructed through E-Commerce (Humanoids 2024)
3 months ago 00:11:46 1
Jobs and Occupations - Vocabulary for Kids - Compilation
3 months ago 00:02:28 1
Yamaha | Artist Profile | Krissy Morash of Escuela Grind
3 months ago 00:04:08 1
Wild animals for kids - Vocabulary for kids
3 months ago 00:10:12 1
Let’s Learn English Around the House and Home | English Video with Subtitles
3 months ago 00:32:16 1
The Teacher Series #10
3 months ago 00:03:30 1
The Ancient Egypt - 5 things you should know - History for kids