Aleksei Petrenko: SampleFactory and high-throughput reinforcement learning
Data Fest Online 2020
Reinforcement Learning track
Sample Factory: Egocentric 3D Control from Pixels at 100 000 FPS with Asynchronous Reinforcement Learning
The quest for sample efficiency in general-purpose RL algorithms has proven to be rather challenging. The level results in RL has been growing largely due to the increased amount of compute research labs are willing to use in their projects. As a result, SOTA-level results have become increasingly unreachable for regular researchers.
Our goal is to bring the deep RL back to the community by improving the efficiency of training and reducing the cost of data collection. We present the SampleFactory - an on-policy RL training system optimized for speed. By maximizing the hardware utilization of our algorithm we approach 150000 FPS of training on a single machine, 10x faster than many popular frameworks. Our agents trained with SampleFactory APPO approach human level of performance in challenging and immersive 3D games.
Register and get access to the tracks:
Join the community:
3 views
605
141
2 months ago 00:05:04 1
«ПРИПЛЫЛИ» 4 серия/ мини-сериал. Янтарный
3 months ago 00:39:39 3
Aleksei Petrenko: SampleFactory and high-throughput reinforcement learning
6 months ago 00:03:28 1
The Barber of Siberia./ Сибирский цирюльник..
7 months ago 02:11:38 1
King Lear - Grigori Kozintsev - Jüri Järvet - Shakespeare - 1970 - HD Restored - 4K
2 years ago 02:01:19 90
Adiós a Matiora (.)
3 years ago 00:49:35 529
ОФИЦЕРЫ 1 сезон (2006) 8 серий, 5 серия (Актер Алексей Макаров - Егор Осоргин по кличке СТАВР)
5 years ago 00:26:24 9
VizDoom as a Research Platform | Aleksei Petrenko
7 years ago 00:03:29 2.4K
March 26 ♈ Famous BirthDays
10 years ago 02:35:06 1
12 Nikita Mikhalkov FULL MOVIE Двенадцать Никита Михалков