Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call “reinforcement learning“.
Their new paper is called “Reward-free curricula for training robust world models“
Interviewer: Dr. Tim Scarfe
Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking.
We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail.
MLST Discord:
00:00:00 - Intro
00:01:05 - Mod
1 view
48
15
3 weeks ago 00:02:42 5
Royal & the Serpent - “Wasteland” (from Arcane Season 2) [Official Visualizer]
4 weeks ago 00:20:33 1
Movie “Wicked” Sign of the End Time
4 weeks ago 00:29:28 1
🎥 Can you guess the Anime by the First 10 Seconds? 🔥 Anime Quiz
4 weeks ago 00:04:14 1
Paramore: Decode [OFFICIAL VIDEO]
4 weeks ago 00:00:32 1
…but the people are retarded
4 weeks ago 00:02:36 5
Jingle Bells | Christmas Song | Super Simple Songs
4 weeks ago 00:03:02 1
Milk & Cookies | Holiday Song for Kids | Rhymington Square
4 weeks ago 00:02:59 1
Monster Girl Quest Paradox Part 3 OST - Ending 4-3
4 weeks ago 00:12:48 1
The Brand New STIHL MS400.1! WHY is this CHAINSAW different?
4 weeks ago 00:04:12 2
Stuart Townend - Christ Be In My Waking
4 weeks ago 00:08:10 1
AI Agents Will Create MILLIONAIRES in 2025 – Are You Ready
1 month ago 00:03:04 1
Arena Of Valor Hack - How to Get Unlimited Vouchers! iOS Android