Multimodal Question Answering for Language and Vision (Richard Socher, Founder & CEO, MetaMind)

This presentation took place at the RE•WORK Deep Learning Summit in San Francisco on 28-29 January 2016: Multimodal Question Answering for Language and Vision Deep Learning has made tremendous breakthroughs possible in visual understanding and speech recognition. Ostensibly, this is not the case in natural language processing (NLP) and higher level reasoning. However, it only appears that way because there are so many different tasks in NLP and no singl

31 view

1151

396