Gael Varoquaux- Dirty Data Science Machine Learning On Non Curated Data| PyData Global 2020

Talk Cleaning data to analyze it is a major roadblock to data science. I will discuss two specific problems, missing values and categories which variants and typos, in the context of machine learning. This talk will be on recent publications but give simple solutions in Python. Speaker I am a research director at Inria (French National Computer Science Research Institute), studying machine learning for health, as well as a visiting professor at McGill university. I have a strong academic track record in f
Back to Top