Riccardo Cappuzzo
I am a research engineer at Inria, part of P16 and of the SODA research team. I am the lead developer of the skrub Python package and spend most of my time on that, but I am also interested in research on tabular learning and tabular foundational models.
Session
Skrub is an open source package that simplifies machine-learning with dataframes by providing a variety of tools to explore, prepare and feature-engineer dataframes so they can be integrated into scikit-learn pipelines. Skrub DataOps allow to build extensive, multi-table wrangling plans, explore hyperparameter spaces, and export the resulting objects for deployment.
The talk showcases various use cases where skrub can simplify the job of a data scientist from data preparation to deployment, through code examples and demonstrations.