BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.com//euroscipy-2026//talk//MLEJZS
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-euroscipy-2026-MLEJZS@pretalx.com
DTSTART;TZID=CET:20260721T093000
DTEND;TZID=CET:20260721T100000
DESCRIPTION:Skrub is a package that eases preparing dataframes so they can 
 be used in machine-learning tasks. In practice\, data can be spread over m
 ultiple tables\, represent various types of information (tabular\, textual
 \, graphical)\, or be stored on external database systems rather than data
 frames. \n\nSkrub Data Ops help with constructing versatile pipelines that
  can handle this variety of scenarios\, while at the same time avoiding da
 ta leakage and allowing to build rich hyper-parameter grids that can be ex
 plored to maximize the performance of the final machine learning model. \n
 \nIn this talk\, we give a brief introduction of the Data Ops framework be
 fore presenting three separate use cases highlighting their versatility: a
  traditional machine learning pipeline that uses Optuna to perform hyper-p
 arameter tuning\, a pipeline that trains on data stored in a relational da
 tabase rather than a dataframe\, and an image classification task with Pyt
 orch. \n\nBy the end of the talk\, attendees will learn about the skrub Da
 ta Ops\,  their main features and how they can be used successfully in dif
 ferent practical scenarios.
DTSTAMP:20260603T195528Z
LOCATION:Room 1.38 (Ground Floor\, Turing)
SUMMARY:How to use skrub Data Ops in practice - Riccardo Cappuzzo\, Guillau
 me Lemaitre
URL:https://pretalx.com/euroscipy-2026/talk/MLEJZS/
END:VEVENT
END:VCALENDAR
