2024-04-05 –, Room 111
Polars conquered dataframes, and now it is coming for machine learning! With Polars-powered feature-extraction and a best-of-the-class set of diagnostic tools, functime enables forecasting thousands of time series all at once, from the comfort of your laptop.
Though forecasting practitioners are the intended audience, the talk has something for every data scientist. With Polars, we can push the boundary for what "reasonable scale" means - and build a new generation of tools for machine learning.
Polars is mature, production ready, intuitive to write and pleasant to read. And it's fast. Thanks to Rust and Rayon, you can achieve speeds greater than numba's. If you combine it with top-of-the-class evaluation methods, not only can you get speedups of about 1-2x order of magnitude in feature engineering and cross-validation, but also dramatically improve your development workflow.
That's what we set out to demonstrate with functime. We chose to write a time-series library first, because forecasting can be a costly undertaking, with significant problems of scale. Making predictions with big panel datasets usually required fitting thousands of univariate models, one at a time, using distributed systems. On the other hand, functime unlocks an efficient forecasting workflow, from your laptop.
📍Intended audience. This talk is a hands-on demonstration for forecasting practitioners and data scientists alike. It will showcase how to build clean and performant forecasting pipelines with rich feature-engineering capabilities - enabling a seamless and more efficient modelling workflow.
Nevertheless, the principles behind functime can be grasped by every machine learning practitioner: forecasting is just a use-case to shows off Polars' potential. With Polars, we can improve the current state of machine learning modelling and raise the ceiling for what reasonable scales means.
🚩 Talk outline
• minutes 0-3. Problem setting: the current problem with forecasting.
• minutes 3-7. What is Polars and why it is so fast.
• minutes 7-10. What is global forecasting and why it is so effective.
• minutes 10-20. A simple fit-evaluate modelling workflow.
• minutes 20-25. An advanced workflow with blazingly fast feature extractors and cross-validation.
• minutes 25-30. Wrap up and QA.
ML Engineer