PyCon UK 2022

Training, Deploying, and Running a ML model using Python and Snowpark
2022-09-16 , Room C

In this session, we will train a ML model to predict ROI of variable advertising spend budgets across multiple channels including search, video, social media, and email using Snowpark for Python and scikit-learn.


In this session, we will train a Linear Regression model to predict future ROI (Return On Investment) of variable advertising spend budgets across multiple channels including search, video, social media, and email using Snowpark for Python and scikit-learn. By the end of the session, you will have an interactive web application deployed visualizing the ROI of different allocated advertising spend budgets.

During this hands-on session, we will:

  • Set up your favorite IDE (e.g. Jupyter, VSCode) for Snowpark and ML
  • Analyze data and perform data engineering tasks using Snowpark DataFrames
  • Use open-source Python libraries from a curated Anaconda channel with near-zero maintenance or overhead
  • Deploy ML model training code to Snowflake using Python Stored Procedures
  • Create and register Python User-Defined Functions (UDFs) for inference
  • Create Streamlit web application that uses the UDF for real-time prediction

With experience in big data, data science, and machine learning Dash is able to apply 18+ years of full-stack, hands-on software engineering skills to help build solutions that solve business problems and surface trends that shape markets in new ways than imagined before. As a developer advocate, he is passionate about evaluating new ideas, and helping articulate how technology can address a given business problem.

Dash has worked for global enterprises and in agile environments–for tech startups in the Bay Area in varying verticals, such as VoIP, Online Gaming, Digital Health, NoSQL database, and Data Cloud platforms.