2026-04-14 –, Platinum [2nd Floor]
Writing talks is hard, but being a good conference speaker is even harder. Resultantly, this talk is recursive: I'll take a talk previously written for a London data science meetup on using Apache Spark and Apache Kafka to build ML data processing pipelines, and revamp it using Snowflake's Cortex Code CLI!
In this talk, we'll walk through a basic Apache Spark data pipeline which reads in an image dataset, processes it, and detects raccoons. That said, sponsored talks are always boring: let's see what we can do to spice things up using AI! We'll use Snowflake's Cortex Code CLI coding agent together to improve the talk live, taking suggestions from the audience as we go!
Attendees to the talk can expect to learn the following:
- What Apache Spark is, what it excels at, and how to set up a basic cluster
- How to use HuggingFace ViT (vision transformer) to run a basic computer vision setup
- A little bit about Snowflake's new coding agent, Cortex Code CLI (the part where we advertise at you, but I promise it will be fun)
- Building a basic Streamlit app
- .. and whatever other fun we get up to together!
Join for a session full of fun experimentation with interesting tools – and learn a bit about data pipelines too! This session is suitable for beginner to intermediates!coding agent, live!
Celeste Horgan is a Sr. OSS Developer Advocate and OSPO Lead at Snowflake. Previous roles include work at Aiven, The Linux Foundation, Stripe and commercetools. She has worked in open source since 2020, is a former contributor to the Kubernetes project, and currently immersed in the Postgres open source ecosystem. Her work has been featured in the New York Times and she regularly speaks internationally at technical conferences.