The Spark of Big Data: An Introduction to Apache Spark
04-19, 10:00–10:45 (Europe/Berlin), B09

Get ready to level up your big data processing skills! Join us for an introductory talk on Apache
Spark, the distributed computing system used by tech giants like Netflix and Amazon. We'll
cover PySpark DataFrames and how to use them. Whether you're a Python developer new to
big data or looking to explore new technologies, this talk is for you. You'll gain foundational
knowledge about Apache Spark and its capabilities, and learn how to leverage DataFrames and
SQL APIs to efficiently process large amounts of data. Don't miss out on this opportunity to up
your big data game!


Get ready to level up your big data processing skills! Join us for an introductory talk on Apache
Spark, the distributed computing system used by tech giants like Netflix and Amazon. We'll
cover PySpark DataFrames and how to use them. Whether you're a Python developer new to
big data or looking to explore new technologies, this talk is for you. You'll gain foundational
knowledge about Apache Spark and its capabilities, and learn how to leverage DataFrames and
SQL APIs to efficiently process large amounts of data. Don't miss out on this opportunity to up
your big data game!


Expected audience expertise: Domain

None

Expected audience expertise: Python

Intermediate

Abstract as a tweet

Spark your big data skills! Learn Apache Spark basics: data frames, SQL APIs, and merging data for Python devs new to big data & tech explorers. Don't miss out! #ApacheSpark #BigData #Python

Pasha Finkelshteyn is a developer advocate for data engineering at JetBrains with more than a
decade of experience in the industry. He has a passion for making big data processing
accessible to all and has spent most of his career working with the JVM. However, Pasha
switched to Data Engineering where he discovered the power of Python