Hyukjin Kwon

Hyukjin is a Databricks software engineer as the tech-lead in OSS PySpark team, Apache Spark PMC member and committer, working on many different areas in Apache Spark such as PySpark, Spark SQL, SparkR, infrastructure, etc. He is the top contributor in Apache Spark, and leads efforts such as Project Zen, Pandas API on Spark, and Python Spark Connect.


Institute / Company

Databricks

Git*hub|lab

https://github.com/HyukjinKwon


Session

08-17
16:05
30min
Scaling pandas to any size with PySpark
Hyukjin Kwon, Allan Folting

This talk discusses using the pandas API on Apache Spark to handle big data, and the introduction of Pandas Function APIs. Presented by an Apache Spark committer and a product manager, it offers technical and managerial insights.

High Performance Computing
Aula