David Li
I am an engineer at Columnar Technologies. Previously I worked at Voltron Data (formerly known as Ursa Computing) and Two Sigma Investments. I'm a longtime open source contributor; I have worked on the Apache Arrow project since 2019 and was one of the creators of the ADBC subproject. I am currently a PMC member of the Arrow project.
Session
Data scientists and AI engineers today have more tools than ever at their disposal. But more choices has also led to fragmentation, even when it comes to the basic task of loading data. With Apache Arrow ADBC (Arrow Database Connectivity), anyone working with databases in Python can access systems from BigQuery to PostgreSQL to Snowflake using familiar DB-API interfaces and maximum performance. We’ll show how ADBC makes it easy and fast to work with different data sources and different engines (like pandas and Polars). We’ll also look under the hood, with an introduction to the Apache Arrow ecosystem at the heart of the modern data science and AI ecosystems.