Joris Van den Bossche

I am a core contributor to Pandas and Apache Arrow, and maintainer of GeoPandas. I did a PhD at Ghent University and VITO in air quality research and worked at the Paris-Saclay Center for Data Science. Currently, I work at Voltron Data, contributing to Apache Arrow, and am a freelance teacher of python (pandas).


Institute / Company

Voltron Data

Git*hub|lab

github.com/jorisvandenbossche

Homepage

https://jorisvandenbossche.github.io/

Twitter handle

@jorisvdbossche


Sessions

08-16
11:05
30min
Pandas 2.0 and beyond
Joris Van den Bossche, Richard Shadrach

Pandas has reached a 2.0 milestone in 2023. But what does that mean? And what is coming after 2.0? This talk will give an overview of what happened in the latest releases of pandas and highlight some topics and major new features the pandas project is working on

Data Science and Visualisation
Aula
08-17
10:30
90min
Interoperability in the Scientific Python Ecosystem
Tim Head, Mridul Seth, Olivier Grisel, Franck Charras, Sebastian Berg, Joris Van den Bossche

This slot will cover the effort regarding interoperability in the scientific Python ecosystem. Topics:

  • Using the Array API for array-producing and array-consuming libraries
  • DataFrame interchange and namespace APIs
  • Apache Arrow: connecting and accelerating dataframe libraries across the PyData ecosystem
  • Entry Points: Enabling backends and plugins for your libraries

Using the Array API for array-producing and array-consuming libraries

Already using the Array API or wondering if you should in a project you maintain? Join this maintainer track session to share your experience and exchange knowledge and tips around building array libraries that implement the standard or libraries that consume arrays.

DataFrame-agnostic code using the DataFrame API standard

The DataFrame Standard provides you with a minimal, strict, and predictable API, to write code that will work regardless of whether the caller uses pandas, polars, or some other library.

DataFrame Interchange protocol and Apache Arrow

The DataFrame interchange protocol and Arrow C Data interface are two ways to interchange data between dataframe libraries. What are the challenges and requirements that maintainers encounter when integrating this into consuming libraries?

Entry Points: Enabling backends and plugins for your libraries

In this talk, we will discuss how NetworkX used entry points to enable more efficient computation backends to plug into NetworkX

Scientific Applications
HS 119 - Maintainer track