Johannes Dröge PyCon DE & PyData 2026

Johannes Dröge
.ical

Johannes holds a PhD in computer science, has developed open-source software, algorithms and statistic methods for genome data analysis, worked as a data scientist, and led a group of data engineers in a mid-size startup. He is currently bootstrapping SaaS infrastructure software projects with a focus on cross-organizational data sharing.

Homepage:

https://www.parsimonit.com

LinkedIn:

https://www.linkedin.com/in/johdro/

GitHub:

https://github.com/fungs/

Session

04-15

15:00

45min

Beyond Kafka and S3: HTTP-Native Bytestreams for Python Data Pipelines

Johannes Dröge

Real-time bytestreams between systems in different organizations or secured environments, whether for batch dataset delivery or continuous streaming, are surprisingly hard. Traditional solutions fall short: message brokers like Kafka use discrete messages, file storage like S3 works for batch exchange but lacks streaming and coordination, while HTTP client-server approaches require one side to host and expose server endpoints, introducing security and operational overhead.

This talk introduces the ZebraStream Protocol: an open, HTTP-based bytestream protocol with coordination mechanisms that let you stream data—Parquet files, compressed archives, encrypted content—directly between decoupled systems using Python's file-like interface. No message framing, no server hosting, no exposed endpoints.

We'll explore the design of a bytestream protocol for data sharing and integration that crosses the file-stream boundary, enabling seamless integration with pandas, DuckDB, and any Python library expecting file-like objects, supporting use cases from ETL pipelines to IoT data delivery, cross-org collaboration to home network automation.

PyData: Data Handling & Data Engineering

Palladium [2nd Floor]

Johannes Dröge .ical

Session

Johannes Dröge
.ical