BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.com//pydata-london-2026//talk//BPJEKV
BEGIN:VTIMEZONE
TZID:GMT
BEGIN:STANDARD
DTSTART:20001029T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:GMT
TZOFFSETFROM:+0100
TZOFFSETTO:+0000
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T020000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:BST
TZOFFSETFROM:+0000
TZOFFSETTO:+0100
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-pydata-london-2026-BPJEKV@pretalx.com
DTSTART;TZID=GMT:20260606T102000
DTEND;TZID=GMT:20260606T110500
DESCRIPTION:Adopting a streaming architecture as a Python developer often m
 eans abandoning the tools and abstractions you know: DataFrames\, batch pr
 ocessing\, familiar data workflows\, in favour of an entirely different me
 ntal model. After ten years of tackling this problem across multiple compa
 nies\, I've learned it doesn't have to be that way.\n\nIn this talk\, I'll
  show how to treat Kafka not as a stream of individual messages but as a s
 ource of micro-batches\, and how to deserialize those messages\, whether J
 SON or Protobuf\, into Arrow-backed DataFrames. The result: your processin
 g code looks the same whether the data comes from a Parquet file or a Kafk
 a topic.\n\nNo heavy framework required. Using confluent-kafka and Apache 
 Arrow\, I'll walk through how to build this from the ground up\, so you un
 derstand every layer of the stack.
DTSTAMP:20260602T223427Z
LOCATION:Doddington Forum
SUMMARY:Kafka Streaming\, the Pythonic Way - Arthur Andres
URL:https://pretalx.com/pydata-london-2026/talk/BPJEKV/
END:VEVENT
END:VCALENDAR
