BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.com//euroscipy-2026//talk//VPYLDF
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-euroscipy-2026-VPYLDF@pretalx.com
DTSTART;TZID=CET:20260720T121000
DTEND;TZID=CET:20260720T123000
DESCRIPTION:Your GPU is fast\, so why does your Python code still feel slow
 ? This talk shows a practical\, Python-first profiling workflow with Nsigh
 t Systems\, Nsight Compute\, and NVTX for CuPy\, Numba\, PyTorch\, JAX\, a
 nd CUDA extensions. We will use timelines to find launch overhead\, hidden
  synchronizations\, and host-device copies\, then drill into kernel bottle
 necks like memory throughput and occupancy. You will leave with a repeatab
 le loop for turning profiles into measurable speedups.
DTSTAMP:20260603T201611Z
LOCATION:Room 2.41 (First Floor\, Turing)
SUMMARY:Profiling Python GPU Code - Bryce Adelstein Lelbach
URL:https://pretalx.com/euroscipy-2026/talk/VPYLDF/
END:VEVENT
END:VCALENDAR
