BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.com//pycon-lt-2023//talk//NBRSE9
BEGIN:VTIMEZONE
TZID:EET
BEGIN:STANDARD
DTSTART:20000101T000000
RRULE:FREQ=YEARLY;BYMONTH=1;UNTIL=20011231T220000Z
TZNAME:EET
TZOFFSETFROM:+0200
TZOFFSETTO:+0200
END:STANDARD
BEGIN:STANDARD
DTSTART:20031026T050000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:EET
TZOFFSETFROM:+0300
TZOFFSETTO:+0200
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20030330T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:EEST
TZOFFSETFROM:+0200
TZOFFSETTO:+0300
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-pycon-lt-2023-NBRSE9@pretalx.com
DTSTART;TZID=EET:20230519T140000
DTEND;TZID=EET:20230519T153000
DESCRIPTION:Are you struggling with big data in your business? Join us to d
 iscover how PySpark can help you solve your problems efficiently and effec
 tively. In this workshop\, we will revisit the key concepts of PySpark\, i
 ncluding parallel processing and lazy evaluation. We will explore DataFram
 es as a convenient layer of so called RDDs and work with an optimizer to g
 et the most out of our transformations. \n\nWe'll also take a look the Spa
 rk UI\, which allows us to monitor and optimize our processes. To put our 
 knowledge into practice\, we'll simulate a business problem and walk throu
 gh the entire process of data preparation (preprocessing)\, training a mod
 el with MLLib\, and performing inference on preprocessed test data. We'll 
 also add a business logic layer to our solution for further customization 
 (postprocessing). \n\nOptional content includes lessons learned from large
 -scale production systems based on PySpark. We'll share insights on how to
  optimize performance and scale your solution to handle big data with ease
 .
DTSTAMP:20260307T214610Z
LOCATION:Coral A - Workshop
SUMMARY:Unlocking the Power of PySpark: A Comprehensive Workshop - Carsten 
 Frommhold
URL:https://pretalx.com/pycon-lt-2023/talk/NBRSE9/
END:VEVENT
END:VCALENDAR
