PyConDE & PyData Berlin 2024

Katharine Jarmul

Katharine Jarmul is a privacy activist and data scientist whose work and research focuses on privacy and security in data science workflows. She works as a Principal Data Scientist at Thoughtworks and author of Practical Data Privacy. She is a passionate and internationally recognized data scientist, programmer, and lecturer.


Sessions

04-22
16:10
45min
Your Model _Probably_ Memorized the Training Data
Katharine Jarmul

I know you probably don't want to hear about it, but your deep learning model probably memorized some of its training data. In this talk, we'll review active research on deep learning and memorization, particularly for large models such as large language and multi-modal models.

We'll also explore potential ways to think through when this memorization is actually desired (and why) as well as threat vectors and legal risk of using models who have memorized training data. We'll also look at potential privacy protections which could address some of the issues and how to embrace memorization by thinking through different types of models and their use.

PyData: Machine Learning & Deep Learning & Stats
B07-B08
04-24
13:10
60min
(PyLadies Panel) Reflecting Within: Challenging Narratives in Tech Feminism
Paloma Oliveira, Katharine Jarmul, Cheuk Ting Ho, Naa Ashiorkor Nortey

For the third year in a role, the PyLadies Panel at PyCon PyData engages with a broader audience on critical issues related to gender disparities, ethics, and the ongoing importance of women-focused tech groups. Adopting unconventional formats, the PyLadies Panel aims to foster meaningful discussions among PyLadies members and the Python community, encouraging open dialogue and community solidarity.

Plenary
Kuppelsaal