PyCon DE & PyData 2026

Subhosri Basu

I am a GenAI researcher at Fraunhofer Institute, Germany. Born in India, I decided to move to Germany in search of new challenges. My professional journey has been shaped by a passion to solve problems in various domains. Academically, I have graduated with a Master's degree from the department of electrical and computer science. My focus has always been around statistics. I have been able to work on projects related to artificial intelligence and deep learning, especially in the field of signal processing and imaging. With my experience, I want to guide the growth of next generation of ML researcher. When I am not working, you will find me exploring Europe.


Session

04-14
16:30
30min
Catch the LLM if you Can: Watermarking LLMs
Subhosri Basu

With Large Language Models (LLMs), generating high-quality text and images is easy and so is
misusing it. As AI-generated content becomes harder to distinguish from human generated content,
developers are increasingly asking: How can we verify whether a piece of text comes from an LLM?
We’ll explore Python’s simplicity and rich ecosystem of libraries to solve this problem.

This talk introduces the foundations of LLM watermarking and shows how developers can implement
these techniques entirely in Python. We’ll discuss two core approaches, EXP sampling method and
KGW method. We will walk through the implementation of the KGW method using simple,
transparent code, and compare it with the EXP approach. There's no need for a large model or a GPU
cluster to understand how these systems work and the core ideas can be implemented in pure
Python using simple code. The code repositories, which includes both methods will be provided so
that the attendees can follow along.

Along the way, we’ll discuss the trade-offs and the limitations of current research. And for those
wondering, “Do I have to implement all this myself?”, the talk concludes with a demo of MarkLLM, an
existing open-source toolkit that provides a unified Python interface for experimenting with
watermarking algorithms.

Attendees will leave with a clear understanding of how watermarking works, when it’s useful, and
how to integrate these techniques into real-world Python projects.

General: Ethics & Privacy
Dynamicum [Ground Floor]