PyCon Lithuania 2024

Einat Orr

Oz Katz is the Co-Creator of the open source lakeFS Project, an open source platform that delivers resilience and manageability to object-storage based data lakes, as well as the CTO and co-founder of Treeverse, the company behind lakeFS. Oz engineered and maintained petabyte-scale data infrastructure at analytics giant SmilarWeb, which he joined after the acquisition of Swayy.


Twitter handle. For example (@handle-name)

https://x.com/ozkatz100

Notable open source projects that you contribute to. Add URLs, one per line.

lakeFS, co-creator


Session

04-05
15:00
25min
Data Version Control Done Right with Python and Unity
Einat Orr, Nir Ozeri

Python is a leading language of choice for the Databricks and ML ecosystem, alongside a delta tables stack leveraging Unity catalog to manage petabytes of structured data. To build and experiment with ML data and models, version control has become the backbone of modern machine learning (ML) projects, bringing critical aspects of reproducibility and experimentation to teams who are able to experiment in isolation, while still collaborating on projects.

Data
Room 203