EuroSciPy 2025

Michał Szczepanik


Affiliation

Institute of Neuroscience and Medicine, Brain and Behaviour (INM‑7), Forschungszentrum Jülich

Position / Job

postdoc

Homepage

https://mszczepanik.eu

GitHub/GitLab profile URL

https://github.com/mslw/

Photo euroscipy-2025/question_uploads/IMG_3853_512_SpwVb9o.jpg

Session

08-19
15:30
90min
Managing Scientific Data and Workflows with DataLad
Ole Bialas, Michał Szczepanik

The flourishing of open science has created an unprecedented opportunity for scientific discovery through the global exchange of data and collaboration between researchers. DataLad (datalad.org) supports this by providing the tools to develop flexible and decentralized collaborative workflows while upholding scientific rigor. It is free and open source data management software, built on top of the version control systems Git and git-annex. Among its major features are version control for files of any size or type, data transport logistics, and digital process provenance capture for reproducible digital transformations.
In this hands-on workshop, we will start by exploring DataLad’s basic functionality and learn how to run and re-run analyses while versioning and keeping track of your data. Following this, we will explore DataLad’s collaborative features and learn how to install and work with existing datasets and how to share and distribute your work online. After completing this tutorial, you will be equipped to start using DataLad to manage your own research projects and share them with the world.

Interdisciplinary Frontiers and other Scientific Python Applications
Small room