Ada is a Rust and Python developer. She is a tool builder in various areas, including schedulers for clusters, tools to support AI experiments, or a tool to create slides . She is an active contributor to several open source projects. Ada has a PhD in Computer Science.
- Breaking the Constraints of Linear Notebook Environments
I am a fifth-year computer science student at AGH University of Science and Technology (AGH UST) in Kraków, Poland, where I am currently conducting research in the fields of Natural Language Processing (NLP) and Chemoinformatics. My academic work is focused on developing innovative solutions that leverage computational techniques to analyze language data and chemical information. In addition to my studies, I am professionally working as a backend engineer, where I develop, and maintain server-side applications, ensuring scalability, efficiency, and reliability of systems. This combination of research and industry experience allows me to stay on the cutting edge of technology while applying practical solutions to real-world problems.
- How To Accelerate Molecular Insights - Efficient Distance Calculations In Python
Hi, I am Aditi. I mostly work around API dispatching things in the scientific python ecosystem, mostly in NetworkX, nx-parallel backend and scikit-image.
GitHub: https://github.com/Schefflera-Arboricola
Previous talks: https://github.com/Schefflera-Arboricola/blogs/tree/main/archive
- Understanding Dispatching Approaches in the Scientific Python Ecosystem
I am a PhD student in the Environmental Physics Group (zfs.agh.edu.pl/en) at the Faculty of Physics and Applied Computer Science, AGH University of Krakow. My research interests lie at the intersection between cloud microphysics and climate-science applications of water isotopes. I am a member of the github.com/open-atmos community and the maintainer of PySDM project.
- Can Your Research Notebook Survive a Code Update? On maintaining Reproducibility with Continuous Integration
I am the developer of the open-source libraries https://github.com/uxlfoundation/oneDAL and https://github.com/uxlfoundation/scikit-learn-intelex , which provide optimized implementations of classical machine learning algorithms.
- Using Cython and C++ kernels to speed up Python libraries
After obtaining her master degree in technomathematics in 2013, Anna Lührs joined the (former) division HPC in Neuroscience at Jülich Supercomputing Centre (JSC), for which she also acted as deputy lead, initially as research associate focusing on image segmentation on GPU clusters. She shifted her focus towards project management, research coordination and science communication for her department and the Human Brain Project, an EU-funded project with more than 100 project partners and a total duration of 10 years. Anna now works in and is deputy lead of the Office for (Inter)national Coordination and Networking at the JSC, which she joined 2023.
- Women in HPC – Breaking Barriers and Shaping the Future
I am an Associate Professor in AI and Robotics at Aivancity based in Paris, France. I got my PhD from the University of Galway in Ireland in Electrical and Electronic Engineering. I then worked at ENS Lyon in collaboration with Inria and Inrae on deep learning for 3D biological image analysis, then joined the Paris Brain Institute with Inria on deep learning for data analysis of Alzheimer's patients, and the Pasteur Institute in Paris on applications of deep learning in the field of drug discovery. My research and teaching interests focus on applications of deep learning in computer vision, computational biology and health, as well as human-machine interactions and intelligent systems.
- Automated Chess Analysis: Real-Time Move Detection and Game Narration Using Computer Vision and Large Language Models
- Guardians of Science: A Python Tutorial on a RAG-Powered Compliance Plug-In and Ethical AI tools
- Automated Chess Analysis: Real-Time Move Detection and Game Narration Using Computer Vision and Large Language Models
- Guardians of Science: A Python Tutorial on a RAG-Powered Compliance Plug-In and Ethical AI tools
After having a career as a Data Scientist and Developer Advocate, Cheuk dedicated her work to the open-source community. Currently, she is working as an AI developer advocate for JetBrains. She has co-founded Humble Data, a beginner Python workshop that has been happening around the world. She has served the EuroPython Society board for two years and is now a fellow and director of the Python Software Foundation.
- Deploy your Machine Learning model with Fast API
Daniel Alejandro Coll Tejeda is a dedicated researcher of Cloud and Distributed Systems Lab at the University Rovira i Virgili (URV), Tarragona. Specializing in cloud computing, his current research involves intensive data analysis and the creation of sophisticated tools designed to optimize the management of diverse cloud infrastructures, encompassing Kubernetes orchestration, serverless computing paradigms (such as AWS Lambda and Lithops), and virtual machine environments.
- Processing Cloud-optimized data in Python (Dataplug)
- Using Cython and C++ kernels to speed up Python libraries
Davide De Marchi is a researcher and software engineer specializing in geospatial big data. He has significant experience in Big Data, Cloud Computing, GIS, Remote Sensing, and Data Visualization. His career includes contributions to the design and implementation of interactive visualization tools, notably at the European Commission - Joint Research Centre where he has been a key developer of the BDAP platform. Earlier in his career, he gained substantial experience in the development of geospatial data processing software and served as an adjunct professor at the University of Urbino
- Voilà Meta-Dashboards for Streamlined Geospatial Data Visualization
Evgeni is a long-term contributor to SciPy and related open-source projects. His native programming language is Fortran, which he used to write physics simulations software; with time he gravitated more and more towards Python. Evgeni's background is computational physics and applied maths, and he currently works at Quansight Labs on expanding the reach of open source software.
- The road(map) towards SciPy 2.0
Florian is Head of Data Science & Mathematical Modeling at inovex GmbH, an IT project center driven by innovation and quality, focusing its services on ‘Digital Transformation’. He holds a PhD in mathematics, has more than 10 years of experience in predictive & prescriptive analytics use-cases and likes everything math 🤯
- Solving Hard Optimization Problems with Pyomo and HiGHS: A Practical Introduction
I am a curious person who studied Physics and Applied Maths. I spent over a year at CERN for my MSc in High Energy Physics. However, I found maths and computer sciences equally fascinating, so I left academia to pursue these fields. Over the years, I developed a passion for handling large datasets and using compression to enable their analysis on commodity hardware accessible to everyone.
I am the CEO of ironArray SLU and also leading the Blosc Development Team. I am very excited in working in providing a way for sharing Blosc2 datasets in the network in an easy and effective way via Caterva2, and Cat2Cloud, a software as a service that we are introducing.
As an Open Source believer, I started the PyTables project more than 20 years ago. After 25 years in this business, I started several other useful open source projects like Blosc, Caterva2 and Btune; those efforts won me two prizes that mean a lot to me:
- 2023: NumFOCUS Project Sustainability Award
- 2017: Google’s Open Source Peer Bonus
You can know more on what I am working on by reading my latest blogs.
- Python-Blosc2: Compress Better, Compute Bigger!
- Compress, Compute, and Conquer: Python-Blosc2 for Efficient Data Analysis
Highly skilled Software Engineer and Data Scientist at Intesa Sanpaolo Bank, based in Milan, Italy. With a strong foundation in computational subjects, I hold a PhD in Physics with a specialization in computational modelling of biophysical processes. Having transitioned to the banking sector, I have applied my expertise to drive innovation in financial-related projects for numerous years. Additionally, I contribute to the open-source community as a maintainer and developer of the SymPy library, a widely-used computer algebra system.
- Enhancing SymPy Algorithms with MatchPy's Efficient Pattern Matching
Frank became a self-employed software developer and consultant while studying Physics in Freiburg, contributing to open-source projects. During his Master's, he specialized in data analysis for particle physics at CERN and obtained a doctoral degree in 2022, working with the ATLAS collaboration. Since 2023, he has been AI Technical Leader and AI Engineer Lead at MDPI, one of the largest open-access publishers.
- Annotating the dynamic: Type Annotation for DataFrames
I have joined to napari project while working on PhD project, as I understand that great visualization significantly improves data understanding, so improves the quality of research. Now I'm shifting into full-time open-source contributor. I'm mainly focusing now on improving tooling to lower entry level.
On university, I teach students about Python.
- Use napari for easier interactive extraction of knowledge from images and other spatial data
I'm chief machine learning officer and open source software engineer at :probabl. I'm a core developer of scikit-learn and imbalanced-learn.
- Skrub: machine learning for dataframes
- Predictive Modeling with Imbalanced Datasets Using Scikit-learn
Igor Tatarnikov is a Research Software Engineer at University College London’s Sainsbury Wellcome Centre, where he aspires to create easy to use software tools for neuroscientists with a focus on image analysis.
Igor holds a BSc in Microbiology and Immunology and an MSc in Neuroscience from the University of British Columbia (Vancouver, Canada), as well as a Bachelor in Computer Science from Dalhousie University (Halifax, Canada). For his MSc, Igor explored the electrophysiological characteristics of genetic mouse models of Parkinson’s disease. Igor’s multidisciplinary background is particularly useful for his current work, where he creates open-source tools for neuroanatomical image analysis.
- The BrainGlobe initiative - image analysis in a common coordinate space.
Jacob Tomlinson is a senior software engineer at NVIDIA. His work involves maintaining open source projects including RAPIDS and Dask. He also tinkers with kr8s in his spare time. He lives in Exeter, UK.
- GPU Python for the Real World: Practical GPU-Accelerated Python with RAPIDS
I studied Physics at Technische Universität Dresden and Freie Universität Berlin.
After achieving my master's degree in 2021, I started working at Technische Universität Berlin in the ELVA project as technical lead and software developer.
- ELVA - Local-First Real-Time Collaboration Apps in Your Terminal
I am a PhD candidate in Computer Science at AGH University of Krakow, and a member of Graph ML and Chemoinformatics Lab at Faculty of Computer Science. My research concerns fair evaluation, graph representation learning, graph classification, chemoinformatics, and molecular property prediction. I'm also interested in time series, NLP, and MLOps, and I'm also teaching all of those things at AGH. I also work at Placewise as Data Science Engineer, focusing on various ML problems in tabular learning, CV and NLP, and their end-to-end MLOps. Beside my professional work, I train Historical European Martial Arts (HEMA) with messer and longsword, and like reading and tabletop RPGs.
- Machine learning for ecotoxicology and bee pesticide toxicity prediction
Jan initially immersed himself in the realms of cognitive science and computational neuroscience. However, he couldn’t resist the siren call of Bayesian machine learning, and his PhD evolved into a mission to enhance the user-friendliness of this complex field. He set out to bridge cutting-edge methods with user-friendly software, making the world of simulation-based inference more accessible for practitioners. In 2024, he joined the TransferLab, ready to continue his journey of making advanced methodologies approachable and transformative.
- Pyro Meets SBI: Unlocking Hierarchical Bayesian Inference for Complex Simulators
- Beyond Likelihoods: Bayesian Parameter Inference for Black-Box Simulators with sbi
Author of optimagic | Head of TransferLab at appliedAI | Expert in numerical optimization | Building Bandsaws, Pizza Ovens and Furniture
- Let's rewrite optimagic from scratch in half an hour and see what we can learn
Jérôme Dockès is a research engineer at Inria and one of the developers of the Skrub and Nilearn python packages.
- Skrub: machine learning for dataframes
Jesse Grabowski is a PhD candidate at Paris 1 Pantheon-Sorbonne. He is also a principal data scientist at PyMC labs, and a core developer of PyMC, Pytensor, and related packages. His area of research includes time series modeling, macroeconomics, and finance.
- Recent Developments in Pytensor, the Successor Package to Theano
Jon is a Machine Learning Engineer specialized in IoT systems. He has a Master in Data Science and a Bachelor in Electronics Engineering, and has published several papers on applied Machine Learning.
He has been contributing to open-source software since 2010.
These days Jon is co-founder and Head of Data Science at Soundsensing, a leading provider of condition monitoring solutions for commercial buildings and HVAC systems.
He is also the creator and maintainer of emlearn, an open-source Machine Learning library for microcontrollers and embedded systems.
- Sensor data processing on microcontrollers with MicroPython
Jost is a Senior Research Software Engineer at King’s College London, where they work on software projects ranging from astrophysics to Trusted Research Environments for analysing medical data. They are a certified Carpentries instructor and regularly develop and deliver training courses.
Jost received a PhD in astroparticle physics from the University of Sheffield and is a maintainer of several widely used supernova neutrino codes, including SNEWPY and sntools.
- Python Profiling and Optimisation—A Training Course for Researchers
Juan Luis (he/him/él) is an Aerospace Engineer with a passion for tech communities and sustainability. He works at QuantumBlack, AI by McKinsey, as Product Manager for Kedro, an open source Python framework for reproducible, maintainable and modular data science code. He has worked as Developer Advocate at Read the Docs, as software engineer in the space, consulting, and banking industries, and as a Python trainer for several private and public entities.
Apart from being a long-time user and contributor to many projects in the scientific Python stack (NumPy, SciPy, Astropy) he has published several open-source packages, the most important one being poliastro, an open-source Python library for interactive astrodynamics used in academia and industry.
Finally, Juan Luis is the founder and former chair of the Python España association, the point of contact for the Spanish Python community, former organizer of PyCon Spain, and current organizer of the PyData Madrid monthly meetups.
- Accelerate your scientific Python code with Rust
Magister of engineering in Computer Science, currently research assistant at University of Szczecin, Institute of Management.
- Introduction to Scikit-learn
Kai is a senior software engineer and open source contributor with a focus on scientific computing and mathematical optimization. He currently works at Cartesian Software in Sydney, Australia, where he develops high-performance tools for solving large-scale linear programming problems. With a background in both software engineering and applied mathematics, his work bridges the gap between research-grade algorithms and production-ready systems.
He is an active member of the scientific Python ecosystem and a passionate advocate for sustainable open source development. His contributions span core libraries, tooling, and infrastructure that support numerical analysis, data workflows, and optimization.
- Maintaining People, Not Just Projects: Attracting and Retaining Talent in FOSS
Likhita Yerra, a Master’s student in AI and Data Science, specializes in Python, computer vision, and large language models. I develop innovative machine learning solutions with PyTorch, TensorFlow, Docker, and Streamlit, passionate about advancing AI and scientific computing for real-world impact.
- Automated Chess Analysis: Real-Time Move Detection and Game Narration Using Computer Vision and Large Language Models
- Guardians of Science: A Python Tutorial on a RAG-Powered Compliance Plug-In and Ethical AI tools
Loïc has a Particle Physics background, which is how he discovered Python towards the end of his PhD.
He is a scikit-learn and joblib core contributor and has been involved in a number of Python open-source projects in the past 10 years, amongst which Pyodide, dask-jobqueue, sphinx-gallery and nilearn.
- PyPI in the face: running jokes that PyPI download stats can play on you
- Use napari for easier interactive extraction of knowledge from images and other spatial data
I am an undergraduate student studying Computer Science & Philosophy at the University of Oxford. Currently, I am a Mentee @ prefix.dev for European Summer of Code, working on Pixi and rattler-build.
I am also a maintainer of SciPy and array-api-extra, a member of the Consortium for Python Data API Standards, and a founding member of quantity-dev.
- A Hitchhiker's Guide to the Array API Standard Ecosystem
- Standardised Quantity/Unit APIs discussion
2019 BS in Physics (Princeton University), cum laude
2020 MSc in Applied Mathematics (University of Edinburgh), with distinction
2024 PhD in Applied Mathematics (Universitat Jaume I), sobresaliente cum laude
- Python-Blosc2: Compress Better, Compute Bigger!
- Compress, Compute, and Conquer: Python-Blosc2 for Efficient Data Analysis
Engineer and PhD student at BMW Group in cooperation with FAU Erlangen-Nuremberg, specializing in radar signal processing and perception with a focus on deep learning. Currently developing a Python framework for large-scale radar data generation.
- Python Framework for Large-Scale Radar Data Generation and Visualization
Applied AI Researcher focussing on uncertainty quantification and Bayesian inference in various industrial settings.
- Beyond Likelihoods: Bayesian Parameter Inference for Black-Box Simulators with sbi
- Managing Scientific Data and Workflows with DataLad
I've been a Python user since 1999, teaching Python professionally since 2004. I am also active in the community, organizing Python conferences such as PyCon DE, EuroSciPy, and BarCamps. I am a PSF Fellow, PSF Community Service Award winner, and chair of the German Python Software Verband.
- Introduction to Python and JupyterLab
- Units next to your Data: Arrays with Scipp
PhD researcher at LMU Munich with a background in software engineering and a M.Sc. degree in physics.
- Introduction to NumPy and DataFrames
I studied Biology at the University of Tübingen, where I first learned how to code using Matlab. Then, I moved to Leipzig, where I did a master’s degree and later a PhD in neurobiology. In my research, I studied how the brain processes sound location using electroencephalography (EEG) and custom experimental setups for spatial audio. During that time, I started using Python and eventually co-authored “slab”, a Python toolbox for psychoacoustic experiments. After my PhD, I moved to the University of Rochester in New York, where I studied how the brain processes naturalistic speech by modeling EEG that was recorded while the participants listened to audiobooks. For this research I published another toolbox, originally written in Matlab, called “mTRFpy”. As my postdoc was coming to an end, I was looking for a position where I could combine my interest in neuroscience with my passion for programming. I found such a position at the University Clinic Bonn where I currently work as a research software consultant. In this position, I develop and teach workshops where neuroscience researchers can improve their software skills. I also do one-on-one consulting to help neuroscientists deal with the computational challenges they are faced in their research.
- Managing Scientific Data and Workflows with DataLad
Olivier Grisel is a machine learning engineer at Probabl and a contributor to the scikit-learn library.
- Predictive Modeling with Imbalanced Datasets Using Scikit-learn
I am a data science and computer science student at AGH University of Kraków. My primary interests include machine learning and chemoinformatics.
- Efficient and accurate models for peptide function prediction
Web & mobile security researcher with a few years of experience. MSc in computer sciences. Currently working on network security, including kubernetes infrastructure. In free time doing hackthebox, sharing knowledge and analysing applications in Apple ecosystem.
- How to become a software detective and perform security research
- Industrial-Level Documentation for Scientific Projects
- Recent Developments in Pytensor, the Successor Package to Theano
- Skrub: machine learning for dataframes
Samarth Bachkheti is a geophysicist and AI practitioner specializing in seismic imaging, quantitative interpretation, and geomechanical assessment for offshore energy projects. With expertise in machine learning applications for geoscience, he leads the development of deep-learning tools for seabed imaging and offshore site assessments. His recent work focuses on deep learning for underwater object detection, integrating AI into geophysical workflows to enhance efficiency and accuracy in offshore engineering. He has presented at industry conferences and actively collaborate with research institutions to advance AI adoption in geoscience.
- Python for subsea engineering: A case study on seabed object detection using AI/ML
Sebastian has been a NumPy developer for about 10 years now. After a PhD in phsyics he worked at as a postdoc at the Berkeley Institute for Datascience on NumPy as grants byt the Alfred P. Sloan Foundation and the Gordon and Betty Moore Foundation. Since 2022 he has been a software engineer at NVIDIA where he continues to contribute to NumPy.
- Understanding Dispatching Approaches in the Scientific Python Ecosystem
- Array API and library dispatching
Stefanie Molin is a software engineer at Bloomberg in New York City, where she tackles tough problems in information security, particularly those revolving around data wrangling/visualization, building tools for gathering data, and knowledge sharing. She is also a core developer of numpydoc and the author of “Hands-On Data Analysis with Pandas: A Python data science handbook for data collection, wrangling, analysis, and visualization,” which is currently in its second edition and has been translated into Korean and Chinese. She holds a bachelor’s of science degree in operations research from Columbia University's Fu Foundation School of Engineering and Applied Science, as well as a master’s degree in computer science, with a specialization in machine learning, from Georgia Tech. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.
- Beyond the Basics: Data Visualization in Python
I am a scikit-learn core maintainer and work at NVIDIA.
Before working on scikit-learn I helped build mybinder.org and worked on JupyterHub.
Many years ago I was a particle physicist at CERN in Geneva.
- Array API and library dispatching
- ELVA - Local-First Real-Time Collaboration Apps in Your Terminal
Pedro Garcia is professor of the Computer Engineering and Mathematics Department at the University Rovira i Virgili (Spain). He leads he “Cloud and Distributed Systems Lab” research group and coordinates large research european projects. In particular, he leads CloudStars (2023-2027), NearData (2023-2025), CloudSkin (2023-2025), and he participates as partner in EXTRACT (2023-2025). He also coordinated FP7 CloudSpaces (2013-1015), H2020 IOStack (2015-2017) and H2020 CloudButton (2019-2022).
During 2019-2020 he worked as visiting scientist in IBM Watson Research in the Hybrid Clouds group focused on serverless technologies. His research topics are distributed systems, cloud computing, data analytics, software architectures and middleware. He has published more than 100 papers on journals and prestigious conferences (ACM Middleware, IEEE ICDCS, USENIX FAST, ICDE, IMC). He has participated in scientific committees of different conferences like Middleware, CCGRID, CloudCom, CIC, P2P, CLOSER, or WETICE among others. He is currenlty co-organizing the International Workshop on Serverless Computing (WoSC).
- Processing Cloud-optimized data in Python (Dataplug)