The Rapid Analytics and Model Prototyping (RAMP) framework: tools for collaborative data science challenges
2019-09-04, 16:45–17:00, Track 2 (Baroja)

The RAMP (Rapid Analytics and Model Prototyping) framework provides a platform to organize reproducible and transparent data challenges. We will present the different framework bricks.

We will give an overview of the RAMP framework, which provides a platform to organize reproducible and transparent data challenges.

RAMP workflow is a python package used to define and formalize the data science problem to be solved. It can be used as a standalone package and allows a user to prototype different solutions. In addition to RAMP workflow, a set of packages have been developed allowing to share and collaborate around the developer solutions. Therefore, RAMP database provides a database structure to store the solutions of different users and the performance of these solutions. RAMP engine is the package to run the user solutions (possibly on the cloud) and populate the database. Finally, RAMP frontend is the web frontend where users can upload their solutions and which shows the leaderboard of the challenge.

The project is open-source and can be deployed on any local server. The framework has been used at the Paris-Saclay Center for Data Science for setting up and solving about twenty scientific problems, for organizing collaborative data challenges, for organizing scientific sub-communities around these events, and for training novice data scientists.

Abstract as a tweet – RAMP framework: a solution to collaborate on your data science challenge Domains – Machine Learning, Parallel computing / HPC, Statistics Project Homepage / Git – Domain Expertise – some Python Skill Level – basic Project Homepage / Git –