David Berenstein
Hi there 👋
From failing to study medicine ➡️ BSc industrial engineer ➡️ MSc computer scientist.
Life can be strange, so better enjoy it.
I´m sure I do by: 👨🏽🍳 Cooking, 👨🏽💻 Coding, 🏆 Committing.
@davidbstein1957
Notable open source projects that you contribute to. Add URLs, one per line. –https://github.com/argilla-io/argilla
https://github.com/explosion/spaCy
https://github.com/davidberenstein1957/concise-concepts
https://github.com/davidberenstein1957/classy-classification
https://github.com/davidberenstein1957/spacy-setfit
Session
04-05
14:00
25min
🧼 From GPU-poor to data-rich: data quality practices for LLM fine-tuning
Gabriel Martín Blázquez, David Berenstein
If you are GPU-poor you need to become data-rich. I will give an overview of what we learned from looking at Alpaca, LIMA, Dolly, UltraFeedback and Zephyr and how we applied that to fine-tuning a state-of-the-art open source LLM called Notus and Notux by becoming data-rich.
Data
Room 111