Creating Your Own LLM Tuning Platform with Open Source Technologies Devconf.US

Creating Your Own LLM Tuning Platform with Open Source Technologies
.ical

2024-08-14 13:00–13:35, Conference Auditorium (capacity 260)

FMS HF Tuning is an open source package by IBM that leverages Supervised Fine-tuning Trainer from HuggingFace to support multiple tuning techniques for LLMs. We will give an overview of the tuning techniques available and demonstrate how the library can be utilized from a Jupyter notebook from the Open Data Hub platform.

The session will include:
- Introduction into fms-hf-tuning, when, why and where you can use it.
- Architectural overview of how it fits into the Open Data Hub and Red Hat OpenShift AI platform.
- Exploring different tuning techniques like Low-rank adaptation (LoRA), prompt tuning, fine tuning, and inference.
- Deploy and run production ready LLM model tuning and inference on ODH.

Attendees will leave with a greater understanding of the complexity and benefits of LLM tuning, and the open source tools and platforms available that they can leverage to improve their AI solutions.

See also: PowerPoint Presentation (3.3 MB)

James Busche

James Busche is a senior software engineer in the IBM Open Technologies Group, currently focused on the Open Source CodeFlare project. Previously, James has been a DevOps Cloud engineer for IBM Watson and the worldwide Watson Kubernetes deployments.

This speaker also appears in:

How To Win Friends & Influence LLMs (with Prompt Engineering)

Kelly Abuelsaad

Kelly is a software engineer at IBM Research with extensive experience in developing cloud platforms tailored for AI workloads. She is currently leading the development of a Kubernetes-based tuning management platform for Large Language Models (LLMs), contributing back to the open source community.

Creating Your Own LLM Tuning Platform with Open Source Technologies .ical 2024-08-14 13:00–13:35, Conference Auditorium (capacity 260)

Creating Your Own LLM Tuning Platform with Open Source Technologies
.ical

2024-08-14 13:00–13:35, Conference Auditorium (capacity 260)