BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.com//good-2026//speaker//SSEBUZ
BEGIN:VTIMEZONE
TZID:MST
BEGIN:STANDARD
DTSTART:20001029T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10;UNTIL=20061029T090000Z
TZNAME:MST
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
END:STANDARD
BEGIN:STANDARD
DTSTART:20071104T030000
RRULE:FREQ=YEARLY;BYDAY=1SU;BYMONTH=11
TZNAME:MST
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000402T030000
RRULE:FREQ=YEARLY;BYDAY=1SU;BYMONTH=4;UNTIL=20060402T100000Z
TZNAME:MDT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
END:DAYLIGHT
BEGIN:DAYLIGHT
DTSTART:20070311T030000
RRULE:FREQ=YEARLY;BYDAY=2SU;BYMONTH=3
TZNAME:MDT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-good-2026-AWC8QJ@pretalx.com
DTSTART;TZID=MST:20260310T113000
DTEND;TZID=MST:20260310T115500
DESCRIPTION:We share how we built LLM inference services for researchers at
  Ohio Supercomputer Center (OSC) using Open OnDemand. Our users need LLMs 
 in many different ways\, so we support several service models. We started 
 with an Ollama module on the cluster\, added OpenWebUI\, and later created
  dedicated OnDemand apps that start the LLM server and give users a simple
  interface. We also provide a Jupyter + Ollama app that exposes an OpenAI 
 API interface in a Jupyter kernel. We maintain shared models so users can 
 start quickly\, while still allowing local models. We have also deployed t
 he Ollama + OpenWebUI service on our Kubernetes cluster\, but scaling it f
 or a large number of users remains a challenge. We plan to add additional 
 LLM engines and front-ends as part of our future improvements as well.
DTSTAMP:20260619T081719Z
LOCATION:Main Hall
SUMMARY:Building LLM Inference Services for Researchers on HPC with Open On
 Demand - Heechang Na [Ohio Supercomputer Center]
URL:https://pretalx.com/good-2026/talk/AWC8QJ/
END:VEVENT
END:VCALENDAR
