Toward Modern Linear Algebra: Single API Kernels for HPC
Modern hardware like NVIDIA’s H100, GB100, and AMD’s MI300 accelerators demand flexible, high-performance software. DLA.jl modernizes dense linear algebra with a unified, hardware-agnostic API, while Dagger.jl enables dynamic task scheduling across CPUs and GPUs. Together, they provide scalable, efficient computation without vendor lock-in. This talk explores their impact on HPC, AI, and scientific computing, highlighting future directions in mixed precision and adaptive scheduling.