Patrick Deziel
Patrick Deziel is a machine learning engineer and Python and Go programmer. Patrick has extensive experience building machine learning powered applications and contributing to open source projects such as Yellowbrick, an ML visualization library written for Python. He currently works at Rotational Labs where he builds software to support prototyping and evaluation of AI/ML powered solutions.
Session
Optical character recognition (OCR) has been a long standing method of extracting text data from images. Traditional OCR models rely on pattern recognition and feature extraction using computer vision techniques and specialized Python libraries. Recently, large language models (LLMs) and generic AI assistants have provided an alternative method of text extraction. This talk explores the efficacy of using LLMs and VLMs for information extraction in production data pipelines and a data-driven approach for evaluating them against traditional OCR methods in terms of accuracy, reliability, latency, and cost.