PyConDE & PyData Berlin 2024

Select ML from Databases
2024-04-22 , B07-B08

This talk introduces a new workflow for building your machine learning models using the capabilities of modern databases that support machine learning use cases natively. There is an overview of how machine learning models are being created today to how they could look in the near future by utilising the features provided by current databases.


Developing machine learning models involves the use of data to identify patterns that would help solve business problems. Over the years as the scale of data increased, data started to get stored in databases. The model-building workflows would typically fetch the data from the databases, perform some transformations to create features, and use them to train the models. In some cases, these features would get stored in databases known as feature stores for reuse. To infer the model output in real-time, typically, there would be a small service or an API endpoint that would be deployed to get the results to the consumers.

As these use cases became more common, modern databases started incorporating features that aid in building machine learning models. This talk covers some of the features provided by some of the databases like including common models like linear regression, image classification, text processing, support for functions with custom models, etc. Apart from these features, many of them also make it easy to deploy the model without needing an external service for the inference. Instead, they provide native interfaces for inference like querying in SQL like languages.

This talk includes an example of how to build your custom model in Python and then include it inside your Couchbase database making inference a matter of using database queries. The example would help to understand some of the capabilities of modern databases in building machine learning model


Expected audience expertise: Domain:

Intermediate

Expected audience expertise: Python:

Intermediate

Abstract as a tweet (X) or toot (Mastodon):

Select ML from Databases: New workflow for building your machine learning models using the capabilities of modern databases

Public link to supporting material, e.g. videos, Github, etc.:

https://github.com/couchbaselabs/neural-translation-example/

Gregor Bauer from Couchbase is driven by understanding customer needs and delivering suitable solutions. With a telecommunications background, he has led technical teams in delivering customized device management and IoT solutions globally. As Manager Solutions Engineering CEUR at Couchbase, he specializes in application modernization, multi-cloud strategies, and sustainable high user experience, with a particular focus on edge computing.