Devconf.US

Working with filesystem in Time Series database
08-14, 16:40–17:15 (US/Eastern), Metcalf Small Ballroom (capacity 100)

Time Series databases face the significant challenge of processing vast amounts of data. At VictoriaMetrics, we are actively developing an open-source Time Series database entirely from scratch using Go. Our average installation handles between 2 to 4 million samples per second during ingestion, with larger setups managing over 100 million samples per second on a single cluster.
In his presentation, Roman will explore various techniques essential for constructing write-heavy applications such as:
- Understanding and mitigating write amplification.
- Implementing instant database snapshots.
- Safeguarding against data corruption post power outages.
- Evaluating the advantages and disadvantages of utilizing Write Ahead Log.
- Enhancing reliability in Network File System (NFS) environments.
Throughout the talk, Roman will illustrate these concepts with real code examples sourced from open-source projects.

Roman is a software engineer with experience in distributed systems, databases, monitoring, and high-performance microservices. Roman's passion is open source and he's proud to have contributions to Prometheus, Grafana, and ClickHouse. Currently, Roman is working on the open source time series database and monitoring solution VictoriaMetrics.