Version 0.15 June 14, 2022
We released a new schedule version!
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche (June 14, 2022, 12:20 p.m. → June 14, 2022, 3:20 p.m.)
- “Word2Vec model to generate synonyms on the fly in Apache Lucene” by Daniele Antuzi, Ilaria Petreti (June 14, 2022, 2:50 p.m. → June 14, 2022, 2:40 p.m.)
Version 0.14 June 13, 2022
We released a new schedule version!
Sadly, we had to cancel sessions:
- “Streams, SQL, Action! Up & Running with Materialize” by Marta Paes
- “Combining Data in Headless Architectures” by Roy Derks
Version 0.13 June 6, 2022
We released a new schedule version!
Version 0.12 June 1, 2022
We released a new schedule version!
We have new sessions!
- “Working in the Open...Search” by Charlotte Henkle, Sean Neumann
- “A smooth ride: Online car buying and selling at mobile.de” by Ricardo Kawase
We sadly had to cancel a session: “In Layman’s TermsQuery: A walk through the life of a Search Query” by Conor Landry
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “Apache Kafka simply explained” by Olena Kutsenko (Frannz Salon → Palais Atelier)
- “Compress giant language models to effective and resource-saving models using knowledge distillation” by Qi Wu (June 13, 2022, 11 a.m., Palais Atelier → June 13, 2022, 5:20 p.m., Maschinenhaus)
- “Barcamp” by Nick Burch (Maschinenhaus → Palais Atelier)
- “Should we stop using distance in our location-based data recommendation models?” by Charlie Davies (June 14, 2022, 4:50 p.m., Frannz Salon → June 14, 2022, 10:40 a.m., Maschinenhaus)
- “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche (June 14, 2022, 10:10 a.m., Maschinenhaus → June 14, 2022, 12:20 p.m., Kesselhaus)
- “Matscholar: The search engine for materials science researchers” by John Dagdelen (June 14, 2022, 5:20 p.m., Maschinenhaus → June 14, 2022, 10:10 a.m., Frannz Salon)
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!” by Opher Dubrovsky, Ido Nadler (June 14, 2022, 12:20 p.m. → June 14, 2022, 11:30 a.m.)
Version 0.11 May 19, 2022
We released a new schedule version!
We have a new session: “Goodbye Tracking, Hello Privacy: The Technology & Architecture behind Ethical Search & Discovery” by Nina Müller, Lara Menéndez García.
Version 0.10 May 11, 2022
We released a new schedule version!
We have a new session: “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche.
Version 0.9 May 10, 2022
We released a new schedule version!
We have moved a session around: “What's new in Apache Solr 9.0” by Anshum Gupta (June 14, 2022, 11:30 a.m., Kesselhaus → June 14, 2022, 2:50 p.m., Palais Atelier).
Version 0.8 May 4, 2022
We released a new schedule version!
We have new sessions!
- “Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach” by Sakshi Deo Shukla
- “Understanding Vespa with a Lucene mindset” by Atita Arora
- “Cloud-native ETL with Java Quarkus, Kubernetes, and Jib Container Builder” by Hakan Lofcali
- “Patterns and anti-patterns for production ready Kafka Streams apps” by Christoph Schubert
- “Min and Max Aggregations with Updates in Real Time.” by Minakshi Korad
Sadly, we had to cancel sessions:
- “Into the flamegraph: From the primitives through advanced concepts” by Yonatan Goldschmidt
- “Building useful resources to learn streaming concepts” by Aizhamal Nurmamat kyzy, Pablo Estrada
We have moved a session around: “Next generation OLAP stack using Apache Pinot” by Chinmay Soman (June 14, 2022, 2:50 p.m. → June 14, 2022, 4:50 p.m.)
Version 0.7 May 2, 2022
We released a new schedule version!
We have moved a session around: “What we learned from reading 100+ Kubernetes Post-Mortems” by Noaa Barki (June 14, 2022, 10:40 a.m. → June 13, 2022, 2 p.m.)
Version 0.6 April 28, 2022
We released a new schedule version!
We have a new session: “Barcamp” by Nick Burch.
Version 0.5 April 27, 2022
We released a new schedule version!
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “Benefits of MQTT for IoT Messaging and Beyond” by Mary Grygleski (Kesselhaus → Palais Atelier)
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!” by Opher Dubrovsky, Ido Nadler (June 14, 2022, 2:50 p.m., Maschinenhaus → June 14, 2022, 12:20 p.m., Kesselhaus)
Version 0.4 April 21, 2022
We released a new schedule version!
We have a new session: “Meet the people fighting surveillance capitalism” by Fiona Coath.
Version 0.3 April 20, 2022
We released a new schedule version!
We have new sessions!
- “Dense Concept Retrieval” by Konstantinos Perifanos, Lily Davies
- “Do It Yourself: Programmable Metrics using OpenTelemetry” by Ricardo Ferreira
We sadly had to cancel a session: “What's going on in my cluster?” by Matthias Haeussler
Version 0.2 April 14, 2022
We released a new schedule version!
We have new sessions!
- “Into the flamegraph: From the primitives through advanced concepts” by Yonatan Goldschmidt
- “Kafka Monitoring: What Matters!” by Amrit Sarkar
- “Build Real-time Analytic Applications: The Easy Way.” by Sergio Ferragut
- “Effective CI/CD for Large Systems” by Josh Reed
- “Optimizing Containers for Security and Scaling” by Thomas Fricke
- “Neural Search - Let's talk about quality” by Maximilian Werk, Florian Hoenicke
- “Help! I Need To UnSQLize My Application” by Joel Lord
- “The perils of building a democratic data platform” by Andre Jasiskis, Joaquim Torres
- “Benefits of MQTT for IoT Messaging and Beyond” by Mary Grygleski
- “Luxuries, necessities, and the challenges that remain: some experiences with accelerated data science” by William Benton, Sophie Watson
- “What's new in Apache Solr 9.0” by Anshum Gupta
- “Building an Open-source Framework for Generating Embedding Vectors” by Frank Liu
- “Hybrid search > sum of its parts?” by Lester Solbakken
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!” by Opher Dubrovsky, Ido Nadler
- “Entity Linking at scale with Lucene” by Edoardo Tosca
- “Logging Apache Spark - How we made it easy” by Simona Meriam
- “Using Solr unconventionally to serve 26bn+ documents” by Richard Goodman
- “Streams, SQL, Action! Up & Running with Materialize” by Marta Paes
- “In Layman’s TermsQuery: A walk through the life of a Search Query” by Conor Landry
- “Should we stop using distance in our location-based data recommendation models?” by Charlie Davies
- “Running Apache Spark on K8s: From AWS EMR to K8s” by Ramiro Alvarez Fernandez, Álvaro Panizo, Daniel Hernández Alfageme
- “AI-powered Semantic Search; A story of broken promises?” by Jo Kristian Bergum
- “NrtSearch: Yelp’s fast, scalable, and cost-effective open source search engine” by Umesh Dangat
- “Scaling an online search engine to thousands of physical stores” by Aline Paponaud
- “What's going on in my cluster?” by Matthias Haeussler
- “Offline Ranking Validation - Predicting A/B Test Results” by Andrea Schuett, Yunus Lutz
- “Searching through large graphs using Elasticsearch” by Radu Pop
- “Reproducible and shareable notebooks across a data science team” by Mike Tapi Nzali, Pascal Godbillot
- “Open Science: Building Models Like We Build Open-Source Software” by Steven Kolawole
- “Relevance is not a Thing but a Perception” by Ana Maria García Sánchez
- “Muves: Multimodal and multilingual vector search with Hardware Acceleration” by Aarne Talman, Dmitry Kan
- “Apache Kafka simply explained” by Olena Kutsenko
- “Live build: How to harness streaming data in real time to track, transform and build on heart rate data” by Tomáš Neubauer, Javier Blanco Cordero
- “Building useful resources to learn streaming concepts” by Aizhamal Nurmamat kyzy, Pablo Estrada
- “Next generation OLAP stack using Apache Pinot” by Chinmay Soman
- “The Race to the Bottom - Low Latency in the age of the Transformer” by Max Irwin
- “Combining Data in Headless Architectures” by Roy Derks
- “Solving the knapsack problem with recursive queries and PostgreSQL” by Francesco Tisiot
- “Learning about AI/ML for Text, with Wordle!” by Nick Burch
- “Scaling the Open Source Climate Community” by Erik Erlandson
- “Why a Search Engine Makes a Great Log Analytics Solution” by Eli Fisher
- “The life of a search engine administrator” by Lucian Precup, Vincent Bréhin
- “Architecting Solr indexing pipelines in Google Cloud Platform” by Shubhro Jyoti Roy
- “The future of Lucene's MMapDirectory: Why use it and what's coming with Java 19 and later?” by Uwe Schindler
- “Neural Search Comes to Apache Solr: Approximate Nearest Neighbor, BERT and More (Buzzwords)!” by Alessandro Benedetti
- “What we learned from reading 100+ Kubernetes Post-Mortems” by Noaa Barki
- “Compress giant language models to effective and resource-saving models using knowledge distillation” by Qi Wu
- “Changelog Stream Processing with Apache Flink” by Timo Walther
- “Cross-Platform Data Lineage with OpenLineage” by Julien Le Dem
- “Change data capture with Debezium…and without” by Petros Angelatos
- “Autoscaling Elasticsearch for Logs on Kubernetes” by Radu Gheorghe, Ciprian Hacman
- “Word2Vec model to generate synonyms on the fly in Apache Lucene” by Daniele Antuzi, Ilaria Petreti
- “Don't Panic: Getting Your Infrastructure Drift Under Control” by Eran Bibi
- “Matscholar: The search engine for materials science researchers” by John Dagdelen
Version 0.1 April 6, 2022
These are our first confirmed sessions!