Version 0.15 June 14, 2022
We released a new schedule version!
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche (June 14, 2022, 12:20 p.m. → June 14, 2022, 3:20 p.m.)
- “Word2Vec model to generate synonyms on the fly in Apache Lucene” by Daniele Antuzi, Ilaria Petreti (June 14, 2022, 2:50 p.m. → June 14, 2022, 2:40 p.m.)
Version 0.14 June 13, 2022
We released a new schedule version!
Sadly, we had to cancel sessions:
- “Streams, SQL, Action! Up & Running with Materialize” by Marta Paes
- “Combining Data in Headless Architectures” by Roy Derks
Version 0.13 June 6, 2022
We released a new schedule version!
Version 0.12 June 1, 2022
We released a new schedule version!
We have new sessions!
We sadly had to cancel a session: “In Layman’s TermsQuery: A walk through the life of a Search Query” by Conor Landry.
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “Apache Kafka simply explained” by Olena Kutsenko (Frannz Salon → Palais Atelier)
- “Compress giant language models to effective and resource-saving models using knowledge distillation” by Qi Wu (June 13, 2022, 11 a.m., Palais Atelier → June 13, 2022, 5:20 p.m., Maschinenhaus)
- “Barcamp” by Nick Burch (Maschinenhaus → Palais Atelier)
- “Should we stop using distance in our location-based data recommendation models?” by Charlie Davies (June 14, 2022, 4:50 p.m., Frannz Salon → June 14, 2022, 10:40 a.m., Maschinenhaus)
- “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche (June 14, 2022, 10:10 a.m., Maschinenhaus → June 14, 2022, 12:20 p.m., Kesselhaus)
- “Matscholar: The search engine for materials science researchers” by John Dagdelen (June 14, 2022, 5:20 p.m., Maschinenhaus → June 14, 2022, 10:10 a.m., Frannz Salon)
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!” by Opher Dubrovsky, Ido Nadler (June 14, 2022, 12:20 p.m. → June 14, 2022, 11:30 a.m.)
Version 0.11 May 19, 2022
We released a new schedule version!
We have a new session: “Goodbye Tracking, Hello Privacy: The Technology & Architecture behind Ethical Search & Discovery” by Nina Müller, Lara Menéndez García .
Version 0.10 May 11, 2022
We released a new schedule version!
We have a new session: “URL Frontier, an open source API and implementation for crawl frontiers” by Julien Nioche .
Version 0.9 May 10, 2022
We released a new schedule version!
We have moved a session around: “What's new in Apache Solr 9.0” by Anshum Gupta (June 14, 2022, 11:30 a.m., Kesselhaus → June 14, 2022, 2:50 p.m., Palais Atelier).
Version 0.8 May 4, 2022
We released a new schedule version!
We have new sessions!
- “Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach”
- “Understanding Vespa with a Lucene mindset”
- “Cloud-native ETL with Java Quarkus, Kubernetes, and Jib Container Builder”
- “Patterns and anti-patterns for production ready Kafka Streams apps”
- “Min and Max Aggregations with Updates in Real Time.”
Sadly, we had to cancel sessions:
- “Into the flamegraph: From the primitives through advanced concepts” by Yonatan Goldschmidt
- “Building useful resources to learn streaming concepts” by Aizhamal Nurmamat kyzy, Pablo Estrada
We have moved a session around: “Next generation OLAP stack using Apache Pinot” by Chinmay Soman (June 14, 2022, 2:50 p.m. → June 14, 2022, 4:50 p.m.)
Version 0.7 May 2, 2022
We released a new schedule version!
We have moved a session around: “What we learned from reading 100+ Kubernetes Post-Mortems” by Noaa Barki (June 14, 2022, 10:40 a.m. → June 13, 2022, 2 p.m.)
Version 0.6 April 28, 2022
We released a new schedule version!
We have a new session: “Barcamp” by Nick Burch .
Version 0.5 April 27, 2022
We released a new schedule version!
We had to move some sessions, so if you were planning on seeing them, check their new dates or locations:
- “Benefits of MQTT for IoT Messaging and Beyond” by Mary Grygleski (Kesselhaus → Palais Atelier)
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!” by Opher Dubrovsky, Ido Nadler (June 14, 2022, 2:50 p.m., Maschinenhaus → June 14, 2022, 12:20 p.m., Kesselhaus)
Version 0.4 April 21, 2022
We released a new schedule version!
We have a new session: “Meet the people fighting surveillance capitalism” by Fiona Coath .
Version 0.3 April 20, 2022
We released a new schedule version!
We have new sessions!
We sadly had to cancel a session: “What's going on in my cluster?” by Matthias Haeussler.
Version 0.2 April 14, 2022
We released a new schedule version!
We have new sessions!
- “Into the flamegraph: From the primitives through advanced concepts”
- “Kafka Monitoring: What Matters!”
- “Build Real-time Analytic Applications: The Easy Way.”
- “Effective CI/CD for Large Systems”
- “Optimizing Containers for Security and Scaling”
- “Neural Search - Let's talk about quality”
- “Help! I Need To UnSQLize My Application”
- “The perils of building a democratic data platform”
- “Benefits of MQTT for IoT Messaging and Beyond”
- “Luxuries, necessities, and the challenges that remain: some experiences with accelerated data science”
- “What's new in Apache Solr 9.0”
- “Building an Open-source Framework for Generating Embedding Vectors”
- “Hybrid search > sum of its parts?”
- “Scaling your Kafka pipeline can be a pain - but it doesn’t have to be!!”
- “Entity Linking at scale with Lucene”
- “Logging Apache Spark - How we made it easy”
- “Using Solr unconventionally to serve 26bn+ documents”
- “Streams, SQL, Action! Up & Running with Materialize”
- “In Layman’s TermsQuery: A walk through the life of a Search Query”
- “Should we stop using distance in our location-based data recommendation models?”
- “Running Apache Spark on K8s: From AWS EMR to K8s”
- “AI-powered Semantic Search; A story of broken promises?”
- “NrtSearch: Yelp’s fast, scalable, and cost-effective open source search engine”
- “Scaling an online search engine to thousands of physical stores”
- “What's going on in my cluster?”
- “Offline Ranking Validation - Predicting A/B Test Results”
- “Searching through large graphs using Elasticsearch”
- “Reproducible and shareable notebooks across a data science team”
- “Open Science: Building Models Like We Build Open-Source Software”
- “Relevance is not a Thing but a Perception”
- “Muves: Multimodal and multilingual vector search with Hardware Acceleration”
- “Apache Kafka simply explained”
- “Live build: How to harness streaming data in real time to track, transform and build on heart rate data”
- “Building useful resources to learn streaming concepts”
- “Next generation OLAP stack using Apache Pinot”
- “The Race to the Bottom - Low Latency in the age of the Transformer”
- “Combining Data in Headless Architectures”
- “Solving the knapsack problem with recursive queries and PostgreSQL”
- “Learning about AI/ML for Text, with Wordle!”
- “Scaling the Open Source Climate Community”
- “Why a Search Engine Makes a Great Log Analytics Solution”
- “The life of a search engine administrator”
- “Architecting Solr indexing pipelines in Google Cloud Platform”
- “The future of Lucene's MMapDirectory: Why use it and what's coming with Java 19 and later?”
- “Neural Search Comes to Apache Solr: Approximate Nearest Neighbor, BERT and More (Buzzwords)!”
- “What we learned from reading 100+ Kubernetes Post-Mortems”
- “Compress giant language models to effective and resource-saving models using knowledge distillation”
- “Changelog Stream Processing with Apache Flink”
- “Cross-Platform Data Lineage with OpenLineage”
- “Change data capture with Debezium…and without”
- “Autoscaling Elasticsearch for Logs on Kubernetes”
- “Word2Vec model to generate synonyms on the fly in Apache Lucene”
- “Don't Panic: Getting Your Infrastructure Drift Under Control”
- “Matscholar: The search engine for materials science researchers”
Version 0.1 April 6, 2022
These are our first confirmed sessions!