PyData London 2026

“Beyond Spark MLlib: Deduplicating Common Crawl at Scale”

Feedback is a valuable tool for speakers to improve their content and presentation. Even short feedback can prove valuable to a speaker! Please take the time and communicate your thoughts in a constructive way. Thank you for your feedback!