PyCon Lithuania 2024

Mastering Web Scraping: Unleash Your Data Extraction Wizardry!
2024-04-03 –, Room 228

Unlock the full potential of web scraping with this session! From novice to virtuoso, join us on an exciting journey of data extraction as we unravel secrets and advanced techniques.

πŸ” Session Highlights:
1/ Building Web Scrapers - The Art Unveiled πŸ› οΈ
2/ Proxy and Browser Farms Adventure 🌐
3/ Scrapoxy Orchestration - Elevate Your Scalability πŸš€
4/ Protection Measures Disclosed πŸ”’

This concise session will immerse you in the fascinating world of web scraping.


Unlock the full potential of web scraping with this session! From novice to virtuoso, join us on an exciting journey of data extraction as we unravel secrets and advanced techniques.

πŸ” Session Highlights

1/ Building Web Scrapers - The Art Unveiled πŸ› οΈ
- Master the craft of constructing resilient scrapers for diverse websites.
- Explore best practices, strategies, and common pitfalls.

2/ Proxy and Browser Farms Adventure 🌐
- Understand the different proxy types: datacenter, residential, and mobile proxies.
- Dive into browser farms for dynamic and efficient data extraction.
- Become an expert in mastering browser farms with Puppeteer and Playwright.

3/ Scrapoxy Orchestration - Elevate Your Scalability πŸš€
- Discover Scrapoxy, an open-source proxies aggregator designed for intelligent traffic routing.
- Scrapoxy will be your ultimate tool to be present anywhere on the planet.

4/ Protection Measures Disclosed πŸ”’
- Overcome challenges like captchas and anti-bot measures.
- Gain insights into reverse engineering protection mechanisms for token generation.

This concise session will immerse you in the fascinating world of web scraping.

Don't miss the opportunity to master these essential skills and revolutionise your approach to data extraction!

WebScraping #Proxy #ReverseEngineering πŸ•΅οΈβ€β™‚οΈ

See also: banner (1008.0Β KB)

Fabien Vauchelles is the Anti-Ban Expert at Wiremind. With over a decade of experience in web scraping, Fabien's passion for code and technology helps him to bypass bans. He is the creator of Scrapoxy, an opensource proxy aggregator for webscraping.

He had the opportunity to speak at Devoxx FR, Zyte’s Extract Summit and Voxxed Days.

This speaker also appears in: