Alfonso de la Guarda

CTO and Technology Architect for Veo365.com , Prix.tips, Scraprix.com, Prixlead and Machinalix.
Old School Hacker.
Computer Science, Anthropology and Social Communicator.
Game and low level programmer since 1983, starting with CBM 64 and Amiga through the most important computer technologies, operating systems and programming languages.
Community Developer for Be Inc (Beos).
Community Developer for OLPC Project.
Free Software and Open Source guy.
Linux fan since 1997, with implementations from basic network servers until flight simulators for defense.
Technology consultant for many institutions, including Peruvian Army, EsSalud, SISOL, SALUDPOL and Health Ministry of Chile.


Sessions

10-21
10:30
25min
Architecture for the extraction, automation and massive data processing
Alfonso de la Guarda

Live broadcast: https://www.youtube.com/watch?v=OcgLuOs1Hrc

Present a solution that integrates various components in its architecture, both computational resources, databases and its own python applications and other open source ones. The idea is to show the problems and challenges posed by traditional scraping and how we have been able to build solutions that reduce them, even more so if what is sought is to do it en masse and in parallel. This also means building an automated flow for the post-processing and transformation of the data using machine learning services such as NLP and classification.

Data Science, AI, and Machine Learning
Data