Alfonso de la Guarda
CTO and Technology Architect for Veo365.com , Prix.tips, Scraprix.com, Prixlead and Machinalix.
Old School Hacker.
Computer Science, Anthropology and Social Communicator.
Game and low level programmer since 1983, starting with CBM 64 and Amiga through the most important computer technologies, operating systems and programming languages.
Community Developer for Be Inc (Beos).
Community Developer for OLPC Project.
Free Software and Open Source guy.
Linux fan since 1997, with implementations from basic network servers until flight simulators for defense.
Technology consultant for many institutions, including Peruvian Army, EsSalud, SISOL, SALUDPOL and Health Ministry of Chile.
Session
Live broadcast: https://www.youtube.com/watch?v=OcgLuOs1Hrc
Present a solution that integrates various components in its architecture, both computational resources, databases and its own python applications and other open source ones. The idea is to show the problems and challenges posed by traditional scraping and how we have been able to build solutions that reduce them, even more so if what is sought is to do it en masse and in parallel. This also means building an automated flow for the post-processing and transformation of the data using machine learning services such as NLP and classification.