Julien Nioche

Having studied Russian language and culture in Paris and taught French in a school in Kiev, Ukraine, Julien went on to graduate in Text Engineering and Natural Language Processing. He moved to the UK to work as a researcher at the University of Sheffield in 2005 and founded DigitalPebble in 2008.

Julien has been involved in several open source projects, mainly at the Apache Software Foundation, and was the PMC chair for Apache Nutch. He is an Emeritus member of the Apache Software Foundation.

Julien runs workshops on web crawling, speaks at conferences and reviews technical books. He has over 20 years experience in the Java programming language.


Sessions

06-14
15:20
20min
URL Frontier, an open source API and implementation for crawl frontiers
Julien Nioche

This talk will present URLFrontier, an API and service implementation of a crawl frontier. After an introduction to how it fits in a distributed crawl architecture, we will go in more details on what the project provides, how it has been used so far and future works.

Store
Kesselhaus