PyCon UK 2019

The Companies We Keep: Using network graphs to map corporate control
2019-09-16, 17:00–17:30, Assembly Room

We present how we built a graph database mapping UK companies and who controls them using Python and a Neo4j graph database, and the insights and impact this work has had.


In June 2016 the UK government launched the world’s first “beneficial ownership” register; a requirement for all UK companies to register who were the “persons of significant control”, PSCs, who actually controlled the company. Recent investigative journalism has made headlines with the leaking of the Panama and Paradise papers and it is clear that transparency in corporate ownership needs to be a significant factor within modern democracy. In a partnership between DataKind UK and Global Witness we have built the worlds first network graph mapping all of the UK public data on those who control corporate interests in the UK; it comprises in excess of 4.5 million companies and 4 million individual people. It has been enriched with company officer data and metrics of financial secrecy based upon geographic regions.

The goal of the project was to enable Global Witness to search for "shady patterns" within corporate ownership networks to act as leads for investigative journalism to expose corrupt practices. Further more, we were able to analyse the completeness of the register and identify ways of improving such data structures to inform other world governments how to best build similar public registers of corporate ownership.

We present here how we built this amazing data structure using Python tools for cleaning and data processing and a Neo4j graph database storing the network graph itself. In addition, we share the outcomes and insights of the project ranging from changing how the NGO approaches data to being debated in the Houses of Parliament.


Is your proposal suitable for beginners? – False