Dr. Pedro Szekely is a Principal Scientist and Director of the AI division at the USC Information Sciences Institute (ISI. His research focuses on table understanding, knowledge graphs and applications of knowledge graphs. He teaches a graduate course at USC on Building Knowledge Graphs, and has given tutorials on knowledge graph construction at KDD, ISWC, AAAI and WWW.
In many use cases, projects want to reuse a subset of Wikidata focused on project-specific topics, combining the Wikidata data with project-specific data. Extracting a subgraph of Wikidata is difficult because Wikidata is very large, and it is difficult to specify what to keep and what to discard. This talk presents Knowledge Graph Toolkit (KGTK, https://github.com/usc-isi-i2/kgtk), a toolkit that can process the full Wikidata on a laptop, and provides a rich suite of commands for query and path following that can be used to flexibly extract topic specific subgraphs.