Language: 漢語
07-31, 13:50–14:10 (Asia/Taipei), TR412-2
Wikidata 至今已經累計九億條條目,面臨資料問題(bad data)的挑戰。例如重複條目名稱或內容、維基百科的來源條目移動或版本變動,而與維基百科、開放街圖條目相互參照的 Wikidata 並未連動調整。
本講座分享採用開源工具、MediaWiki 開放資料 API 偵測與協助解決資料著錄問題。
Target audience –WikiData 社群與 WikiData 使用者
Translate Title –How to fix bad data issues of WikiData by using the open data tool
Talk Length –20
您是否知悉並同意如採遠端形式分享,需提供預錄影片(您需同意大會才能接受您的稿件) – yes Difficulty –中階
講者所屬社群 –WikiData
other info –Open source 🤘
hackmd url –https://hackmd.io/@coscup/BJl_ETwA_/%2F%40coscup%2FHJ_8NTwA_
slido url – English Abstract –Wikidata, which has accumulated 900 million entries to date, but also faces the challenge of bad data. For example, duplicate entry names or content. Second, the Wikipedia source entries that have moved or the content have been updated, but the cross-referenced Wikidata entries have not been adjusted.
This talk shares the experience of using open source tools and the MediaWiki open data APIs to detect and help to fix the bad data issues.