Linked Jazz & Wikibase
Matt Miller
Semlab at Pratt Institute, semlab.io
@thisismmiller
The Semantic Lab
D
F
In conjunction with
Drawings of the Florentine Painters
DADAlytics
Grant-funded through
Linked Jazz
Linked Jazz
Uses oral history transcripts to build an RDF based social network of Jazz related entities. Check out linkedjazz.org
History with Wikimedia
Linked Jazz Data Management History
“You are here”
Heap of files: RDF, CSV, JSON, etc
(~2012)
Relational DB
Linked Data Platform (LDP) server
Why are we trying Wikibase?
Wikibase: Infrastructure
We are testing two instances of wikibase:
http://base.semlab.io/ (4 vCPUs 8GB / 160GB Disk / Digital Ocean)
http://sandbase.semlab.io/ (2 vCPUs 4GB / 80GB Disk / Digital Ocean)
Wikibase: Infrastructure
Using the Docker image to run our instances.
Our fork: https://github.com/SemanticLab/wikibase-docker
For now our only modifications are using the build script (not just pulling) to build the images, adding files to the wikibase image and adding additional configuration to LocalSettings.php
Has been a very smooth experience once you get over Docker learning curve.
Wikibase: Bootstrapping Data
We already have lots of data we want to load into our instance.
First step was preparing legacy data for ingest: https://github.com/linkedjazz/lj_database_cleanup
Transcript
Entity
Individual Statement
Host Institution
Entity
Wikibase: Bootstrapping Data
First attempt: https://github.com/SemanticLab/data-2-wikibase
Next attempt:
Using References to track provenance at the statement leve
Working around field limitations
Thinking about seralizations based on context and use case
Next Steps
Thanks!