1 of 33

Wicked data with Wikidata

Dan Scott, Laurentian University

@denials

2 of 33

100 ‡a We love authorities

3 of 33

Unifying and disambiguating

100 1 _ ‎‡a Trump, Donald,‏ ‎‡d 1946-‏

400 1 _ ‎‡a Tramps, D. Dž.‏ ‎‡q (Donalds Džons),‏ ‎‡d 1946-‏ National Library of Latvia

��https://viaf.org/viaf/49272447

4 of 33

Hot cross references

‡150 __ $a Dictators

‡450 __ $a Tyrants

‡550 __ $w g $a Heads of state

5 of 33

Authorities / overlords

Library of Congress

MeSH (NLM)

ISNI (ISO)

VIAF (OCLC)

6 of 33

Google Knowledge Graph

7 of 33

Browse

8 of 33

Rich relationships

9 of 33

Freebase

10 of 33

"We believe strongly in a robust community-driven effort to collect and curate structured knowledge about the world" - Google

11 of 33

100,000 requests per day, free

12 of 33

Google Knowledge Graph Search API result != Knowledge Graph card info

{

"@type": "EntitySearchResult",

"result": {

"@id": "kg:/m/0cqt90",

"name": "Donald Trump",

"@type": [ "Person", "Thing" ],

"description": "45th U.S. President",

"image": {

"contentUrl": "http://t2.gstatic.com/images?q=tbn:ANd9GcRJNzU33KiweQzfvzmlUgMp8vXBA5jGMRfylj5FQ7TwJz0JlNuI",

"url": "https://en.wikipedia.org/wiki/Donald_Trump_and_Billy_Bush_recording",

"license": "http://creativecommons.org/licenses/by-sa/2.0"

},

"detailedDescription": {

"articleBody": "Donald John Trump is an American businessman, television personality, politician, and the 45th President of the United States.\nBorn and raised in Queens, New York City, Trump received an economics degree from the Wharton School of the University of Pennsylvania in 1968. ",

"url": "https://en.wikipedia.org/wiki/Donald_Trump",

"license": "https://en.wikipedia.org/wiki/Wikipedia:Text_of_Creative_Commons_Attribution-ShareAlike_3.0_Unported_License"

},

"url": "https://www.donaldjtrump.com/"

},

"resultScore": 1670.089844

}

13 of 33

Google Knowledge Graph Search API result != Knowledge Graph card info

{

"@type": "EntitySearchResult",

"result": {

"@id": "kg:/m/0cqt90",

"name": "Donald Trump",

"@type": [ "Person", "Thing" ],

"description": "45th U.S. President",

"image": {

"contentUrl": "http://t2.gstatic.com/images?q=tbn:ANd9GcRJNzU33KiweQzfvzmlUgMp8vXBA5jGMRfylj5FQ7TwJz0JlNuI",

"url": "https://en.wikipedia.org/wiki/Donald_Trump_and_Billy_Bush_recording",

"license": "http://creativecommons.org/licenses/by-sa/2.0"

},

"detailedDescription": {

"articleBody": "Donald John Trump is an American businessman, television personality, politician, and the 45th President of the United States.\nBorn and raised in Queens, New York City, Trump received an economics degree from the Wharton School of the University of Pennsylvania in 1968. ",

"url": "https://en.wikipedia.org/wiki/Donald_Trump",

"license": "https://en.wikipedia.org/wiki/Wikipedia:Text_of_Creative_Commons_Attribution-ShareAlike_3.0_Unported_License"

},

"url": "https://www.donaldjtrump.com/"

},

"resultScore": 1670.089844

}

No facts or

relationships

14 of 33

Bad guy Google / good guy Google?

15 of 33

Wikidata growing nicely

16 of 33

17 of 33

DBpedia is not Wikidata

18 of 33

Searching Wikidata for entities

19 of 33

Complete with auto-complete

20 of 33

Item page - human readable

21 of 33

https://www.wikidata.org/wiki/Q6550

22 of 33

https://www.wikidata.org/wiki/Q6550.json

{

"entities": {

"Q6550": {

"pageid": 7674,

"ns": 0,

"title": "Q6550",

"lastrevid": 439312714,

"modified": "2017-01-30T10:54:43Z",

"type": "item",

"id": "Q6550",

"labels": {

"fr": { "language": "fr", "value": "Donald Duck" },

"en": { "language": "en", "value": "Donald Duck" },

"ar": { "language": "ar", "value": "بطوط" }

},

"claims": {

"P1080": [

{

"mainsnak": {

"snaktype": "value",

"property": "P1080",

"datavalue": {

"value": {

"entity-type": "item",

"numeric-id": 12301358,

"id": "Q12301358"

},

"type": "wikibase-entityid"

23 of 33

SPARQL is magic

24 of 33

Awards received by Donald Trump

SELECT ?awardLabel ?year ?forWorkLabel

WHERE {

BIND ( wd:Q22686 as ?human ) .

?human p:P166 ?awardStmt .

?awardStmt ps:P166 ?award .

?awardStmt pq:P585 ?date .

BIND(YEAR(?date) AS ?year) .

OPTIONAL { ?awardStmt pq:P1686 ?forWork . }

SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }

}

ORDER BY DESC(?year)

awardLabel

year

forWorkLabel

Time Person of the Year

2016

Star on Hollywood Walk of Fame

2007

Golden Raspberry Award for Worst Supporting Actor

1990

Ghosts Can't Do It

Ellis Island Medal of Honor

1986

Jewish National Fund Tree of Life Award

1983

25 of 33

Other humans who have received the same awards

26 of 33

27 of 33

Notability

Wikidata in its first phases has two main goals: to centralize interlanguage links across Wikimedia projects and to serve as a general knowledge base for the world at large.

An item is notable if "[i]t fulfills some structural need, for example: it is needed to make statements made in other items more useful."

https://www.wikidata.org/wiki/Wikidata:Notability

28 of 33

Anonymous or authenticated edits

29 of 33

Promiscuous authority linkage

OCLC

WorldCat Entities

30 of 33

Wikipedia simplicity win

{{Authority control |VIAF=xxxxxx |LCCN=n/xx/xxxxxx |ISNI=xxxx xxxx xxxx xxxx |ORCID=xxxx-xxxx-xxxx-xxxx |GND=xxxxxx |SELIBR=xxxxxx |SUDOC=xxxxxxxxx |BNF=xxxxxx |BPN=xxxxx |RID=xxxxx |BIBSYS=xxxxx |ULAN=xxxxx |MBA=xxxxxx |NLA=xxxxxxx |NDL=xxxxxxxx}}

From old inline Wikipedia markup:

{{Authority control}}

To new inline Wikipedia markup:

31 of 33

Opportunity:

  • Rich local collections
  • Autonomy
  • Authority control
  • Globally visible

32 of 33

Amplify Canadian music in Wikipedia!

33 of 33