Learning to generate one-sentence biographies from Wikidata
Andrew Chisholm, Will Radford, Ben Hachey
School of Information Technologies
University of Sydney
The Task
Title | Mathias Tuomi |
Gender | male |
Date of birth | 1985-09-03 |
Occupation | squash player |
Citizenship | finland |
Matias Tuomi, (born September 30, 1985 in Espoo) is a professional squash player who represents Finland.
Relations
Text
2
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
The Plan
3
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Motivation
4
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Motivation
5
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Motivation » Consistency checking
Barack Hussein Obama II (born August 4, 1961 in Hawaii) is an American politician who served as the 44th President of the United States from 2009 to 2017.
Barack Hussein Obama II (born August 4, 1961 in Kenya) is an American politician who served as the 44th President of the United States from 2009 to 2017.
Relation | Value |
Title | Barack Obama |
Gender | male |
Date of birth | 1961-08-04 |
Place of birth | Hawaii |
Occupation | ... |
6
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Creating a Dataset
7
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Creating a dataset » Sources
8
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Creating a dataset » Constraints
9
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Dataset » Relation coverage
10
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Creating a dataset » Constraints
11
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Dataset » Task complexity
Robert Charles Cortner (April 16, 1927 – May 19, 1959) was an American automobile racing driver from Redlands, California.�
Barry MacKay (8 January 1906 – 12 December 1985) was a British actor.�
Joseph "Flip" Nuñez (August 27, 1931 – November 3, 1995) was an American jazz pianist, composer, and vocalist of Filipino descent.
12
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Dataset » Language modelling benchmark
13
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling
14
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Baseline
TITLE, known as GIVEN NAME, (born DATE OF BIRTH in PLACE OF BIRTH; died DATE OF DEATH in PLACE OF DEATH) is an POSITION HELD and OCCUPATION from CITIZENSHIP.
15
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Neural model
16
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Fact linearization
Source Language: Linearized Facts (Vinyals et al., 2015, Gillick et al., 2016, Xiao et al., 2016)�
#TITLE matias tuomi #SEX_OR_GENDER male #DATE_OF_BIRTH 1985 09 03 #OCCUPATION squash player #CITIZENSHIP finland�
Target Language: English
matias tuomi , ( born september 30 , 1985 in espoo ) is a professional squash player who represents finland .
17
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Base sequence-to-sequence model
18
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Constraining generation
19
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Modelling » Sequence-to-sequence Auto-encoding
20
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Examples
21
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Examples
22
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Examples
23
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation
24
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation
25
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Reference similarity
26
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Human preference
27
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Crowd Task
28
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Human preference
29
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Content Selection
30
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Comparing to Wikidata
31
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Evaluation » Comparing to Wikipedia
32
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Summing up
33
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Thanks!
Code + Data: github.com/andychisholm/mimo
34
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Examples
35
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
That’s it!
Relations
Text
Facts
36
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
37
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Annotation challenges
38
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Dataset » Relation Occurrence
39
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey
Human Preference Evaluation
40
Learning to generate one-sentence biographies from Wikidata
EACL 2017
Chisholm, Radford, Hachey