ABCDEFGHIJ
1
Education Index Dataset - v1 via Gapminder
2
About this fileThis file has multiple sheets with data for one or more indicators used by Gapminder. In this sheet below you'll first find an overview of the indicators (measures) and the list of underlying sources. The actual data we use, is found in the sheet(s) labeled "data-...".
This file is also a documentation of the data process. To follow how the data was transformed from the original sources, start in the sheet to the far right, which holds the input data. You can then follow the process step by step, by looking at the formulas in the sheets from right to left, until you reach the output in the "data-..." sheets.
3
Version:v1
4
Updated:September 5 2020
5
Download latest version:Excel file »
6
Latest version online:http://gapm.io/deducation_idx
7
Contributor(s) to this version: Claudia S.
8
FeedbackPlease give feedback here
9
10
#Indicator(s)DescriptionFull nameUnitIDtypeUsage
11
1Mean years of schoolingAverage years of schooling based on Lee-Lee(2016), Barro-Lee(2018) and UNDP(2018)Mean years of total schooling across all education levels.yearsavg_years_schoolmeasure2
12
2OWID Education IndexEducation index calculated based on Avg years of schooling, taking values 0 as minimum and 15 as maximumOWID Education Index%owid_education_idxmeasure2
13
14
Sources
15
Dataset description:The Average years of schooling index combines figures from three published dataset:
For the period 1870-1949 inclusive, the estimates correspond to population aged 25-64, and are taken from Lee-Lee (2016). For the period 1950-1990 inclusive, the estimates correspond to population aged 25+, and are taken from Barro-Lee (2018). For the period 1991-2017 inclusive, the estimated correspond to population 25+, and are taken from the UNDP, HDR (2018).
The Education index should use the expected years of schooling and the mean years of schooling, but for this dataset it is only considering the mean years of schooling.
16
Link to documentation:http://gapm.io/deducation_idx
17
Short source summary:OWID
18
19
#Source idNameLink
20
1OWIDOWID - Years of Schooling - based on Lee-Lee (2016), Barro-Lee (2018) and UNDP (2018)
https://github.com/owid/owid-datasets/blob/master/datasets/Years%20of%20Schooling%20-%20based%20on%20Lee-Lee%20(2016)%2C%20Barro-Lee%20(2018)%20and%20UNDP%20(2018)/Years%20of%20Schooling%20-%20based%20on%20Lee-Lee%20(2016)%2C%20Barro-Lee%20(2018)%20and%20UNDP%20(2018).csv
21
22
License
23
Attribution:
We believe in free knowledge and therefor we share free data. Most sheets in this file are provided under the open license, called Creative Commmon Attribution License CC BY 4.0., except those sheets mentioned in the exceptions section below. This means you can freely use, copy, and spread the data in those sheets, as long as you mention the following: 'Free data from Gapminder.org'.
You should also mention the underlaying data sources listed above and include this link: http://gapm.io/deducation_idx
24
License link:Creative Common License CC BY 4.0
25
Exceptions:
The sheets starting with the word "data" are covered by this license. Other sheets are included for documentation purpose, and may include data that is governed by other licenses. Check the underlying sources for the specific licenses in these cases.
26
27
VersionsLinkChanges compared to previousDateContributors
28
v1
https://docs.google.com/spreadsheets/d/1gK1UfJMvoeaK28f9HiqdeQ3cOU3V-EfImnsX_CrVFrA/edit#gid=501532268
First version2020 September 5Claudia S.
29
30
Technical stuff
31
Dataset name:Education Index
32
Dataset id:education_idx
33
Doc urlhttps://docs.google.com/spreadsheets/d/1gK1UfJMvoeaK28f9HiqdeQ3cOU3V-EfImnsX_CrVFrA/edit#gid=501532268
34
Doc id of work doc spreadsheet1gK1UfJMvoeaK28f9HiqdeQ3cOU3V-EfImnsX_CrVFrA
35
FormulasThe formulas in this workbook may be referring to other spreadsheets online, by their named ranges, and not by sheet names. Search for "named ranges" to see how to use those instead of cell ranges.
36
For developersIf you like the data we use into your own products, it's better if you fetch data from our standardized gitHub repo on https://open-numbers.github.io
These spreadsheets are part of Gapminder's data compilation process and allow end users to track how we combine data.
37
Read more:gapm.io/dataworks
38
CHART PREVIEWS
39
[c]
data-for-countries-etc-by-year
https://www.gapminder.org/tools/#$state$time$dim=time;&entities$dim=geo;&entities_colorlegend$dim=geo;&marker$axis_x$which=income_per_person_gdppercapita_ppp_inflation_adjusted&scaleType=log&spaceRef:null;&axis_y$data=data_&which=Mean%20years%20of%20schooling&spaceRef:null;&label$which=name&scaleType=ordinal;&size$which=population_total&use=indicator&scaleType=linear;&color$which=_default&use=constant&scaleType=ordinal;;;&data$reader=ddfbw&service=https:////big-waffle.gapminder.org&dataset=sg-master&translateContributionLink:///crowdin.com//project//systema-globalis;&data_$reader=google_csv&path=https:////docs.google.com//spreadsheets//d//1gK1UfJMvoeaK28f9HiqdeQ3cOU3V-EfImnsX_CrVFrA//gviz//tq?tqx=out:csv/&sheet=data-for-countries-etc-by-year&hasNameColumn:true&nameColumnIndex:1;&chart-type=bubbles
40
Mean%20years%20of%20schooling
41
DDF mapping:schema for indicator table
42
concept_id6
43
name_short2
44
name4
45
description3
46
unit5
47
type7
48
usage8
49
50
Catalog statusIndicator IDTime unit
Countries etc
RegionsIn. LevelsWorld
51
Mean years of schoolingavg_years_schoolyearBAD: Request failed for https://spreadsheets.google.com returned code 404. Truncated server response: <!DOCTYPE html><html lang="en"><head><meta name="description" content="Web word processing, presentations and spreadsheets"><meta name="viewport" c... (use muteHttpExceptions option to examine full response)BAD: Request failed for https://spreadsheets.google.com returned code 404. Truncated server response: <!DOCTYPE html><html lang="en"><head><meta name="description" content="Web word processing, presentations and spreadsheets"><meta name="viewport" c... (use muteHttpExceptions option to examine full response)BAD: Request failed for https://spreadsheets.google.com returned code 404. Truncated server response: <!DOCTYPE html><html lang="en"><head><meta name="description" content="Web word processing, presentations and spreadsheets"><meta name="viewport" c... (use muteHttpExceptions option to examine full response)BAD: Request failed for https://spreadsheets.google.com returned code 404. Truncated server response: <!DOCTYPE html><html lang="en"><head><meta name="description" content="Web word processing, presentations and spreadsheets"><meta name="viewport" c... (use muteHttpExceptions option to examine full response)
52
53
#Validation
54
1output-sheetsGOOD: There is at least one output sheet present (sheets starting with 'data-for-' and not ending with '-in-columns')
55
2
output-sheet:data-for-world-by-year
GOOD: The 'data-for-world-by-year' output sheet has at least 4 header columns
56
3
output-sheet:data-for-world-by-year
GOOD: The 'data-for-world-by-year' output sheet does not have filter mode turned on (since it breaks the CSV endpoint)
57
4
output-sheet:data-for-regions-by-year
GOOD: The 'data-for-regions-by-year' output sheet has at least 4 header columns
58
5
output-sheet:data-for-regions-by-year
GOOD: The 'data-for-regions-by-year' output sheet does not have filter mode turned on (since it breaks the CSV endpoint)
59
6
output-sheet:data-for-countries-etc-by-year
GOOD: The 'data-for-countries-etc-by-year' output sheet has at least 4 header columns
60
7
output-sheet:data-for-countries-etc-by-year
GOOD: The 'data-for-countries-etc-by-year' output sheet does not have filter mode turned on (since it breaks the CSV endpoint)
61
8versionGOOD: Named range 'version' exists
62
9versionGOOD: 'Version:' is filled in
63
10versionGOOD: The version at 'Version:' starts with a v, followed by an integer
64
11dateGOOD: Named range 'date' exists
65
12dateGOOD: 'Updated:' is filled in
66
13gapmioGOOD: Named range 'gapmio' exists
67
14gapmioGOOD: 'Latest version online:' is filled in
68
15contributorsGOOD: Named range 'contributors' exists
69
16contributorsGOOD: 'Contributor(s) to this version:' is filled in
70
17indicator_tableGOOD: Named range 'indicator_table' exists
71
18indicator_tableGOOD: The named range 'indicator_table' covers the whole Indicator(s) table (the rows immediately above and below the table are empty)
72
19indicator_table:row_1GOOD: This first column of row '1' in the indicator(s) table is incremental (from 1 and up)
73
20indicator_table:row_1GOOD: Indicator 1 has a short indicator name (Column 2)
74
21
indicator_table:row_1:data-for-world-by-year
GOOD: The indicator name cell of indicator 1 is referenced in the 'data-for-world-by-year' output sheet in column 4 as "=ABOUT!C11"
75
22
indicator_table:row_1:data-for-regions-by-year
GOOD: The indicator name cell of indicator 1 is referenced in the 'data-for-regions-by-year' output sheet in column 4 as "=ABOUT!C11"
76
23
indicator_table:row_1:data-for-countries-etc-by-year
GOOD: The indicator name cell of indicator 1 is referenced in the 'data-for-countries-etc-by-year' output sheet in column 4 as "=ABOUT!C11"
77
24indicator_table:row_1GOOD: Indicator 1 has a description (Column 3)
78
25indicator_table:row_1GOOD: Indicator 1 has a full name (Column 4)
79
26indicator_table:row_1GOOD: Indicator 1 has a unit (Column 5)
80
27indicator_table:row_1GOOD: Indicator 1's unit does not start or end with a space
81
28indicator_table:row_1GOOD: Indicator 1 has an ID (Column 6)
82
29indicator_table:row_1GOOD: Indicator 1's ID contains only lowercase latin characters (a-z) or numbers, and no space, dashes or underscores. (Column 6)
83
30indicator_table:row_1GOOD: Indicator 1's ID has less than or equal to 20 characters
84
31indicator_table:row_1GOOD: Indicator 1 has a type set (Column 7)
85
32indicator_table:row_1GOOD: Indicator 1 has a usage level set (Column 8)
86
33indicator_table:row_2GOOD: This first column of row '2' in the indicator(s) table is incremental (from 1 and up)
87
34indicator_table:row_2GOOD: Indicator 2 has a short indicator name (Column 2)
88
35
indicator_table:row_2:data-for-world-by-year
GOOD: The indicator name cell of indicator 2 is referenced in the 'data-for-world-by-year' output sheet in column 5 as "=ABOUT!C12"
89
36
indicator_table:row_2:data-for-regions-by-year
GOOD: The indicator name cell of indicator 2 is referenced in the 'data-for-regions-by-year' output sheet in column 5 as "=ABOUT!C12"
90
37
indicator_table:row_2:data-for-countries-etc-by-year
GOOD: The indicator name cell of indicator 2 is referenced in the 'data-for-countries-etc-by-year' output sheet in column 5 as "=ABOUT!C12"
91
38indicator_table:row_2GOOD: Indicator 2 has a description (Column 3)
92
39indicator_table:row_2GOOD: Indicator 2 has a full name (Column 4)
93
40indicator_table:row_2GOOD: Indicator 2 has a unit (Column 5)
94
41indicator_table:row_2GOOD: Indicator 2's unit does not start or end with a space
95
42indicator_table:row_2GOOD: Indicator 2 has an ID (Column 6)
96
43indicator_table:row_2GOOD: Indicator 2's ID contains only lowercase latin characters (a-z) or numbers, and no space, dashes or underscores. (Column 6)
97
44indicator_table:row_2GOOD: Indicator 2's ID has less than or equal to 20 characters
98
45indicator_table:row_2GOOD: Indicator 2 has a type set (Column 7)
99
46indicator_table:row_2GOOD: Indicator 2 has a usage level set (Column 8)
100
47dataset_descriptionGOOD: Named range 'dataset_description' exists