ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
Billionaires Dataset - v2 via Gapminder
2
About this fileThis file has multiple sheets with data for one or more indicators used by Gapminder. In this sheet below you'll first find an overview of the indicators (measures) and the list of underlying sources. The actual data we use, is found in the sheet labeled "eo_final".
This file is also a documentation of the data process. To follow how the data was transformed from the original sources, start in the sheet to the far right, which holds the input data. You can then follow the process step by step, by looking at the formulas in the sheets from right to left, until you reach the output in the "eo_final" sheet.
3
Version:v2
4
Updated:September 6 2022
5
Download latest version:Excel file »
6
Contributor(s) to this version: Hisham N.
7
8
#Indicator(s)DescriptionFull nameUnitIDtypeUsage
9
1Gapminder IDA unique Gapminder Id for a billionaire listed in one or more sources (Hurun, Forbes)Gapminder IDStringgm_idcategory1
10
2
11
Sources
12
Dataset description:Gapminder has scraped a list of billionaires from Forbes' 'The World's Billionaires' and Hurun's 'Hurun Global Rich List' with the goal of combining both sources, and in that process, removing duplicates. Forbes has a detailed profile of each billionaire that has been on the list with their full name, age, source of wealth, country, and their most recent net worth. According to Forbes, they keep track of each billionaires' moves and take into account their assets, including stakes in public and private companies, real estate, yachts, art, and cash. Hurun also provides information of the billionaires' full name, age, the name of the company they work for, and their industry. Gapminder has then given a unique ID to each of the listed billionaire (as a combination of their country code, birth year, and full name) to first identify the matches and then filter through unmatched names to check for existing duplicates.
The final output, in the 'eo_final' sheet is a 3-column table, with a unique gapminder id, Forbes id, and Hurun id, replaced with 'n/a' if the name was only visible in one of the sources. Keep in mind that the data is not listed in alphabetical order -- as they are manually pasted from different 3-column table in the 'eo_draft' sheet. You can follow through the various waves of filtering from the sheet to the right.
13
Link to documentation:https://docs.google.com/document/d/1D5E093Wa__rQo7xbZHawpRaHLL2PQGGpF6BHMsfBdTo/edit?usp=sharing
14
Short source summary:Gapminder based on Billionaires list from Forbes and Hurun
15
16
#Source idName
17
1ForbesBillionaires' information scraped from Forbes World's Billionaires List
18
2HurunBillionaires' information scraped from Hurun Global Rich List
19
20
License
21
Attribution:#NAME?
22
License link:Creative Common License CC BY 4.0
23
Exceptions:
The sheets starting with the word "data" are covered by this license. Other sheets are included for documentation purpose, and may include data that is governed by other licenses. Check the underlying sources for the specific licenses in these cases.
24
25
VersionsLink
26
v1
https://docs.google.com/spreadsheets/d/1ITY5YjQIs4ypUdjZVe8Dcsa8tZolIMhcWXzz81Ada9M/edit#gid=671387805
27
28
29
30
Validation
31
output-sheetGOOD: output sheet present: eo_final
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100