Global Open Data Index 2015
Coordinators workshop
Present yourself!
Before we start -
Who are you?
Where are you from?
What is your favourite sweet?
What will we cover today?
What is the Global Open Data Index?
The Global Open Data Index collects and presents information on the current state of open data release around the world. The Global Open Data Index is run by Open Knowledge with the assistance of volunteers from the Open Knowledge Network and around the world.
What makes the index?
The Index assumptions
Assumption 1: Open Data is defined by the Open Definition
“Open means anyone can freely access, use, modify, and share for any purpose(subject, at most, to requirements that preserve provenance and openness).”
Assumption 2: The role of government in publishing data
for the key datasets we survey, the national government has a responsibility to ensure the open publication of such data even if is it held and managed by a third-party. Therefore, even if the data is not produced by the government, we see it as responsible to ensure the open publication of the data.
Assumption 3: National government as aggregator of data �Federal (or national) government is accountable for the open publication by all its sub-governments.
How does the index work?
First step - Survey
Users can do the following:
Datasets
This year, we refined the definition of datasets:
Elections results has three qualities (“Results by constituency / district for all major national electoral contests”).
This year’s datasets
National Statistics
Government Budget
Government Spending
Legislation
Election Results
National Map
Pollutant Emissions
Company Register
Transport Timetables
Location datasets (Zipcodes OR administrative boundaries)
Government procurement tenders (past and present)
Water Quality
Weather forecast
Health Performance data
Land Ownership
For the full definition of the datasets click here!
National Statistics
Key national statistics such as demographic and economic indicators (GDP, unemployment, population, etc). To satisfy this category, the following minimum criteria must be met:�
- GDP for the whole country updated at least quarterly
- Unemployment statistics updated at least monthly
- Population updated at least once a year
National Budget
National government budget at a high level. This category is looking at budgets, or the planned government expenditure for the upcoming year, and not the actual expenditure. To satisfy this category, the following minimum criteria must be met:
- Planned budget divided by government department and sub-department
- Updated once a year.
- The budget should include descriptions regarding the different budget sections.
National spending
Records of actual (past) national government spending at a detailed transactional level. A database of contracts awarded or similar will *not* be considered sufficient. This data category refers to detailed ongoing data on *actual* expenditure. Data submitted in this category should meet the following minimum criteria:
- Individual record of transactions
- Date of the transactions
- Government office which had the transaction
- Name of vendor
- Amount of the transaction
Legislation
This data category requires all national laws and statutes available to be available online, although it is not a requirement that information on legislative behaviour e.g. voting records is available. To satisfy this category, the following minimum criteria must be met:
- Content of the law / statutes
- If applicable, all relevant amendments to the law
- Date of last amendments
- Data should be updated at least quarterly
Elections results
This data category requires results by constituency / district for all major national electoral contests. To satisfy this category, the following minimum criteria must be met:
- Result for all major electoral contests
- Number of registered votes
- Number of invalid votes
- Number of spoiled ballots
- All data should be reported at the level of the polling station
National Map
This data category requires a high level national map. To satisfy this category, the following minimum criteria must be met:
- Scale of 1:250,000 (1 cm = 2.5km).
- Markings of national roads
- National borders
- Marking of streams, rivers, lakes, mountains.
- Updated at least once a year.
Pollutant Emissions
Aggregate data about the emission of air pollutants, especially those potentially harmful to human health (although it is not a requirement to include information on greenhouse gas emissions). Aggregate means national-level or available for at least three major cities. In order to satisfy the minimum requirements for this category, data must be available for the following pollutants and meet the following minimum criteria:
- Particulate matter (PM) Levels
- Sulphur oxides (SOx)
- Nitrogen oxides (NOx)
- Volatile organic compounds (VOCs)
- Carbon monoxide (CO)
- Updated at least once a week.
- Measured either at a national level by regions or at leasts in 3 big cities.
Company Register
List of registered (limited liability) companies. The submissions in this data category do not need to include detailed financial data such as balance sheet, etc. To satisfy this category, the following minimum criteria must be met:
- Name of company
- Unique identifier of the company
- Company address
- Updated at least once a month
Transport Timetables
Timetables of major government operated (or commissioned) *national-level* public transport services (specifically bus and train). The focus here is on national level services (not those which operate *only* at a municipal or city level and which are not controlled or regulated by the national government). A 'yes' in any question will refer to both types of transport. However, if there is no national level service operated or regulated by the government for a given type of transport (for instance busses), then this type is ignored in this data category. Data submitted in this category should meet the following minimum criteria:
- Time of operating
- Time of leaving first station and arriving to the last station
- Updated at least once a year
Location datasets
A database of postcodes/zipcodes and the corresponding spatial locations in terms of a latitude and a longitude (or similar coordinates in an openly published national coordinate system). If a postcode/zipcode system does not exist in the country, please submit a dataset of administrative borders. Data submitted in this category must satisfy the following minimum conditions:
- Zipcodes
* Address
* Coordinate (latitude longitude)
* national level
* updated once a year
- Administrative boundaries
* Boarders poligone
* name of poligone (city, neighborhood)
* national level
* updated once a year
Government procurement tenders
All tenders and awards of the national/federal government aggregated by office. Monitoring tenders can help new groups to participate in tenders and increase government compliance. Data submitted in this category must be aggregated by office, updated at least monthly & satisfy the following minimum criteria:
- Tenders
* tenders name
*tender description
*tender status
- Awards:
* Award title
* Award description
* value of the award
* suppliers name
Water Quality
Data, measured at the water source, on the quality of water is essential for both the delivery of services and the prevention of diseases. In order to satisfy the minimum requirements for this category, data should be available on level of the following chemicals by water source and be updated at least weekly:
- fecal coliform
- arsenic
- fluoride levels
- nitrates
- TDS (Total dissolved solids)
Weather forecast
5 days forecast of temperature, precipitation and wind as well as recorded data for temperature, wind and precipitation for the past year. In order to satisfy the minimum requirements for this category, data submitted should meet the following criteria:
- 5 days forecast of temperature updated daily
- 5 days forecast of wind updated daily
- 5 days forecast of precipitation updated daily
- Historical temperature data for the past year
Health Performance data
Geo location of public hospitals and health facilities with opening hours and infectious diseases rate, updated at least once a year. Data submitted in this category must include the following:
- Location of public hospitals and clinics.
- Data on infectious diseases rates in a country.
This is experimental dataset - Special attention here!
Land Ownership
Cadaster showing land ownership data on a map and include all metadata on the land. Cadaster data submitted in this category must include the following characteristics:
- Land borders
- Land owners name
- Land size
- National level
- Be updated yearly
The index questions
Survey flow and Conditions
Second step - Review
Third step - government response
We will try to send our results for government response, this will be done in order to see if we missed something. The final call however, is in the hand of the reviewers.
Fourth step publish
The role of the lndex coordinator
How to get people to be involved?
If I need help?
https://discuss.okfn.org/c/open-data-index
Other resources
Let’s work on a dataset together!