1 of 39

Global Open Data Index 2015

Coordinators workshop

2 of 39

Present yourself!

Before we start -

Who are you?

Where are you from?

What is your favourite sweet?

3 of 39

What will we cover today?

  • What is the index?
  • How does the Index work?
  • How to submit a result?
  • How to get people involved?
  • How to get help?

4 of 39

What is the Global Open Data Index?

The Global Open Data Index collects and presents information on the current state of open data release around the world. The Global Open Data Index is run by Open Knowledge with the assistance of volunteers from the Open Knowledge Network and around the world.

5 of 39

What makes the index?

  • PEOPLE!
    • It is by the community (public consultations, crowdsourced)
    • For the community (advocacy, education)
  • Easy
    • 15 datasets
    • 9 questions

6 of 39

The Index assumptions

Assumption 1: Open Data is defined by the Open Definition

“Open means anyone can freely access, use, modify, and share for any purpose(subject, at most, to requirements that preserve provenance and openness).”

Assumption 2: The role of government in publishing data

for the key datasets we survey, the national government has a responsibility to ensure the open publication of such data even if is it held and managed by a third-party. Therefore, even if the data is not produced by the government, we see it as responsible to ensure the open publication of the data.

Assumption 3: National government as aggregator of data �Federal (or national) government is accountable for the open publication by all its sub-governments.

7 of 39

8 of 39

How does the index work?

  • First step - survey
  • Second step - review
  • Third step - government response
  • Fourth step - publish

9 of 39

First step - Survey

Users can do the following:

  • Add information about new places and datasets
  • Update last year’s submissions
  • Comment on this year’s submission

10 of 39

Datasets

This year, we refined the definition of datasets:

  • Describe the dataset by at least 3 key data characteristics it must have.

Elections results has three qualities (“Results by constituency / district for all major national electoral contests”).

  1. The results
  2. Geographical data
  3. Candidate data
  4. Include how often the dataset needs to be updated.
  5. Aggregation. Mention which aggregation level the data needs to be in. Some datasets can be in more than one aggregation level and mentioning the aggregation level can help to avoid confusion between datasets.

11 of 39

This year’s datasets

National Statistics

Government Budget

Government Spending

Legislation

Election Results

National Map

Pollutant Emissions

Company Register

Transport Timetables

Location datasets (Zipcodes OR administrative boundaries)

Government procurement tenders (past and present)

Water Quality

Weather forecast

Health Performance data

Land Ownership

For the full definition of the datasets click here!

12 of 39

National Statistics

Key national statistics such as demographic and economic indicators (GDP, unemployment, population, etc). To satisfy this category, the following minimum criteria must be met:�

- GDP for the whole country updated at least quarterly

- Unemployment statistics updated at least monthly

- Population updated at least once a year

13 of 39

National Budget

National government budget at a high level. This category is looking at budgets, or the planned government expenditure for the upcoming year, and not the actual expenditure. To satisfy this category, the following minimum criteria must be met:

- Planned budget divided by government department and sub-department

- Updated once a year.

- The budget should include descriptions regarding the different budget sections.

14 of 39

National spending

Records of actual (past) national government spending at a detailed transactional level. A database of contracts awarded or similar will *not* be considered sufficient. This data category refers to detailed ongoing data on *actual* expenditure. Data submitted in this category should meet the following minimum criteria:

- Individual record of transactions

- Date of the transactions

- Government office which had the transaction

- Name of vendor

- Amount of the transaction

15 of 39

Legislation

This data category requires all national laws and statutes available to be available online, although it is not a requirement that information on legislative behaviour e.g. voting records is available. To satisfy this category, the following minimum criteria must be met:

- Content of the law / statutes

- If applicable, all relevant amendments to the law

- Date of last amendments

- Data should be updated at least quarterly

16 of 39

Elections results

This data category requires results by constituency / district for all major national electoral contests. To satisfy this category, the following minimum criteria must be met:

- Result for all major electoral contests

- Number of registered votes

- Number of invalid votes

- Number of spoiled ballots

- All data should be reported at the level of the polling station

17 of 39

National Map

This data category requires a high level national map. To satisfy this category, the following minimum criteria must be met:

- Scale of 1:250,000 (1 cm = 2.5km).

- Markings of national roads

- National borders

- Marking of streams, rivers, lakes, mountains.

- Updated at least once a year.

18 of 39

Pollutant Emissions

Aggregate data about the emission of air pollutants, especially those potentially harmful to human health (although it is not a requirement to include information on greenhouse gas emissions). Aggregate means national-level or available for at least three major cities. In order to satisfy the minimum requirements for this category, data must be available for the following pollutants and meet the following minimum criteria:

- Particulate matter (PM) Levels

- Sulphur oxides (SOx)

- Nitrogen oxides (NOx)

- Volatile organic compounds (VOCs)

- Carbon monoxide (CO)

- Updated at least once a week.

- Measured either at a national level by regions or at leasts in 3 big cities.

19 of 39

Company Register

List of registered (limited liability) companies. The submissions in this data category do not need to include detailed financial data such as balance sheet, etc. To satisfy this category, the following minimum criteria must be met:

- Name of company

- Unique identifier of the company

- Company address

- Updated at least once a month

20 of 39

Transport Timetables

Timetables of major government operated (or commissioned) *national-level* public transport services (specifically bus and train). The focus here is on national level services (not those which operate *only* at a municipal or city level and which are not controlled or regulated by the national government). A 'yes' in any question will refer to both types of transport. However, if there is no national level service operated or regulated by the government for a given type of transport (for instance busses), then this type is ignored in this data category. Data submitted in this category should meet the following minimum criteria:

- Time of operating

- Time of leaving first station and arriving to the last station

- Updated at least once a year

21 of 39

Location datasets

A database of postcodes/zipcodes and the corresponding spatial locations in terms of a latitude and a longitude (or similar coordinates in an openly published national coordinate system). If a postcode/zipcode system does not exist in the country, please submit a dataset of administrative borders. Data submitted in this category must satisfy the following minimum conditions:

- Zipcodes

* Address

* Coordinate (latitude longitude)

* national level

* updated once a year

- Administrative boundaries

* Boarders poligone

* name of poligone (city, neighborhood)

* national level

* updated once a year

22 of 39

Government procurement tenders

All tenders and awards of the national/federal government aggregated by office. Monitoring tenders can help new groups to participate in tenders and increase government compliance. Data submitted in this category must be aggregated by office, updated at least monthly & satisfy the following minimum criteria:

- Tenders

* tenders name

*tender description

*tender status

- Awards:

* Award title

* Award description

* value of the award

* suppliers name

23 of 39

Water Quality

Data, measured at the water source, on the quality of water is essential for both the delivery of services and the prevention of diseases. In order to satisfy the minimum requirements for this category, data should be available on level of the following chemicals by water source and be updated at least weekly:

- fecal coliform

- arsenic

- fluoride levels

- nitrates

- TDS (Total dissolved solids)

24 of 39

Weather forecast

5 days forecast of temperature, precipitation and wind as well as recorded data for temperature, wind and precipitation for the past year. In order to satisfy the minimum requirements for this category, data submitted should meet the following criteria:

- 5 days forecast of temperature updated daily

- 5 days forecast of wind updated daily

- 5 days forecast of precipitation updated daily

- Historical temperature data for the past year

25 of 39

Health Performance data

Geo location of public hospitals and health facilities with opening hours and infectious diseases rate, updated at least once a year. Data submitted in this category must include the following:

- Location of public hospitals and clinics.

- Data on infectious diseases rates in a country.

This is experimental dataset - Special attention here!

26 of 39

Land Ownership

Cadaster showing land ownership data on a map and include all metadata on the land. Cadaster data submitted in this category must include the following characteristics:

- Land borders

- Land owners name

- Land size

- National level

- Be updated yearly

27 of 39

28 of 39

The index questions

  1. Does the data exist?
  2. Is the data in digital form?
  3. Publicly available?
  4. Is the data available for free?
  5. Is the data available online?
  6. Is the data machine- readable?
  7. Available in bulk?
  8. Openly licensed?
  9. Is the data provided on a timely and up to date basis

29 of 39

Survey flow and Conditions

30 of 39

Second step - Review

  • Coordinators will look for pitfalls in submission (we will look at it later)
  • Reviewers will review all countries per theme (not country review)

31 of 39

Third step - government response

We will try to send our results for government response, this will be done in order to see if we missed something. The final call however, is in the hand of the reviewers.

32 of 39

Fourth step publish

  • First the Index will be published under embargo to journalist.
  • We will (hopefully) launch the GODI 2015 in OGP.
  • We will also promote it throughout the network.

33 of 39

The role of the lndex coordinator

  • Publicise the Index & Reach out to community members in region Solicit Contribution from local Open Data Communities/networks in target countries
  • Work closely with the Open Data Index team on regional outreach strategy as well as the expert reviewers
  • Helping to organise Open Data Index submission events/sessions at local/region events
  • Translation of final materials/press release etc.sh

34 of 39

35 of 39

How to get people to be involved?

  1. We will give each of you a list of leads that we had from last year.
  2. Look for people who are interested in Open Data, but also in thematic issues like environment. Remember, a place can have multiple submitters.
  3. Social Media - direct tweets are useful! Send a request on slack for tweeting from the OKFN account.

36 of 39

If I need help?

  1. Use the slack channel to ask questions.
  2. Use or direct submitters to the forum -

https://discuss.okfn.org/c/open-data-index

37 of 39

Other resources

38 of 39

39 of 39

Let’s work on a dataset together!