50+ Reasons Not to Release Open Data
For the background on this document, see this blog post.
Update: We have rolled out these rebuttals as blog posts, which you can view here: https://sunlightfoundation.com/taxonomy/term/why-open-data/
Thanks so much to all of you who contributed ideas for this project throughout its evolution! We couldn't have done it without the help of so many different perspectives, and we feel the end result is richer for it. Join the continuing conversation with #whyopendata.
Index of categories (click to jump to a topic):
Apathy/Confusion: The Problem of Newness
a. No one cares [1 - 3]
b. Why do this? [4 - 11]
The Challenges of Integrating New Systems (Or, Rage Against the Bureaucracy)
c. It's hard [12 - 18]
d. It's expensive [19 - 22]
People Power: Staff(ing) Concerns
e. It's not my job [23 - 25]
f. We don't have the people to do that [26 - 27]
Heavy Hitters: Legality, Security, Accuracy, and Privacy
g. We can't legally do that [28 - 31]
h. It will be misunderstood / misused / it's not accurate [32 - 37]
i. Privacy [38]
Open Data as a Work (Already?) in Progress (and Other Excuses)
j. It's already public [39 - 41]
k. Say what? / Misc. [42 - ##]
Apathy/Confusion: The Problem of Newness
No one cares
- Why would we do this?
- You mean people are actually interested in this?
- No one cares
- There is a lack of public interest
- There are few public requests for data to be open/for data in general
- Government is boring
- No one would be interested in it
- We don’t have community support
- We don’t have civic hackers
Why do this?
- There’s no clear use for {insert specific type of data here}
- I don't know what you could possibly need that for
- How would someone even use that kind of data?
- What kind of data do you want?
- We need more proven results before we can open data
- The old system works
- We don’t want to have you (academic researcher) doing research for research sake. We want your research to benefit us directly, otherwise we don’t want to give you our data.
- It would just confuse people.
- We should really be selling this.
The Challenges of Integrating New Systems (Or, Rage Against the Bureaucracy)
It's hard
- It’s hard
- I don’t know how to start
- I don’t know how to engage my government/my boss on this issue
- I don’t know who to talk to about it
- I don’t know how to organize on this issue
- We don't know how
- I honestly haven't ever heard of an API or (insert other data term here)
- You’ll need to write custom software / our system can’t do that (Getting access to Police incident logs from a computer aided dispatch system
- We don’t know how to address our backlog
- We don't know how to address our archives
- We won’t be able to keep up! / We don't have the time
- I don’t have time/no one will help me
- There will be unending data release
- … unending cost
- … exhaustion
- Our staff can’t handle new software/training/processes
- We don’t have the technology
- We don’t have a website
- We don't have the servers for this
- We would have to take our entire production system offline for a couple of days to program the system to get the data you want.
- There’s too much data
- It's unfamiliar
- I honestly don’t know what open data is
- We don’t have data
- I don’t know what data is available that can be opened
- It will take too much of our time.
It's expensive
- It’s expensive / We don't have the budget
- It requires expensive software
- It would be costly to provide copies of data in open formats
- It would require a lot of staff time
- It would require new processes and staff training
- Keeping everything updated would be costly
- We’re a small town/city/county/state/country
- We have a tight budget right now
- My Professional Incentives are to NOT Share
- When I share my work or products publically, I either have to take an additional time-consuming step or potentially end up creating new work that doesn’t actually lead to any personal rewards. Will I get paid more if I open government data? Will I get promoted faster? Will I be able to leave work earlier or receive good recognition from my supervisors? The answer to all of these questions is usually “No”. If open data is not part of a compensated job function, there shouldn’t be any reasonable expectation that it will get done.
People Power: Staff(ing) Concerns
It's not my job
- It's not my job
- That's someone else's job
- My boss hasn't told me to and you're not my boss
- My bosses/the politicians don’t like it
We don't have the people to do that
- We don’t have/can’t afford a CTO/CDO/CIO/other staffer to do that
- We don't have anyone who knows how to do that
Heavy Hitters: Legality, Security, Accuracy, and Privacy
We can't legally do that
- We don’t have the legal authority to make open data/records
- It's illegal
- It's proprietary
- We don't have the rights to do that
- It's not classified, but we don't think it would be good PR to open this
It will be misunderstood/misused
- If we put the data out there in bulk, people will alter it
- Hackers will do stuff with it
- People can steal it
- The data will be mishandled or misinterpreted
- We can’t make sure the government will still own the information if it’s open
- People might misuse it
- If we share our data/code, we’ll be hacked
- It might be presented in ways that result in people misunderstanding it
- The media will misreport
- People don’t understand my data.
- It’s complex/magical/for experts
- The data source is a mess
- It will be hard to parse and put online
- Releasing it like this will make us look bad
- The data might have errors or mistakes and could misinform the public
- We're not even sure how accurate the data is, so we don't see the point in sharing it
- There are problems with the data
Privacy
- PRIVACY!
- It's classified
- We're worried about the mosaic effect.
- We don't want any perception at all that the government is collecting information about the American people
It's already public
- It’s already public
- Anyone can access records during working hours at our government building
- It's available on LexisNexis or through another provider / You can buy it from X company for a fee
- That’s what FOIA is for
- We make enough available now
- We are an open government
Say what? / Misc.
- But we released APIs!
- We’re more interested in APIs
- Oh, you mean make a data portal? We have one of those.
- How can we verify the authenticity of the data?
- We spent a lot of [time, staff-power, resources, etc.] collecting this data
- Our agency is making money from selling the data
- Someone will repackage and sell the information
- That would really piss off industry
- We want to control who uses it and what gets done with it (even if it means nothing is done with it)
- Our competition will be able to do more with it than we do with it
- The federal government/other overlapping agency already has our data
- Another layer of government is already collecting this data -- get it from them, not us
- That would really piss off Congress
- It’s not a good time
- Moving too quickly may make us go too far and then be unable to return
- Data being used by privileged technologists and educated communities creates a further divide to access to information
- This just isn’t a priority now
- It's pending/not final/in progress
- We lost it (the data) / we don't have it
Additional reasons to potentially address:
From the document "Concerns about opening up data, and responses which have proved effective," available here: https://docs.google.com/document/d/1nDtHpnIDTY_G32EMJniXaOGBufjHCCk4VC9WGOf7jK4/mobilebasic?pli=1
We’ll get spam
- We don’t have to include email addresses in the data
- You already have those email addresses on the website
- We have a spam filter!
- We can make including your email opt in or opt out.
Our lawyers want to make a custom license[d]
- This usually comes from not explaining to the lawyers what you’re trying to do
- Often there’s chinese whispers via levels of management between techies and lawyers, see if you can talk with the lawyer directly, and may not be familiar with open licenses.
- Precedents can help. If you can show several peer organisations that have published under this open license you wish to use, people will be more confident.
If we publish this data, people might sue us
- If the concern is about information about people, make a policy and stick to it. eg. we’ll correct/remove things in 7 days. (This is already embodied in one of the Data Protection Principles in the UK)
- If the concern is about errors which impact business, case law may help, but there’s not much yet. (let us know when there is some!)
- Ensure you can state what data you would have been providing on any given day, eg. someone claims they couldn’t find the nearest hospital in your data and someone died. Very helpful to be able to prove/show what data you were providing at they day/time.
- If the data is already available on your website or via an app, then are you really increasing risk by opening the data?
We want people to come direct to us so we know why they want the data[g]
- This is similar to “People might misinterpret the data” if the intention is to help people satisfy their information need.
- Well, OK, can the *metadata* or some minimal subset at least be made open?
- If there’s the option to open up the data, chances are that the information would already be available under Freedom of Information rules, wherein requestors aren’t required to justify or explain themselves. This still allows the owner to find out on a “best efforts” basis, but it’s important to remember that they won’t be able to find out on every occasion.
- An example case is a comms department “find an expert” service for arranging media interviews with your staff. In this case, rather than provide direct contact information you can offer to make the contact information via a logged web-form or specific email address. This will be subverted if the people in the dataset is linked with your open phonebook dataset.
- This can be a tricky one as once you open your data you can’t track every instance of use. This can be very threatening to people who produce this data and need to be able to provide documented proof that they/their team are doing something of value, especially when staffing levels are being reduced. One option is to find someone in power to reassure them that their work is valued and that their management understand & accept that when the data is opened they are trading control for utility.