Agent-based modelling of the data economy
Lawrence Kay Senior Policy Advisor
lawrence.kay@
theODI.org
Sara Mahmoud Researcher
sara.mahmoud@
theODI.org
Nigel Shardlow
Planning Director
nigel@
sandtable.com
Lies Boelen
Data Scientist
lies@
sandtable.com
Agenda:
Agent-based modelling of the data economy
2
theODI.org
Agent-based modelling of the data economy: what is our aim?
To explore how agent-based models might help policy-makers understand the effects of data policy on the data economy.
Place your image over the grey box and crop accordingly
3
theODI.org
Agent-based modelling of the data economy: what will we produce?
A playable model of the data economy that helps policy-makers to explore some of its features, and guidance on how to develop an ABM as a non-specialist.
Place your image over the grey box and crop accordingly
4
theODI.org
Agent-based modelling of the data economy: what is the data economy?
Data is non-rivalrous and excludable, and innovative products and services are produced by a combination of data and human capital.
Place your image over the grey box and crop accordingly
5
theODI.org
Agent-based modelling of the data economy: what is the nature of policy-making for the data economy?
Policy questions for the data economy are probably in a complex environment, which has features of emergence and self-organization.
Place your image over the grey box and crop accordingly
6
theODI.org
Agent-based modelling of the data economy: which policies could we investigate?
Ethics
Organisational change
Portability
What are the effects of data being used more ethically?
What are the effects of organisations being able to better extract value from data?
What are the effects of portability and on competition and innovation, and what happens if we also develop data trusts?
7
theODI.org
Agent-based modelling of the data economy: why model?
We are building on the Blackett Review
Key themes
Source: Government Office for Science (2018) Computational Modelling: Technological Futures
Place your image over the grey box and crop accordingly
8
theODI.org
Agent-based modelling of the data economy: where are we starting with our model?
Source: Government Office for Science (2018) Computational Modelling: Technological Futures
Model purpose | Essential features | Risks |
Prediction | Anticipates unknown data | Conditions of application unclear |
Explanation | Uses plausible mechanisms to match outcome data in a well-defined manner | Model is ‘brittle’, so minor changes in the set-up result in a bad fit to explained data |
Understanding theory | Systematically maps out or establishes the consequences of some mechanisms | Mistakes in the model specification; inadequate coverage of possibilities |
Illustration | Shows an idea clearly | Over-interpretation to make theoretical or empirical claims |
Analogy | Maps to what is being modelled in a plausible but flexible way and provides new insights | Confusion between a way of thinking about something and the truth — this model gives no support to empirical claims |
9
theODI.org
Your help
We would like your insights, challenges, and questions on our model as it develops towards it final version in March 2019
Coming up...
11
Agent-Based Modelling: A Very Short Introduction
Thomas Schelling (1978) Micromotives and Macrobehaviour
Racial map of Baltimore
Image: Erik Fischer. Shared under Creative Commons Attribution-Sharealike 2.0 Generic license.
14
Segregation model
Image: Geraint Ian Palmer. Image shared with the permission of the author.
Each dot represents a family.
Families prefer to have at least x% of their neighbours the same colour as them
If this criterion isn’t met, they move until it is.
15
Segregation model
Each dot represents a family.
Families prefer to have at least 60% of their neighbours the same colour as them
If this criterion isn’t met, they move until it is.
Image: Geraint Ian Palmer. Image shared with the permission of the author.
16
Segregation model
Showing final states for a range of different values of same-colour percentage preference
Strong segregation emerges even at lower preference thresholds
Image: Geraint Ian Palmer. Image shared with the permission of the author.
17
Linking Macro and Micro
Source: Coleman, J (1990) Foundations of Social Theory
Micro-level decisions aggregate up to unexpected macro-level effects and patterns
Macro level phenomena can be explained in terms of the micromotives that are driving them
Increased levels of computer power and data availability make the study of the relationship between micro and macro tractable
18
Coleman’s Boat (or Bathtub)
Model Components (Boxology) - Schelling Model
19
ABM of the data economy
Goal of the model
To answer the question:
“What are the effects of data portability on competition and innovation?”
Boxology
Overview
Agent Attributes
Which attributes does a firm have?
Which attributes does a consumer have?
Behavioural rules
How can a firm’s attributes change?
Which rules affect consumer attributes?
Product quality
x
Value of data
Firms hold historical data from users, but this value depreciates over time.
The intuition here is that newly harvested consumer data is worth more than that harvested in the past.
Value of data (mathematically)
time
Example of the consumer utility function
?
?
?
Consumer
(Currently with Firm 1)
(Firm 1 portable with firms 2 and 3)
Consumer most likely to choose firm 2, since it has the highest net utility
How does a consumer choose a firm?
�����
How is data portability implemented? (1)
Firm A
Firm B
Current tick
time
How is data portability implemented? (2)
Firm A
Firm B
Current tick
time
Reminder ...
… data portability gives a boost to the utility function
So what does this look like?
See here for now
Consumers
Firms
Scenario
The privacy scenario
Example: World with two big firms
No privacy scenario
Privacy weight (Wpriv) = 4
Privacy scenario
tick 150
Privacy weight (Wpriv) = 4
Privacy scenario
tick 150
Privacy weight (Wpriv) = 6
Example: World with one big firm
No privacy scenario
Privacy weight (Wpriv) = 4
Privacy scenario
tick 150
Privacy weight (Wpriv) = 6
Privacy scenario
tick 150
Privacy weight (Wpriv) = 8
App
Possible future directions
What should go into the next iteration of the model?
(ctd)
Playtime
Group exercises
What’s next?
Get in touch
We would like your insights, challenges, and questions on our model as it develops towards it final version in March 2019. The next workshop will be in late February, or early March, 2019.
lawrence.kay@theodi.org
References
54