Features of the Data Landscape
(Big) Data & Data Assemblages
Ceilyn Boyd
ceilyn_boyd@harvard.edu
ceilyn.boyd@simmons.edu
NEASIST
11 January 2019
Navigating the Data Landscape - What features will you encounter?
2
Connecting data practice and research
3
Practice
Harvard Library
Research
Simmons University
(Big) Data &
Data Assemblages
?
Critical Data Studies & Philosophy of data
Research Data Management
Concepts
4
5 Key Concepts
(Big) Data
Measurements, pre-factual, pre-analytical pieces of information exhibiting 3Vs (volume, variety & velocity)
Assemblages
Non-hierarchical arrangements of components characteristized by extrinsic properties and relations
Data Assemblages
Sociotechnical infrastructures concerned with data
Research Data Management
Active management of data throughout the research and data lifecycle
Critical Data Studies
Interdisciplinary research area concerned with the critical, systematic investigation of data & data assemblages
5
Term: Critical Data Studies (CDS)
Interdisciplinary research area concerned with the critical, systematic investigation of data & data assemblages
6
Critical Data Studies (CDS)
7
Data
big data
Critical Theory
Discipline
small(er) data
Critical Data Studies
data assemblages
Social Theories
assemblage theory
Geography
(or LIS, RDM, etc.)
Foundation of Critical Data Studies
Origins
7 Guiding Principles
(Dalton & Thatcher, 2014)
(Oliphant, 2017)
8
Term: (Big) Data
9
Data are not pure or natural objects with an essence of their own. They exist in a context, taking on meaning from that context and the perspective of the beholder.
(Borgman, 2015, p. 18)
10
“
Big Data possess a suite of key traits: volume, velocity and variety (the 3Vs), but also exhaustivity, resolution, indexicality, relationality, extensionality and scalability...Our analysis reveals that the key definitional boundary markers are the traits of velocity and exhaustivity.
(Kitchin & McArdle, 2016, p. 1)
11
“
(Big) Data Characteristics
Data
Big Data
Data and datasets that exhibit some or all of the following properties:
(Kitchin & McArdle, 2016, p. 2)
12
Data: Small & Big
13
Characteristic | Small Data | Big Data |
Volume | Limited to large | Very large |
Exhaustivity | Samples | Entire populations |
Resolution & Indexicality | Coarse and weak, to tight and strong | Tight and strong |
Relationality | Weak to strong | Strong |
Velocity | Slow, freeze-framed, or bundled | Fast, continuous |
Variety | Limited to wide | Wide |
Extensibility & Scalability | Low to medium | High |
(Kitchin, 2014, p. 28)
(Kitchin & McArdle, 2016, p. 2)
(Bigger) Data in a (Harvard) Library Context
14
Characteristic | Examples |
Volume | Digitized & digital collections Born digital collections Electronic records Scientific research data |
Exhaustivity | Scientific research data |
Resolution & Indexicality | Scientific research data |
Relationality | Collections as Data Research data |
Velocity | Research data |
Variety | Born digital collections Digitized collections Metadata Research data |
Extensibility & Scalability | Born digital collections Collections as Data Research data |
Term: Assemblage
Non-hierarchical arrangement of components characteristized by extrinsic properties and relations
15
An assemblage is an arrangement of heterogeneous, autonomous components whose extrinsic properties and interactions give rise to a unique, enduring individual.
(Boyd, 2018, p. 5)
16
“
[T]he assemblage permits the researcher to speak of emergence, heterogeneity, the decentred and the ephemeral in nonetheless ordered social life.
(Marcus & Saka, 2006, p. 101)
17
“
Assemblage
Definition
Origins
Purpose & Use
Related Concepts & Theories
18
Assemblage Characteristics
Assemblages are...
Assemblages have...
19
Examples of Real-World Assemblages
20
21
Sesay, A., Oh, O.-O., & Ramirez, R. (2016). Understanding Sociomateriality through the Lens of Assemblage Theory: Examples from Police Body-Worn Cameras.
22
Diagram of an assemblage that emphasizes its machine-like, interlocking qualities
(Boyd, 2018)
23
Term: Data assemblage
Sociotechnical infrastructure concerned with data
24
Data Assemblage
Definition
Origins
Purpose & Use
Related Concepts & Theories
25
Data Assemblage Characteristics
Data assemblages (are)…
Data assemblages have…
26
27
28
Here’s How #Ferguson Exploded on Twitter Last Night (Oh, 2014) https://www.motherjones.com/politics/2014/11/ferguson-twitter-map
29
Amazon Doesn’t Consider the Race of Its Customers. Should It?
(Ingold & Soper, 2016)
https://www.bloomberg.com/graphics/2016-amazon-same-day/
30
Research
Build infrastructure to systematically describe, analyze & compare data assemblages
31
32
Me
CDS
Analyze & Compare
Data assemblages
Overview of Research Direction
Research Questions
Approach
33
Knowledge Organization Systems - Ontology is most complex
Pieterse, V., & Kourie, D. G. (2014)
34
Q: How can we systematically describe, analyze, and compare assemblage phenomena?
A: Build an ontology & glossary
A. Workflow to Develop Domain Ontology & Glossary
35
Statistics | # |
Total concepts | 71 |
Top-level concepts | 8 |
Descendents for each top-level concept | 36 |
assemblage | 6 |
capacity | 3 |
event | 0 |
mechanism | 1 |
population | 1 |
process | 5 |
property | 20 |
virtual diagram | 0 |
Visualization of Assemblage Theory Domain Ontology
36
Domain Ontology & Glossary
37
B. Workflow to Develop Assemblage Theory Conceptual Framework
38
Q: How can assemblage theory be made more accessible and useful to LIS researchers? How can we support empirical studies involving data assemblages?
A: Build a conceptual framework
B. Assemblage Theory Conceptual Framework Summary
8 Theory Propositions
12 Concepts
3 Relationships
3 Facets
39
Assemblage Theory Conceptual Framework Diagram
40
Assemblage
Probability
Causality
Structure
Virtual Diagram
Virtual Diagram
Virtual Diagram
Event
Mechanism
Process
Virtual diagram*
Capacity
Population
Virtual diagram*
Assemblage
Component
Property
Role
Scale
Scope
Virtual diagram*
41
Overview of Research Direction - Redux
Research Questions
Approach
42
Summary
43
Why Assemblage Theory & (Data) Assemblages?
44
Questions?
Thank you.
ceilyn_boyd@harvard.edu
ceilyn.boyd@simmons.edu
45
References
Borgman, C. (2015). Big data, little data, no data: scholarship in the networked world. Cambridge, Massachusetts: MIT Press.
boyd, danah, & Crawford, K. (2012). Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon. Information, Communication & Society, 15(2), 662–679.
Boyd. C. M. (2018). A Prototype Domain Ontology for Neo-Assemblage Theory. Manuscript in preparation.
Dalton, C., & Thatcher, J. (2014). What does a critical data studies look like, and why do we care?
DeLanda, M. (2006a). A new philosophy of society: assemblage theory and social complexity. London: Continuum.
DeLanda, M. (2006b). Deleuzian Social Ontology and Assemblage Theory. In M. Fuglsang & B. Meier Sorensen (Eds.), Deleuze and the Social. Edinburgh University Press.
DeLanda, M. (2016). Assemblage theory. Edinburgh: Edinburgh University Press.
Ford, H. (2014). Big Data and Small: Collaborations between ethnographers and data scientists. Big Data & Society, 1(2).
Kitchin, R. (2014). The data revolution: big data, open data, data infrastructures & their consequences. Los Angeles, California: SAGE Publications.
Kitchin, R., & McArdle, G. (2016). What makes big data, big data? Exploring the ontological characteristics of 26 datasets. Big Data & Society, 1–6.
Marcus, G. E., & Saka, E. (2006). Assemblage. Theory, Culture & Society, 23(2–3), 101–106.
Oliphant, T. (2017). A case for critical data studies in library and information studies. Journal of Critical Library and Information Science Studies, 1(1).
Welles, B. F. (2014). On minorities and outliers: The case for making Big Data small. Big Data & Society, 1(1).
46
Credits
47