Storage devices
Lars Ailo Bongo (larsab@cs.uit.no)
inf-2201 Spring 2021
06.04.21
Big data sources
Data growth at EMBL-EBI
Source: Charles E. Cook et al. Nucl. Acids Res. 2016;44:D20-D26
What is big data?
< 8GB
< 1TB
TBs
PBs
What is big data?
What is big data?
What is big data?
<100ms
seconds
minutes
hours
weeks
Computation time:
Big data optimizations
Overview
Storage properties
Jiahua He, Arun Jagatheesan, Sandeep Gupta, Jeffrey Bennett, Allan Snavely, "DASH: a Recipe for a Flash-based Data Intensive Supercomputer," sc, pp.1-11, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010
Punch cards
Magnetophone
Tape
Disk Sectors and Access
Chapter 6 — Storage and Other I/O Topics — 18
A Typical Magnetic Disk Controller
Disk Caching
Kryder’s vs. Moore’s Law
50 Years Later (Mark Kryder at SNW 2006)
Hard drive teardown
2021 numbers?
Disk Performance (2TB disk)
More on Performance
Block Size | % of Disk Transfer Bandwidth |
1Kbytes | 0.28% |
1Mbytes | 73.99% |
3.24Mbytes | 90% |
FCFS order
SSTF (Shortest Seek Time First)
Elevator (SCAN)
C-SCAN (Circular SCAN)
Storage System
RAID (Redundant Array of Independent Disks)
Flash Storage
Flash Memory
NAND Memory Organization
Flash Translation Layer
Hardware/Software Architecture
SSD
SSD
HD
Controller
Controller
Controller
Block Device Driver
Block Device Driver
I/O Scheduler
Generic Block Layer
File System
File System
Virtual File System
Device Driver
Flash Storage Layer
File System
SATA
PCI-E
HW
SW (OS)
Non-volatile DRAM (NVRAM)
(Netlist Nvvault)
Remote Direct Memory Access
Low latency remote memory access
10.000 Years Storage System
Summary