1 of 9

AWS Database Choices

2 of 9

Introduction

  • AWS offers over 15 purpose-built database services for OLTP.
  • For OLAP, AWS offers Redshift.
  • Supports diverse data models: relational, key-value, document, etc.
  • Selection depends on specific application requirements.

3 of 9

Choices

4 of 9

AWS Database Services Overview

  • Amazon RDS: Managed relational databases (e.g., MySQL, PostgreSQL).
  • Amazon Aurora: High-performance relational database compatible with MySQL and PostgreSQL.
  • Amazon DynamoDB: Fully managed NoSQL key-value and document database.
  • Amazon ElastiCache: In-memory data store for caching (Redis, Memcached).
  • Amazon Neptune: Graph database for connected data.
  • Amazon Timestream: Time series database for IoT and operational applications.
  • Amazon DocumentDB: Document database with MongoDB compatibility.
  • Amazon Keyspaces: Managed Cassandra-compatible database.
  • Amazon MemoryDB: Redis-compatible, durable in-memory database.​

  • Redshift: OLAP

5 of 9

Summary 1/3

Data model

When would you use it?

What is it optimized for?

Related database engines or services

Relational

Use when you're migrating or modernizing an on- premises relational workload, or if your workload has less predictable query patterns.

Optimized for structured data that is stored in tables, rows, and columns. Relational databases support complex queries through joins.

Key-value

Use for workloads such as session stores or shopping carts. Key-value databases can scale to large amounts of data and extremely high throughput of requests, while servicing millions of simultaneous users through distributed processing and storage.

Optimized to provide a serverless, NoSQL, fully managed database with single-digit millisecond performance at any scale.

6 of 9

Summary 2/3

In-memory

Use Amazon ElastiCache when you need a caching layer to improve read performance. Use Amazon MemoryDB when you need full data persistence, but still need sub-millisecond read latencies.

Optimized to support microsecond reads and sub-millisecond writes. MemoryDB supports microsecond reads and single-digit millisecond writes. ElastiCache is an ephemeral cache, while MemoryDB is an in-memory database.

Document

Use when you want to store JSON-like documents with rich querying abilities across the fields of the documents.

Optimized for storing semi-structured data as documents with multilayered attributes.

Wide-column

Use when you need to migrate your on-premises Cassandra workloads, or when you need to process data at high speeds for applications that require single-digit millisecond latency.

Optimized for workloads that require heavy reads/writes and high throughput, coupled with low latency and linear scalability.

7 of 9

Summary 3/3

Graph

Use when you have to model complex networks of objects, such as social networks, fraud detection, and recommendation engine use cases.

Optimized for traversing and evaluating large numbers of relationships, and identifying patterns with minimal latency.

Time series

Use when you have a large amount of time series data, potentially from a number of sources, such as Internet of Things (IoT) data, application metrics, and asset tracking.

Optimized for storing and querying data that is associated with timestamps and trend lines.

8 of 9

Conclusion

  • AWS offers databases for varied workloads.
  • Proper selection ensures performance and cost-efficiency.
  • Migration of data is a consideration
  • Other cloud providers like Azure and GCP offer similar products

9 of 9

Vector Databases for AI

  • AWS databases now support vector capabilities for AI applications.
  • Facilitates semantic search and context retrieval in generative AI workflows.
  • Integrates with existing data sources for enhanced AI model grounding.