DynamoDB to ElastiCache Replication Example

DISCLAIMER

This project is provided as an example/proof of concept only and is not intended for production use. It demonstrates architectural patterns and implementation techniques but lacks the comprehensive error handling, security controls, and operational robustness required for production systems. Use this as a learning resource or starting point for your own implementation.

This project demonstrates a real-time data replication pattern from Amazon DynamoDB to Amazon ElastiCache using AWS CDK and TypeScript. It showcases how to maintain data consistency between a DynamoDB table and an ElastiCache cluster using DynamoDB Streams and Lambda functions.

Architecture Overview

The architecture implements a Change Data Capture (CDC) pattern using DynamoDB Streams to capture changes in the source DynamoDB table and replicate them to an ElastiCache Serverless cluster in real-time.

flowchart LR
    DDB[DynamoDB Table] -->|Streams| Stream[DynamoDB Stream]
    Stream -->|Triggers| Lambda[Lambda Function]
    Lambda -->|Updates| EC[ElastiCache Serverless]
    Simulator[DDB Changes Simulator] -->|Writes| DDB
    subgraph VPC
        Lambda
        EC
    end

Key Components

DynamoDB Table: Source of truth for data with stream enabled
DynamoDB Stream: Captures item-level changes in the table
Lambda Function: Processes stream events and updates ElastiCache
ElastiCache Serverless: Destination cache for high-performance data access
VPC Configuration: Secure network setup for ElastiCache and Lambda
DDB Changes Simulator: Generates sample data changes for demonstration

Batch Size and CDC Ordering

Why Batch Size Matters

The project explicitly sets batchSize: 1 in the DynamoDB Streams to Lambda configuration. This is a critical design decision that directly impacts the ordering guarantees of the Change Data Capture (CDC) process.

Impact of Batch Size on Event Ordering:

Batch Size = 1 (Current Implementation)
- Pros:
  - Guarantees strict sequential processing of events
  - Ensures that each record is fully processed before moving to the next
  - Prevents partial batch failures that could lead to out-of-order updates
- Cons:
  - Lower throughput due to processing one record at a time
  - Higher Lambda invocation costs
Batch Size > 1
- Pros:
  - Higher throughput and potentially lower costs
  - Fewer Lambda invocations for the same number of records
- Cons:
  - Risk of partial batch failures
  - If a batch partially fails, some records might be processed out of order
  - Retries of failed batches can lead to duplicate processing

sequenceDiagram
    participant DDB as DynamoDB
    participant Stream as DynamoDB Stream
    participant Lambda as Lambda Function
    participant EC as ElastiCache
    
    Note over DDB,EC: Batch Size = 1 (Current Implementation)
    DDB->>Stream: Update Item A (v1)
    DDB->>Stream: Update Item A (v2)
    Stream->>Lambda: Process Update A (v1)
    Lambda->>EC: Update Cache A (v1)
    Stream->>Lambda: Process Update A (v2)
    Lambda->>EC: Update Cache A (v2)
    
    Note over DDB,EC: Batch Size > 1 (Potential Issues)
    DDB->>Stream: Update Item A (v1)
    DDB->>Stream: Update Item A (v2)
    Stream->>Lambda: Process Batch [A(v1), A(v2)]
    Lambda--xEC: Partial Failure
    Lambda->>Lambda: Retry Batch
    Lambda->>EC: Update Cache A (v1)
    Lambda->>EC: Update Cache A (v2)
    Note over EC: Correct final state but<br/>potential inconsistency during retry

DLQ Considerations

The project intentionally disables the SQS Dead Letter Queue (deploySqsDlqQueue: false) to preserve message ordering. If a DLQ were enabled, failed messages would be sent to the queue and potentially processed out of order when retried, breaking the sequential processing guarantee.

Deployment and Usage

Prerequisites

AWS CLI configured with appropriate permissions
Node.js and npm installed
AWS CDK installed (npm install -g aws-cdk)

Setup and Deployment

Clone the repository
Install dependencies:
```
npm install
```
Build the project:
```
npm run build
```
Deploy the stack:
```
npx cdk deploy
```

Testing the Solution

Once deployed, the DDB Changes Simulator will automatically generate random changes to the DynamoDB table every minute. These changes will be:

Captured by DynamoDB Streams
Processed by the Lambda function
Replicated to ElastiCache

You can monitor the process through CloudWatch Logs for the Lambda function.

DDB Changes Simulator

The project includes a built-in simulator that generates random data changes to demonstrate the replication process without requiring manual input.

How the Simulator Works

Scheduled Execution: The simulator runs as a Lambda function triggered by an EventBridge rule every minute
Random Operations: It performs a configurable number of random operations (default: 10) on the DynamoDB table:
- Upsert operations (create or update items) - 2/3 probability
- Delete operations - 1/3 probability
Data Generation:
- Creates items with random IDs (1-100)
- Assigns random names from a predefined list
- Adds timestamps to track when changes occurred

Simulator Configuration

The simulator behavior can be adjusted through environment variables:

DYNAMODB_TABLE: The target DynamoDB table name (automatically set during deployment)
CHANGES_COUNT: Number of random operations to perform in each execution (default: 10)

Implementation Details

The simulator is implemented as:

A CDK construct (DDBChangesSimulator) that sets up the required infrastructure
A Lambda function that generates and executes the random operations
An EventBridge rule that triggers the Lambda on a schedule

This approach provides a continuous stream of realistic data changes that exercise the entire replication pipeline, making it easy to observe and validate the system's behavior without manual intervention.

Useful Commands

npm run build compile typescript to js
npm run watch watch for changes and compile
npm run test perform the jest unit tests
npx cdk deploy deploy this stack to your default AWS account/region
npx cdk diff compare deployed stack with current state
npx cdk synth emits the synthesized CloudFormation template

Considerations for Production Use

Error Handling: Implement more robust error handling and monitoring
Scaling: Consider adjusting Lambda concurrency limits for higher throughput
Monitoring: Add CloudWatch alarms for stream iterator age and Lambda errors
Recovery: Implement a recovery mechanism for extended outages
Security: Review IAM permissions to ensure least privilege principle

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bin		bin
lib		lib
test		test
.gitignore		.gitignore
.npmignore		.npmignore
README.md		README.md
cdk.json		cdk.json
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DynamoDB to ElastiCache Replication Example

DISCLAIMER

Architecture Overview

Key Components

Batch Size and CDC Ordering

Why Batch Size Matters

Impact of Batch Size on Event Ordering:

DLQ Considerations

Deployment and Usage

Prerequisites

Setup and Deployment

Testing the Solution

DDB Changes Simulator

How the Simulator Works

Simulator Configuration

Implementation Details

Useful Commands

Considerations for Production Use

About

Uh oh!

Releases

Packages

Languages

arditti/aws-example-dynamodb-to-elasticache

Folders and files

Latest commit

History

Repository files navigation

DynamoDB to ElastiCache Replication Example

DISCLAIMER

Architecture Overview

Key Components

Batch Size and CDC Ordering

Why Batch Size Matters

Impact of Batch Size on Event Ordering:

DLQ Considerations

Deployment and Usage

Prerequisites

Setup and Deployment

Testing the Solution

DDB Changes Simulator

How the Simulator Works

Simulator Configuration

Implementation Details

Useful Commands

Considerations for Production Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages