Amazon Aurora MySQL Zero-ETL Integration set up with Amazon RedShift

Cloud/DevOps

Sep 23, 2024

7-8 min

Share blog

Introduction

AWS Zero-ETL Integration is a streamlined approach that eliminates the traditional Extract, Transform, Load (ETL) process for replicating data between services. Instead of manually replicating data between different databases or services, Zero-ETL enables seamless, near real-time data sharing and integration without requiring extensive data pipelines.

Amazon work station announced Amazon Aurora zero-ETL integration with Amazon Redshift at AWS re:Invent 2022 and is now available for Aurora MySQL 3.05.0 (compatible with MySQL 8.0.32) and higher versions in multiple regions.

Benefits of Zero-ETL

In the organization’s data strategy, zero-ETL offers many benefits.

Agility

Zero-ETL streamlines data architecture and minimizes the need for extensive data engineering. It enables the addition of new data sources without requiring the reprocessing of large datasets. This flexibility boosts agility, fostering data-driven decision-making and accelerating innovation.

Cost efficiency

Zero-ETL uses cloud native and scalable integration techniques which allows businesses to optimize cost based on their actual usage. It also reduces infrastructure costs, development efforts, and maintenance overheads.

Real time insights

Zero-ETL provides near-real-time data access, ensuring freshed data for analytics, AI/ML, and reporting. It guarantees accurate and timely insights for use cases like real-time dashboards, optimized gaming experience, data quality monitoring, and customer behavior analysis.

In this post, a step-by-step guidance is provided on how to set up zero-ETL between Amazon Aurora and Amazon RedShift.

You need an Amazon Aurora MySQL cluster up and running, if you haven’t set up, please follow our blog for Creating an Amazon Aurora DB Cluster

Configure the Aurora MySQL source with a customized DB cluster parameter group

Once Amazon Aurora MySQL cluster is setup, we need to create a custom parameter group with the following parameters,

On the Amazon RDS console, create a DB cluster parameter group called

javascript

zero-etl-database

Zero-ETL integrations require specific values for the Aurora DB cluster parameters that control binary logging (binlog). For example, enhanced binlog mode must be turned on

javascript

(aurora_enhanced_binlog=1).

2. Set the following binlog cluster parameter settings:

javascript

1. binlog_backup=0
2. binlog_replication_globaldb=0
3. binlog_format=ROW
4. aurora_enhanced_binlog=1
5. binlog_row_metadata=FULL
6. binlog_row_image=FULL

In addition, make sure that the binlog_transaction_compression parameter is not set to ON, and that the binlog_row_value_options parameter is not set to PARTIAL_JSON. By default, these parameters are not set.

3. Select Save changes.

Configure AWS RedShift destination with enabling case sensitive identifiers

We have set up the source DB cluster, now we need to configure Amazon RedShift as target data warehouse with the following requirements;

1
Must be an RA3 node type (ra3.16xlarge, ra3.4xlarge, or ra3.xlplus), or RedShift Serverless
2
Must be Encrypted, only in case of provisioned cluster

For our use case, we will be creating a Redshift Serverless data warehouse;

1
On the Amazon Redshift console, choose Serverless dashboard in the navigation pane.
2
Choose Create workgroup.

The following screenshot shows the default settings for Amazon Redshift Serverless.

3. For Workgroup name, enter custom-wg. Choose base RPU capacity in 8 to 512 range available in increments of 8.

4. For Namespace, select Create a new namespace and enter custom-ns, also provide database name, Admin username, and password of your own choice.

Keep all the rest as the default.

After setup completes, choose Continue to go to your Serverless dashboard. You can see that the serverless workgroup and namespace are available.

Create zero-ETL integration

To create the zero-ETL integration, complete the following steps:

1. On the Amazon RDS console, In the navigation pane choose Zero-ETL integrations and then on the Zero-ETL integration page, choose Create zero-ETL integration

2. For Integration identifier, enter a name, for example zero-etl-integration-rds.

3. For Source database, choose Browse RDS databases and choose the source cluster, in our case the cluster name is database-1 and choose Next.

4. Under Target, for Amazon Redshift data warehouse, choose Browse Redshift data warehouses and choose the Redshift Serverless destination namespace (custom-ns).

Add tags and encryption is an optional.

5. On the Review and create page, verify the integration name, source, target and other settings. When confirmed, choose Create zero-ETL integration.

Choose the integration to view the details and monitor its progress. It takes a few minutes to change the status from Creating to Active depending on the size of the dataset already available in the source.

Create a database from the integration in Amazon Redshift

To create your database, follow the steps:

1. On the Redshift Serverless dashboard, navigate to the custom-ns namespace.

2. Choose Query data and click on Query Editor v2 to open the query editor.

3. Connect to the Redshift Serverless data warehouse by choosing Create connection.

4. Run the following query to obtain the integration_id from the svv_integration system table:

javascript

$ select integration_id from svv_integration;

5. Use the integration_id from the previous step to create a new database from the integration:

javascript

$ CREATE DATABASE database-1-rs FROM INTEGRATION '';

The integration is now complete, and an entire snapshot of the source will reflect as is in the destination. Ongoing changes will also be synced in near-real time.

Blogs

Discover the latest insights and trends in technology with the Omax Tech Blog.

View All Blogs

Responsive web development illustration showing cross-device software design on laptop, tablet, and mobile screens.

Usman Baig

6-8 min

April 20, 2026

Our Proven Web Development Process That Delivers Real Results

In software development, success does not come from coding alone. Real results come from understanding business needs, planning the right workflow, building user-friendly designs...

Secure AWS Systems Manager connectivity illustration showing private cloud access to servers and databases without SSH exposure.

Umer Khan

6-8 min

April 20, 2026

Secure AWS Connectivity Using AWS Systems Manager (SSM)

In traditional cloud architectures, secure access to private resources such as databases and internal servers often relies on...

Cloud upload architecture illustration showing secure multi-account AWS infrastructure for enterprise environments.

Umer Khan

6-10 min

April 19, 2026

Building a Secure Multi-Account AWS Architecture for Enterprise Environments (Dev, STG, UAT, Prod)

In today’s cloud-first world, scalability and speed are no longer enough security, governance, and cost control are equally critical...

Friendly AI assistant robot beside a smartphone, representing adaptive AI agents for modern workflows.

Zohaib Anwar

6-8 min

April 15, 2026

Why You Should Use AI Agents Over Single Prompts: Unlocking the Power of Adaptive AI for Complex Workflows

In the world of artificial intelligence (AI), one of the biggest advancements has been the rise of AI agents that adapt dynamically to real-time data and complex workflows...

Data operations dashboard showing production quality checks, performance trends, and incident alerts across stores.

Yawar Khan

8-10 min

April 09, 2026

Production Ready ( Quality, performance, and the lessons learned shipping to 150 stores )

We chose dbt over custom scripts, built observability, optimized performance, and shipped to production...

Scalable data pipeline diagram highlighting dbt macros, reusable models, and multi-store analytics flow.

Yawar Khan

8-10 min

April 08, 2026

Scaling from 15 to 150 Stores ( When copy-paste becomes technical debt, macros become salvation )

We built a pipeline with observability, incremental models for performance, and snapshots for history. Our 15-store deployment ran smoothly...

Observability dashboard tracking source freshness, pipeline status, and real-time data quality alerts.

Yawar Khan

8-10 min

April 07, 2026

Keeping Your Data Fresh: ( The wake-up call at 3am that taught us about observability )

That morning taught us a crucial lesson: a successful dbt run doesn't mean your data is fresh, accurate, or complete. You need observability.

Retail data architecture visual showing fragmented store databases consolidated into a unified analytics pipeline.

Yawar Khan

8-10 min

April 06, 2026

Retail Data Chaos: How We Found Our Way Out ( When spreadsheets fail and databases multiply, where do you turn? )

Picture this: You're managing data for a growing retail chain. Store after store opens New York, San Francisco, Los Angeles—each with its own MySQL database...

Secure AI access workflow showing authentication, authorization, and protected enterprise operations.

Yawar Khan

8-10 min

April 07, 2026

Securing Your AI-Powered Future (How Authorization Ensures Safe and Appropriate Access)

Discover how authorization in MCP ensures secure, role-based access for AI-powered business workflows...

Software Development

Data Engineering & Analytics

Artificial Intelligence

IT Staff Augmentation

ERP/CRM Solutions

Cloud/DevOps

UI/UX Design

Custom Software Development

SaaS Development

Web Application Development

MVP Development Services

Quality Assurance & Testing

Amazon Aurora MySQL Zero-ETL Integration set up with Amazon RedShift

Share blog

Introduction

Benefits of Zero-ETL

Agility

Cost efficiency

Real time insights

Configure the Aurora MySQL source with a customized DB cluster parameter group

Configure AWS RedShift destination with enabling case sensitive identifiers

Create zero-ETL integration

Create a database from the integration in Amazon Redshift

Blogs

Our Proven Web Development Process That Delivers Real Results

Secure AWS Connectivity Using AWS Systems Manager (SSM)

Building a Secure Multi-Account AWS Architecture for Enterprise Environments (Dev, STG, UAT, Prod)

Why You Should Use AI Agents Over Single Prompts: Unlocking the Power of Adaptive AI for Complex Workflows

Production Ready ( Quality, performance, and the lessons learned shipping to 150 stores )

Scaling from 15 to 150 Stores ( When copy-paste becomes technical debt, macros become salvation )

Keeping Your Data Fresh: ( The wake-up call at 3am that taught us about observability )

Retail Data Chaos: How We Found Our Way Out ( When spreadsheets fail and databases multiply, where do you turn? )

Securing Your AI-Powered Future (How Authorization Ensures Safe and Appropriate Access)

Get In Touch