Optimizing correlated subqueries in Amazon Aurora PostgreSQL

December 18, 2025

71

Optimizing correlated subqueries in Amazon Aurora PostgreSQL — DB 5191 1120x630

Correlated subqueries may cause efficiency challenges in Amazon Aurora PostgreSQL-Appropriate Version, usually inflicting functions to expertise decreased efficiency as knowledge volumes develop. On this publish, we discover the superior optimization configurations obtainable in Aurora PostgreSQL that may remodel these efficiency challenges into environment friendly operations with out requiring you to switch a single line of SQL code. The configurations we’re going to discover are the subquery transformation and the subquery cache optimizations.

Should you’re experiencing efficiency impacts attributable to correlated subqueries or planning a database migration the place rewriting a question utilizing correlated subquery to a unique kind isn’t possible, the optimization capabilities of Aurora PostgreSQL may ship the efficiency enchancment you want.

Understanding correlated subqueries

A correlated subquery is a nested question that references columns from its outer question, making a dependency that requires the interior question to execute as soon as for every row processed of the outer question. This relationship is what makes them each highly effective and probably problematic for efficiency.

Anatomy of a correlated subquery

The next diagram reveals a correlated subquery that finds the utmost order quantity for every buyer by trying up their orders in a separate desk. For every buyer row in the primary question, the subquery executes as soon as to search out that particular buyer’s highest order whole utilizing the matching customer_id from the orders desk.

-- OUTER QUERY: The primary question that selects from the shoppers desk
SELECT customer_name, 
( 
	-- INNER QUERY: Correlated subquery that finds the max order for every buyer
	SELECT MAX(o.total_amount) 
	FROM orders o 
	WHERE o.customer_id = c.customer_id 
) AS max_order_amount 
FROM prospects c; -- OUTER QUERY: prospects desk with alias 'c'

Key parts:

Outer question: Processes the primary dataset (prospects desk)
Internal question: The subquery that depends upon outer question values
Correlation situation: o.customer_id = c.customer_id hyperlinks the queries
Correlated column: c.customer_id from the outer question used within the interior question

The efficiency problem

Conventional execution of correlated subqueries follows this sample:

Fetch one row from the outer question
Execute the subquery with correlated values fetched from step 1
Return the subquery outcome
Repeat for the subsequent outer row

For 10,000 prospects with 500 orders every, this might suggest 10,000 separate subquery executions, leading to sub-optimal efficiency for the question.

Aurora PostgreSQL gives two highly effective strategies to beat the efficiency challenges of correlated subqueries: subquery transformation optimization and subquery cache optimization.

Subquery transformation

The subquery transformation optimization mechanically converts correlated subqueries into environment friendly join-based execution plans. As a substitute of working the identical subquery repeatedly for every row in your outer desk, this optimization runs the subquery simply as soon as and shops the leads to a hash lookup desk. The outer question then merely seems to be up the solutions it wants from the hash desk, which is way sooner than recalculating the identical factor repeatedly.

The subquery transformation function delivers the best discount in question execution time in a number of key eventualities. When your outer question is predicted to return massive outcome units of greater than 1,000 rows, the transformation can considerably enhance efficiency. The function excels with costly subqueries that embrace aggregations, advanced joins, and sorting operations throughout the subquery itself. It proves significantly helpful when indexes are lacking on correlation columns, as a result of the transformation eliminates the necessity for these indexes. Migration eventualities the place rewriting queries isn’t possible signify one other ultimate use case; you should utilize subquery transformation to enhance efficiency with out modifying current SQL code. A best-case situation for the subquery transformation optimization happens when the desk is massive and has lacking indexes and the subquery outcome makes use of aggregation features.

Tips on how to allow subquery transformation

Aurora PostgreSQL can mechanically remodel eligible correlated subqueries into environment friendly be a part of operations. This may be enabled at both the session or parameter group degree.

-- Allow the transformation function on the session degree
SET apg_enable_correlated_scalar_transform = on;

-- Allow it on the parameter group degree
aws rds modify-db-cluster-parameter-group 
--db-cluster-parameter-group-name mydbclusterparametergroup 
--parameters "ParameterName=apg_enable_correlated_scalar_transform,ParameterValue=1,ApplyMethod=quick"

-- Confirm it's enabled on the psql immediate 
SHOW apg_enable_correlated_scalar_transform

Limitations and scope

The transformation applies solely when the next circumstances are met:

Correlated columns seem solely within the subquery WHERE clause
Subquery WHERE circumstances use AND operators solely
The subquery should return a scalar worth utilizing combination features like: MAX, MIN, AVG, COUNT, or SUM.
No LIMIT, OFFSET, or ORDER BY operators might be current within the subquery
There are not any indexes current on the outer question or subquery be a part of columns

The next are examples of queries that may not be optimized by subquery transformation based mostly on these limitations:

A correlated discipline in a SELECT clause:

SELECT customer_name, 
(SELECT MAX(o.total_amount) * c.discount_factor -- unsupported
FROM orders o WHERE o.customer_id = c.customer_id)
FROM prospects c;

OR circumstances in a WHERE clause:

SELECT customer_name, 
(SELECT MAX(o.total_amount)
FROM orders o 
WHERE o.customer_id = c.customer_id OR o.standing="lively") -- unsupported
FROM prospects c;

A LIMIT within the subquery:

SELECT customer_name, 
(SELECT AVG(o.total_amount)
FROM orders o WHERE o.customer_id = c.customer_id
LIMIT 5) -- unsupported 
FROM prospects c;

The subquery cache

The subquery cache optimization shops and reuses subquery outcomes for repeated correlation values, decreasing computational overhead and bettering efficiency when the identical subquery circumstances are evaluated a number of occasions. The perfect situation for enabling the subquery cache is when you might have excessive correlation worth repetition (many rows with the identical attributes), costly subquery computations, queries ineligible for plan remodel, and when the cached result’s finite. An ideal instance situation happens when the subquery situation is just too sophisticated, so the subquery remodel received’t work for the question, and the cache hit price is greater than 30%, which signifies there are repeated rows within the outer desk.

The subquery cache optimization provides a PostgreSQL Memoize node, a caching layer between a nested loop and its interior subquery, storing outcomes to keep away from redundant computations. As a substitute of executing the subquery for each buyer row, the Memoize node:

Caches outcomes based mostly on the customer_id (cache key).
Reuses cached outcomes when the identical customer_id seems once more.
Reduces redundant computations by storing beforehand calculated max (total_amount) values.

Tips on how to allow subquery caching

Subquery caching enhances transformation by storing and reusing subquery outcomes. It may be enabled in two methods.

You may allow subquery caching on the session degree:

SET apg_enable_subquery_cache = on;

You may as well allow in your cluster parameter group, utilizing the next dynamic parameter:

aws rds modify-db-cluster-parameter-group 
--db-cluster-parameter-group-name mydbclusterparametergroup 
--parameters "ParameterName=apg_enable_subquery_cache,ParameterValue=1,ApplyMethod=quick"

Lastly, you may examine the change has been utilized:

SHOW apg_enable_subquery_cache;

Limitations and scope

The next are subquery varieties which can be can be utilized within the subquery cache:

Scalar subqueries with correlation
Deterministic features solely
Hashable correlation column varieties

The operators IN, ANY, ALL can’t be used with statements within the subquery cache.

When to make use of subquery remodel or subquery cache

The subquery cache might be enabled independently of subquery transformation, which suggests you should utilize subquery caching as a standalone efficiency enhancement or together with subquery transformation for max optimization. Use subquery transformation by itself when you might have massive datasets with minimal repeated values and your subqueries meet the strict necessities, as a result of it might probably considerably cut back execution time by changing nested loops into environment friendly joins. Use the subquery cache when your queries have advanced circumstances that forestall transformation however include many repeated correlation values, permitting the cache to retailer and reuse outcomes even when transformation isn’t doable.

The place subqueries meet the necessities for each subquery transformation and caching due to repeated correlation values, then you’ll get most profit by utilizing each optimizations:

-- Allow each optimizations
SET apg_enable_correlated_scalar_transform = on;
SET apg_enable_subquery_cache = on;

In such circumstances, the optimizer will do the remodel first and subsequently the subquery cache might be used to construct the question plan.

Subquery optimizations in motion

Let’s have a look at the best way to use these two optimizations in observe. For subquery transformation, we are going to measure affect in question execution time earlier than and after optimization. For subquery cache, we are going to measure affect in cache hit ratio (CHR).

Subquery transformation affect

To evaluate the affect of subquery transformation, we begin by producing take a look at knowledge. The next knowledge preparation SQL statements create two tables, the primary known as inner_table with three columns (id, a, and b) and populates it with 10,000 rows of knowledge the place column a cycles by values 1–100 and column b comprises random numbers. The code then fills the second outer_table with 50,000 rows break up between repeated values (1–100) and distinctive values (101–25100), adopted by gathering statistics on each tables.

-- We create tables
CREATE TABLE outer_table (
id SERIAL PRIMARY KEY,
a INT,
b INT
);
CREATE TABLE inner_table (
id SERIAL PRIMARY KEY,
a INT,
b INT
);
-- Insert knowledge into interior desk with some randomness
INSERT INTO inner_table (a, b)
SELECT 
1 + mod(generate_series - 1, 100), -- 100 rows per 'a' worth
ground(random() * 1000)::int
FROM generate_series(1, 10000);

-- Insert knowledge into outer desk
-- First 25K rows with values 1-100 (repeated values)
INSERT INTO outer_table (a, b)
SELECT 
1 + mod(generate_series - 1, 100),
ground(random() * 1000)::int
FROM generate_series(1, 25000);

-- Subsequent 25K rows with values 101-25100 (distinctive values)
INSERT INTO outer_table (a, b)
SELECT 
generate_series + 100,
ground(random() * 1000)::int
FROM generate_series(1, 25000);

INSERT INTO outer_table (a, b)
SELECT 
generate_series + 100, -- Distinctive a values ranging from 101
ground(random() * 1000)::int -- Random b values
FROM generate_series(1, 25000);

-- Collect up to date statistics
ANALYZE outer_table;
ANALYZE inner_table;

An index isn’t wanted on this instance, as a result of correlated subquery transformation works finest with out indexes. It’s because subquery transformations can take away the necessity for indexes by changing a number of desk scans right into a single desk scan, decreasing question complexity and avoiding the storage overhead and Information Manipulation Language (DML) latency penalties related to sustaining indexes.

Earlier than enabling the transformation:

clarify analyze SELECT outer_table.a, outer_table.b,
(SELECT AVG(inner_table.b) FROM inner_table
WHERE inner_table.a = outer_table.a) FROM outer_table;
QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------
Seq Scan on outer_table (value=0.00..9013896.00 rows=50000 width=40) (precise time=0.894..41856.498 rows=50000 loops=1)
SubPlan 1
-> Combination (value=180.25..180.26 rows=1 width=32) (precise time=0.836..0.836 rows=1 loops=50000)
-> Seq Scan on inner_table (value=0.00..180.00 rows=100 width=4) (precise time=0.419..0.831 rows=50 loops=50000)
Filter: (a = outer_table.a)
Rows Eliminated by Filter: 9950
Planning Time: 0.083 ms
Execution Time: 41860.809 ms
(8 rows)

After enabling the transformation:

Sections indicating {that a} hash lookup desk is getting used as a substitute of repeated subquery executions for each row within the outer desk are highlighted in daring.

clarify analyze SELECT outer_table.a, outer_table.b,
(SELECT AVG(inner_table.b) FROM inner_table
WHERE inner_table.a = outer_table.a) FROM outer_table;
QUERY PLAN
---------------------------------------------------------------------------------------------------------------------------------
Hash Left Be part of (value=207.50..1109.78 rows=50000 width=40) (precise time=3.008..18.671 rows=50000 loops=1)
Hash Cond: (outer_table.a = inner_table.a)
-> Seq Scan on outer_table (value=0.00..771.00 rows=50000 width=8) (precise time=0.010..5.132 rows=50000 loops=1)
-> Hash (value=206.25..206.25 rows=100 width=36) (precise time=2.992..2.994 rows=100 loops=1)
Buckets: 1024 Batches: 1 Reminiscence Utilization: 13kB
-> HashAggregate (value=205.00..206.25 rows=100 width=36) (precise time=2.930..2.965 rows=100 loops=1)
Group Key: inner_table.a
Batches: 1 Reminiscence Utilization: 32kB
-> Seq Scan on inner_table (value=0.00..155.00 rows=10000 width=8) (precise time=0.008..0.895 rows=10000 loops=1)
Planning Time: 0.156 ms
Execution Time: 20.807 ms
(11 rows)

The next graph reveals the execution time (in milliseconds) in a take a look at case with an interior desk of 10,000 rows and an outer desk of 75,000 rows. By utilizing the subquery remodel, the question execution time is decreased by 99.95% in comparison with not utilizing the function, from 41seconds down to twenty.8 milliseconds. The proportion enchancment realized is data-dependent, however this instance demonstrates the dimensions of enchancment that may be achieved.

A bar chart showing query execution times with the subquery transformation enabled and disabled. In this example there is a 99.95% reduction in execution time with the transformation enabled.

Subquery cache affect

Identical to reviewing the affect of subquery transformation, step one to evaluate the affect of utilizing a subquery cache is to generate take a look at knowledge. The next SQL code creates two tables (outer_table and inner_table) with similar buildings (id, a, bcolumns) and populates inner_table with 10,000 sequential rows whereas filling outer_table with 100,000 rows break up between repeated values (1–100) within the first half and distinctive values (101–50100) within the second half. The script concludes by updating the database statistics for each tables utilizing the ANALYZE command.

-- We drop the tables of the identical title that existed earlier than
DROP TABLE IF EXISTS outer_table;
DROP TABLE IF EXISTS inner_table;
-- We create the tables
CREATE TABLE outer_table (
id SERIAL PRIMARY KEY,
a INT,
b INT
);
CREATE TABLE inner_table (
id SERIAL PRIMARY KEY,
a INT,
b INT
);
-- Insert knowledge into interior desk with some randomness
INSERT INTO inner_table (a, b)
SELECT 
generate_series,
ground(random() * 1000)::int -- Random b values
FROM generate_series(1, 10000);
-- Insert knowledge into outer desk
-- First 50K rows with values 1-100 (repeated values)
INSERT INTO outer_table (a, b)
SELECT 
1 + mod(generate_series - 1, 100), -- a values cycle from 1-100
ground(random() * 1000)::int -- Random b values
FROM generate_series(1, 50000);
-- Subsequent 50K rows with values 101-50100 (distinctive values)
INSERT INTO outer_table (a, b)
SELECT 
generate_series + 100, -- Distinctive a values ranging from 101
ground(random() * 1000)::int -- Random b values
FROM generate_series(1, 50000);
-- Collect contemporary statistics
ANALYZE outer_table;
ANALYZE inner_table;

An index isn’t wanted as a result of when indexes are current and usable, particular person subquery executions would possibly already be quick sufficient that caching gives diminishing returns.

With out the subquery cache:

postgres=> set apg_enable_subquery_cache= off;
SET
postgres=> clarify analyze SELECT outer_table.a, outer_table.b, 
(SELECT AVG(inner_table.b) FROM inner_table 
WHERE inner_table.a = outer_table.a) FROM outer_table;
QUERY PLAN 
---------------------------------------------------------------------------------------------------------------------------
Seq Scan on outer_table (value=0.00..18003041.00 rows=100000 width=40) (precise time=0.485..38607.351 rows=100000 loops=1)
SubPlan 1
-> Combination (value=180.00..180.01 rows=1 width=32) (precise time=0.386..0.386 rows=1 loops=100000)
-> Seq Scan on inner_table (value=0.00..180.00 rows=1 width=4) (precise time=0.175..0.385 rows=1 loops=100000)
Filter: (a = outer_table.a)
Rows Eliminated by Filter: 9999
Planning Time: 0.082 ms
Execution Time: 38611.835 ms
(8 rows)

With the subquery cache:

Highlighted in daring are the cache hits and misses displaying a 49.9% CHR.

postgres=> clarify analyze SELECT outer_table.a, outer_table.b, 
(SELECT AVG(inner_table.b) FROM inner_table 
WHERE inner_table.a = outer_table.a) FROM outer_table;
QUERY PLAN 
-------------------------------------------------------------------------------------------------------------------------------
Seq Scan on outer_table (value=0.00..18004041.00 rows=100000 width=40) (precise time=0.472..19328.986 rows=100000 loops=1)
SubPlan 1
-> Memoize (value=180.01..180.02 rows=1 width=32) (precise time=0.193..0.193 rows=1 loops=100000)
Cache Key: outer_table.a
Cache Mode: binary
Hits: 49900 Misses: 50100 Evictions: 0 Overflows: 0 Reminiscence Utilization: 4942kB
-> Combination (value=180.00..180.01 rows=1 width=32) (precise time=0.384..0.385 rows=1 loops=50100)
-> Seq Scan on inner_table (value=0.00..180.00 rows=1 width=4) (precise time=0.346..0.384 rows=0 loops=50100)
Filter: (a = outer_table.a)
Rows Eliminated by Filter: 10000
Planning Time: 0.083 ms
Execution Time: 19333.955 ms
(12 rows)

Understanding cache hit price

The Cache Hit Charge determines cache effectiveness and is calculated like the next:

CHR = (Cache Hits) / (Complete Cache Lookups)

From this output, we are able to decide our CHR within the previous instance to be 49.9%. Usually, for the subquery cache, a CHR above 70% is taken into account wonderful. A CHR price above 30% is nice, and under 30%, the cache might be disabled due to restricted profit.

It’s because—not like shared buffer CHRs, which may moderately attain 90% or extra as a result of the identical knowledge pages are regularly accessed—a 70% CHR for subquery cache displays practical knowledge distribution patterns the place solely a portion of customers have duplicate correlation values. A 50% CHR means half the shoppers within the outer question have duplicate values (comparable to repeat purchases), which represents regular enterprise eventualities, whereas anticipating 90% would suggest unrealistic knowledge patterns the place almost all customers have similar behaviours.

You may as well set parameters associated to the subquery cache to resolve whether or not the cache will get used. For every cached subquery, after the variety of cache misses outlined by apg_subquery_cache_check_interval is exceeded, the system evaluates whether or not caching that specific subquery is useful by evaluating the CHR towards the apg_subquery_cache_hit_rate_threshold. If the CHR falls under this threshold, the subquery is faraway from the cache. The next are the defaults for these parameters.

-- Configure cache habits
SET apg_subquery_cache_check_interval = 500;
SET apg_subquery_cache_hit_rate_threshold = 0.3;

The next graph reveals that on this instance when utilizing the subquery cache the question execution time is decreased to 50% of its authentic execution time. The development you obtain will range from case to case. For instance, the effectiveness of the cache might be influenced by the variety of repeated rows within the outer question.

A bar chart showing query execution times with and without the subquery cache enabled. With the cache enabled there is a 50% reduction in eceution time.

When to think about alternate options

Think about using the unique PostgreSQL plan when you might have wonderful indexes already current on correlation columns, queries execute sometimes, you might have a small dataset for the outer question, otherwise you’re coping with advanced subqueries with a number of correlation circumstances. Contemplate manually rewriting the question when the plan is just too advanced to remodel or when the cache hit price is lower than 30% within the subquery cache.

Conclusion

On this publish, you might have seen how the correlated subquery optimizations obtainable in Aurora PostgreSQL can considerably enhance efficiency by two complementary strategies: automated subquery transformations and clever subquery caching. These options work collectively to deal with gradual correlated queries with out requiring code rewrites. Subquery transformation optimizes queries behind the scenes for vital pace enhancements, whereas caching remembers outcomes from costly subqueries that repeatedly course of comparable knowledge patterns. You may implement these optimizations incrementally, monitor their results by Amazon CloudWatch, and take a look at safely utilizing the Amazon Aurora cloning and blue/inexperienced deployment options earlier than manufacturing rollout, decreasing the necessity to spend time rewriting advanced queries and permitting Aurora to mechanically deal with the efficiency optimization heavy lifting.

To get began implementing these optimizations in your individual atmosphere, see Optimizing correlated subqueries in Aurora PostgreSQL and dive deeper into efficiency evaluation strategies at this PostgreSQL web page about studying EXPLAIN plans.

Optimizing correlated subqueries in Amazon Aurora PostgreSQL

Understanding correlated subqueries

Anatomy of a correlated subquery

Subquery transformation

Tips on how to allow subquery transformation

Limitations and scope

The subquery cache

Tips on how to allow subquery caching

Limitations and scope

When to make use of subquery remodel or subquery cache

Subquery optimizations in motion

Subquery transformation affect

Subquery cache affect

Understanding cache hit price

When to think about alternate options

Conclusion

In regards to the authors

Related Articles

Striving for Sustainability: Meet Mike King

Amazon DynamoDB international tables now assist replication throughout AWS accounts

Establishing GitLab runners · Andrew Wegner

LEAVE A REPLY Cancel reply

Latest Articles

Striving for Sustainability: Meet Mike King

Amazon DynamoDB international tables now assist replication throughout AWS accounts

Establishing GitLab runners · Andrew Wegner

At Chicago’s Tattoo Woman Fest in 2025 – Scene360

Optimizely: An Energetic Metadata Pioneer – Atlan