Avoid OOM-killing query if result-level caching fails #17652

jtuglu-netflix · 2025-01-22T06:21:42Z

Description

Currently, result-level caching which attempts to allocate a large enough buffer to store query results will overflow the Integer.MAX_INT capacity. ByteArrayOutputStream materializes this case as an OutOfMemoryError, which is not caught and terminates the node. This limits the allocated buffer for storing query results to whatever is set in CacheConfig.getResultLevelCacheLimit().

Important Note

I opted to use LimitedOutputStream here as it is already used with ByteArrayOutputStream. While ok in a QueryRunners (single-threaded), this still is less-than-ideal in the general case because it doesn't guarantee strict consistency between overflow exception delivery and ordering of writes to the buffer(see another example below). As such, this class in general is *not* thread-safe and I think should be refactored to account for this. This is because every case of LimitedOutputStream already uses ByteArrayOutputStream, which *is* already using locks, we should suffer no performance hit by synchronizing LimitedOutputStream::write methods. This is just in the general spirit of future-proofing code, given that we're already using locks, we might as well avoid as many future races as we can : ). Given that this would take some changes to the LimitedOutputStream API (from extending ByteArrayOutputStream directly) I've opted to not change these APIs here, but in a separate PR.

Changes to `LimitedOutputStream`

Expose an public OutputStream get() which returns the output stream for stream-specific operations.
Set wrapped to be atomic. This isn't a complete fix for the thread-safety concerns above, but at least it prevents a future simple race case where multiple threads writing can result in uncaught buffer overflows:

T1: write():
T1: read written = INT_MAX - 1
INT
T2: write():
T2: read written = INT_MAX - 1
T2: write written += 1
T2: write() succeeds
INT
T1: write written += 1
T1: write() succeeds
FIN

Release note

Avoid OOM-killing node if large result-level cache population fails for query

Key changed/added classes in this PR

processing/src/main/java/org/apache/druid/io/LimitedOutputStream.java
server/src/main/java/org/apache/druid/query/ResultLevelCachingQueryRunner.java
server/src/test/java/org/apache/druid/query/ResultLevelCachingQueryRunnerTest.java

This PR has:

…or query Currently, result-level caching which attempts to allocate a large enough buffer to store query results will overflow the Integer.MAX_INT capacity. ByteArrayOutputStream materializes this case as an OutOfMemoryError, which is not caught and terminates the node. This limits the allocated buffer for storing query results to whatever is set in `CacheConfig.getResultLevelCacheLimit()`.

jtuglu-netflix force-pushed the fix-oom-on-result-level-cache-population branch from 5f7c019 to 551d891 Compare January 22, 2025 06:34

jtuglu-netflix force-pushed the fix-oom-on-result-level-cache-population branch from 551d891 to 8bc7891 Compare January 22, 2025 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid OOM-killing query if result-level caching fails #17652

Avoid OOM-killing query if result-level caching fails #17652

jtuglu-netflix commented Jan 22, 2025 •

edited

Loading

Avoid OOM-killing query if result-level caching fails #17652

Are you sure you want to change the base?

Avoid OOM-killing query if result-level caching fails #17652

Conversation

jtuglu-netflix commented Jan 22, 2025 • edited Loading

Description

Important Note

Changes to LimitedOutputStream

Release note

Key changed/added classes in this PR

jtuglu-netflix commented Jan 22, 2025 •

edited

Loading

Changes to `LimitedOutputStream`