Reduce memory usage in writer with more memory efficient output buffer implementation #24913

chenyangfb · 2025-04-14T15:11:05Z

Description

Currently ChunkedSliceOutput is used for storing compressed output in writer. It managed list of buffers with size of power of 2 (e.g. 8k, 16k, 32k), and reuse buffers after flushing. It could leads to extra memory usage and OOM due to 1) mismatch in compressed output size and buffer size, 2) reusing buffers and not freeing buffers leads to extra memory usage by design.

Common scenario which leads to OOM includes

large number of streams with small amount of data (100k stream with 1k compressed bytes), each using minimal buffer size (e.g. 8k)
Each stream is wasting half of largest buffer (e.g. 8M out of 16M buffer)
Writer memory usage is high even after flushing (Reduce memory usage in writer by freeing unused buffers #23724 support freeing unused buffer in chunk supplier during reset)

This PR introduce OrcLazyChunkedOutputBuffer which focus on avoiding used memory.

Create buffer size based on the size of compressed output, this avoid the issue 1) and 2) mentioned above
lazy initialization in OrcLazyChunkedOutputBuffer and OrcOutputBuffer
Reset all the closed buffers, only keep the active buffer.

This behavior is controlled by lazyOutputBuffer in OrcWriterOptions, and it's disabled by default.

Impact

Reduce memory usage in writer.

Test Plan

Tested with Spark workload with high memory usage.
~10% improvement in run time and resource usage (memory reservation time), reduced GC time.
Tested with general Spark workload
No change in cpu time, slight reduction in run time and GC time.

Release Notes

General change
Reduce memory usage in writer with more memory efficient output buffer implementation

Support reset all readers in startStripe()

b09bbaf

chenyangfb force-pushed the orc_output_buffer branch 2 times, most recently from b400461 to b75ec9b Compare April 14, 2025 16:21

Support free unused buffer in output buffer chunk supplier

2d5d698

chenyangfb force-pushed the orc_output_buffer branch 2 times, most recently from b88e026 to 04a0df7 Compare April 14, 2025 17:13

chenyangfb changed the title ~~Add OrcLazyChunkedOutputBuffer which is more memory efficient~~ Reduce memory usage in writer with more memory efficient output buffer implementation Apr 14, 2025

Add OrcLazyChunkedOutputBuffer which is more memory efficient

4d7394d

chenyangfb force-pushed the orc_output_buffer branch from 04a0df7 to 4d7394d Compare April 14, 2025 18:15

chenyangfb marked this pull request as ready for review April 14, 2025 18:18

chenyangfb requested review from sdruzkin and a team as code owners April 14, 2025 18:18

chenyangfb requested a review from presto-oss April 14, 2025 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage in writer with more memory efficient output buffer implementation #24913

Reduce memory usage in writer with more memory efficient output buffer implementation #24913

chenyangfb commented Apr 14, 2025 •

edited

Loading

Reduce memory usage in writer with more memory efficient output buffer implementation #24913

Are you sure you want to change the base?

Reduce memory usage in writer with more memory efficient output buffer implementation #24913

Conversation

chenyangfb commented Apr 14, 2025 • edited Loading

Description

Impact

Test Plan

Release Notes

chenyangfb commented Apr 14, 2025 •

edited

Loading