Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: support streaming read on append-only iceberg source #20074

Open
chenzl25 opened this issue Jan 8, 2025 · 0 comments
Open

feat: support streaming read on append-only iceberg source #20074

chenzl25 opened this issue Jan 8, 2025 · 0 comments
Assignees
Milestone

Comments

@chenzl25
Copy link
Contributor

chenzl25 commented Jan 8, 2025

Is your feature request related to a problem? Please describe.

Currently, Risingwave supports Iceberg as a source, which is limited to usage in batch queries. In contrast, both Spark and Flink allow streaming reads from append-only Iceberg sources. It is crucial for Risingwave to enhance its capabilities by supporting streaming reads on Iceberg, matching the functionality offered by other engines. The implementation of append-only Iceberg streaming reads is relatively straightforward, as it requires handling only the append commits while ignoring delete commits. This can be achieved by managing the streaming state table to track snapshots at a coarse-grained level or by monitoring the files within the snapshot at a finer granularity.

Moreover, the Iceberg engine table stands to gain from this enhancement. Presently, we maintain two copies of data for the Iceberg engine table. For append-only tables, we can simplify this by storing exclusively the Iceberg table data and eliminating the Hummock copy. Once the Hummock copy is removed, it would be essential to rely on Iceberg streaming reads to construct materialized views on top of the Iceberg engine table.

Describe the solution you'd like

No response

Describe alternatives you've considered

No response

Additional context

No response

@github-actions github-actions bot added this to the release-2.3 milestone Jan 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants