Buffering and data loss prevention on k8s #8563
-
Hi, I'm looking to configure buffering for my sink, with the intent of having 0 data loss. I saw in the docs that you suggest not using aws gp2 (what we are using) for disk buffer, are there any other suggestions for what I should use? Also the disk buffer doesn't specify what volume it is loaded on, is there a way I can use a persistent volume and not the node volume for this? Also is there any info on how hard is the performance hit for using disk buffer? I know it will be dependent on a bunch of things like type of disk + cpu etc. But is this on a scale of milliseconds, seconds or minutes difference? Thanks. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 9 replies
-
Hey @asafdl great question! We're working on a set of comprehensive guides that'll cover questions like this and help with going to production with large deployments. That said, could you provide a little more info about your data sources and sinks? In general, disk buffers should be a last resort for situations that do not offer alternative durability strategies. I would not use disk buffers if you absolutely cannot lose data. Better strategies include:
I hope that helps. |
Beta Was this translation helpful? Give feedback.
Hey @asafdl great question! We're working on a set of comprehensive guides that'll cover questions like this and help with going to production with large deployments.
That said, could you provide a little more info about your data sources and sinks? In general, disk buffers should be a last resort for situations that do not offer alternative durability strategies. I would not use disk buffers if you absolutely cannot lose data. Better strategies include: