Skip to content

Have separate fetch operators for rerender and revbased updates

DCausse requested to merge two-separate-fetch-operators into main

Revision based updates should be applied in order to ensure better consitencies, this unfortunately introduce a throughput penalty because the asyncIO operator must be waiting for all preceding events to be enriched to fill its buffer. Rerenders could on the other be applied in any order and could benefit from a more optimized asyncIO operator that runs un-ordered. The stream is now separated to these two discting fetch operators and unioned after the fetch. This might also help to have more granular back-pressure metrics because flink will have to create separate tasks for these.

The capacity is shared for the two operators with a new config option that tune the ratio the revision based updates will get from this global capacity (15% by default).

Merge request reports