@dcausse @otto @tchin @lbowmaker
Closes: T311084
Refactor the Mediawiki Stream Enrichment job to consume events from the consolidated page-change stream. The following change have been introduced:
- Replace schemas defined by cases classes with a
Row
object. - Integrates WMF boilerplate for stream configuration.
- Simplifies the Async enrichment function logic.
An entry point to review code could be enrichment/src/test/scala/EnrichmentSuite.scala
.
Dependencies on pending work
-
events should be produced in the page-change stream -
wikimedia-event-utilities 1.2 should be released with https://gerrit.wikimedia.org/r/c/wikimedia-event-utilities/+/810387 merged -
Page change schema has been finalised -
Enriched page change schema has been finalized.
TODO:
-
Implement a checkpointing and offset management strategy,