Add DAG and table maintenance for wmf_content.mediawiki_content_current_v1
(Depends on repos/data-engineering/dumps/mediawiki-content-dump!62 (merged)).
In this MR we:
- Implement a daily DAG to calculage and merge changes from
wmf_content.mediawiki_content_history_v1
intowmf_content.mediawiki_content_current_v1
. This is done via a MERGE INTO, which is under review separately. - Implement Iceberg table maintenance for
wmf_content.mediawiki_content_current_v1
, including a call torewrite_position_delete_files()
which allows us to run the MERGE INTO pipeline withmerge-on-read
(faster, and more stable!). - Modified folder hierarchy a bit to accomodate the set of
history
content jobs vs thecurrent
jobs.
Bug: T391283