T419865 Add DAG for calculating Wikipedia reference groups
Note
Reviewers: Please see REVIEW_AIRFLOW_MRS.md for directions on what to check for.
Contributor checklist
- I have written structure tests and generated fixtures for this DAG that will be merged into data-engineering/airflow-dags/tests/wmde
- I have ran the above tests and code quality checks locally or with Docker as outlined in the tests section of the Airflow DAGs project readme
- I have tested the jobs for this DAG in my local database using queries generated with wmde/analytics/hql/gen_hql_test_scripts.py or by passing parameters to the production queries
- I have tested the included DAGs using the process outlined in TEST_AIRFLOW_DAGS.md and the JSON test variable files provided for each DAG
-
All tables that are needed by the included DAG jobs have been created and are accessible by the
analytics-wmdeAirflow user -
All changes from the
mainbranch have been rebased into this branch
Description
Bug: T419865
-
wikipedia_newcomer_ve_reference_groups_monthly- Jobs: wmde/analytics/hql/airflow_jobs/wikipedia_newcomer_ve_reference_groups/
- The first of two needed DAGs for T419865, with this focusing on the grouping of users based on their reference edit behavior (using Reference Check or not).
Test outputs
Please describe the outputs of the tests that were ran.
Destination tables summary
If applicable, include sanitized outputs of DAG jobs so that the results can be compared against expected outputs.
-
wmde.TABLE_NAME- Description
Test screenshots
Include screenshots of the DAGs in the test Airflow UI as verification of the tests.
DAG_ID