Adding support for "produced_by" configuration of datasets
Implementation of the proposal outlined in https://phabricator.wikimedia.org/T372647
Airflow producer dataset annotation
NOTE: Missing implementation of automatic configuration of execution_delta based on target DAG's schedule
Example:
produced_by:
airflow:
instance: search
dag_id: dummy_dag
task_group_id: dummy_grouped_tasks
- If
produced_byconfiguration is present for anyDatasetimplementation,get_sensor_forreturns a configured external task sensor - Depending on whether produced_by configuration refers to the Airflow instance the DAG code is running on, or not, producer
get_sensor_forreturns either the basic ExternalTaskSensor or RestExternalTaskSensor