Airflow different data latency

shubhamp · July 10, 2019, 3:13am

Hi all,
I am using Airflow and different tasks in my DAG are dependent on data with different latency.
If I want to run the DAG for all the rows in 1 dataset, with each task changing a value (based on computation on different datasets) in one column (say task 4 will change column 4) of the main dataset, how can I efficiently track which rows have been completely executed and re-run the tasks which didn’t get the data the first time round.

Topic		Replies	Views
Run task from DAG only once Airflow	0	2082	January 8, 2020
Airflow DAG Run Delay: running State Persists After Tasks Complete	0	50	November 8, 2024
Feedback on my implementation Airflow	5	2159	December 18, 2020
Task that wait until DAG is refreshed Airflow	1	1858	March 23, 2022
Can airflow tracks each dag run based on parameter passed into it Airflow	0	1594	March 11, 2022

Airflow different data latency

Related topics