How do you calculate running total for past 7 days
Sigiloso
For the last 7 days , we need a range based window ordered by dates from pyspark.sql import Window from pyspark.sql import functions as F window = ( Window.orderBy(F.col("order_date").cast("long")) .rangeBetween(-6 * 86400, 0) # cast date to unix timestamp & 6 days back + today = 7 days, in seconds ) result = df.withColumn("running_total_7d", F.sum("amount").over(window)) result.show()