Please I will like to iterate and perform calculations accumulated in a column of my dataframe but I can not. Can you help me?
Here the creation of my dataframe. I would like to calculate an accumulated blglast the column and stored in a new column
from pyspark.sql import HiveContext
from pyspark import SparkContext
from pandas import DataFrame as df
hive_context = HiveContext(sc)
tab = hive_context.table("table")
df=hive_context.sql("SELECT blglast FROM tab_temp AS b limit 50")