Operations - Transformations and Actions

Transformations - Create a new RDD from the existing RDD or none

Actions - Will have the results

Lazy Evaluation - Only when the Action is called on the RDD, transformations are computed.

Finally, as you derive new RDDs from each other using transformations, Spark keeps track of the set of dependencies between different RDDs, called the lineage graph. It uses this information to compute each RDD on demand and to recover lost data if part of a persistent RDD is lost.

Transformations:

Actions:

results matching ""

    No results matching ""