Apache Spark What is the difference between requiredChildDistribution and outputPartitioning?

58 views Asked by At

In Apache Spark, each physical operator in the physical plan has 4 properties:

  • outputPartitioning
  • outputOrdering​
  • requiredChildDistribution
  • requiredChildOrdering

But aren't outputPartioning and requiredChildDistribution the same? How are they different and what do they fundamentally represent? Same for outputOrdering and requiredChildOrdering?

0

There are 0 answers