Set explicit guidelines when running large jobs #459

williambrandler · 2021-12-07T02:55:45Z

Spark jobs with lots of partitions can crash if the driver is too small, for example with the error,

Caused by: org.apache.spark.SparkException: Job aborted due to stage failure: Total size of serialized results of 731259 tasks (4.0 GiB) is bigger than spark.driver.maxResultSize 4.0 GiB.

Set explicit guidelines for each job about cluster setup.

This will come once we have the continuous integration pipeline running on multitask jobs with a different setup for each use case (ingest vs etl vs regressions etc)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set explicit guidelines when running large jobs #459

Set explicit guidelines when running large jobs #459

williambrandler commented Dec 7, 2021

Set explicit guidelines when running large jobs #459

Set explicit guidelines when running large jobs #459

Comments

williambrandler commented Dec 7, 2021