VDB
KO

PYSEC-2019-114

Details

Prior to Spark 2.3.3, in certain situations Spark would write user data to local disk unencrypted, even if spark.io.encryption.enabled=true. This includes cached blocks that are fetched to disk (controlled by spark.maxRemoteBlockSizeFetchToMem); in SparkR, using parallelize; in Pyspark, using broadcast and parallelize; and use of python udfs.

Are you affected?

Enter the version of the package you're using.

Affected packages

PyPI / pyspark
Introduced in: 0 Fixed in: 2.3.3
Fix pip install --upgrade 'pyspark>=2.3.3'

References