Data Engineering: Java gateway error on server

21 Nov 2017

Documented an error encountered on CDS server.

Traceback (most recent call last):
  File "", line 22, in <module>
    .config("spark.driver.maxResultSize", "4g") \
  File "/home/serverteam_1/anaconda3/lib/python3.6/site-packages/pyspark/sql/", line 169, in getOrCreate
    sc = SparkContext.getOrCreate(sparkConf)
  File "/home/serverteam_1/anaconda3/lib/python3.6/site-packages/pyspark/", line 334, in getOrCreate
    SparkContext(conf=conf or SparkConf())
  File "/home/serverteam_1/anaconda3/lib/python3.6/site-packages/pyspark/", line 115, in __init__
    SparkContext._ensure_initialized(self, gateway=gateway, conf=conf)
  File "/home/serverteam_1/anaconda3/lib/python3.6/site-packages/pyspark/", line 283, in _ensure_initialized
    SparkContext._gateway = gateway or launch_gateway(conf)
  File "/home/serverteam_1/anaconda3/lib/python3.6/site-packages/pyspark/", line 95, in launch_gateway
    raise Exception("Java gateway process exited before sending the driver its port number")
Exception: Java gateway process exited before sending the driver its port number

Anyway, we outlined what needs to be done/summarized in our final writeup: Performance and Costly Functions on Server

Basically, up to this point, Linnea and I have investigated

comments powered by Disqus