He is Budi

Enabling and using Pyspark with Jupyter and Anaconda

09/02/201807/03/2018 Budi Wibowo(Author)No Comments

I remember it took me sometime to get this configured when I first started trying Jupyter and Spark out. Hopefully this is helpful for others. This works for Hadoop 2.6.0-CDH5.9.1, Spark 1.6.0 using python2.7 and Python3. For other versions, you need to adjust the path accordingly. Basically, you just need to tell spark 4 things: […]

He is Budi

My Work and Things I Can't Find Googling!

Category: spark

Enabling and using Pyspark with Jupyter and Anaconda