Get Spark pre-built package from the downloads page of the Spark project website.ģ. This Guide Assumes you already have Anaconda and Gnu On Windows installed. There are many articles online that talk about Jupyter and what a great tool it is, so we won’t introduce it in details here. We use PySpark and Jupyter, previously known as IPython Notebook, as the development environment. Spark provides APIs in Scala, Java, Python (PySpark) and R. It is highly recommend that you use Mac OS X or Linux for this course, these instructions are only for people who cannot run Mac OS X or Linux on their computer. Instructions tested with Windows 10 64-bit.