A frequently recommended distribution is often that of Anaconda as it ships with a lot of useful libraries and has binary installers for Windows.
#Install spark ubuntu command line install#
If you like to use the Spark Python API via PySpark, you will need to install Python. You can do so by following instructions from on Microsoft/BashOnWindows/Unable to install Java JDK or Runtime #49. In case you like to use Bash, you will need to install Java on the Linux subsystem for Windows separately as it is a distinct OS. You can check which version of Java you are running on the CMD via java -version Note that the path to the JRE may be different when using a JDK. To run Spark, you only require a Java runtime environment (JRE) but you may also download the Java development kit (JDK) which includes the JRE. Install Javaīinary installers for Java can be downloaded from Oracle Java SE. For more details, see Do not change Linux files using Windows apps and tools. This includes using any Windows editor to edit Linux files (e.g. It is very important that you do not use any Windows apps/tools to perform any operation on Linux files as these files may get corrupted. you will eventually edit configuration files of the Linux on Windows subsystem. If you plan to set up Bash on Windows to work with Spark etc. Python (in case you want to use PySpark)Ī general note of caution for Bash on Windows.Java runtime environment (JRE) or Java Development Kit (JDK) version >= 8.
#Install spark ubuntu command line how to#
We show how to install Spark on Windows 10 (see exact system info below) but generally it should also work on Windows 7. We will demonstrate a basic example how to use Scala and Python (via PySpark) from the command line. It is assumed that the reader has basic knowledge of the Windows command line (CMD) and, if Bash should be used, a basic familiarity with Bash on Ubuntu on Windows (part of Linux Subsystem on Windows). This is a quick tutorial how to install ApacheSpark on Windows via the pre-built on Apache Hadoop. Download/clone windows utilities for corresponding Hadoop version.A general note of caution for Bash on Windows.