How to run Spark locally on Windows using eclipse in java

Question

How to run Spark locally on Windows using eclipse in java

5.1k views Asked by Yonanam At 11 June 2015 at 13:29

I'm trying to test Mllib's implementation of SVM. I want to run their java example locally on windows, using eclipse. I've downloaded Spark 1.3.1 pre-built for Hadoop 2.6 . When i try to run the example code, i get:

15/06/11 16:17:09 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

What should i change in order to be able to run the example code in this setup?

Original Q&A

There are 3 answers

**vishal rathod** · Answer 1 · 2021-05-25T15:19:55+00:00

Create the directory:

E:\hadoop_home\bin

Download desired winutils.exe file from any hadoop-x.x.x/bin directory from the following github repo: https://github.com/steveloughran/winutils
Place the downloaded winutils.exe file into

E:\hadoop_home\bin

directory which we created in step 1.

Set the

hadoop.home.dir

in system property in the code ex:

import org.apache.spark.sql.SparkSession

object QuesCount {

  def main(args: Array[String]) = {

    System.setProperty("hadoop.home.dir", "E:\\hadoop_home")

  }
}

Right click on your scala file Run As> Scala Application

**snesneros** · Answer 2 · 2015-08-03T15:20:03+00:00

Create the following directory structure: "C:\hadoop_home\bin" (or replace "C:\hadoop_home" with whatever you like)
Download the following file: http://public-repo-1.hortonworks.com/hdp-win-alpha/winutils.exe
Put the file from step 2 into the "bin" directory from step 1.
Set the "hadoop.home.dir" system property to "C:\hadoop_home" (or whatever directory you created in step 1, without the "\bin" at the end).

**AudioBubble** · Answer 3 · 2018-06-21T06:07:00+00:00

To run Spark in windows eclipse with Maven project:-

Create Scala project and declare Scala object. Then convert the project into Maven (You should be having m2eclipse plugin installed for this - You can find this in Eclipse marketplace).
pom.xml will be created., Please add the below dependency,

<dependency>
      <groupId>org.apache.spark</groupId>
      <artifactId>spark-core_2.10</artifactId>
      <version>1.6.0</version>
</dependency>

Now, build your project with some sample spark code(There should not be any errors)
Now follow the below setup,
- Create the following directory structure: "C:\hadoop_home\bin" (or replace "C:\hadoop_home" with whatever you like)
- Download the following file: http://public-repo-1.hortonworks.com/hdp-win-alpha/winutils.exe
- Put the file from step 2 into the "bin" directory from step 1.
- Set the "hadoop.home.dir" system property to "C:\hadoop_home" (or whatever directory you created in step 1, without the "\bin" at the end). Note: You should be declaring this property in the beginning of your Spark code.

System.setProperty("hadoop.home.dir", "C://Users//Desktop//hadoop_home")

TechQA.

How to run Spark locally on Windows using eclipse in java

There are 3 answers

Related Questions in HADOOP

Related Questions in APACHE-SPARK

Related Questions in APACHE-SPARK-MLLIB

Popular Questions

Popular Tags

Trending Questions