I am using Cascalog on Eclipse . it looks like the dependency on hadoop is provided in project.clj file of project like below
:profiles { :dev {:dependencies [[org.apache.hadoop/hadoop-core "1.1.2"]]}}
If i have to include dependency on locally installed Hadoop single node cluster or some external hadoop cluster --how should i do ? If it is local should i simply put the "path to hadoop" in place of "org.apache.hadoop" ? Your ideas would be appreciated.
Best Regards, Sindhu
Sindhu, specification of your cluster's location isn't appropriate in project.clj.
project.clj is to clojure what pom.xml is to java/maven. Check out the tutorial on leinengen's dependency management here. You should make sure that the version you declare dependency on matches what you'll be running against.
The cluster you end up running on is controlled in hadoop conf files - specifically by changing the location of your job tracker with "mapred.job.tracker" in mapred-site.xml. You can read about them here