I am new to Spark and was trying to run the example mentioned in SparkR page. With some effort, I was able to install sparkR into my machine and was able to run the basic wordcount example. However, when I try to run:
library(SparkR) #works fine - loads the package sc <- sparkR.init() #works fine sqlContext <- sparkRSQL.init(sc) #fails
It says, there is no package called ‘sparkRSQL’. As per documentation sparkRSQL.init is a function in sparkR package. Please let me know if I am missing anything here.
Thanks in advance.
I have already faced this problem when trying to test sparkR. There is a lack of documentation on this part. The problem is that "sparkRSQL" and "sparkRHive" are not included in the master branch, so you have to install sparkR package from the "sparkr-sql" branch using this command:
There was a hint in Amplab website