Spark HiveContext : Spark Engine OR Hive Engine?

Question

Spark HiveContext : Spark Engine OR Hive Engine?

999 views Asked by Rohan Nayak At 14 September 2017 at 06:00

I am trying to understand spark hiveContext. when we write query using hiveContext like

sqlContext=new HiveContext(sc)
sqlContext.sql("select * from TableA inner join TableB on ( a=b) ")

Is it using Spark Engine OR Hive Engine?? I believe above query get executed with Spark Engine. But if thats the case why we need dataframes?

We can blindly copy all hive queries in sqlContext.sql("") and run without using dataframes.

By DataFrames, I mean like this TableA.join(TableB, a === b) We can even perform aggregation using SQL commands. Could any one Please clarify the concept? If there is any advantage of using dataframe joins rather that sqlContext.sql() join? join is just an example. :)

Original Q&A

There are 1 answers

**Arush Kharbanda** · Accepted Answer · 2017-09-14T07:28:21+00:00

The Spark HiveContext uses Spark execution engine underneath see the spark code.

Parser support in spark is pluggable, HiveContext uses spark's HiveQuery parser.

Functionally you can do everything with sql and Dataframes are not needed. But dataframes provided a convenient way to achieve the same results. The user doesn't need to write a SQL statement.

TechQA.

Spark HiveContext : Spark Engine OR Hive Engine?

There are 1 answers

Related Questions in APACHE-SPARK

Related Questions in APACHE-SPARK-SQL

Related Questions in HIVECONTEXT

Popular Questions

Trending Questions