I'm wondering if anyone knows of a good tutorial for installing HCatalog on top of Hive so I can load data in via Hive, query via Impala and also run some scripts using Pig?
I'm wondering if downloading the VM from Cloudera would be the best option but I haven't seen many tutorials (that are clear with config steps) to get HCatalog running with Hive!
Thanks in advance :)
 
                        
Starting from Hive 0.12, HCatalog comes with hive. You do not need separate installation or start hcat server. Use the appropriate mechanisms to access hcat managed tables from different mapreduce tools (Pig, java map-reduce programs)