Generating TPCH-SF300 and SF1000 data

282 views Asked by Netme121 At 11 October 2020 at 10:26

I am trying to generate SF300 and SF1000 TPCH data on Databricks. However, my scripts have been running for over 24hrs now and I am guessing I did something wrong.

I followed the instructions the instructions on: https://github.com/databricks/spark-sql-perf. Then I used the notebook(tpcds_datagen.scala) in their repository to generate data. Of course, I modified the parameters to change TPC-DS to TPC-H. But it's extremely slow.

Could someone suggest a quicker way and help me out? Thanks in advance.

Original Q&A

TechQA.

Generating TPCH-SF300 and SF1000 data

There are 0 answers

Related Questions in DATABASE

Related Questions in APACHE-SPARK

Related Questions in DATABRICKS

Related Questions in DATA-GENERATION

Related Questions in TPC

Popular Questions

Popular Tags

Trending Questions