Created 11-01-2017 06:52 AM
I am executing the tpcds-build.sh file before running tpcds-setup.sh file to generate data for hive benchmarking.
The error after executing the command ./tpcds-build.sh :
Building TPC-DS Data Generator curl http://dev.hortonworks.com.s3.amazonaws.com/hive-testbench/tpcds/README curl: (7) couldn't connect to host make: *** [tpcds_kit.zip] Error 7 TPC-DS Data Generator built, you can now use tpcds-setup.sh to generate data.
Please help in this regard.
Thanks in advance.
Created 11-01-2017 07:00 AM
Created 11-01-2017 07:15 AM
Can you please try cloning this repo and run the tpcds-build.sh
https://github.com/hortonworks/hive-testbench (hive14 branch)
Also, please check that maven and gcc are installed. Make sure that you are able to ping dev.hortonworks.com. If you are not able to ping then your firewall might be blocking it. You should disable your firewall in that case or allow this url to pass.
Thanks,
Aditya
Created 11-01-2017 08:51 AM
Hi Aditya,
Thanks for the answer. Both maven and gcc are installed. Is there any way I can build this and generate data without internet connection??? since the cluster I am using has no connection to internet.
Thanks in advance.
Created 11-02-2017 08:50 AM
I didn't try this out. But I guess you can run tpcds-build.sh where you have internet connectivity build all the artifacts required. Copy them to the box where you need to run the setup.sh and run the setup script. Make sure to use same java version in both the boxes.