Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Why cann't I access the url which is "http://util-1.ent.cloudera.com/impala-test-data/"?

avatar
Explorer

where can i to get impala test data?

1 ACCEPTED SOLUTION

avatar
Explorer

Impala TPC-DS test data generation by this project https://github.com/cloudera/impala-tpcds-kit    

View solution in original post

3 REPLIES 3

avatar
Explorer

set -e
set -u

echo "Copying data files from the share. If the file already exists locally, the files"\
"will not be copied. It's not check summing the files or anything like that, if"\
"you need to force a copy, delete the local directory:"\
"IMPALA_HOME/testdata/impala-data"

DATAsrc="http://util-1.ent.cloudera.com/impala-test-data/"
DATADST=${IMPALA_HOME}/testdata/impala-data

mkdir -p ${DATADST}
pushd ${DATADST}

# Download all .tar.gz files from the source, excluding the hostname and directory name.
# If the file already exists locally, skip the download.
wget -q --cut-dirs=1 --no-clobber -r --no-parent -nH --accept="*.tar.gz" ${DATASRC}
for filename in *.tar.gz
do
echo "Extracting: ${filename}"
tar -xzf ${filename}
done
popd

echo "Test data download successful."

avatar
Explorer

avatar
Explorer

Impala TPC-DS test data generation by this project https://github.com/cloudera/impala-tpcds-kit