Support Questions

Find answers, ask questions, and share your expertise

Why cann't I access the url which is "http://util-1.ent.cloudera.com/impala-test-data/"?

avatar
Explorer

where can i to get impala test data?

1 ACCEPTED SOLUTION

avatar
Explorer

Impala TPC-DS test data generation by this project https://github.com/cloudera/impala-tpcds-kit    

View solution in original post

3 REPLIES 3

avatar
Explorer

set -e
set -u

echo "Copying data files from the share. If the file already exists locally, the files"\
"will not be copied. It's not check summing the files or anything like that, if"\
"you need to force a copy, delete the local directory:"\
"IMPALA_HOME/testdata/impala-data"

DATAsrc="http://util-1.ent.cloudera.com/impala-test-data/"
DATADST=${IMPALA_HOME}/testdata/impala-data

mkdir -p ${DATADST}
pushd ${DATADST}

# Download all .tar.gz files from the source, excluding the hostname and directory name.
# If the file already exists locally, skip the download.
wget -q --cut-dirs=1 --no-clobber -r --no-parent -nH --accept="*.tar.gz" ${DATASRC}
for filename in *.tar.gz
do
echo "Extracting: ${filename}"
tar -xzf ${filename}
done
popd

echo "Test data download successful."

avatar
Explorer

avatar
Explorer

Impala TPC-DS test data generation by this project https://github.com/cloudera/impala-tpcds-kit