I am trying to execute below
./bin/tool.sh ./lib/accumulo-examples-simple.jar $PKG.BulkIngestExample $ARGS -t test_sample --inputDir /tmp/bulk -workDir /tmp/bulkWork
Here is the error
Accumulo is not properly configured.
Try running $ACCUMULO_HOME/bin/bootstrap_config.sh and then editing
After that I tried running "$ACCUMULO_HOME/bin/bootstrap_config.sh" in my shell
It's asking more questions.
Choose the heap configuration:
Choose the Accumulo memory-map type:
Choose the Apache Hadoop version:
1) HADOOP 1
2) HADOOP 2
Please help me on this.
You should not use bootstrap_config.sh on a system that already has a configured instance of Accumulo. The error you get is because bin/tool.sh has to be able to find Accumulo client configs via ACCUMULO_CONF_DIR.
Presuming your Accumulo instance is managed by CM you should make sure the node you are running commands on is listed as an Accumulo client gateway node and then use the accumulo-tool command in place of the bin/tool.sh script.
Presuming you have a gateway role configured and ACCUMULO_HOME points at the expected location:
$> accumulo-tool $ACCUMULO_HOME/lib/accumulo-examples-simple.jar $PKG.BulkIngestExample $ARGS -t test_sample --inputDir /tmp/bulk --workDir /tmp/bulkWork
Which host are you running commands on?
From your pasted assignments, it looks like only the host "DN1" has a Gateway role defined. Is that correct?
On a host with the gateway role defined can you copy/paste the output of running "accumulo info"? (It'll render better if you place it in a code block.)
The error indicates that Accumulo cannot find one of its configuration files. How did you install Accumulo - via parcels or packages?
Also, please provide the value of your ACCUMULO_CONF_DIR environment variable, e.g.:
When managed by CM, ACCUMULO_CONF_DIR gets handled by the gateway role. Accessing it is rolled into the scripts that are exposed on your path. Please move this discussion to the other thread I mentioned for follow up.