Member since
09-25-2015
356
Posts
382
Kudos Received
62
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2440 | 11-03-2017 09:16 PM | |
1920 | 10-17-2017 09:48 PM | |
3820 | 09-18-2017 08:33 PM | |
4512 | 08-04-2017 04:14 PM | |
3459 | 05-19-2017 06:53 AM |
10-28-2015
09:15 PM
2 Kudos
The docs generally look in good shape. One minor correction: Instead of "hive.server2.enable.SSL" use "hive.server2.use.SSL" It can be turned ON in both binary and http mode.
... View more
10-27-2015
07:43 PM
1 Kudo
Have you tried symlink as workaround ln -s /usr/lib64/libsasl2.so.3 /usr/lib64/libsasl2.so
... View more
10-24-2015
01:09 AM
Can you add more details? What is your workflow xml like? I am guessing you have already tried embedding the Sqoop command with args in exec tag.
... View more
10-23-2015
11:33 PM
1 Kudo
Have you tried passing the sqoop command with arguments <action name="sqoopAction">
<sqoop xmlns="uri:oozie:sqoop-action:0.2">
<command>SQOOP COMMAND AND ARGS</command>
...
</sqoop>
...
</action
... View more
10-23-2015
02:47 PM
This looks like a classpath issue. I suspect your phoenix-client.jar is packaged with classes compiled against a different jackson jar then what it finds in the classpath.
... View more
10-23-2015
02:09 AM
That should work I think.
... View more
10-22-2015
11:41 PM
4 Kudos
You can use the sqljdbc4.jar that ships under the windows install of HDP at <OOZIE_HOME>/extra_libs.
... View more
10-22-2015
08:18 PM
1 Kudo
Looking at the command line options for Oozie, there is an option to the oozie job to specify localtime, see the documentation. -localtime use local time (same as passing your time zone to -timezone). Overrides -timezone option
... View more
10-22-2015
05:33 PM
15 Kudos
Various open-source tools and applications are available to do performance, scalability and reliability testing for various popular HDP components. Here is a list of some of the popular ones: HDFS TestDFSIO Measure the I/O performance of HDFS in your cluster. Source code for the tool can be found here. NameNode Benchmark Applies load on the namenode by performing continuous read, write, rename and delete operations on small files. Source code for this tool can be found here. Synthetic Load Generator The synthetic load generator (SLG) is a tool for testing NameNode behavior under different client loads. The user can generate different mixes of read, write, and list requests by specifying the probabilities of read and write. The user controls the intensity of the load by adjusting parameters for the number of worker threads and the delay between operations. More information on the tool can be found here. YARN/MR TeraSort Measure performance by measuring time to sort 1TB of data. The test runs in three steps. First one is TeraGen to generate the dataset, second one is the TeraSort to sort the generated data and third one is TeraValidate to verify the sort order is correct. You can change to use a different data size. MapReduce Benchmark Runs a job multiple times and takes average of all runs. Source code for the tool can be found here. GridMix GridMix submits a mix of synthetic jobs, modeling a profile mined from production loads. More information on the tool can be found here. HBase YCSB Performance evaluation of HBase under pre-defined workloads. More information can be found here. HBase Performance Evaluation Script used for evaluating HBase performance and scalability. Runs a HBase
client that steps through one of a set of hardcoded tests or 'experiments'
(e.g. a random reads test, a random writes test, etc.). More information can be found here. LoadTest Tool A command-line utility that reads, writes, and verifies data. Unlike
PerformanceEvaluation, this tool validates the data written, and supports
simultaneously writing and reading the same set of keys. Source for the tool can be found here. ChaosMonkey A utility to injects faults in a running cluster. More information can be found here. Hive TPC Benchmarks (TPCDS & TPCH) TPCDS and TPCH are analytic benchmarks that model generally applicable aspects of decision support system. Automated scripts to run TPC benchmarks at scale including the converted queries can be found here. Pig PigMix PigMix is a set of queries used to test pig performance. More information can be found here.
... View more