Member since
07-31-2013
1924
Posts
462
Kudos Received
311
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2128 | 07-09-2019 12:53 AM | |
| 12446 | 06-23-2019 08:37 PM | |
| 9560 | 06-18-2019 11:28 PM | |
| 10523 | 05-23-2019 08:46 PM | |
| 4894 | 05-20-2019 01:14 AM |
08-23-2018
07:40 PM
If you are using Cloudera Manager and have re-added the host/role after deleting it, Cloudera Manager should've marked your HBase service with a 'Stale Configuration' icon, indicating a restart is required for your HBase service (and client configuration deployment for its gateways) to see the changes in the ZK client configuration. There's no way to live-refresh the configuration at runtime, but you can consider performing a rolling restart to eliminate availability issues.
... View more
08-23-2018
07:38 PM
1 Kudo
There will not be any operational problems such as crashes or errors when running a HDFS balancer on a cluster with HBase running, but there can potentially be a performance impact depending on what the balancer decides to move based on its space thresholds. The performance impact would come from loss of locality - the RegionServers' required HFiles may find their blocks to be remote, so a slightly higher network usage can be observed until the next major compaction rewrites a block replica locally. If you'd like to narrow down the time-frame of impact, you can run the HDFS balancer with the desired balancing threshold, and then once it is complete, immediately follow up with a major compaction command on your latency-sensitive HBase tables.
... View more
08-23-2018
05:12 PM
That worked, thanks very much. I'd missed this difference between HBase and HDFS.
... View more
08-17-2018
11:11 AM
I have this same problem, I too tried follow the method of @Vinod369 and not resolved
... View more
08-11-2018
02:16 AM
Since it wasn't really described how exactly did you resolve it... The point is that on the client side (it's important that it's not on the server side), set "dfs.datanode.use.datanode.hostname" in the org.apache.hadoop.conf.Configuration object to value "true". If the Configuration object isn't created by your code (like if Spark creates it, in my case), then it depends on what creates it... see its documentation. But some guesses: Attempt 1: Set it inside $HADOOP_HOME/etc/hadoop/hdfs-site.xml. Hadoop command line tools use that, your Java application though... maybe not. Attempt 2: Put $HADOOP_HOME/etc/hadoop/ into the Java classpath (or pack hdfs-site.xml into your project under /src/main/resources/, but that's kind of dirty...). This works with Spark. Spark only: SparkSession.builder().config("spark.hadoop.dfs.client.use.datanode.hostname", "true").[...] Of course, you may also need to add the domain name of the DataNode-s (as the NameNode knows it) into the /etc/hosts on the computer running your application.
... View more
08-11-2018
01:10 AM
1 Kudo
The documentation of CM metrics for HBase Master states the below: """ The number of times the balancer was invoked outside a balance cluster operation. The balancer might be invoked when a new table is created, for example, to place the newly created regions. """ - https://www.cloudera.com/documentation/enterprise/latest/topics/cm_metrics_master.html If you want more granularity on when this metric receives an increment, look for the method calls named 'incrMiscInvocations' inside the balancer base class within HBase Master: https://github.com/cloudera/hbase/blob/cdh5.15.0-release/hbase-server/src/main/java/org/apache/hadoop/hbase/master/balancer/BaseLoadBalancer.java -- The _across_masters suffix just averages the value across all HBase Masters running in the cluster. The _across_roletype generally applies the same way to all other role type metrics in CM. You can also query them without the across suffixes. The total_ prefix sums instead of averaging the value, across all relevant role types for your chosen metric. The rate bit basically appears for all counter style metrics. It represents the change over time (dx/dt). You can perform an integral(…) over any rate-named metrics to access actual value growth. These are tsquery specifics documented further at https://www.cloudera.com/documentation/enterprise/latest/topics/cm_dg_metric_aggregation.html#cmug_topic_11_8
... View more
08-03-2018
12:10 PM
Hi Harsh, I am on cloudera 5.14 and i am also this issue. From version 5.2.X to the current version and this is stlil an issue? Do hue or cloudera manager have a configuration to take care of these issues?
... View more
08-01-2018
10:47 PM
I am facing an issue mentioned while inserting data to hive table in oozie workflow and my oozie-site.xml is having the 'oozie.action.launcher.mapreduce.job.ubertask.enable' as false. Can anyone share the solution how to get rid of this error? Thanks, Vineela 14096 [uber-SubtaskRunner] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: IllegalArgumentException java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 at org.apache.hadoop.fs.Path.initialize(Path.java:206) at org.apache.hadoop.fs.Path.<init>(Path.java:197) at org.apache.hadoop.hive.ql.Context.getScratchDir(Context.java:267) at org.apache.hadoop.hive.ql.Context.getLocalScratchDir(Context.java:302) at org.apache.hadoop.hive.ql.optimizer.physical.CommonJoinTaskDispatcher.processCurrentTask(CommonJoinTaskDispatcher.java:539) at org.apache.hadoop.hive.ql.optimizer.physical.AbstractJoinTaskDispatcher.dispatch(AbstractJoinTaskDispatcher.java:179) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.dispatch(TaskGraphWalker.java:111) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.walk(TaskGraphWalker.java:180) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.startWalking(TaskGraphWalker.java:125) at org.apache.hadoop.hive.ql.optimizer.physical.CommonJoinResolver.resolve(CommonJoinResolver.java:79) at org.apache.hadoop.hive.ql.optimizer.physical.PhysicalOptimizer.optimize(PhysicalOptimizer.java:107) at org.apache.hadoop.hive.ql.parse.MapReduceCompiler.optimizeTaskPlan(MapReduceCompiler.java:273) at org.apache.hadoop.hive.ql.parse.TaskCompiler.compile(TaskCompiler.java:225) at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:10348) at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:10141) at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:222) at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:421) at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:305) at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:1110) at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1158) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1047) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1037) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:207) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:159) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:370) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:305) at org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:403) at org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:419) at org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:708) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:675) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:615) at org.apache.oozie.action.hadoop.HiveMain.runHive(HiveMain.java:321) at org.apache.oozie.action.hadoop.HiveMain.run(HiveMain.java:298) at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:49) at org.apache.oozie.action.hadoop.HiveMain.main(HiveMain.java:69) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:236) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.runSubtask(LocalContainerLauncher.java:378) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.runTask(LocalContainerLauncher.java:296) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.access$200(LocalContainerLauncher.java:181) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler$1.run(LocalContainerLauncher.java:224) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 at java.net.URI.checkPath(URI.java:1804) at java.net.URI.<init>(URI.java:752) at org.apache.hadoop.fs.Path.initialize(Path.java:203) ... 51 more
... View more
07-31-2018
07:09 AM
@Harsh J No, we rarely run balancer in this environment. I'll set it to 3 for now and observe for a while for any reoccurence of those WARNs if any . (CM recommends to set it between a value equal or greater than replication factor and lesser than number of DNs) Regards
... View more
07-30-2018
07:48 PM
Have you followed the solution made above? Depending on where you are trying to write into your cluster, unless you have full access to communicating with all your DataNode hosts and its ports, you will face this error.
... View more