About Harsh J

Harsh J · ‎08-23-2018

If you are using Cloudera Manager and have re-added the host/role after deleting it, Cloudera Manager should've marked your HBase service with a 'Stale Configuration' icon, indicating a restart is required for your HBase service (and client configuration deployment for its gateways) to see the changes in the ZK client configuration. There's no way to live-refresh the configuration at runtime, but you can consider performing a rolling restart to eliminate availability issues.

Harsh J · ‎08-23-2018

There will not be any operational problems such as crashes or errors when running a HDFS balancer on a cluster with HBase running, but there can potentially be a performance impact depending on what the balancer decides to move based on its space thresholds. The performance impact would come from loss of locality - the RegionServers' required HFiles may find their blocks to be remote, so a slightly higher network usage can be observed until the next major compaction rewrites a block replica locally. If you'd like to narrow down the time-frame of impact, you can run the HDFS balancer with the desired balancing threshold, and then once it is complete, immediately follow up with a major compaction command on your latency-sensitive HBase tables.

Evan · ‎08-23-2018

That worked, thanks very much. I'd missed this difference between HBase and HDFS.

Tirabassi · ‎08-17-2018

I have this same problem, I too tried follow the method of @Vinod369 and not resolved

ddekany · ‎08-11-2018

Since it wasn't really described how exactly did you resolve it... The point is that on the client side (it's important that it's not on the server side), set "dfs.datanode.use.datanode.hostname" in the org.apache.hadoop.conf.Configuration object to value "true". If the Configuration object isn't created by your code (like if Spark creates it, in my case), then it depends on what creates it... see its documentation. But some guesses: Attempt 1: Set it inside $HADOOP_HOME/etc/hadoop/hdfs-site.xml. Hadoop command line tools use that, your Java application though... maybe not. Attempt 2: Put $HADOOP_HOME/etc/hadoop/ into the Java classpath (or pack hdfs-site.xml into your project under /src/main/resources/, but that's kind of dirty...). This works with Spark. Spark only: SparkSession.builder().config("spark.hadoop.dfs.client.use.datanode.hostname", "true").[...] Of course, you may also need to add the domain name of the DataNode-s (as the NameNode knows it) into the /etc/hosts on the computer running your application.

Harsh J · ‎08-11-2018

The documentation of CM metrics for HBase Master states the below: """ The number of times the balancer was invoked outside a balance cluster operation. The balancer might be invoked when a new table is created, for example, to place the newly created regions. """ - https://www.cloudera.com/documentation/enterprise/latest/topics/cm_metrics_master.html If you want more granularity on when this metric receives an increment, look for the method calls named 'incrMiscInvocations' inside the balancer base class within HBase Master: https://github.com/cloudera/hbase/blob/cdh5.15.0-release/hbase-server/src/main/java/org/apache/hadoop/hbase/master/balancer/BaseLoadBalancer.java -- The _across_masters suffix just averages the value across all HBase Masters running in the cluster. The _across_roletype generally applies the same way to all other role type metrics in CM. You can also query them without the across suffixes. The total_ prefix sums instead of averaging the value, across all relevant role types for your chosen metric. The rate bit basically appears for all counter style metrics. It represents the change over time (dx/dt). You can perform an integral(…) over any rate-named metrics to access actual value growth. These are tsquery specifics documented further at https://www.cloudera.com/documentation/enterprise/latest/topics/cm_dg_metric_aggregation.html#cmug_topic_11_8

Onedile · ‎08-03-2018

Hi Harsh, I am on cloudera 5.14 and i am also this issue. From version 5.2.X to the current version and this is stlil an issue? Do hue or cloudera manager have a configuration to take care of these issues?

Vineela · ‎08-01-2018

I am facing an issue mentioned while inserting data to hive table in oozie workflow and my oozie-site.xml is having the 'oozie.action.launcher.mapreduce.job.ubertask.enable' as false. Can anyone share the solution how to get rid of this error? Thanks, Vineela 14096 [uber-SubtaskRunner] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: IllegalArgumentException java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 at org.apache.hadoop.fs.Path.initialize(Path.java:206) at org.apache.hadoop.fs.Path.<init>(Path.java:197) at org.apache.hadoop.hive.ql.Context.getScratchDir(Context.java:267) at org.apache.hadoop.hive.ql.Context.getLocalScratchDir(Context.java:302) at org.apache.hadoop.hive.ql.optimizer.physical.CommonJoinTaskDispatcher.processCurrentTask(CommonJoinTaskDispatcher.java:539) at org.apache.hadoop.hive.ql.optimizer.physical.AbstractJoinTaskDispatcher.dispatch(AbstractJoinTaskDispatcher.java:179) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.dispatch(TaskGraphWalker.java:111) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.walk(TaskGraphWalker.java:180) at org.apache.hadoop.hive.ql.lib.TaskGraphWalker.startWalking(TaskGraphWalker.java:125) at org.apache.hadoop.hive.ql.optimizer.physical.CommonJoinResolver.resolve(CommonJoinResolver.java:79) at org.apache.hadoop.hive.ql.optimizer.physical.PhysicalOptimizer.optimize(PhysicalOptimizer.java:107) at org.apache.hadoop.hive.ql.parse.MapReduceCompiler.optimizeTaskPlan(MapReduceCompiler.java:273) at org.apache.hadoop.hive.ql.parse.TaskCompiler.compile(TaskCompiler.java:225) at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:10348) at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:10141) at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:222) at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:421) at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:305) at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:1110) at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1158) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1047) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1037) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:207) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:159) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:370) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:305) at org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:403) at org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:419) at org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:708) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:675) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:615) at org.apache.oozie.action.hadoop.HiveMain.runHive(HiveMain.java:321) at org.apache.oozie.action.hadoop.HiveMain.run(HiveMain.java:298) at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:49) at org.apache.oozie.action.hadoop.HiveMain.main(HiveMain.java:69) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:236) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.runSubtask(LocalContainerLauncher.java:378) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.runTask(LocalContainerLauncher.java:296) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler.access$200(LocalContainerLauncher.java:181) at org.apache.hadoop.mapred.LocalContainerLauncher$EventHandler$1.run(LocalContainerLauncher.java:224) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: java.net.URISyntaxException: Relative path in absolute URI: file:./tmp/yarn/637aaa69-8734-4f8b-96f0-37e4a6c71e89/hive_2018-08-01_06-30-40_248_2693357230662796925-1 at java.net.URI.checkPath(URI.java:1804) at java.net.URI.<init>(URI.java:752) at org.apache.hadoop.fs.Path.initialize(Path.java:203) ... 51 more

Prav · ‎07-31-2018

@Harsh J No, we rarely run balancer in this environment. I'll set it to 3 for now and observe for a while for any reoccurence of those WARNs if any . (CM recommends to set it between a value equal or greater than replication factor and lesser than number of DNs) Regards

Harsh J · ‎07-30-2018

Have you followed the solution made above? Depending on where you are trying to write into your cluster, unless you have full access to communicating with all your DataNode hosts and its ports, you will face this error.

Member Since	‎07-31-2013 07:21 AM
Last Visited
Posts	1,924
Kudos received	461

Cloudera Community

Re: S3Guard Suggested to help fix Consistency

Re: Failed to start namenode. java.io.FileNotFound...

Re: sqoop import issue

Re: Efficient ways to store many images files

Re: S3 loading into HDFS

Re: How to refresh hbase's zookeeper quorum

Re: Can I run the balancer for hdfs

Re: HBase simple authorization group permissions n...

Re: Sqoop with Oozie error

Re: HDFS put failing due to internal IP address us...

Re: total_balancer_misc_invocations_rate_across_ma...

Re: Why does /tmp/hive/admin/ take up so much spac...

Re: Oozie Hive action failed with wrong tmp path

Re: YARN - occasional Error message

Re: only be replicated to 0 nodes instead of minRe...