About ssubhas

ssubhas · ‎02-24-2017

@Aruna Sameera I am suspecting the issue to be with the difference in the hdfs URI table using and the actual URI. Compare the output of below commands: metatool -listFSRoot hdfs getconf -nnRpcAddresses

ssubhas · ‎02-22-2017

@Aruna Sameera Can you share the output of following? hive> describe formatted telecom.recharge; hadoop fs -ls /user/hive/warehouse hadoop fs -ls /user/hive/warehouse/telecom.db

ssubhas · ‎02-21-2017

SYMPTOM When running Hive queries from Resource Manager log the following error is displayed: 2017-02-07 15:08:32,140 ERROR impl.MetricsSinkAdapter (MetricsSinkAdapter.java:publishMetricsFromQueue(148)) - Got sink exception, retry in 4600ms org.apache.hadoop.metrics2.MetricsException: Failed to putMetrics at org.apache.hadoop.metrics2.sink.timeline.HadoopTimelineMetricsSink.putMetrics(HadoopTimelineMetricsSink.java:216) at org.apache.hadoop.metrics2.impl.MetricsSinkAdapter.consume(MetricsSinkAdapter.java:186) at org.apache.hadoop.metrics2.impl.MetricsSinkAdapter.consume(MetricsSinkAdapter.java:43) at org.apache.hadoop.metrics2.impl.SinkQueue.consumeAll(SinkQueue.java:87) at org.apache.hadoop.metrics2.impl.MetricsSinkAdapter.publishMetricsFromQueue(MetricsSinkAdapter.java:134) at org.apache.hadoop.metrics2.impl.MetricsSinkAdapter$1.run(MetricsSinkAdapter.java:88) Caused by: java.net.UnknownHostException: http at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) at java.net.Socket.connect(Socket.java:579) at java.net.Socket.connect(Socket.java:528) at java.net.Socket.<init>(Socket.java:425) at java.net.Socket.<init>(Socket.java:280) ROOT CAUSE This issue occurs when reverse lookup returns incorrect hostname for the IP address. RESOLUTION To resolve this issue, fix the DNS issue under /etc/resolv.conf.

ssubhas · ‎02-21-2017

@Joshua Adeleke Seems like some tables are present on hive_dev which is causing the issue. Try using clean database for metastore and try.

ssubhas · ‎02-21-2017

@Joshua Adeleke It may be due to already existing the database. Share the output of schematool -initSchema -dbType mysql -dryRun and output of following query from Hive metastore db: select * from "VERSION";

ssubhas · ‎02-21-2017

@Raj Kadel The issue might be due to incorrect parameters for ACID, try below set: hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager hive.compactor.initiator.on=true. hive.compactor.worker.threads=10 hive.support.concurrency=true

ssubhas · ‎02-17-2017

@motohiro mito For now, connectivity to Hiveserver2 from Hue is not supported. Jira HUE-2738 is already in place to track the same.

ssubhas · ‎02-14-2017

@Srikanth Puli You need to use "describe extended <table_name>;" as below: describe extended tableex5 ; | Detailed Table Information | Table(tableName:tableex5, dbName:default, owner:hive, createTime:1487032307, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:col1, type:string, comment:null), FieldSchema(name:col2, type:int, comment:null)], location:hdfs://ssnode253stats.openstacklocal:8020/apps/hive/warehouse/tableex5, inputFormat:org.apache.hadoop.mapred.TextInputFormat, outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.contrib.serde2.MultiDelimitSerDe, parameters:{serialization.format=1, field.delim=%|%}), bucketCols:[], sortCols:[], parameters:{}, skewedInfo:SkewedInfo(skewedColNames:[], skewedColValues:[], skewedColValueLocationMaps:{}), storedAsSubDirectories:false), partitionKeys:[], parameters:{totalSize=24, numRows=2, rawDataSize=0, COLUMN_STATS_ACCURATE={"BASIC_STATS":"true"}, numFiles=1, transient_lastDdlTime=1487032937}, viewOriginalText:null, viewExpandedText:null, tableType:MANAGED_TABLE) |

ssubhas · ‎02-13-2017

@Bala Vignesh N V Yes, there would be performance difference between select * and select column as 'select *' would bring in all the column data and with indexing not affecting much. ORC would have better performance compared to textformat irrespective of select query being run. This is because the data is stored in splits and each split header containing the details regarding data within the split. ORC also has predicate pushdown which facilitates the better performance. Refer to link1 and link2 for details on increasing performance.

ssubhas · ‎02-07-2017

@David Halik This is not a Bug, basically the 'Releases notes for HDP 2.5.3' does information for the supported metastore databases. Release_MR

Online	Offline
Last Visited	‎06-08-2020 12:38 PM

Member Since	‎04-11-2016 05:12 AM
Last Visited	‎06-08-2020 12:38 PM
Posts	535
Kudos received	147

Cloudera Community

Re: What does --m 1 represent in sqoop import sta...

Re: HDP-2.6.4.0 - Superset startup failes with err...

Re: Unable to import data from Informix non transa...

Re: We have an AWS cluster setup for HCP Metron a...

Re: HDP support for Sqoop 2.x ?

Re: Unable to fine the Hive location folder

Re: Unable to fine the Hive location folder

"Error: Got sink exception, retry in 4600ms org.ap...

Re: Trying to reinstall Hive but have metastore er...

Re: Trying to reinstall Hive but have metastore er...

Re: hive JDBC : update and delete not woking

Re: how to connect hue with hive via zookeeper

Re: Hive - Get number of rows, total size resulted...

Re: Will there be any performance issues if we sel...

Re: Hive metastore upgrade from 2.0.0 to 2.1.0 fai...