About smruti

smruti · ‎06-26-2023

@aafc could you please share the CDP (HIve) version you are using? Also, have you tried with the latest Clouldera JDBC driver for hive?

smruti · ‎06-24-2023

@rahuledavalath In a 3-node ZK cluster, we can only afford to lose 1 instance/node at once. However, having this cluster across two geo-locations could put us in a difficult position. So, it will be a good idea to build the cluster across 3 regions, with one instance in each.

smruti · ‎06-21-2023

@Choolake Try : entries=$((count2-count1)) This should work provided we have valid values on both variables.

smruti · ‎06-14-2023

@xiamu this error could appear if the data nodes are not healthy. Does the job fail repeatedly, or it succeeds at times? Have you tried running it with a different user? This is where it is failing: private void setupPipelineForAppendOrRecovery() throws IOException { // Check number of datanodes. Note that if there is no healthy datanode, // this must be internal error because we mark external error in striped // outputstream only when all the streamers are in the DATA_STREAMING stage if (nodes == null || nodes.length == 0) { String msg = "Could not get block locations. " + "Source file \"" + src + "\" - Aborting..." + this; LOG.warn(msg); lastException.set(new IOException(msg)); streamerClosed = true; return; } setupPipelineInternal(nodes, storageTypes, storageIDs); }

smruti · ‎06-14-2023

@haihua Do you mean it works from beeline but not from Hive CLI? If it works with beeline, why don't we run it run it with that instead? beeline ... -f query.hql Also, could you try providing required privileges to the role with GRANT OPTION ? Refer to https://docs.cloudera.com/documentation/enterprise/latest/topics/sg_hive_sql.html#grant_privilege_with_grant

smruti · ‎06-09-2023

@snm1523 From beeline use sys; select cd_id, count(cd_id) as column_count from columns_v2 group by cd_id order by cd_id asc; -- this will return column_count for each table Every individual table will have a unique cd_id. To map the table names with cd_id, try the following. select t.tbl_name, s.cd_id from tbls t join sds s where t.sd_id=s.sd_id order by s.cd_id asc; You could also merge the two queries to get the o/p together.

smruti · ‎06-09-2023

@BrianChan Does the AD user have permissions to view the Hive table? This privilege need to be set under Hadoop SQL policies in Ranger. If this is already done, you also need to set required atlas policies. Refer to https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/atlas-securing/topics/atlas-configure-ranger-authorization.html Make sure that the entiity-type is set correctly.

smruti · ‎06-09-2023

@snm1523 It should be SET FILEFORMAT INPUTFORMAT... Please try it as follows. ALTER TABLE alter_file_format_test SET FILEFORMAT INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat' SERDE 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe';

smruti · ‎05-20-2023

@mwblee I am not sure if you are using any Cloudera Hive distribution, if you are, consider upgrading to latest CDP version where we have fixed many issues around compactor initiator/worker/cleaner. e.g. For initiator: upstream Jiras - HIVE-21917, HIVE-22568, HIVE-22081 For this specific issue, you may check take a look at multiple factors, such as Hive metastore being overloaded, slow/large(certain txn related tables) metastore database. You may enable DEBUG logging in Hive metastore, and this will provide more information on why/where the compactor is stuck. If you are using opensource Hive, upgrade to Hive 4.x; you will have much better experience w.r.t. compaction.

smruti · ‎05-12-2023

@quangbilly79 cloudera-manager-server, daemon and agent package needs to be installed in one node that will host your Cloudera Manager UI. Please refer to the Installation guide and follow the step by step procedure. You also need to setup the manager DB as you have mentioned above. Start the Manager and access UI. Once the Manager is accessible, you may add the other nodes by specifying the IP addresses as described here : https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/install_software_cm_wizard.html

Online	Offline
Last Visited	‎12-09-2025 02:39 AM

Member Since	‎10-28-2020 05:19 AM
Last Visited	‎12-09-2025 02:39 AM
Posts	622
Kudos received	46

Cloudera Community

Re: I have a HIVE table which contains JSON as a c...

Re: ANALYZE command not write data into hive metas...

Re: HBase stores base64 data when data is inserted...

Re: Deleting hive service on CDP Private Base and ...

Re: Not Able to run import command. it fails with ...

Re: Hive - Error connecting in FIPS enabled enviro...

Re: CDP zookeeer HA in Different Geo Location Clus...

Re: When the table's new record count updated, we ...

Re: Tez Could not get block location when insert

Re: hive alter tale error does not have privileges...

Re: Number of columns each table has in Hive

Re: Atlas no record found

Re: Change input and output format of existing hiv...

Re: hive compaction stuck in initiated state

Re: Do I need to install Cloudera Manager (CDH) on...