About dineshc

dineshc · ‎08-31-2017

When I am logged in as a non hdfs user, I am able to set storage policies but not able to get the storage policy for a path I just set. I am able to get storage policies only when I login as hdfs user. Is this expected behavior or is their a solution for this ? I am using HDP 2.6.1

dineshc · ‎08-31-2017

@Sagar Shimpi As we moved into 2017, we can now remove storage policy for a certain path! hdfs storagepolicies -unsetStoragePolicy -path <path>

dineshc · ‎08-30-2017

@Viswa Here are the 2 major aspects on which they differ: 1. Connection: The Hive CLI, which connects directly to HDFS and the Hive Metastore, and can be used only on a host with access to those services. Beeline, which connects to HiveServer2 and requires access to only one .jar file: hive-jdbc-<version>-standalone.jar . 2. Authentication Hive CLI uses only Storage Based Authentication Beeline uses SQL standard-based authorization or Ranger-based authorization. Thus greater security. It is better to use Beeline for the above reasons than Hive CLI (I believe it will soon be deprecated). Read here for greater understanding on beeline : https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_data-access/content/beeline-vs-hive-cli.html

dineshc · ‎08-30-2017

@Freemon Johnson Login to the host which has Ambari. This will list all components in all versions of HDP that are there in your system. yum list installed | grep @HDP- You can filter using a specific version, ex for HDP 2.6.1.0 yum list installed | grep @HDP-2.6.1.0

dineshc · ‎08-28-2017

This is one of the really useful things out here!

dineshc · ‎08-28-2017

This is wonderful!

dineshc · ‎08-24-2017

For redundancy, dfs.namenode.name.dir has paths to two different disks. Thus there are FSImage files on two different disks so that on failure of one disk, the fs image is available from another disk and the Failover Controller is not triggered just because of one disk failure. Typically, the HDP upgrade will create a backup of FS Images from the path mentioned in dfs.namenode.dir. In this case it will create a backup of the redundant copy as well, which will be expensive in terms of storage and time. Is there a way, where we can guide ambari upgrade process to choose one of the redundant files instead of all of them ?

dineshc · ‎08-24-2017

You can use the collect_set UDAF function in hive. select name, collect_set(city) from people group by name; collect_Set will remove all duplicates. If you need the duplicates then you must use "collect_list". Read here for official documentation and details.

dineshc · ‎08-23-2017

@Jay SenSharma - Thank you for prompt response. I see this only works for Beeline version 2.3.0 and latest. I am currently using 1.2.1 and thus this will have no effect. However, I appreciate your help. I will try for a workaround for version 1.2.1, if nothing else works I will upgrade to 2.3.0 and accept your answer.

dineshc · ‎08-23-2017

When using HiveCLI, if we SET hive.cli.print.current.db = true; Then the command prompt for Hive CLI displays the current database name until the end of the session as shown below: hive mydb > I tried using the same property in Beeline but had no effect. Is there a different way of achieving this in Beeline ?

Online	Offline
Last Visited	‎12-08-2021 02:51 PM

Member Since	‎10-04-2016 05:35 PM
Last Visited	‎12-08-2021 02:51 PM
Posts	243
Kudos received	276

Cloudera Community

Re: Hortonworks HDPCA Practice Exam V3 Task.

Re: Spark 1.6 - Dataframe read json throws org.apa...

Re: Service 'webhcat' check failed: RA080 Can't de...

Re: Unable to see HDFS metrics in Grafana

Re: Spark sort by key with descending order

HDFS Storage Policies : Unable to get storage poli...

Re: How to delete/remove storage policy ?

Re: Hive CLI vs Beeline

Re: How to list HDP service components via command...

Re: Ambari database tables ER Diagram - Ambari 2.4...

Re: Ambari database cleanup - Speed up

Ambari/HDP Upgrade : Can we force to backup only o...

Re: Hive- Convert all values for a column to a co...

Re: Beeline : Hive property to print current datab...

Beeline : Hive property to print current database ...