About csguna

csguna · ‎12-02-2017

what file format are using ? does your table stats being collected ? do you use partitioning or bucketing in your table ? select* from table without limit clause will hurt the performance as it is bad query do you really need to pull in all the record ?

csguna · ‎12-02-2017

Hi @AlexMulti @syamsri @UjjwalRana @Suribharu are you still looking for solution ? what version of hive are you using ? what file format does your table has ? what client tool are using hiveshell or beeline ? did you perform the pre-requsites before performing the hive delete operation?

csguna · ‎11-27-2017

below is the example let me know if that works for you. You can use Range or hash partition , also we can perform Range as well as hash partition together or just hash partition by using bucket . Below is the table that has primary key that coloum id we are using for partition (that is a good practice ) CREATE TABLE customersDetails ( state STRING, PRIMARY KEY (state, name) ) PARTITION BY RANGE (state) ( PARTITION VALUE = 'al', PARTITION VALUE = 'ak', PARTITION VALUE = 'wv', PARTITION VALUE = 'wy' ) STORED AS KUDU;

csguna · ‎11-18-2017

were are you firing this query ? beeline or impala shell . just dumb question but i am asking you anyways your syntax is correct could be some editor issue or space. try the below format CREATE TABLE foo1 (id INT PRIMARY KEY, col1 STRING ,col2 STRING ) PARTITION BY HASH(id) PARTITIONS 3 STORED AS KUDU;

csguna · ‎10-19-2017

there are couple of places that needsd tuining in the query level 1 . stats for the table is must for good performance 2. when user is joining two tables make sure there are using the large table in the last and the first table is smaller 3. you can also use HINTS to imporve query performance. 4. hive table's file format is big a factor 5. choosing when to use paritioning vs bucketing. 6.allocate good memory to hiveserver2 and metastore 7.heapsize 8 .load balancer on the host https://www.cloudera.com/documentation/enterprise/5-9-x/topics/admin_cm_ha_hosts.html#concept_qkr_bfd_pr

csguna · ‎10-19-2017

1. check your cloudera-scm-agent status 2 . check your server_host value in /etc/cloudera-scm-agent/config.ini , it should point to your cloduera manager's host.

csguna · ‎10-09-2017

@Nevo Its costly but could you fire the below command and see if that fixes REFRESH [db_name.]table_name REFRESH DATABASE_NAME.TABLE_NAME

csguna · ‎10-04-2017

could you fire the below commands in master and slave netstat -anp | grep 50060 also see if you can ping your slave from master and vice versa looks like issue between them

csguna · ‎10-04-2017

whats your firewall or iptable status ?

csguna · ‎10-04-2017

You can allocate 4GB kick off this command in the terminal . that should bring you a cloudera manager express edition sudo /home/cloudera/cloudera-manager --force --express Once everything is started in the terminal you wll see a Cloudera manager url along with the credentials . Let me know if this is suffice

Online	Offline
Last Visited	‎10-28-2024 06:24 AM

Member Since	‎05-16-2016 09:33 PM
Last Visited	‎10-28-2024 06:24 AM
Posts	785
Kudos received	112

Cloudera Community

Re: Kerberos / Sentry Integration

Re: How to upgrade Hive from 2.1 to 3.0 via CDH 6....

Re: How does nameservice id works for HA, how does...

Re: What license does the express edition fall und...

Re: Sqoop2 over Sqoop1 in CDH6

Re: Getting connection reset exception when the hi...

Re: Update and Delete are not working in Hive ?

Re: Error during CREATE KUDU table using IMPALA

Re: Error during CREATE KUDU table using IMPALA

Re: Adding nodes will improve performance ?

Re: Cloudera Manager distribution error

Re: Create Impala table from existing Parquet file

Re: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-ou...

Re: Cloudera Manager distribution error

Re: Where is exactly cloudera manager? How to find...