Member since
05-16-2016
785
Posts
114
Kudos Received
39
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2328 | 06-12-2019 09:27 AM | |
| 3579 | 05-27-2019 08:29 AM | |
| 5725 | 05-27-2018 08:49 AM | |
| 5244 | 05-05-2018 10:47 PM | |
| 3113 | 05-05-2018 07:32 AM |
12-02-2017
10:38 PM
what file format are using ? does your table stats being collected ? do you use partitioning or bucketing in your table ? select* from table without limit clause will hurt the performance as it is bad query do you really need to pull in all the record ?
... View more
12-02-2017
10:05 PM
Hi @AlexMulti @syamsri @UjjwalRana @Suribharu are you still looking for solution ? what version of hive are you using ? what file format does your table has ? what client tool are using hiveshell or beeline ? did you perform the pre-requsites before performing the hive delete operation?
... View more
11-27-2017
08:12 PM
below is the example let me know if that works for you. You can use Range or hash partition , also we can perform Range as well as hash partition together or just hash partition by using bucket . Below is the table that has primary key that coloum id we are using for partition (that is a good practice ) CREATE TABLE customersDetails (
state STRING,
PRIMARY KEY (state, name)
)
PARTITION BY RANGE (state)
(
PARTITION VALUE = 'al',
PARTITION VALUE = 'ak',
PARTITION VALUE = 'wv',
PARTITION VALUE = 'wy'
)
STORED AS KUDU;
... View more
11-18-2017
09:57 AM
were are you firing this query ? beeline or impala shell . just dumb question but i am asking you anyways your syntax is correct could be some editor issue or space. try the below format CREATE TABLE foo1 (id INT PRIMARY KEY, col1 STRING ,col2 STRING )
PARTITION BY HASH(id) PARTITIONS 3 STORED AS KUDU;
... View more
10-19-2017
07:15 PM
there are couple of places that needsd tuining in the query level 1 . stats for the table is must for good performance 2. when user is joining two tables make sure there are using the large table in the last and the first table is smaller 3. you can also use HINTS to imporve query performance. 4. hive table's file format is big a factor 5. choosing when to use paritioning vs bucketing. 6.allocate good memory to hiveserver2 and metastore 7.heapsize 8 .load balancer on the host https://www.cloudera.com/documentation/enterprise/5-9-x/topics/admin_cm_ha_hosts.html#concept_qkr_bfd_pr
... View more
10-19-2017
07:03 PM
1. check your cloudera-scm-agent status 2 . check your server_host value in /etc/cloudera-scm-agent/config.ini , it should point to your cloduera manager's host.
... View more
10-09-2017
09:59 PM
@Nevo Its costly but could you fire the below command and see if that fixes REFRESH [db_name.]table_name
REFRESH DATABASE_NAME.TABLE_NAME
... View more
10-04-2017
09:25 PM
could you fire the below commands in master and slave netstat -anp | grep 50060 also see if you can ping your slave from master and vice versa looks like issue between them
... View more
10-04-2017
07:23 PM
2 Kudos
You can allocate 4GB kick off this command in the terminal . that should bring you a cloudera manager express edition sudo /home/cloudera/cloudera-manager --force --express Once everything is started in the terminal you wll see a Cloudera manager url along with the credentials . Let me know if this is suffice
... View more