Member since
11-11-2019
610
Posts
33
Kudos Received
25
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1498 | 02-28-2023 09:32 PM | |
2429 | 02-27-2023 03:33 AM | |
25061 | 12-24-2022 05:56 AM | |
2013 | 12-05-2022 06:17 AM | |
5227 | 11-25-2022 07:37 AM |
12-13-2022
06:25 AM
@raghu9raghavend I think I caught the issue, you are using binary and proving httppath jdbc:hive2://ZK1:2181,ZK2:2181,ZK3:2181/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveServer2;transportMode=binary;httpPath=cliservice; Please check,if below set to http or binary hive.server2.transport.mode Change the url: http: jdbc:hive2://ZK1:2181,ZK2:2181,ZK3:2181/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver2;transportMode=http;httpPath=cliservice; binary: jdbc:hive2://ZK1:2181,ZK2:2181,ZK3:2181/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver2 Change hiveSever2 ==> hiveserver2
... View more
12-05-2022
06:17 AM
Please "Accept as solution" , if this has worked
... View more
12-05-2022
06:17 AM
@Elbehwash 1. Please navigate to CM ==> Hive on tez ==> COnfiguration ==> Hiveserver2 safety valve for hive-site.xml 2. Add hive.server2.tez.initialize.default.sessions = false; 3. restart and check
... View more
12-04-2022
11:04 PM
@Mars001 I see you are using EXTERNAL jdbc driver. COuld you please try with beeline and provide below if this fails: 1. query and queryID 2. hiveserver2 logs 3. Output of show create table <tablename>;
... View more
11-25-2022
07:37 AM
Please find the difference between hive.tez.container.size and tez.task.resource.mb hive.tez.container.size This property specifies tez container size. Usually value of this property should be the same as or a small multiple (1 or 2 times that) of YARN container size yarn.scheduler.minimum-allocation-mb and should not exceed value of yarn.scheduler.maximum-allocation-mb. As a general rule don't put value higher than memory per processor as you want 1 processor per container and you want to spun up multiple containers. You can find very detailed answer and a great architecture diagram on Hortonworks community answer here tez.task.resource.memory.mb Amount of memory used by launched task in TEZ container. tez.task.resource.memory.mb should be set < hive.tez.container.size This will be recalculated. Run the job without setting.
... View more
11-25-2022
02:03 AM
hi @pankshiv1809 DOnt include set tez.task.resource.memory.mb=10240;
... View more
11-24-2022
11:59 PM
@pankshiv1809 Please increase the container size: set hive.tez.container.size=10240; set tez.runtime.io.sort.mb=4096; ==> 40% of hive.tez.container.size Keep on increasing the container. Please also collect table and column stats too https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.5/bk_cloud-data-access/content/hive-analyzing-tables.html Please mark it "Accept As Solution". if your query is answered.
... View more
11-24-2022
12:46 AM
@drgenious Can you please provide below: 1. oozie wf logs(oozie application logs) ==> i can find the exact hive query executed. 2. Hive application logs. Please collect it using: yarn logs -applicationId <appid> > <appid>.txt ==> FOr bith Oozie and hive
... View more
11-23-2022
01:00 AM
@drgenious 1. Impala is always faster. Impala does not use yarn. Impala stores catalog data locally which fetches information faster. Impala backend gthread is built on C++ which is very fast. 2. Impala is not fault tolerant , it is best suited for adhoc queries and ETL is best suited for Hive as Hive is fault tolerant. If the query fails due to network/disk failure,hive will retry but Impala would fail. 3. For stemaming/ingestion like Kafka flow you need to put it in EXTERNAL tables not in Managed(ACID) tables. Managed tabled can be used,if you want to perform alteration of the data like Update/Delete . Please let me know,if you have any queries. Please click "Accept As Solution" , if your query is answered.
... View more
11-22-2022
01:46 AM
Hi @Raed_Faly Could you please share the DDL's for the tables, based on that I can update.
... View more