Member since
11-04-2015
261
Posts
44
Kudos Received
33
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 9134 | 05-16-2024 03:10 AM | |
| 4210 | 01-17-2024 01:07 AM | |
| 3641 | 12-11-2023 02:10 AM | |
| 7062 | 10-11-2023 08:42 AM | |
| 4097 | 09-07-2023 01:08 AM |
03-21-2023
04:03 AM
1 Kudo
Hi @ThomasCloudeara , First, the JDBC driver v2.6.4 is very old. Kindly use the latest one, download it from our website: https://www.cloudera.com/downloads/connectors/impala/jdbc/ Preferably use the "JDBC4.2" specification driver, with that use "com.cloudera.impala.jdbc.Driver" instead. Second, please follow the guide https://docs.cloudera.com/documentation/other/connectors/impala-jdbc/2-6-30/Cloudera-JDBC-Connector-for-Apache-Impala-Install-Guide.pdf and verify the JDBC connection string: - if the LDAP authentication is really enabled (AuthMech=3 tells that) or if you have Kerberos authentication only in your cluster (which needs AuthMech=1) - if LDAP auth is really enabled, can you connect to Impala with impala-shell with the same username/password pair? Thanks Miklos
... View more
03-06-2023
04:30 AM
Hi @KaimingGu , the translation might have been incorrect, so I assume you're actually facing "Client Fetch Wait Time" taking the most of the time. At the same time the "Client Fetch Wait Time Percentage" is also close to 100%. This means that while the query has been executed quicky, the client was too slow to fetch the results. This is usually a sign that the network may be slow or the client is facing other slowness, for example it does some processing between the "result.next()" calls, or hitting GC pauses (if it's a Java application). See also the following blog post which explains the different query profile metrics: https://www.ericlin.me/2020/02/impala-query-profile-explained-part-5-query-metrics/ Best regards Miklos Szurap Customer Operations Engineer, Cloudera
... View more
01-19-2023
08:20 AM
Hi @StuartM , I know it's not a direct answer, but this requirement sounds more like a good call for Kafka - which inherently supports the idea of "consumer offsets".
... View more
10-13-2022
08:25 AM
Hi @MichaelPlet , yes, sure, 7.1.7 SP1 is definitely a stable and also a long term support release with lots of bugfixes over 7.1.7. Also check the additional cumulative hotfixes which were release on top of 7.1.7 SP1: https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/runtime-release-notes/topics/chf-pvcb-sp1-overview.html#chf-pvcb-sp1-overview If you have some security vulnerability questions, then kindly raise those questions through a support case. Thank you Miklos Szurap Customer Operations Engineer, Cloudera
... View more
10-13-2022
06:13 AM
These are pretty long GC pauses, I assume they are from the HMS logs. With long GC pauses of course every operation will suffer and will be slow, eventually the SMON's request will time out . Kindly review the HMS heap size and consider to increase it until you get a stable performance (without such GC pauses).
... View more
10-13-2022
03:29 AM
The Canary is just testing whether the basic operatins are working in Hive Metastore. If that shows "unhealthy" it does not necessarily mean that the jobs are failing due to the Hive Metastore not functioning (it may be just slow for example), it is however indeed a warning sign for you that something is not proper. Please connect with beeline to the HiveServer2 and verify what is working and what is failing, then check the HiveServer2 logs and HiveMetastore logs. You can file a support case (where you can share much more details) if this is an urgent issue.
... View more
10-13-2022
01:20 AM
Hi @hanumanth , I assume this is a CDH 6 cluster. Do you have Sentry enabled as well? Is this always happening, or just at some times? Have you tested in beeline how long does it take to drop an example database? Does it also fail with a timeout? I guess it is taking more than 60 seconds (that's the service monitor's default timeout), and since the default timeout for HS2 to HMS is 5 minutes it actually succeeds. Thanks, Miklos
... View more
10-05-2022
01:44 AM
Hi @Jarinek , Yes, in CDH/CDP every service which depends on HDFS will inherit the HDFS configuration "auth-to-local rules", in CM in HDFS Configuration see "Additional Rules to Map Kerberos Principals to Short Names". Kafka does not need HDFS so that's why it has a separate such configuration. See the documentation how to set it: https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/security-kerberos-authentication/topics/cm-security-kerberos-authentication-auth-to-local-isolate.html Best regards Miklos
... View more
08-22-2022
01:02 AM
Hi @Shaswat , Without reviewing completely what (else) may be the problem, the "port=21000" is definitely not correct. Impala has two "frontend" ports to which the clients can connect: - Port 21000 is used only for "impala-shell" - Port 21050 is used for all the other client applications using JDBC, ODBC, Hue or other Python based applications using Impyla - which is also used in the above example. Please see Impyla docs for more. Best regards Miklos
... View more
08-02-2022
01:16 AM
Hi @Neel_Sharma , The message suggests that the query tried to read the table's datafiles as if the table was parquet file based. (and parquet might be the default table format in Hive in CDP - of course only if the table format is not specified during creation) However the table creation script you've shared suggests the table should be text (CSV) based. Can you please verify it with checking what is the table format, with: DESCRIBE FORMATTED GeoIP2_ISP_Blocks_IPv4; Are you in the right database? For the second issue - how do you create the external tables from tab delimited files? How are the files uploaded to hdfs? Thanks Miklos
... View more