Member since
03-06-2020
406
Posts
56
Kudos Received
37
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 403 | 08-29-2025 12:27 AM | |
| 1026 | 11-21-2024 10:40 PM | |
| 984 | 11-21-2024 10:12 PM | |
| 3066 | 07-23-2024 10:52 PM | |
| 2164 | 05-16-2024 12:27 AM |
10-17-2022
08:58 PM
@coco, has the above issue been resolved for you? was @ChethanYM 's response helpful? If you have found a solution, can you please share it with our community?
... View more
09-29-2022
05:24 PM
How create function SURROGATE_KEY()?. I note that it would be necessary to install a package to perform the function in Impala
... View more
09-24-2022
10:03 PM
1 Kudo
Hi, It looks like waiting for inserting the data, It may get finish after few minutes. Is it worked or still hangs for hours?Create table when we upload CSV file usually takes more time. Regards, Chethan YM
... View more
09-08-2022
04:56 AM
thanks Chethan. The solution here https://community.cloudera.com/t5/Support-Questions/Invalid-resource-request-requested-resource-type-yarn-io-gpu/m-p/243629#M205427 is apparently no longer valid. I switched back to the Capacity scheduler, increased yarn.nodemanager.resource.memory-mb and everything seems to be OK now
... View more
09-07-2022
11:20 PM
Hi @vz If both columns are in string i think you can use concat or concat_ws, Can you check below articles and see if this helps? https://community.cloudera.com/t5/Support-Questions/HIVE-Concatenates-all-columns-easily/td-p/180208 https://blog.fearcat.in/a?ID=01600-32e80587-5a71-411e-835b-ed905cb1b61a https://stackoverflow.com/questions/51211278/concatenate-multiple-columns-into-one-in-hive Note: If my reply answers your question please give a thumbs up and accept it as a solution. Regards, Chethan YM
... View more
09-07-2022
08:04 AM
@chethan --Thanks for sharing information.
... View more
09-05-2022
05:16 AM
Hi, Please review the below documentation: https://docs.cloudera.com/HDPDocuments/DAS/DAS-1.4.5/index.html As per this it looks like it only works on Hive and need PostgreSQL db. Regards, Chethan YM
... View more
09-05-2022
05:05 AM
1 Kudo
Hi @Iga21207 , So how it works in catalod is when you run any refresh commands then that is executed sequentially and once that is completed then it goes to next one. It doesn't run in parallel as per the catalogd which is a single threaded operation. There is a lock that catalogd thread creates on class getCatalogObjects(). So when you are refreshing(means they have not completed yet sequentially) and after that when the new request came in then the Catalog threw the error on that table as it can't get the lock because the lock is already there on previous table on which the refresh command was running. Not sure on your CDH version, This may resolved in Higher version of CDP/CDH. Note: If i answered your question please give a thumbs up and accept it as a solution. Regards, Chethan YM
... View more
08-29-2022
05:21 AM
2 Kudos
Hi, After you run the query you need to look at the query profile to analyse the complete memory. look for “Per node peak memory usage” in the profile to understand how much memory each host or impala daemon used to run this query. For above snippet from your side it looks like this query has the 3gb max limit to run the query, this can be set at session level or in impala admission control pool. If you provide the complete query profile i think we can get more details. Regards, Chethan YM
... View more
08-29-2022
05:07 AM
1 Kudo
Hi, Yes, Hive metastore is a component that stores all the structure information(metadata) of objects like tables and partitions in the warehouse including column and column type information etc... Regards, Chethan YM Note: If this answered your question please accept the reply as a solution.
... View more