About ChethanYM

ChethanYM · ‎10-25-2022

Hi @yassan , I would like to let you know that, the default value on the flag(inc_stats_size_limit_bytes) is set to 200 MB, as a safety check to prevent Impala from hitting the maximum limit for the table metadata. Whereas, the error reported usually serves as an indication that 'COMPUTE INCREMENTAL STATS' should not be used on the particular table and consider spitting the table thereby, using regular 'COMPUTE STATS' statement if possible. However, incase if you are not able to use the 'Compute Stats' statement then you could try to increase the default limit on the flag(inc_stats_size_limit_bytes) where, it should be set less than 1 GB limit and the value is measured in bytes. Below is the seteps: 1. CM > Impala Service > Configuration > Search "Impala Command Line Argument Advanced Configuration Snippet (Safety Valve)" 2. Add --inc_stats_size_limit_bytes= #####Please note that the above value is in bytes. For example, if you want to set 400 Mb, please input 419430400(400*1024*1024). 3. Please save and restart Impala service. Note: If I answered your question please give a thumbs up and Accept it as a solution. Regards, Chethan YM

ChethanYM · ‎10-25-2022

Hi, There is a KB article related to this issue, please review this below: https://community.cloudera.com/t5/Customer/Permission-denied-when-accessing-Hive-tables-from-Spark-in/ta-p/310498 Regards, Chethan YM

ChethanYM · ‎10-25-2022

Hi, The alert message does not give more information to check, review the HMS and CM service monitor logs related to this issue and provide the stack-traces. Regards, Chethan YM

ChethanYM · ‎10-25-2022

Hi, As per the below Git, PauseTransitRunnable is the runnable which is scheduled to run at the configured interval, it checks all bundles. to see if they should be paused, un-paused or started. https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/service/PauseTransitService.java#:~:text=PauseTransitRunnable%20is%20the%20runnable%20which%20is%20scheduled%20to%20run%20at%20the%20configured%20interval%2C%20it%20checks%20all%20bundles It also released the lock, Are you running this sqoop-import from oozie? If yes try to rerun outside of oozie and check if it still get stucks and review the corresponding NM and RM logs if anything interrupting. Regards, Chethan YM

ChethanYM · ‎09-24-2022

Hi , There seems to be a UDF present for SK in Hive, Have you tried this? Is it Working? https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/using-hiveql/topics/hive_surrogate_keys.html Regards, Chethan YM

ChethanYM · ‎09-24-2022

Hi, It looks like waiting for inserting the data, It may get finish after few minutes. Is it worked or still hangs for hours?Create table when we upload CSV file usually takes more time. Regards, Chethan YM

ChethanYM · ‎09-16-2022

Hi, Below is the suspected causes for this issue: https://issues.apache.org/jira/browse/YARN-3055 https://issues.apache.org/jira/browse/YARN-2964 Yes, You can set that parameter at workflow level and test. Regards, Chethan YM

ChethanYM · ‎09-15-2022

Hi @coco Can you follow the below steps in Hue, if you are running the job from Hue. 1. Login to Hue 2. Go to Workflows -> Editors -> Workflows 3. Open the Workflow to edit. 4. On the left hand pane, Click 'Properties' 5. Under section 'Hadoop Job Properties', in the Name box enter 'mapreduce.job.complete.cancel.delegation.tokens' and in value enter 'true'. 6. Save the workflow and submit. If you are running from terminal add the above property in configurations section then rerun the workflow and see if it helps. If this works please accept it as a solution. Regards, Chethan YM

ChethanYM · ‎09-07-2022

Hi @vz If both columns are in string i think you can use concat or concat_ws, Can you check below articles and see if this helps? https://community.cloudera.com/t5/Support-Questions/HIVE-Concatenates-all-columns-easily/td-p/180208 https://blog.fearcat.in/a?ID=01600-32e80587-5a71-411e-835b-ed905cb1b61a https://stackoverflow.com/questions/51211278/concatenate-multiple-columns-into-one-in-hive Note: If my reply answers your question please give a thumbs up and accept it as a solution. Regards, Chethan YM

ChethanYM · ‎09-07-2022

Hi @gocham , In CDP 7.1.7 Capacity Scheduler is alone supported and Fair Scheduler is not supported, Capacity Scheduler is the default and only supported scheduler. You must transition from Fair Scheduler to Capacity Scheduler when upgrading your cluster to CDP Private Cloud Base. This is the related Jira from Cloudera - CLR-106983 Note: If i answered your question please give a thumbs up and accept it as a solution. Regards, Chethan YM

Online	Offline
Last Visited	‎08-28-2025 09:14 PM

Member Since	‎03-06-2020 05:48 AM
Last Visited	‎08-28-2025 09:14 PM
Posts	406
Kudos received	55

Cloudera Community

Re: Impala's log "ORC read request to already read...

Re: impala forces full table scan

Re: Sqoop export fails from hive to oracle when co...

Re: Multi Node Hadoop Cluster setup with Hbase and...

Re: Hive connecting to node that does not exist

Re: Why use "COMPUTE STATS" instead of "COMPUTE IN...

Re: Please save me!!! [Spark] MetaException: Permi...

Re: Hive metastore canary failed to create catlog

Re: The sqoop import job is paused(Aggregate Resou...

Re: How create surogate key in Impala?

Re: Unable to upload table from CSV File

Re: JA017, Long running shell action Oozie workflo...

Re: JA017, Long running shell action Oozie workflo...

Re: Hive Date field and time field concat

Re: Can not start resource manager with Fair Sched...