About ssubhas

kbanik · ‎09-24-2024

I believe there is a better way to do it from hue config than changing the python code: > Navigate to HUE > Configuration > Hue Service Advanced Configuration Snippet (Safety Valve) for hue_safety_valve.ini > add == [beeswax] max_catalog_sql_entries=15000 == If you need to list 15000 entries at once instead of 5k which is by default. > Save and restart affected Note that, you can't expect optimal performance out of HUE UI to load table list since no is too high and this is not a hue limitation rather being put on purpose so that table list load can be faster.

VidyaSargur · ‎04-14-2024

@Richardxu18, as this is an older article, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this article as a reference in your new post.

cdsuya · ‎08-19-2022

Hello @ssubhas , the above worked however, when we try the same with LazySerde, it is able to escape the delimiter but loads few NULL values at the end. PFB snippet of statement I used: CREATE TABLE test1(5columns string) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES( 'separatorChar'='|', 'escapeChar'='\\' ) STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'; NOTE: also tried field.delim=|, format.serialization=|. It works when serde properties are not mentioned and we use escape by Clause as you suggested, any way to make it work with LazySerde as well? (Data is Pipe delimited & may also have pipe in the data). Please suggest and help.

DianaTorres · ‎02-15-2022

Hi @CN As this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.

lav · ‎01-24-2022

Hi, when I run the hive query it showing the below error Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask But this error is not showing all the time it got succeed with some of the users some times it got failed. Could you please suggest the reason and how to overcome this. need urgent. could you please help us.

Praveenya · ‎01-19-2021

Thank you So much Subha, It worked like magic.

Matrix · ‎10-22-2020

I did as @ssubhas said, setting the attributes to false. spark.sql("SET hive.enforce.bucketing=false") spark.sql("SET hive.enforce.sorting=false") spark.sql("SET spark.hadoop.hive.exec.dynamic.partition = true") spark.sql("SET spark.hadoop.hive.exec.dynamic.partition.mode = nonstrict") newPartitionsDF.write.mode(SaveMode.Append).format("hive").insertInto(this.destinationDBdotTableName) Spark can create the bucketed table in Hive with no issues. Spark inserted the data into the table, but it totally ignored the fact that the table is bucketed. So when I open a partition, I see only 1 file. When inserting, we should set hive.enforce.bucketing = true, not false. And you will face the following error in Spark logs. org.apache.spark.sql.AnalysisException: Output Hive table `hive_test_db`.`test_bucketing` is bucketed but Spark currently does NOT populate bucketed output which is compatible with Hive.; This means that Spark doesn't support insertion into bucketed Hive tables. The first answer in this Stackoverflow question, explains that what @ssubhas suggested is a workaround that doesn't guarantee bucketing.

scooke · ‎07-30-2020

brutal I know but a oneliner cd $(cat /etc/ambari-server/conf/ambari.properties | grep -i mpack|awk -F'=' '{print$2}') ; ls -l|grep -v cache |grep -v mpacks_replay.log |grep -v total |awk '{print$9}' |xargs The last bit is handy if you to create a ruby fact out of the data

VidyaSargur · ‎06-07-2020

@oudaysaada As this is an older post you would have a better chance of receiving a resolution by starting a new thread. This will also provide the opportunity to provide details specific to your environment that could aid others in providing a more accurate answer to your question.

nishsank · ‎05-02-2020

@ssubhas This did not work as well. Can you help me out. I am unable to connect to HIVE SERVICE in Putty.

Online	Offline
Last Visited	‎06-08-2020 12:38 PM

Member Since	‎04-11-2016 05:12 AM
Last Visited	‎06-08-2020 12:38 PM
Posts	535
Kudos received	147

Cloudera Community

Re: What does --m 1 represent in sqoop import sta...

Re: HDP-2.6.4.0 - Superset startup failes with err...

Re: Unable to import data from Informix non transa...

Re: We have an AWS cluster setup for HCP Metron a...

Re: HDP support for Sqoop 2.x ?

Re: Hue does not list the tables post 5000 in numb...

Re: ERROR: "Couldn't find log associated with oper...

Re: Hive- escaping field delimiter in column value

Re: sqoop import from SQL Server with windows auth...

Re: Hive Query execution issue

Re: Sqoop import to avro failing - which jars to b...

Re: Hive bucketed table from Spark 2.3

Re: List all the installed mPacks in Ambari

Re: Can't create table with encoding iso-8859-1

Re: Unable to connect to Hive CLI through putty(ss...