Member since
02-29-2024
8
Posts
5
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1324 | 10-08-2024 03:57 AM |
10-08-2024
03:57 AM
Yes , upgrading spark to newest SPARK version SPARK3-3.3.2.3.3.7190.5-2-1.p0.54391297 - fixed the issue
... View more
10-01-2024
05:38 AM
For exampale When I run via HUE import pgeocode dist = pgeocode.GeoDistance('pl') I got the Error - Time Out as i cannot access the Internet and need to go via proxy - but how this proxy can be configured in Hue or be part of the code in HUE
... View more
10-01-2024
05:25 AM
Dear Freinds My Cloudera Cluster has no access to the internet - if I need to gather some parcel or any file via wget for example But what if I want to import some python library in HUE or Zeppelin that requires dynamic access to the internet. I have installed via pop3 pgeocode library and now in Peppelin or HUE when I want to dynamically call that library I am getting TIme OUT issue : File "/usr/lib64/python3.9/urllib/request.py", line 1349, in do_open raise URLError(err) urllib.error.URLError: <urlopen error [Errno 110] Connection timed out How can I use proxy server in such cases so HUE or Zeppelin if record has access to the internet - able to reach the python library modules.
... View more
Labels:
- Labels:
-
Apache Zeppelin
-
Cloudera Hue
07-04-2024
06:46 AM
1 Kudo
I think teh time zone has nothing to do with CURRENT_TIMESTAMP () finally. As teh actually need is to use the CURRENT_TIMESTAMP () or TIMESTAMP 'now' and have the local time as default time of that function in Spark SQL while running the query over the HUE. Now these function are giving the time 2 h late with means in UTC format and I would like to have by default 2h later or EUROPE/Berlin time for example
... View more
07-03-2024
03:06 AM
1 Kudo
Hello Thank you for this hint. However I have already check that configuration and it was already Europe/Warsaw so the one diesired. There must be somewhere some other place to specify this i think.
... View more
07-01-2024
11:10 PM
1 Kudo
Hello I am struggling to set the time zone for the sessions on Spark Sql while submitting the query over the HUE.Maybe somebody already has this problem and know how to solve it - I would be grateful for the help. So far I checked that: on pyspak session running on linux the time zone is the one i want : spark.conf.get("spark.sql.session.timeZone") 'Europe/Warsaw' However then i submit the TIMESTAMP on HUE running query against SparkSql I got the UTC Time Stamp. SELECT TIMESTAMP 'now'; TIMESTAMP '2024-07-02 08:01:06.157849' My current time 1 2024-07-02T06:01:06Z the UTC TIME I have also added the variable to the spark-defaults.conf file like spark.sql.session.timeZone=Europe/Warsaw But after restarting Spark3 and Hue nothing has changed. Thank you for your help in advance. Regards, Bart
... View more
Labels:
- Labels:
-
Apache Spark
-
Cloudera Hue
05-16-2024
02:59 AM
2 Kudos
Yes, this is it. Thank you so much for the prompt response.
... View more
05-15-2024
05:34 AM
Dear Friends The concerned Cloudera Services are: Spark 3 , Livy for Spark 3, Hue After I have added Spark SQL interpreter in Hue I am struggling to get more than 1000 rows results once querying the DB. Do you know what parameter should be used to change the default quantity. I Already tried to use : spark.livy.sql.max-result CM --> Livy for Spark3 --> Search for "Livy for Spark 3 Client Advanced Configuration Snippet (Safety Valve) for livy3-conf/livy-client.conf" and add the above parameter with value. spark.livy.sql.max-result=2000 However after restarting the concerns services - nothing has changed. Hereunder you can see what is all about: I would like to change this limitation to e.g. 5000. But not sure on what mentioned services this should be adjusted on. I really appreciate your help in this. Thank you
... View more
Labels:
- Labels:
-
Apache Spark
-
Cloudera Hue