Member since
02-14-2022
3
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1892 | 06-16-2022 07:00 AM |
05-17-2023
08:23 AM
Hi @stephen_obrien As discussed via Cloudera case, there is a performance bottleneck when connecting via knox ( tracked in internal jira ) than directly from phoenix-sqlline from the edge node. You can test the same when the runtime version 7.2.17 is released.
... View more
06-16-2022
07:00 AM
To provide a module with custom Python functions that are declared as UDFs, one must specify: spark_session.sparkContext.addPyFile("/app/mount/python_utils.py") This file should be included in a resource attached to the job. See this post for further examples: https://blog.cloudera.com/managing-python-dependencies-for-spark-workloads-in-cloudera-data-engineering/
... View more