Member since
09-08-2022
7
Posts
2
Kudos Received
0
Solutions
01-30-2024
04:07 AM
1 Kudo
I'd like to write Scala/Java or Python code which would monitor Home Directories (e.g., /Users/phil) for: - files larger than, say, 10GB - files containing private data, such as bank accounts, email addresses, etc.
... View more
01-30-2024
01:02 AM
1 Kudo
Hello, is it possible to monitor a Hue directory using, say, a User Defined Function? If so, can you provide an example? Many thanks.
... View more
Labels:
- Labels:
-
Cloudera Hue
01-05-2024
04:24 AM
Hello, I've noticed how the CML Terminal lets you install Python packages on the 'Net using pip. (A gcc compiler is even available!) Isn't that a security risk? Is there a way to only allow package installation from an in-house repository? Many thanks.
... View more
Labels:
01-05-2024
04:16 AM
Hi Cloude, Thank your for your reply. What do you mean by "usage of storage Quota and Resource allocation on the queue level"? What's the Queue Level?
... View more
01-02-2024
01:48 PM
Hello, let’s say a Datalake is used by several departments within a company. What are recommended ways to measure Compute and Storage by department, for billing purposes? Many thanks.
... View more
Labels:
- Labels:
-
Cloudera Data Platform (CDP)
09-08-2022
01:04 PM
Hi, thank you for your suggestion, but I’m looking for example Java or Scala code that uses the Hadoop or Cloudera API, instead of scripting the « hdfs » tool on Linux.
... View more
09-08-2022
09:58 AM
Hello, Is there a way to retrieve all HDFS groups and respective users, in Cloudera, using Java or Scala/Spark code? In other words, I’m looking for Java/Scala code equivalent to "hdfs groups <user1>, <user2>, ...". Ideally, the code would run in Zeppelin. Many thanks.
... View more
Labels:
- Labels:
-
Apache Hadoop
-
HDFS