Member since
05-15-2023
12
Posts
2
Kudos Received
0
Solutions
08-21-2024
12:49 PM
If you import the Hive table through a script, the lineage data will not be visible. To view the lineage data, your metadata must sync automatically.
... View more
06-18-2024
01:22 AM
1 Kudo
To troubleshoot the issue of ranger policies not getting synced, you can check the following log files in HDFS: 1. ranger_admin.log: This log file contains the logs related to the Ranger Admin service. It can be found in the Ranger Admin node at the location: `/var/log/ranger/ranger-admin`. 2. ranger_admin_audit.log: This log file contains the logs for auditing actions performed by Ranger Admin. It can be found at the same location as ranger_admin.log.= 3. hdfs.log: This log file contains the logs for HDFS operations. It can be found in the Hadoop log folder, which is usually located at: `/var/log/hadoop/hdfs` or `/var/log/hadoop-hdfs`
... View more
02-16-2024
02:54 AM
1 Kudo
Thank you, my friend. A week ago, I read through your configurations in the official documentation and experimented with them. However, I encountered an error along the lines of 'class not found.' Currently, I've identified the root cause: I'm using HDP 3.1.0, which includes PySpark 2.3.2.3.1.0.0-78. Therefore, I upgraded it to PySpark 3, while still using the standalone-metastore-1.21.2.3.1.0.0-78-hive3.jar file by default. That's the reason why, when using the configuration, I encountered the 'class not found' error. Now, I've replaced that JAR file with hive-metastore-2.3.9.jar. Everything is working fine now. Once again, thank you, my friend.
... View more
08-20-2023
08:54 AM
To calculate the DataFrame size, you can use SizeEstimator class.
... View more
07-05-2023
09:03 PM
It works for me
... View more
05-19-2023
01:50 AM
Hi @sonnh You can go ahead the raise the hdfs case by uploading all required logs like NM logs and DN logs.
... View more