Member since
10-28-2020
622
Posts
47
Kudos Received
40
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1956 | 02-17-2025 06:54 AM | |
6690 | 07-23-2024 11:49 PM | |
1330 | 05-28-2024 11:06 AM | |
1880 | 05-05-2024 01:27 PM | |
1260 | 05-05-2024 01:09 PM |
07-25-2024
06:12 AM
@AhXian 1. Are you able to read data from the external table in Prod(step 4)? 2. In step 5, I meant for you to insert data into a new ACID table, which can be created using similar DDL as the original table. 3. After INSERT(step 5), see if you could read the data from the new table. Ideally it should work
... View more
07-24-2024
05:43 AM
Yes, 5 steps mentioned above
... View more
07-24-2024
12:15 AM
In Hive ACID tables, the base/delta files are associated with a writeID. My assumption is the write ID has changed or got removed in the production HMS, so hive does not recognize the data file. So, we will have to follow the workaround that I mentioned in my last update.
... View more
07-23-2024
11:49 PM
@AhXian Ideally it is not supported to copy files into and out of an ACID table location. In this case we can use a workaround: 1. Create an external table similar(same columns) as the original ACID table. 2. Upload the data file into the external table location. 3. MSCK REPAIR table <external table name> 4. See if you can read the data from the external table now. 5. If [4] works, insert the data into the managed table by using: insert into table managed_table_name partition(xx,yy) select * from external_table_name;
... View more
07-23-2024
05:25 AM
@AhXian the metadata is still not updated, as I see the following: ,numFiles ,0 ,numRows ,0 ,rawDataSize ,0 ,totalSize ,0 Let's try the following commands one after the other: MSCK REPAIR raw_zone_sit.jxtemp SYNC PARTITIONS; ANALYZE TABLE raw_zone_sit.jxtemp PARTITION (dl_created_yr='2023', dl_created_mth='12') COMPUTE STATISTICS; ANALYZE TABLE raw_zone_sit.jxtemp PARTITION (dl_created_yr='2023', dl_created_mth='12') COMPUTE STATISTICS for COLUMNS; See if the describe command still reflects 0 files. Also, could you collect the output of the following command from the backend metastore database? select * from NEXT_WRITE_ID where NWI_TABLE='jxtemp';
... View more
07-22-2024
08:45 AM
@AhXian These ORC tables we are talking about, are they ACID? If we copy data files to ACID table location, and run ANALYZE...COMPUTE STATISTICS, that alone might not resolve the issue. We would like for you to share an example. Please share the following command output: describe formatted table_name partition(dl_created_yr='2023', dl_created_mth='12') &HDFS command o/p hdfs dfs -ls -R <partition location from the above command>
... View more
07-09-2024
05:45 AM
@ldylag You may find the DDLs under /opt/cloudera/parcels/CDH/lib/hive/scripts/metastore/upgrade/<db type>
... View more
06-28-2024
03:33 AM
1 Kudo
@zhuodongLi It will be difficult to find the cause of the issue from the error message in the screenshot. It will be best if you could create a support case with the YARN app log of application_1719542914905_0002, and HS2 logs covering the job run period.
... View more
06-27-2024
11:23 PM
1 Kudo
@kaif This is non-Cloudera Hive you are testing. But let me share my two cents. You are using 'org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdConfOnlyAuthorizerFactory' as the auth manager. You may want to use the following auth provider that enables you to use GRANT/REVOKE statements. hive.security.authorization.manager=org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdHiveAuthorizerFactory ...and use: GRANT ROLE admin TO USER kaif; SET ROLE admin;
... View more
06-27-2024
11:02 PM
@kaif Hive does have information_schema database, that you can access from beeline or Hue. You also have SYS db that fetches different information from the RDBMS that is used to store hive metadata. Here's the DDLs of those tables, and this will also give you some idea on what data they store.
... View more