About smruti

smruti · ‎07-24-2024

In Hive ACID tables, the base/delta files are associated with a writeID. My assumption is the write ID has changed or got removed in the production HMS, so hive does not recognize the data file. So, we will have to follow the workaround that I mentioned in my last update.

smruti · ‎07-23-2024

@AhXian Ideally it is not supported to copy files into and out of an ACID table location. In this case we can use a workaround: 1. Create an external table similar(same columns) as the original ACID table. 2. Upload the data file into the external table location. 3. MSCK REPAIR table <external table name> 4. See if you can read the data from the external table now. 5. If [4] works, insert the data into the managed table by using: insert into table managed_table_name partition(xx,yy) select * from external_table_name;

smruti · ‎07-23-2024

@AhXian the metadata is still not updated, as I see the following: ,numFiles ,0 ,numRows ,0 ,rawDataSize ,0 ,totalSize ,0 Let's try the following commands one after the other: MSCK REPAIR raw_zone_sit.jxtemp SYNC PARTITIONS; ANALYZE TABLE raw_zone_sit.jxtemp PARTITION (dl_created_yr='2023', dl_created_mth='12') COMPUTE STATISTICS; ANALYZE TABLE raw_zone_sit.jxtemp PARTITION (dl_created_yr='2023', dl_created_mth='12') COMPUTE STATISTICS for COLUMNS; See if the describe command still reflects 0 files. Also, could you collect the output of the following command from the backend metastore database? select * from NEXT_WRITE_ID where NWI_TABLE='jxtemp';

smruti · ‎07-22-2024

@AhXian These ORC tables we are talking about, are they ACID? If we copy data files to ACID table location, and run ANALYZE...COMPUTE STATISTICS, that alone might not resolve the issue. We would like for you to share an example. Please share the following command output: describe formatted table_name partition(dl_created_yr='2023', dl_created_mth='12') &HDFS command o/p hdfs dfs -ls -R <partition location from the above command>

smruti · ‎07-09-2024

@ldylag You may find the DDLs under /opt/cloudera/parcels/CDH/lib/hive/scripts/metastore/upgrade/<db type>

smruti · ‎06-28-2024

@zhuodongLi It will be difficult to find the cause of the issue from the error message in the screenshot. It will be best if you could create a support case with the YARN app log of application_1719542914905_0002, and HS2 logs covering the job run period.

smruti · ‎06-27-2024

@kaif This is non-Cloudera Hive you are testing. But let me share my two cents. You are using 'org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdConfOnlyAuthorizerFactory' as the auth manager. You may want to use the following auth provider that enables you to use GRANT/REVOKE statements. hive.security.authorization.manager=org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdHiveAuthorizerFactory ...and use: GRANT ROLE admin TO USER kaif; SET ROLE admin;

smruti · ‎06-27-2024

@kaif Hive does have information_schema database, that you can access from beeline or Hue. You also have SYS db that fetches different information from the RDBMS that is used to store hive metadata. Here's the DDLs of those tables, and this will also give you some idea on what data they store.

smruti · ‎06-27-2024

@ldylag No, it is not expected behavior. We expect the tables to be created in the 'public' schema under 'metastore' database.

smruti · ‎05-31-2024

@hadoopranger It seems like a transaction is in failed state. a drop database call probably did not complete successfully. Did we make config changes in postgres while Hive service was running? That could lead to data corruption. I notice the following error: Cannot get Object result for param = 6 for column ""MV_CREATION_METADATA"."TXN_LIST"" : ERROR: current transaction is aborted, commands ignored until end of transaction block org.datanucleus.exceptions.NucleusDataStoreException: Cannot get Object result for param = 6 for column ""MV_CREATION_METADATA"."TXN_LIST"" : ERROR: current transaction is aborted, commands ignored until end of transaction block You may get the out of the following command and see which transaction is in idle but aborted state. select * from pg_stat_activity; Then try to terminate the transaction or perform a ROLLBACK. Take a backup of the database before making these changes to be on the safer side.

Online	Offline
Last Visited	‎12-21-2024 12:27 PM

Member Since	‎10-28-2020 05:19 AM
Last Visited	‎12-21-2024 12:27 PM
Posts	554
Kudos received	44

Cloudera Community

Re: ANALYZE command not write data into hive metas...

Re: HBase stores base64 data when data is inserted...

Re: Deleting hive service on CDP Private Base and ...

Re: Not Able to run import command. it fails with ...

Re: Any alternate for org.apache.hive:hive-jdbc ma...

Re: ANALYZE command not write data into hive metas...

Re: ANALYZE command not write data into hive metas...

Re: ANALYZE command not write data into hive metas...

Re: ANALYZE command not write data into hive metas...

Re: Cloudera 7.1.9 with PostgreSQL on RDS creates ...

Re: FAILED Execution Error return code 2 from org....

Re: Not able to enable authorization in hive.

Re: Does Hive supports information schema?

Re: Cloudera 7.1.9 with PostgreSQL on RDS creates ...

Re: hive metastore server fails to start with post...