About Leo_BR

eric_deleforter · ‎08-08-2019

Hi @Leonardo Araujo, don't know if you did it. About the collateral effect on 2.6.4, there is one to take in account, the Hive Acid activation change the transactional manager from : hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager To hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager The big impact is the way of Hive manage the locks even for no transactionnal tables as describe in Hive documentation about LockManager . This could dramatically increase processing times of your jobs using Hive. You could use the parameter hive.txn.strict.locking.mode to manage the behavior of the LockManager even if the documentation said that it is only available since Hive 2.2, use set -v to show presence of this parameter in Hive 1.2.1000 . You could hit 2 bugs too : HIVE-17900 : Compaction of ACID table might fail if table is partitioned by more than one column HIVE-18293 : Compaction of ACID table failed when using doas Hope this will help. Best regards, Eric.

Leo_BR · ‎06-11-2018

Hi ducna, how are you? Regarding your mapreduce job to delete a mass rows in HBase. Can you share this job with me? I've been facing the same problem and I'm looking for some examples, what is the best approach to massively rows from HBase.

mqureshi · ‎02-10-2017

@Leonardo Araujo Check this link: https://wiki.apache.org/hadoop/HowManyMapsAndReduces Target one map job per block. If the file you are reading has five blocks distributed on 3 nodes (or four or five nodes) on five disks, then you should have five mappers, one for each disk.

Online	Offline
Last Visited	‎08-20-2018 12:51 PM

Member Since	‎06-11-2018 10:58 AM
Last Visited	‎08-20-2018 12:51 PM
Posts	9
Kudos received	1

Cloudera Community

Re: Regarding to activate HIVE ACID transactions o...

Re: Hbase operation slowly after mass delete

Re: How to know the degree of parallelism availabl...