- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
HIVE insert/update/delete
- Labels:
-
Apache Hive
Created ‎05-12-2017 05:51 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi All,
I tried HIVE insert/update/delete operations, I see that it launches a tez job and bit slow. I have a couple of questions -
1. when we use hive streaming processors (in nifi, streamsets) does it insert record by record? I believe no. just to confirm.
2. it seems bit risky to design an acid hive table, is there an approach where it can be used (such as IOT ) safely without having locking/concurrency issues etc.
Thanks,
Avijeet
Created ‎06-08-2017 05:28 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In Nifi, you can use the PutHiveStreaming processor, and it is designed to commit transactions in batch, which is configurable.
I think risky is not the correct term for using an ACID table, but careful may be better; that is, with careful design and configuration, you can avoid locking issues. Be sure to review the Hive documentation on this:
https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions
Created ‎06-08-2017 05:28 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In Nifi, you can use the PutHiveStreaming processor, and it is designed to commit transactions in batch, which is configurable.
I think risky is not the correct term for using an ACID table, but careful may be better; that is, with careful design and configuration, you can avoid locking issues. Be sure to review the Hive documentation on this:
https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions
