We are trying to use hive streaming mutation api. we need to update data if its already present in hive tables.
While trying to connect to hive, we are getting below error:
Exception in thread "main" org.apache.hive.hcatalog.streaming.mutate.client.TransactionException: Not connected - cannot create transaction
Our requirement is to do bulk insertion and no duplicate. If there are any better solution / sample code , please do share.
Any help is much appriciated!
Have you considered using Merge statement? For example, https://community.hortonworks.com/articles/97113/hive-acid-merge-by-example.html