Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How to do multiple parallel inserts into hive external table

Highlighted

How to do multiple parallel inserts into hive external table

New Contributor

I could not find any way to do multiple inserts into hive table because of the naming "000000_0" that hive internally follows while creating external table. I was wondering if there is a way we can set this prefix filename so that we can do multiple parallel inserts.

Anyother way we can do parallel inserts into hive external table is also being looked for

Thanks in advance

1 REPLY 1

Re: How to do multiple parallel inserts into hive external table

Rising Star

As indicated in an existing answer by @hduraiswamy there are some things you can do:

1. Give multiple insert commands in parallel, and they will automatically be executed sequentially

2. Writing multiple files to a directory and then creating a hive table on top of the folder, see the aforementioned answer

If this does not work for you, you can of course also work with a non-external hive table.