Support Questions

Find answers, ask questions, and share your expertise

How to do multiple parallel inserts into hive external table

avatar
New Contributor

I could not find any way to do multiple inserts into hive table because of the naming "000000_0" that hive internally follows while creating external table. I was wondering if there is a way we can set this prefix filename so that we can do multiple parallel inserts.

Anyother way we can do parallel inserts into hive external table is also being looked for

Thanks in advance

1 REPLY 1

avatar

As indicated in an existing answer by @hduraiswamy there are some things you can do:

1. Give multiple insert commands in parallel, and they will automatically be executed sequentially

2. Writing multiple files to a directory and then creating a hive table on top of the folder, see the aforementioned answer

If this does not work for you, you can of course also work with a non-external hive table.


- Dennis Jaheruddin

If this answer helped, please mark it as 'solved' and/or if it is valuable for future readers please apply 'kudos'.