Support Questions

Find answers, ask questions, and share your expertise
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

How to do multiple parallel inserts into hive external table

New Contributor

I could not find any way to do multiple inserts into hive table because of the naming "000000_0" that hive internally follows while creating external table. I was wondering if there is a way we can set this prefix filename so that we can do multiple parallel inserts.

Anyother way we can do parallel inserts into hive external table is also being looked for

Thanks in advance


Super Collaborator

As indicated in an existing answer by @hduraiswamy there are some things you can do:

1. Give multiple insert commands in parallel, and they will automatically be executed sequentially

2. Writing multiple files to a directory and then creating a hive table on top of the folder, see the aforementioned answer

If this does not work for you, you can of course also work with a non-external hive table.

- Dennis Jaheruddin

If this answer helped, please mark it as 'solved' and/or if it is valuable for future readers please apply 'kudos'.
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.