Support Questions
Find answers, ask questions, and share your expertise

can we apply the partitioning on the already existing Hive table

Re: can we apply the partitioning on the already existing Hive table

http://www.slideshare.net/BenjaminLeonhardi/hive-loading-data

Are you using CDH or HDP? In HDP I would propose ORC format. Its very similar to Parquet and just better supported and tested.

If your load from SQL Server is slow its most likely not the hive creation but sqoop. So you could increase the number of mappers but there might not be an easy fix. If you have the problems in the INSERT INTO you can look into the PPT for tips. ( Specifically the distribution methods near the end )

Re: can we apply the partitioning on the already existing Hive table

we are on the CDH. I will have a look on the PPT. Can you answer my another comment on https://community.hortonworks.com/questions/14313/facing-issues-while-ingesting-data-into-hive.html