Created 12-03-2015 05:10 PM
Hi, I'm looking for some guidance around setting hive.exec.max.created.files. Is there a formula or ratio to follow ahead of time to find a right number for my queries?
Created 12-04-2015 02:05 AM
The main purpose of it is to prevent overloading hdfs? If a query generates more than default #100000, it is better user examine the query and see why so. maybe the query is generating too many too small files.
Created 12-04-2015 01:23 AM
Created 12-04-2015 02:05 AM
The main purpose of it is to prevent overloading hdfs? If a query generates more than default #100000, it is better user examine the query and see why so. maybe the query is generating too many too small files.
Created 12-15-2015 08:55 PM
This is usually a symptom of the problem - general recommendation is to turn on
hive.optimize.sort.dynamic.partition=true;
to prevent partitioning+bucketing from blowing up HDFS file counts.
Created 12-09-2015 05:07 PM
are you facing this issue, while trying to load data in a large table.
Created 02-03-2016 03:34 PM
@schauhan are you still having issues with this? Can you accept best answer or provide your own solution?