djdillon

Community
Training
Partners
Support

Cloudera Community

Frequent Visitor

Member since

My Accepted Solutions

Title

Views

Posted

Re: LOAD DATA time for parquet file depends on par...

4744

‎01-22-2019 09:05 AM

This affected us as well. Here are some potentially related bugs. My understanding of the internals is that it is doing a full table refresh. If you are doing LOAD DATA with partition, ideally it would only do an incremental refresh instead of full and you would get closer to flat times. https://issues.apache.org/jira/browse/IMPALA-7330 https://issues.apache.org/jira/browse/IMPALA-7854 What version of Impala are you using?

Community Statistics

Member Since	‎01-12-2017 07:16 AM
Last Visited	‎01-23-2019 04:25 PM
Posts	2
Kudos received	2