Created 10-21-2016 02:07 PM
@Alena Melnikova Good to hear that you are happy with the results:)
Answers:
1. You can go as low as 1k. Choose a balanced option on the average number of rows you query.
2. The usage of function to_date I believe will cause the orc index to stop working (Haven't tested that). Google "why function based index?"
Created 10-22-2016 09:11 AM
got it, thanks!
Created 10-24-2016 09:17 PM
Great job @Alena Melnikova! Nice work with the data and visualization. Really helpful, confirms some longstanding assumptions I've had.
Created 01-04-2018 11:54 AM
Hey everyone,
I have a somewhat similar question, which I posted here:
https://community.hortonworks.com/questions/155681/how-to-defragment-hdfs-data.html
I would really appreciate any ideas.