Member since
10-08-2015
2
Posts
7
Kudos Received
0
Solutions
04-01-2016
10:27 PM
5 Kudos
HDFS caching helps, however it helps only a bit since you are saving only the cost of moving bytes off disk and are still paying the cost of de-serialization, don't get JVM JIT etc. So, with technologies like Hive LLAP (coming in hive-2) you will get significantly better performance because LLAP caches de-serialized vectors in memory-efficient formats (2 bits for certain integer ranges - rather than 4 bytes), cpu-efficient filters (vectorized query processing via filters etc.) removes JVM startup cost for tasks (100s of ms), provides JIT-enhanced CPU performance etc. Rather excited about it!
... View more
10-20-2015
10:20 PM
2 Kudos
https://github.com/hortonworks/gohadoop gohadoop is native go clients for Apache Hadoop YARN. It includes an early version of Hadoop IPC client and requisite YARN client libraries to implement YARN applications completely in go (both YARN application-client and application-master).
... View more
Labels: