- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
What is Impala alternative in HDP?
- Labels:
-
Apache Ambari
-
Apache Impala
Created 10-18-2016 01:44 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi
We have one product which runs on CDH and uses Impala in on of its module. I need to know what are alternatives of Impala in HDP. Also ,we need to connect to Pentaho using that.
Ankit
Created 10-18-2016 01:46 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hive with LLAP - Please check out this recent post: http://hortonworks.com/blog/apache-hive-vs-apache-impala-query-performance-comparison/
- Luis
Created 10-18-2016 01:46 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hive with LLAP - Please check out this recent post: http://hortonworks.com/blog/apache-hive-vs-apache-impala-query-performance-comparison/
- Luis
Created 10-18-2016 01:54 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Ankit Jindal LLAP which comes with HDP 2.5. Or Apache HAWQ, which is also known as HDB. Both are fast sql engines and faster then impala. Both run on yarn. LLAP has terabyte scale. HDB is virtually limitless by adding nodes based on your usage. Lastly for known query patterns or access patterns, apache phoenix should be considered. Using primary row key along with secondary index is simply fast. Here is article on how to use phoenix secondary indexes. https://community.hortonworks.com/content/kbentry/61705/art-of-phoenix-secondary-indexes.html
Created 10-19-2016 07:31 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created 10-19-2016 08:00 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Apache Phoenix
https://phoenix.apache.org/faq.html
SparkSQL is probably what you are looking for.
Hortonworks is 100% open source apache.
It can run Apache Impala, Apache Drill, Hive LLAP, Phoenix, Apache HAWQ, SparkSQL, Presto.
Created 10-24-2016 10:02 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Good point Tim.
Each "SQL on Hadoop" implementation obviously has pros and cons... general rule of thumbs:
- SparkSQL --> good for iterative processing, access existing Hive tables, given results fits in memory
- HAWQ --> good for "traditional" BI-like queries, star schemas, cubes OLAP
- HIVE-LLAP --> good for petabyte scale mixed with smaller tables requiring sub-second queries.
- Phoenix --> A good way to interact with HBase tables, good with time series, good indexing
- Drill, Presto --> Query federation-like capabilities but limited SQL syntax. Performance varies quite a bit.
Created 10-24-2016 10:03 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Good point Tim.
Each "SQL on Hadoop" implementation obviously has pros and cons... general rule of thumbs:
- SparkSQL --> good for iterative processing, access existing Hive tables, given results fits in memory
- HAWQ --> good for "traditional" BI-like queries, star schemas, cubes OLAP
- HIVE-LLAP --> good for petabyte scale mixed with smaller tables requiring sub-second queries.
- Phoenix --> A good way to interact with HBase tables, good with time series, good indexing
- Drill, Presto --> Query federation-like capabilities but limited SQL syntax. Performance varies quite a bit.