Created 12-17-2017 04:50 PM
I just want to ask what are the available SQL On Hadoop engines in Hortonworks Data Platform?
Thanks,
Created 12-18-2017 02:09 PM
Hi @Mohammed Syam,
HDP leverages Apache Hive. Hive is proven to work at the petabyte scale as well as providing ACID capabilities and the ability to run all 99 TPC-DS queries. It also provides in-memory LLAP execution for ad-hoc business intelligence workloads. It fully integrated with YARN and provides a robust cost-based optimizer called Apache Calcite.
Hope this helps.
Created 12-19-2017 01:45 PM
Hi Scott,
Are any of the following SQL On Hadoop engines available within HDP:
Concurrent Lingual,
Hadapt,
InfiniDB,
MammothDB,
MemSQL,
Pivotal HawQ,
Progress DataDirect,
ScleraDB,
Simba
Splice Machine
Spark SQL
FacebookPresto. *
LinkedIn Tajo
Stinger, *
Cloudera Impala, **
Platfora,
JethroData,
HAWQ,
CitusDB,
Rainstor,
MapR
Apache Drill
Microsoft PolyBase,
TeraData
SQL-H,
Spark/Shark *
Hadoop++/HAIL
Thanks,
Mohammed
Created 12-19-2017 02:44 PM
I have use Microsoft PolyBase with HDP
HAWQ is available and supported by Pivotal
SparkSQL comes with HDP and is fully supported
JethroData is a major partner and runs well on HDP
Apache Drill works with HDP, but is not supported
TeraData is a partner and works with HDP
Shark is old and deprecated
Stinger is a name for Hive
Presto works with HDP
Progress has drivers for HDP
IBM BigSQL is well integrated with HDP
Hortonworks HDP is standard unmodified Apache Hadoop, Apache Hive, Apache Phoenix, Apache Spark SQL and can run anything that works with regular Hadoop.
The built-in engines are Apache Hive, Apache Phoenix and Apache Spark SQL. BigSQL and Jethro are well integrated partners.
Apache Calcite is everywhere.