Member since
06-20-2016
488
Posts
433
Kudos Received
118
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3602 | 08-25-2017 03:09 PM | |
| 2505 | 08-22-2017 06:52 PM | |
| 4195 | 08-09-2017 01:10 PM | |
| 8972 | 08-04-2017 02:34 PM | |
| 8946 | 08-01-2017 11:35 AM |
09-22-2016
01:41 AM
Thank you for confirming.
... View more
09-21-2016
06:37 PM
Syncsort is a 3rd party partner of ours. We work very closely with them but we do not support them. I suggest you head over to their support site http://bigdatakb.syncsort.com/Default.htm#KB/FAQ/297_Creating_DMX-h_Job_Tutorial.htm
... View more
09-21-2016
10:48 AM
1 Kudo
@Gaurab D Just to be sure ... are you using DMX-H or simply DMX. The H in the first on means Hadoop. This is the one the integrates with Hadoop and Yarn.
... View more
09-20-2016
07:15 PM
3 Kudos
In the HDP 2.5 release notes it says that Hive 2.1 is TP http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_release-notes/content/tech_previews.html Hive New in this release: Stored Procedures Client-Side LLAP Daemons (HIVE-7193) LLAP / Spark security interface Hive 2.1 In the HDP 2.5 Release email to customers it was stated that Hive 2.1 is TP but that Hive ACID is certified for production with Hive 1.2.1
Apache Hive
Includes Apache Hive 1.2.1 for production and Hive 2.1 (Technical Preview) for cutting-edge performance Hive LLAP (Technical Preview): Persistent query servers and optimized in-memory caching for blazing fast SQL. Up to 25x faster for BI workloads. 100% compatible with existing Hive workloads Hive ACID and Streaming Ingest certified for production use with Hive 1.2.1 Dynamic user-based security policies for data masking and filtering HPL/SQL: Procedural programming within Hive Hive View v1.5.0, improved robustness and security Parquet format fully certified with Hive 1.2.1 / 2.1 In the Hortonworks.com Hive Overview section it states (confusingly) that ACID is GA in Hive 2.1 (though originated in 0.14) http://hortonworks.com/apache/hive/#section_3
... View more
09-20-2016
05:38 PM
(Am using VirtualBox and do not encounter this problem)
... View more
09-20-2016
04:33 PM
1 Kudo
Awesome, thanks Michael. Just had the same problem. (Had to start HBase as well ... anyway to know the sequence of dependencies? Understood that Atlas has many)
... View more
09-20-2016
02:00 PM
@Saumitra Buragohain Thank you for putting this into proper perspective before parachuting into the weeds!
... View more
09-20-2016
01:36 PM
3 Kudos
Syncsort DMX-h is an excellent ETL tool that is YARN-enabled http://www.syncsort.com/en/Products/BigData/DMXh
... View more
09-20-2016
12:37 PM
2 Kudos
You need to take three approaches:
minimize your data before join (e.g. load only columns needed for join and output, filter before join), then optimize your joins, then optimize settings (including compressing intermediate results) For 1, see: https://pig.apache.org/docs/r0.7.0/cookbook.html For 1 and 2, see: https://pig.apache.org/docs/r0.9.1/perf.html After performing these optimizations, for 3 see:
http://chimera.labs.oreilly.com/books/1234000001811/ch08.html#pig_tuning Also, be sure you are running pig on Tez.
... View more