Member since
12-09-2015
37
Posts
28
Kudos Received
4
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
5088 | 12-15-2015 07:47 AM | |
2594 | 12-11-2015 07:51 PM | |
3571 | 12-10-2015 10:06 PM | |
2573 | 12-09-2015 09:17 PM |
04-21-2016
08:02 PM
We've just made a lot of tests: we do not have null value and we already use Hive 1.2.1. We also tried to delete partition from the tables involved in the queries but we always got the same error.
... View more
04-21-2016
07:59 PM
Hi , it could be a compliance issue posting the DDL and query in public. If you can send me an email I can send you the DDL in private. Thanks, Andrea
... View more
04-21-2016
09:16 AM
1 Kudo
@Kuldeep Kulkarni Thanks! Do you know when Tez 1.2.1 will be released? It's planned for HDP 2.4.2?
... View more
04-20-2016
08:28 PM
1 Kudo
We're working on a data preparation phase with Hive and Tez. We're experiencing the following error: Error while processing statement: FAILED: Execution
Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Session
stats:submittedDAGs=0, successfulDAGs=0, failedDAGs=0, killedDAGs=0
(state=08S01,code=2) We're using HDP 2.3.2 with Hive 1.2.1 using ORC format and Tez engine. Does anyone already found this error?
... View more
Labels:
- Labels:
-
Apache Hive
04-20-2016
08:19 PM
If you had some time to spend you can use Falcon to orchestrate data replication from one cluster to another: you had to create Hive table on the second cluster but then you can compare easily the two tables directly from Beeline.
... View more
04-20-2016
08:15 PM
2 Kudos
Hi @Rendiyono Wahyu Saputro, I'll add only one last thing to what @Davide Isoardi wrote: we're converting our demo to run inside Hortonworks Sandbox and we'll push all to GitHub. I'll ping you when we'll finish the job: it could be a good starting point to build your own app. Meanwhile we're also planning a text analysis algorithm to analyze tweets to understand the right rilevance for what we're searching. Stay tuned.
... View more
04-16-2016
09:13 PM
@Davide Vergari just released a custom Ambari service to install Apache Drill thru Ambari on HDP 2.4. Feel free to take a look at it.
... View more
04-16-2016
09:06 PM
Right now I was able to enable SSL in Ranger 0.6.0 downloaded from the Apache Foundation but not in Ranger 0.5.0 included in HDP 2.4.0. Hope in the next release Hortonworks will upgrade Ranger to 0.6.0.
... View more
04-16-2016
08:50 PM
Yes, it's possible. I've already done such a config in a lab environment with HDP cluster on CentOS 7 and Hue node on CentOS 6. It could be a pain if you had a kerberized cluster.
... View more
12-15-2015
07:47 AM
2 Kudos
If not configured properly networking could be a real pain in Hadoop. All nodes in an Hadoop cluster need to see each other and they requires DNS (with reverse) and NTP. You can choose to deploy your cluster inside "cluster" network and use a multi-homed edge node as a bridge between "cluster" and "public" network. But you need to understand how you would access your cluster data (e.g.: JDBC thru Hive, WebHDFS for HDFS files and so on...). An edge nodes doesn't grant you access to Ambari Web UI, Ambari API, etc.. so if you deploy such a config you need to open specific TCP/IP ports in order to grant users on the "public" network to access such a service (e.g. Ambari Views). On the edge node you can deploy all clients (HDFS, YARN, Oozie, Hive, etc...) and let users to access edge node using SSH. It really depends how you want to manage access to Hadoop services and which services you need to give access to your end users. You can use NFS gateway or Knox on the edge node but what are your needs? Please take a look at this links: Hadoop TCP/IP ports: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-... Hadoop IDC and firewalls: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-... NFS Gateway: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-... Knox: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-...
... View more