Member since
09-24-2015
105
Posts
82
Kudos Received
9
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2120 | 04-11-2016 08:30 PM | |
1745 | 03-11-2016 04:08 PM | |
1744 | 12-21-2015 09:51 PM | |
1021 | 12-18-2015 10:43 PM | |
8629 | 12-08-2015 03:01 PM |
02-12-2016
05:56 PM
4 Kudos
Simply curious if Phoenix JDBC connections can go through Knox for security purposes? If so, does anyone have any tutorials/examples of this? Thanks,
... View more
Labels:
- Labels:
-
Apache Knox
-
Apache Phoenix
01-05-2016
03:24 PM
@Aidan Condron Is temp_batting an external or internal table? My assumption is it's an internal table and thus when you load data in path, it's trying to move the data in /apps/warehouse/database(default)/temp_batting and the admin user doesn't have permissions to move the file. Can you please try running: hdfs dfs -chmod -R 777 /user/admin/elecMonthly_Orc
and then trying to run your load data inpath command?
... View more
01-05-2016
02:38 PM
1 Kudo
@Aidan Condron what about the directory about it. What's the output of: hdfs dfs -ls /user/admin
It looks like the files are owned by the user 'Spark'. Which user is the running the Hive Statement?
... View more
01-05-2016
02:29 PM
1 Kudo
@Aidan Condron what user are you running the Hive Statement as?
... View more
01-05-2016
02:26 PM
@Aidan Condron
Can you please take a screenshot of the output of hdfs dfs -ls /user/admin/elecMonthly_Orc
... View more
12-21-2015
09:51 PM
@Bhupendra Mishra Depending on your hardware availability for the POC, I would also look at just doing the POC in the Cloud (e.g. MSFT Azure, AWS, GCP). You can leverage Cloudbreak to quickly deploy a fully fledge distributed cluster running Spark, Yarn, the whole nine yards, in the cloud in a matter of minutes. Here is the documentation on how to do so: Cloudbreak Overview - http://hortonworks.com/hadoop/cloudbreak/ Cloudbreak Docs - http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-...
... View more
12-21-2015
09:45 PM
@Mudit Kumar That is an enterprise decision. You can have one NIC that resolves both the internal Hadoop IP Address as well as the public IP Address too. Typically we see clients adopt some sort of dual firewall setup. Where Edge Nodes (and potentially some/all Master nodes) have access to the DMZ or at least corporate network. The Data Nodes (and remaining master nodes) are behind another firewall and can only communicate with other data nodes, edge nodes and master nodes.
... View more
12-21-2015
09:39 PM
@Suresh Raju to add some color to what Neeraj provided. To get Sandbox.hortonworks.com:4200 to resolve you will need to update your host file to map the IP address the VM starts on to sandbox.hortonworks.com (see https://www.petri.com/easily-edit-hosts-file-windo... ). Alternatively, as Neeraj suggested, you can just reference the IP address the VM starts up on, which in this case is 127.0.0.1 (localhost).
... View more
12-21-2015
03:56 PM
@flwong what exactly do you mean by gate node? An 'Edge node'? Typically in smaller clusters like the 5 node cluster you have layed out, you could leverage a master node as an 'Edge Node' too. Once your cluster grows you can then separate it out into it's own physical server.
... View more
12-21-2015
03:54 PM
@Mudit Kumar that is up to you, some have multiple NICs for redundancy or higher throughput.
... View more