Member since
10-01-2015
3933
Posts
1150
Kudos Received
374
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3373 | 05-03-2017 05:13 PM | |
2801 | 05-02-2017 08:38 AM | |
3082 | 05-02-2017 08:13 AM | |
3011 | 04-10-2017 10:51 PM | |
1527 | 03-28-2017 02:27 AM |
11-21-2016
11:22 PM
We strongly advise to upgrade your Ambari at least to 2.2 as long as your current HDP allows in place upgrade. I can't confirm whether standaloe mode is supported in such an old release but I did try using standalone Ambari 2.4 connect to cluster with Ambari 2.2 and deploy views, worked well.
... View more
11-21-2016
04:03 PM
@Larisa Agishtein please confirm that you followed this guide to setup storm-kafka http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_storm-component-guide/content/storm-kafka-kerb.html
... View more
11-20-2016
11:24 PM
@Peter Kim from the user guide By default sqoop will use query select min(<split-by>), max(<split-by>) from <table name> to find out boundaries for creating splits. In some cases this query is not the most optimal so you can specify any arbitrary query returning two numeric columns using --boundary-query argument. If the actual values for the primary key are not uniformly distributed across its range, then this can result in unbalanced tasks. You should explicitly choose a different column with the --split-by argument. For example, --split-by employee_id . Sqoop cannot currently split on multi-column indices. If your table has no index column, or has a multi-column key, then you must also manually choose a splitting column. If a table does not have a primary key defined and the --split-by <col> is not provided, then import will fail unless the number of mappers is explicitly set to one with the --num-mappers 1 option or the --autoreset-to-one-mapper option is used. The option --autoreset-to-one-mapper is typically used with the import-all-tables tool to automatically handle tables without a primary key in a schema. There's also a whole section on Oracle connector in Sqoop user guide v. 1.4.6 The Data Connector for Oracle and Hadoop generates queries for the mappers of the form: SELECT *
FROM sometable
WHERE rowid >= dbms_rowid.rowid_create(1, 893, 1, 279, 0) AND
rowid <= dbms_rowid.rowid_create(1, 893, 1, 286, 32767) The Data Connector for Oracle and Hadoop queries ensure that:
No two mappers read data from the same Oracle block. This minimizes redundant IO. The table does not require indexes. The Sqoop command line does not need to specify a --split-by column.
... View more
11-20-2016
11:11 PM
@Satish Bomma it would make an awesome follow up to this article, I am not sure whether Ranger authorization can be applied to tables driven by HBaseStorageHandlers, native Atlas/Ranger integration is coming in the next release.
... View more
11-20-2016
11:08 PM
@zhao xiaojing confirm the gateway properties set correctly and if you are using a proxy, in SmartSense 1.3 you can add proxy settings. If all else fails, open a case with support. Here's relevant page in our docs https://docs.hortonworks.com/HDPDocuments/SS1/SmartSense-1.3.0/bk_installation/content/portflow_smartsense_gateway_to_hortonworks.html Server at hadoopXX.XXX.com Port 443
... View more
11-19-2016
05:41 PM
1 Kudo
try to reinstall Ambari after you uninstall it, delete any ambari-server directories and especially remove the following directory /usr/lib/python2.6/site-packages/ambari_server
... View more
11-18-2016
08:38 PM
@Enis can you comment on this?
... View more
11-16-2016
12:39 AM
You can leverage hdfs quotas, here's my answer from another thread https://community.hortonworks.com/questions/11446/hdfs-quota-is-there-a-gui-to-control-hdfs-name-and.html
... View more
11-15-2016
07:59 PM
@Marcy I'm glad you were able to resolve the problem, please use the comments section for non-solution specific dialog and only use "answer" section for solutions and anything that does not fit in comments, usually that's 600 characters or more. It is hard to follow steps to resolution if you add too many answers.
... View more
11-14-2016
08:48 PM
okay, make sure you open port for the RStudio, find out what it is from their documentation and then go to your VM provider application (VirtualBox or VMWare) and open a port, usually in the port forward menu.
... View more