About ripu

ripu · ‎09-19-2016

Please see below options and NOTE NOTE : for both options CopyTable and Export/Import Since the cluster is up, there is a risk that edits could be missed in the export process. http://hbase.apache.org/0.94/book/ops_mgt.html#copytable CopyTable is a utility that can copy part or of all of a table, either to the same cluster or another cluster. The usage is as follows: $ bin/hbase org.apache.hadoop.hbase.mapreduce.CopyTable [--starttime=X] [--endtime=Y] [--new.name=NEW] [--peer.adr=ADR] tablename http://hbase.apache.org/0.94/book/ops_mgt.html#export 14.1.7. Export Export is a utility that will dump the contents of table to HDFS in a sequence file. Invoke via: $ bin/hbase org.apache.hadoop.hbase.mapreduce.Export <tablename> <outputdir> [<versions> [<starttime> [<endtime>]]] Note: caching for the input Scan is configured via hbase.client.scanner.caching in the job configuration. 14.1.8. Import Import is a utility that will load data that has been exported back into HBase. Invoke via: $ bin/hbase org.apache.hadoop.hbase.mapreduce.Import <tablename> <inputdir>

ripu · ‎08-24-2016

@da li hey please have a look at below https://community.hortonworks.com/questions/153/impersonation-error-while-trying-to-access-ambari.html if it helps accept the answer You need to create the proxy settings for 'root', since Ambari runs as root. This allows it to impersonate the user in hdfs. similar thing you need to do for oozie user , like its done for root hadoop.proxyuser.root.groups=* hadoop.proxyuser.root.hosts=*

ripu · ‎08-22-2016

@Scott Shaw Thanks but please see below questions 1. can i get same performance what i get in my optimized and purpose-built infrastructure HDP cluster ? because data lake is central and can i tune it specifically for 1 application ? 2. how can i manage different HDP versions in data lake ? 3. if something goes wrong with security or configuration because of 1 application then my whole data lake will be impacted ?

ripu · ‎08-22-2016

Hi i have a small application that generates some reports without using any map reduce code i want to understand what are the real benefits of using Data lake, i think it will be useful for enterprise if there are many products which are writing data to various hadoop clusters and in order to have unified view of the various issues and having common data store , apart from this what are the other real benefits ? How does data lake work if i want particular HDP version ? i think its easier to switch to particular HDP in a separate cluster from ambari but what about data lake? also if multiple applications use data lake and just 1 application require frequent changes like hbase coprocessor for testing various things , is it advisable to go for data lake ? HA we get in cluster as well , so what are the main advantages technically if we dont bother cost

ripu · ‎08-19-2016

Hi Team, is anyone aware of issues during installation , that why do we get so many broken symlink issues during installation , this issue i faced in HDP 2.3.4 and ambari-2.2.2.0 Please see below https://community.hortonworks.com/questions/33492/hdp-234-failed-parent-directory-usrhdpcurrenthadoo.html i was installing 3 Node HDP 2.4.0.0 cluster, wherein at step "Install,Start and Test" on 1 node installation went fine but on other 2 nodes there were random symlink issues i had to fix broken symlink issues manually most of the times and final after spending so much time i was able to successfully installed HDP 2.4.0.0 issues like as below and shown in image 2016-08-18 21:20:17,474 - Directory['/etc/hive'] {'mode': 0755} 2016-08-18 21:20:17,474 - Directory['/usr/hdp/current/hive-client/conf'] {'owner': 'hive', 'group': 'hadoop', 'recursive': True} 2016-08-18 21:20:17,474 - Creating directory Directory['/usr/hdp/current/hive-client/conf'] since it doesn't exist I had proper prerequisites available before starting installation as given in http://docs.hortonworks.com/HDPDocuments/Ambari-2.2.2.0/bk_Installing_HDP_AMB/content/_hdp_24_repositories.html also randomly doing retries it works 😞 Please advise if you think i am doing something wrong or any good best practices for installation and debugging Thanks,

ripu · ‎08-19-2016

@Ted Yu @emaxwell @Josh Elser thanks all for your confirmation , that's why i asked if rpm is relocatable 🙂 so the bottom line is Hortonworks installation directories cannot be changed , all binary and config files of HDP go in /usr and /etc .. since its hardcoded in RPM and RPM is not relocatable i will close this thread But I believe it should support relocatability from corporate IT policy POV , wherein we many times we have issue putting files in /usr and /etc also i suggest at the time of RPM creation hortonworks should make RPM to be relocatable in order to allow installing binary and config files in other directories instead of /usr and /etc . i understand there are other software's which HDP consists of, but ultimately Hortwonworks can customize this bundle to support user specific needs I should open this as an idea , WDYT ?

ripu · ‎08-19-2016

@Ted Yu @emaxwell @Josh Elser thanks all for your confirmation , that's why i asked if rpm is relocatable 🙂 i will close this thread But I believe it should support this from corporate IT policy POV , wherein we many times have issue putting files in /usr and /etc i should open this as an idea , WDYT ?

ripu · ‎08-18-2016

Hi team, i see HDP stores its lib files and packages in /usr/hdp and maintains diff versions can we control HDP installation packages or rpm and make installation relocatable to other directories like /opt if my It team does not permit installation inside /usr then what to do ? # ls /usr/hdp/ 2.4.0.0-169 2.4.2.0-258 current Please advise rpm -ql hdp-select-2.4.2.0-258.el6.noarch /usr/bin/conf-select /usr/bin/hdp-select

ripu · ‎08-18-2016

Hi Team, we see logs of various hadoop services are stored in /var/log can we change it to our customized location if i dont want to store logs in below location then /var/log/ambari-agent/ /var/log/ambari-metrics-monitor/ /var/log/ambari-server/ /var/log/hbase /var/log/zookeeper i see in ambari changing log location is disabled ?

ripu · ‎08-03-2016

Hi Team, My hadoop namenode servers are without HBA and but servers are RAID 10 so do i need NFS point to save namenode metada file edits etc on NFS location as well if i have active namenode as well in cluster also my question is if my hardware is without HBA storage and RAID 10 so can i connect to NFS point from such hardware ? basically what are the recommendations for namenode HA ?

Online	Offline
Last Visited	‎09-27-2016 02:35 AM

Member Since	‎06-17-2015 08:02 AM
Last Visited	‎09-27-2016 02:35 AM
Posts	61
Kudos received	20

Cloudera Community

Re: want to decrease block size in HDP ambari , wh...

Re: can we customize HDP installation in other dir...

Re: issue error :cloudera unsupported major minor ...

Re: HDP 2.3.4 failed, parent directory /usr/hdp/cu...

Re: hbase table copy from one cluster to other

Re: oozie sqoop job error

Re: When do we need to consider moving to hadoop ...

When do we need to consider moving to hadoop data...

HDP 2.3.4 and ambari-2.2.2.0 installation has inc...

Re: can we customize HDP installation in other dir...

Re: can we customize HDP installation in other dir...

can we customize HDP installation in other directo...

can i change log location in HDP installation

Planning hardware for NameNode/Active/Secondary na...