About rupesh_agarwal

rupesh_agarwal · ‎02-27-2018

val sc=new Scan() add two filters in filterlist (keyonlyfilter,firstkeyonlyfilter) add that filterlist to scan val conf =HBaseConfiguration.create() conf.addResource(newPath("/usr/hdp/current/hbase-client/conf/hbase-site.xml")) conf.set("hbase.zookeeper.quorum","xxx") conf.set(TableInputFormat.INPUT_TABLE,"TEST_TABLE") conf.set(TableInputFormat.SCAN,convertscantoString(sc)) val hBaseRDD = sc.newAPIHadoopRDD(conf, classOf[org.apache.hadoop.hbase.mapreduce.TableInputFormat], classOf[org.apache.hadoop.hbase.io.ImmutableBytesWritable], classOf[org.apache.hadoop.hbase.client.Result]) hBaseRDD.count()

rupesh_agarwal · ‎10-13-2017

ok so, suppose if i put a spark submit command in jar and that jar contains code to connect to hbase then how to pass this

rupesh_agarwal · ‎10-13-2017

it is saying permission denied klist -kte /etc/security/keytabs/hbase.headless.keytab Keytab name: FILE:/etc/security/keytabs/hbase.headless.keytab klist: Permission denied while starting keytab scan

rupesh_agarwal · ‎10-13-2017

What is that <princ> sorry i am new to this

rupesh_agarwal · ‎10-13-2017

what is this file and what is that tag <princ>

rupesh_agarwal · ‎10-13-2017

I am tring to schedule a script in which a folder will be uploaded to hdfs and then it will be downloaded to machine on which oozie schedules the job. From there that script will run eg: sh thatdownloadedscript.sh in that script i have connected to hbase shell and just listed tables but it gives me following error: HBase Shell; enter 'help<RETURN>' for list of supported commands. Type "exit<RETURN>" to leave the HBase Shell Version 1.1.2.2.5.0.0-1245, r53538b8ab6749cbb6fdc0fe448b89aa82495fb3f, Fri Aug 26 01:32:27 UTC 2016 count 'DRM_Email1' ERROR: No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt) Here is some help for this command: Count the number of rows in a table. Return value is the number of rows. This operation may take a LONG time (Run '$HADOOP_HOME/bin/hadoop jar hbase.jar rowcount' to run a counting mapreduce job). Current count is shown every 1000 rows by default. Count interval may be optionally specified. Scan caching is enabled on count scans by default. Default cache size is 10 rows. If your rows are small in size, you may want to increase this parameter. Examples:

Online	Offline
Last Visited	‎04-08-2019 11:11 AM

Member Since	‎06-22-2017 12:02 PM
Last Visited	‎04-08-2019 11:11 AM
Posts	10

Cloudera Community

Re: Efficiently read HBase records without schema ...

Re: I am trying to schedule a shell script which h...

Re: I am trying to schedule a shell script which h...

Re: I am trying to schedule a shell script which h...

Re: I am trying to schedule a shell script which h...

I am trying to schedule a shell script which has f...