About elserj

elserj · ‎04-16-2017

Use Scan.setBatch(int) to control the number of records fetched per RPC with the Java API. The API call you are making only wraps calls to ResutlScanner.next(). It does not affect the underlying RPCs. You may also have to increase hbase.client.scanner.max.results.size as this caps the numbers of records return in a single RPC (default 2MB). The Thrift and REST servers do NOT cache results. Please disregard the comment which asserts this.

elserj · ‎04-16-2017

Use `jstack` to identify why the init process is hanging. Most likely you do not have correct accumulo-site.xml or ZooKeeper or HDFS are not running.

elserj · ‎04-13-2017

Have you performed a sanity check that the RegionServer which gave this error can connect to that IP+port? You can easily use telnet to perform this check (e.g. `telnet 172.16.3.196 16000`). If you get a connection refused error, either the HBase master is not running or there is a network issue preventing this node from talking to the Master.

elserj · ‎04-11-2017

You can also use a standard cron implementation via Linux. e.g. echo "major_compact 'FOO'" | hbase shell -n You could schedule the above to run on a specific node at your off-peak time. Be sure to monitor the output so that you can react to any possible failures.

elserj · ‎04-11-2017

The data is base64 encoded. This is because the data in HBase might otherwise make invalid XML. Base64 decode the rowkey, column and cell data in your client application.

elserj · ‎04-05-2017

The Phoenix Query Server is an HTTP server which expects very specific request data data. Sometimes, in the process of connecting different clients, the various configuration options of both client and server can create confusion about what data is actually being sent over the wire. This confusion leads to questions like "did my configuration property take effect" and "is my client operating as I expect". Linux systems often have a number of tools available for analyzing network traffic on a node. We can use one of these tools, ngrep, to analyze the traffic flowing into the Phoenix Query Server. From a host running the Phoenix Query Server, the following command would dump all traffic from any source to the Phoenix Query Server. $ sudo ngrep -t -d any port 8765 The above command will listen to any incoming network traffic on the current host and filter out any traffic which is not to the port 8765 (the default port for the Phoenix Query Server). A specific network interface (e.g. eth0) can be provided instead of "any" to further filter traffic. When connecting a client to the server, you should be able to see the actual HTTP requests and responses sent between client and server. T 2017/04/05 12:49:07.041213 127.0.0.1:60533 -> 127.0.0.1:8765 [AP] POST / HTTP/1.1..Content-Length: 137..Content-Type: application/octet-stream..Host: localhost:8765..Connection: Keep-Alive..User-Agent: Apache-HttpClient/4.5.2 (Java/1.8.0_45)..Accept-Encoding: gzip,deflate.....?org.apache.calcite.avatica.proto.Requests$OpenConnectionRequest.F.$2ba8e796-1a29-4484-ac88-6075604152e6....password..none....user..none ## T 2017/04/05 12:49:07.052011 127.0.0.1:8765 -> 127.0.0.1:60533 [AP] HTTP/1.1 200 OK..Date: Wed, 05 Apr 2017 16:49:07 GMT..Content-Type: application/octet-stream;charset=utf-8..Content-Length: 91..Server: Jetty(9.2.z-SNAPSHOT).....Aorg.apache.calcite.avatica.proto.Responses$OpenConnectionResponse......hw10447.local:8765 ## The data above is in ProtocolBuffers which is not a fully-human readable format; however, "string" data is stored as-is which makes reading it a reasonable task.

elserj · ‎04-05-2017

Want to add some logs? A thread dump as the message instructs you to do? 🙂

elserj · ‎04-04-2017

You're probably running in the leftover tombstone from a delete: https://hbase.apache.org/book.html#_delete Compact your table and then run the import.

elserj · ‎04-04-2017

"What is the way to export to file system directory?" You cannot do this. The MapReduce job can only write to HDFS as it is running across many nodes. Use the HDFS cli to copy the files to the local FS if you have this requirement. "Will import work only if the table is empty of data?" No, the import job does not require the table to be empty. "Is it possible to append data with import ie table already has some data and we want to add more data with import." This is essentially how the Import job works. The original timestamp on the exported data is preserved. So, if your destination table has a Key with a newer timestamp, you would not see the older data after import. "Also is there any way to extract the table schemas including 'create table' from hbase - or is 'describe <table>' the only way?" Describe is the only way. However, you may want to consider using HBase Snapshots instead of these Export/Import mapreduce jobs. Snapshots implicitly hold onto the schema, but, upon restore, would re-set the table to the exact state (as opposed to Import's "merge")

elserj · ‎04-04-2017

Sounds like the output of a MapReduce job that you ran. HBase does not store anything outside of the value of "hbase.rootdir" in hbase-site.xml. On HDP, this defaults to "/apps/hbase/data". As long as you have no changed this configuration property, you can rest assured that HBase is not actively referring to files stored in "/tmp".

Online	Offline
Last Visited	‎07-01-2022 02:44 PM

Member Since	‎07-17-2019 08:58 AM
Last Visited	‎07-01-2022 02:44 PM
Posts	738
Kudos received	429

Cloudera Community

Re: Why can't Object Stores like Amazon S3 be used...

Re: Not a host:port pair: PBUF, how to resolve?

Re: versioning question in hbase

Re: Phoenix query call from java on larger data se...

Re: Revoke permissions to a superuser on Hbase

Re: Why HBase java Client is slow compared to REST...

Re: Re-initializing Apache Accumulo under HDP

Re: Hbase region servers not able to connect to ma...

Re: Schedule Major Compaction Using Cron Job

Re: HBase Rest API call returns scrambled values

Protocol-level debugging of the Phoenix Query Serv...

Re: Master failed to complete initialization after...

Re: HBASE import/export questions

Re: HBASE import/export questions

Re: /tmp/hbase-tablename loaded with data , what m...