Member since
10-25-2016
14
Posts
0
Kudos Received
0
Solutions
10-01-2020
04:29 AM
Hbase stores data as a sorted map by keys. HBase is considered a persistent, multidimensional, sorted map, where each cell is indexed by a row key and column key (family and qualifier). A rowkey, which is immutable and uniquely defines a row, usually spans multiple HFiles. Rowkeys are treated as byte arrays (byte[]) and are stored in a sorted order in the multi-dimensional sorted map. If you look for a row_key, Hbase is able to identify the node where this data is present. Hadoop runs its computation on the same node where the key is present and hence the performance with technologies like Spark is really good. This is called data localization.
... View more
01-08-2018
01:58 AM
Hallo, can you help me? I have a problem when importing data into hbase table. I've tried to use importtsv, but the problem is the number of columns in my file very much (1000 columns). Do I have to write all the columns or is there another way that can automatically increase the number of columns according to the file? Thankyou..
... View more
11-29-2016
11:57 PM
you might try adding the following to core-site.xml seems like the error is talking about the root group. hadoop.proxyuser.root.groups=*
hadoop.proxyuser.root.hosts=* If you are running cloudera manager, you need to add those in cloudera manager itself, the config is not in the traditional space, but instead in a seperate folder managed by cloudera for each individual service. Confusingly, the config is in the traditional space is for the "gateway".
... View more
10-25-2016
09:35 AM
Many of the health issues could be related to free space issues. If you fix the free space issues (You can just change the threshold that triggers the free space issue or surpress the issues) and wait a few minutes, most of you other issues may go away.
... View more