Reply
Highlighted
New Contributor
Posts: 3
Registered: ‎08-22-2018

How can I improve performance of inserting data to Hbase?

I try to insert data to the Hbase in the shortest possible time using both methods shown below. Unfortunately the efficiency is very low. Do you have any ideas how can I improve performance?

 

 

hbase org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.separator=',' -Dimporttsv.columns="HBASE_ROW_KEY,message" messages hdfs://ip:9000/tmp/file.csv

sudo -u hdfs ./hbase org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.separator=',' -Dimporttsv.bulk.output=hdfs://ip:9000/tmp/example_output -Dimporttsv.columns="HBASE_ROW_KEY,message" messages hdfs://ip:9000/tmp/file.csv
sudo -u hdfs ./hbase org.apache.hadoop.hbase.mapreduce.LoadIncrementalHFiles hdfs://ip:9000/tmp/example_output logs
Announcements