Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How can I improve performance of inserting data to Hbase?

How can I improve performance of inserting data to Hbase?

New Contributor

I try to insert data to the Hbase in the shortest possible time using both methods shown below. Unfortunately the efficiency is very low. Do you have any ideas how can I improve performance?

 

 

hbase org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.separator=',' -Dimporttsv.columns="HBASE_ROW_KEY,message" messages hdfs://ip:9000/tmp/file.csv

sudo -u hdfs ./hbase org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.separator=',' -Dimporttsv.bulk.output=hdfs://ip:9000/tmp/example_output -Dimporttsv.columns="HBASE_ROW_KEY,message" messages hdfs://ip:9000/tmp/file.csv
sudo -u hdfs ./hbase org.apache.hadoop.hbase.mapreduce.LoadIncrementalHFiles hdfs://ip:9000/tmp/example_output logs
Don't have an account?
Coming from Hortonworks? Activate your account here