Support Questions
Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Innovation Accelerator group hub.

What is the Best consolidated Guide for HBase Tuning?

Contributor

In your opinion, what is the best consolidated resource for performance tuning an HBase instance?

Are the HBase guides the best reference we have for performance tuning or are there others which are more comprehensive?

1 ACCEPTED SOLUTION

Hey Wes,

here are some of my notes about HBase (Tuning).

  • Compression (Gzip,Snappy,...)
  • How is data retrieved?
  • Random lookups => Maybe smaller block size makes more sense?
  • Sequential scans => Maybe higher block size makes more sense?
  • Check whether you have hot spots in your Hbase Environment?
  • Metrics to check
    • BlockCache Ratio and Hit/Miss
    • Compaction Queue
    • Memstore size
    • Flush Queue
    • Call Queue
    • CPU load & WIO
    • Memory usage
    • Latency
    • IOPS
    • JVM Metrics (GC, Log,...)

    Links:

    Not a consolidated guide, but it might help anyway 🙂

    Let me know if you need more information about metrics or specific tuning methods (RAM, blockSize, etc.) I might have some more documents on my drive.

    Jonas

    View solution in original post

    2 REPLIES 2

    Hey Wes

    These are a good start. The Hbase SME wiki has a couple of links as well for Hbase tuning and staffing. this link also has info: http://fr.slideshare.net/lhofhansl/h-base-tuninghbasecon2015ok

    Then there are element specific thoughts like cache algos, number versions, number hfiles etc.. that can vary depending in your usage.

    Hey Wes,

    here are some of my notes about HBase (Tuning).

    • Compression (Gzip,Snappy,...)
    • How is data retrieved?
    • Random lookups => Maybe smaller block size makes more sense?
    • Sequential scans => Maybe higher block size makes more sense?
  • Check whether you have hot spots in your Hbase Environment?
  • Metrics to check
    • BlockCache Ratio and Hit/Miss
    • Compaction Queue
    • Memstore size
    • Flush Queue
    • Call Queue
    • CPU load & WIO
    • Memory usage
    • Latency
    • IOPS
    • JVM Metrics (GC, Log,...)

    Links:

    Not a consolidated guide, but it might help anyway 🙂

    Let me know if you need more information about metrics or specific tuning methods (RAM, blockSize, etc.) I might have some more documents on my drive.

    Jonas