- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
specifying the number of reducers for Phoenix bulk load tool
- Labels:
-
Apache HBase
-
Apache Phoenix
Created ‎11-10-2016 10:11 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
IS there a way to specify the number of reducers for the phoenix CSV bulk load utility that uses the Mapreduce method to load data into hbase.
Created ‎11-10-2016 10:56 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
No. During the start of the MR job you may see message like:
mapreduce.MultiHfileOutputFormat: Configuring 20 reduce partitions to match current region count
That's exactly the number of reducers that will be created. How many of them will be running in parallel depends on the MR engine configuration.
Created ‎11-10-2016 10:21 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
MR job creates 1 reducer per region. So if you loading data to an empty table you may presplit table from HBase shell or use salting during table creation.
Created ‎11-10-2016 10:33 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
hi @ssoldatov, Thanks for your reply. So it wont honour the command line argument if i prvide like this
hadoop jar phoenix-4.8.1-HBase-1.1-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool -Dmapreduce.job.reduces=4 (not the full command)
though i specified 4 reducers, it just considered only.
Created ‎11-10-2016 10:56 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
No. During the start of the MR job you may see message like:
mapreduce.MultiHfileOutputFormat: Configuring 20 reduce partitions to match current region count
That's exactly the number of reducers that will be created. How many of them will be running in parallel depends on the MR engine configuration.
