- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to share data between different oozie actions?
- Labels:
-
Apache Oozie
-
Cloudera Hue
-
MapReduce
Created on 07-16-2015 06:34 AM - edited 09-16-2022 02:34 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I have a requirement where I need to share data between multiple mapreduce actions in a single workflow and between different workflows within single coordinator.
I am creating and running my workflows using Hue.
I have read about capture-output element, but this element is not available under mapreduce action. Even if i use Java action to run my mapreduce programs, The capture-element has size constraint of 2KB by default. Could you please let me know how much I can increase the size?
Thanks
Created 07-16-2015 06:38 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The capture-output size is limited cause we store it inside the Oozie DB before we transfer it over to the next action instance. The max size is therefore limited by the size RDBMS supports for CHAR/VARCHAR columns, 64k for MySQL for example.
Created 07-16-2015 06:38 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The capture-output size is limited cause we store it inside the Oozie DB before we transfer it over to the next action instance. The max size is therefore limited by the size RDBMS supports for CHAR/VARCHAR columns, 64k for MySQL for example.
