Reply
Explorer
Posts: 26
Registered: ‎07-09-2015

Oozie Workflow: How to handle distributed cache using oozie

Hi,

I have mentioned the path for distributed cache file in my oozie workflow using file tag. The path mentioned is the file location in hdfs:

<file>/user/hadoopuser/distributed_cache_file.txt#dimension_cache_file.txt</file>

 

I have used below code for accessing the cache files in my mapper:

URI[] CACHE_FILES = context.getCacheFiles();

 

But, instead of reading the cache file mapper is reading the application jar file that i have placed in the lib directory on application directory in hdfs where i have placed workflow.xml and jar file in lib directory.


Do I need to place the distributed cache file in application directory along with workflow.xml file. Could you please help me to resolve the issue.

 

Thanks