01-17-2014 06:00 AM - edited 01-17-2014 07:18 AM
we are using Hadoop 2.0.0-cdh4.4.0 in our company. I am trying to use distributed cache feature. Getting the files from cache in mapper/reducer involves one of these methods:
Each of them returns Path or URI. Sometimes you need to store more then one files. The problem is that you need to be able to say which one is which. Example:
URI uris = context.getCacheFiles();
//uris - setA or setB?
Thank you in advance!
edit: moreover I have found out that method job.addCacheFiles which is only non deprecated for adding files to cache gives me NoSuchMethodException on server even though server cdh version and maven dependencies are of same version 2.0.0-cdh4.4.0. and maven builds it without error. I am going to read it directly from hdfs for now...