Created 01-20-2016 08:41 AM
Hi,
I created UDF that reads local file, which is passed into UDF as a parameter. Im running everything in local mode.
cd pig-test; touch input_file; pig -x local xx = my.udf('input_file')
When i run pig in mapreduce mode,, who/how will copy that input_file to appropriate nodes?
Thank you!
Created 01-20-2016 01:20 PM
You will be copying the file in hdfs and read the file from hdfs. Please see this tutorial
Created 01-20-2016 01:45 PM
@John Smith you will need to specify the full path to hdfs i.e. hdfs://namenode:8020/user/username as your hdfs path. For actual copy commands refer to the docs.
Created 01-21-2016 11:57 PM
@John Smith do you have enough information? Can you choose one of the answers to close out the thread?
Created 01-25-2018 11:26 AM
You can check out following tutorials
Created 06-21-2018 08:00 AM
<a href ="http://www.bisptrainings.com">Go through the tutorial</a>