@petri koski,
UDF (either for Hive or PIG) are running during map-reduce stage (doesn't matter whether it is M/R or TEZ execution engine). In other words, you are println during distributed computing. The code that prints your output is not under your execution shell (unless you are running in local mode).
How to see your printed lines? There are some ways:
- using job tracker UI - find your job and click on logs. One by one across all containers, until you will find it (or in each of them, if your code is applicable to each and every record of processed data).
- using yarn get aggregated logs
yarn logs -applicationId <aplpicationID>