Member since
11-02-2016
5
Posts
1
Kudos Received
0
Solutions
12-09-2016
03:25 PM
1 Kudo
You cannot do this directly from the export command and must do some separate processing. I feel the best way to do this is to run this pig script on export result raw = load 'data.csv' using PigStorage(',');
nonull = foreach raw generate
REPLACE($0, '\\\\N', ''),
REPLACE($1, '\\\\N', ''),
REPLACE($2, '\\\\N', ''),
REPLACE($3, '\\\\N', '');
store nonull into 'nonull/data.csv' using PigStorage(','); Keep in mind this will result in output in the m-r format in hdfs data.csv
data.csv/_SUCCESS
data.csv/part-m-00000
data.csv/part-m-00001
... If you want to process this file in hadoop, just point to data.csv If you want to pull this to edge node with command line use hdfs dfs -getmerge <localpath> nonull/data.csv If you want to download it using Ambari Files View, just double click on nonull/data.csv the click Select All then Concatenate and it will download as a single file
... View more
11-25-2016
11:09 AM
Thanks Bryan. Works like a charm
... View more