Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Outer-Join two CSV

Outer-Join two CSV

New Contributor

book_1.csv

field_1, field_2, gender
aaa,bbb, 0
aaa,bbb, 1

book_2.csv

id, gender
0, Female
1, Male

new_book.csv

field_1, field_2 , gender
aaa,bbb, Female
aaa,bbb, Male


My fileflow:

GetFile -> SplitText ->Extract Text


I have two CSV files (book_1 and book_2) and I want to create a new csv ( new_book) that contains the gender in Male Female instead of 1 and 2.

But I have no idea on how to do it.

1 REPLY 1
Highlighted

Re: Outer-Join two CSV

Super Guru

@João Bernardo

You can use either QueryRecord processor with SQL kind of case statements

(or)

LookUp record processor with simplekeyvaluelookup service

(or)

ReplaceTextwithMapping processor


Please refer to this link for more details usage of these processors.

Don't have an account?
Coming from Hortonworks? Activate your account here