Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

File filter in Nifi UnpackContent

New Contributor

I have a tar file with .csv, .xml, and .txt files that I want to unpack using the UnpackContent processor.   Can I specify all three file types in the file filter?

1 ACCEPTED SOLUTION

Mentor

@Vickey 

The file filter property of the unpackContent processor takes a java regular expression and can be used when unpacking tar or zip file.

In your unpackContent processor, set the "Packaging format" to either "ZIP" or "TAR" based on what package format is used by your source file.
The set a java regular expression such as below to extract only files within that package where the filename ends with the .csv, .txt, or .xml extensions:

.*\.(txt|xml|csv)


Hope this helps,

Matt

 

View solution in original post

2 REPLIES 2

Mentor

@Vickey 

The file filter property of the unpackContent processor takes a java regular expression and can be used when unpacking tar or zip file.

In your unpackContent processor, set the "Packaging format" to either "ZIP" or "TAR" based on what package format is used by your source file.
The set a java regular expression such as below to extract only files within that package where the filename ends with the .csv, .txt, or .xml extensions:

.*\.(txt|xml|csv)


Hope this helps,

Matt

 

New Contributor
Thank you Matt! This is exactly what I needed.
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.