Support Questions

Find answers, ask questions, and share your expertise
Announcements
Welcome to the upgraded Community! Read this blog to see What’s New!

Parsing html file in nifi

avatar
Explorer

How can I parsing html file in nifi?
I need remove the first two <table> and take data from every <th> in <tr>.
Which processors I need to use and how?
Example of a file structure on pic.

79381-parsing-html-file.jpg

1 REPLY 1

avatar

@Vladislav Shcherbakov

I think using InvokeScriptedProcessor may be a good option on this case. Please take a look here:

http://funnifi.blogspot.com/2016/02/invokescriptedprocessor-hello-world.html

Also there is a similar article that you could review here:

https://community.hortonworks.com/articles/193822/parsing-web-pages-for-images-with-apache-nifi.html

HTH

*** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.

Labels