Support Questions

Find answers, ask questions, and share your expertise

Parsing html file in nifi

avatar
Contributor

How can I parsing html file in nifi?
I need remove the first two <table> and take data from every <th> in <tr>.
Which processors I need to use and how?
Example of a file structure on pic.

79381-parsing-html-file.jpg

1 REPLY 1

avatar

@Vladislav Shcherbakov

I think using InvokeScriptedProcessor may be a good option on this case. Please take a look here:

http://funnifi.blogspot.com/2016/02/invokescriptedprocessor-hello-world.html

Also there is a similar article that you could review here:

https://community.hortonworks.com/articles/193822/parsing-web-pages-for-images-with-apache-nifi.html

HTH

*** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.