Created on 07-03-2018 09:17 AM - edited 08-18-2019 01:11 AM
How can I parsing html file in nifi?I need remove the first two <table> and take data from every <th> in <tr>.Which processors I need to use and how?Example of a file structure on pic.
Created 07-03-2018 11:39 AM
@Vladislav Shcherbakov
I think using InvokeScriptedProcessor may be a good option on this case. Please take a look here:
http://funnifi.blogspot.com/2016/02/invokescriptedprocessor-hello-world.html
Also there is a similar article that you could review here:
https://community.hortonworks.com/articles/193822/parsing-web-pages-for-images-with-apache-nifi.html
HTH
*** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.