I have a python code to make a file with squid content (for example 50,000,000 and 5.9G)
I wanna transfer it HDFS durung runnig my code, so I use tailFile, mergeContent (Maximum Number of Entries = 10000) and putHDFS(block.size=64 MB)
heap size(in bootstrap.conf) = 4096MB
After my code was finished, tailFile start to get all the content of my file but I am wanting to start when my code start .
I am using single NiFi
How can I solve it?