I have a question regarding connecting a webportal which will act a data source. I need to know how can i do connect it to my cluster in cloudera.
Can you please clarify whether you want to connect the back end database of the web portal?
Or you want to consume a Web API ?
If the answer to the first question is yes then Sqoop is the tool you're looking for. In the other case, you can consume the data in real-time with Nifi and Kafka.
Bascially i am working on a project related to data exchange between two users. The users will have a account in the webpotal and will upload the files on that webportal that will be stored HDFS file system. I want to know which tool is better for this use case.