I am initiating a big data project, however, I don't how to start it. What are the tools we need? We have our own server. Should us migrate our data to amazon s3 or not? I still not clear how to link the hortonworks and amazon s3. What are the costs that I need to consider?