I come from a background of C# middleware developer and I also have a intermediate knowledge of Java. I would lke to be a Hadoop developer and would really appreicate if you could guide me what Hadoop components I need to learn first?
After you download the sandbox and HDF. Take a look at these resources
@John D. As Divakar pointed out Hortonworks Products page is a great place to start. It has step by step tutorials and overall about the HDP.
In general if you would like to learn Apache Hadoop is a good place http://hadoop.apache.org/.
The above is a very good book. Hortonworks being part of the open source community and the only distribution with 100% open source projects. Hadoop has grown in many breadth and depths. Traditionally lot of work was done on map reduce and now people have moved more towards Hive (SQL Interface) and Spark.
I would say if you are starting of as a developer, start with Spark. Also as you come from C#, picking up Scala will not be bad and you may enjoy working with Spark.
Understanding Map Reduce framwork, HDFS and internals of Yarn is important to be skilled in Hadoop.
I would say best of luck for your journey.
This tutorial includes a collection of concepts to review and a series of labs that walks you through the process of loading some data, transforming it, querying and building a report.
If you want to learn spark I would go through these Spark tutorials on the Sandbox.
And if you have a question on a tutorial use the new buttons at the bottom of the tutorials to post a question on HCC. =)
I appreciate your suggestions. I do have an Azure subscription and I'd go definitely go ahead and install the sandbox.