Support Questions

Find answers, ask questions, and share your expertise

I am a newbie to Big Data and Hadoop.

avatar
New Contributor

I come from a background of C# middleware developer and I also have a intermediate knowledge of Java. I would lke to be a Hadoop developer and would really appreicate if you could guide me what Hadoop components I need to learn first?

1 ACCEPTED SOLUTION
10 REPLIES 10

avatar
Rising Star

Hi Musheer,

You can refer to the link - http://hortonworks.com/training/

Under developer section you will find the course objective which you can use to get started.

Regards,

Karthik Gopal

avatar

@John D.

Here is the link to download HDP Sandbox http://hortonworks.com/hdp/downloads/

avatar

@John D

@John D. As Divakar pointed out Hortonworks Products page is a great place to start. It has step by step tutorials and overall about the HDP.

In general if you would like to learn Apache Hadoop is a good place http://hadoop.apache.org/.

As for a book - http://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/1449311520

The above is a very good book. Hortonworks being part of the open source community and the only distribution with 100% open source projects. Hadoop has grown in many breadth and depths. Traditionally lot of work was done on map reduce and now people have moved more towards Hive (SQL Interface) and Spark.

I would say if you are starting of as a developer, start with Spark. Also as you come from C#, picking up Scala will not be bad and you may enjoy working with Spark.

Understanding Map Reduce framwork, HDFS and internals of Yarn is important to be skilled in Hadoop.

I would say best of luck for your journey.

avatar
New Contributor

Those are all paid sites. I am wondering if there are any free resources that I can use...

avatar
Rising Star

You can also use the link below for tutorials using sandbox mentioned by @Divakar Annapureddy

http://hortonworks.com/tutorials/

Regards,

Karthik Gopal

avatar
Master Guru

@John D. - Good to see your interest & enthusiasm to learn Hadoop! 🙂 HCC is great place to get your Hadoop questions answered. Please have a look at links provided by @Karthik Gopal & @Divakar Annapureddy

Happy Hadooping!!

avatar
Guru

@John D.: HCC provide a very good and handy documents to start with hadoop. Also you can visit to http://www.hadoopadmin.co.in/ to start with hadoop.

avatar

@John d

@John D.I would suggest downloading the Sandbox and doing the Hello to HDP tutorial.

This tutorial includes a collection of concepts to review and a series of labs that walks you through the process of loading some data, transforming it, querying and building a report.

If you want to learn spark I would go through these Spark tutorials on the Sandbox.

And if you have a question on a tutorial use the new buttons at the bottom of the tutorials to post a question on HCC. 😃

avatar
New Contributor

I appreciate your suggestions. I do have an Azure subscription and I'd go definitely go ahead and install the sandbox.