Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

What type of results that can I get using Apache SPARK

avatar
Rising Star

Hello experts,

I already have some data in HDFS, and some tables with Hive, I would like to do some data analysis/data mining. But I don't know If I use Spark or other Analytics Tool.

What type of analysis/analytics work can I do with Spark?

1 ACCEPTED SOLUTION

avatar

Hello Pedro

Spark core is a general purpose in memory analytics engine. Adding to spark core things like sparkSQL or SparkML you can do many interesting analytics or Datascience modelling, in a programatic or sql fashion. Maybe this tutorial can help you in your first steps.

http://hortonworks.com/hadoop-tutorial/hands-on-tour-of-apache-spark-in-5-minutes/

http://hortonworks.com/blog/data-science-hadoop-spark-scala-part-2/

View solution in original post

2 REPLIES 2

avatar

Hello Pedro

Spark core is a general purpose in memory analytics engine. Adding to spark core things like sparkSQL or SparkML you can do many interesting analytics or Datascience modelling, in a programatic or sql fashion. Maybe this tutorial can help you in your first steps.

http://hortonworks.com/hadoop-tutorial/hands-on-tour-of-apache-spark-in-5-minutes/

http://hortonworks.com/blog/data-science-hadoop-spark-scala-part-2/

avatar

@Pedro Rodgers

for more information ..google search on spark 1.6.1 or http://spark-1.6.1.org

it details more about dataframes,SQL,HiveQl,graphx,machine learning ,R etc. with examples.