Member since
11-10-2014
3
Posts
0
Kudos Received
0
Solutions
11-10-2014
02:18 PM
hello srowen, what I meant by analysis using Mahout algorithms on top of hadoop cluster as well MapReduce preprocessing tasks for instnstance tokenization, stemming and translation of 10 million of tweets. thank you
... View more
11-10-2014
08:41 AM
hello ZKhan , thanks for the replay, by your respose to my question you meant that the HW is sufficient conditioned to 10TB harddrive for each machine, but my whole data set is 10gb, why I need 10TB for each machine. thank you in advance,
... View more
11-10-2014
06:09 AM
Hello I am trying to Setup a Hadoop environment with commodity hardware, in short I want to perform predictive analysis and learn more about Hadoop environment. by performing the analysis on Twitter data using various algorithms such as Clustering, Topic Modeling and Sentiment Analysis, the size of my data set is 10GB with approximately 10000000 tweets. and I have the following hardware specification : One Desktop Quad Core processor 8GB ram and two Desktops with Quad Core processor and 4GB ram . My questions: Is the hardware specification sufficient for the given tasks or do I need to Upgrade the hardware ?
... View more
Labels:
- Labels:
-
Apache Hadoop