About kosmik

kosmik · ‎01-05-2018

Hi, I have some questions about the Hadoop Cluster data node failover: What happened the link is down between the name node and a data node (or between 2 data nodes) when the Hadoop cluster is processing some data? Does Hadoop cluster have any OOTB to recover this problem? What happens one data node is down when the Hadoop cluster is processing some data? Also, another question is about the Hadoop cluster hardware configuration. Let's say we will use our Hadoop cluster to process 100GB log files each day, how many data nodes do we need to set up? And for each data node hardware configuration(e.g. CPU, RAM, Harddisk)? Thank You Hari

kosmik · ‎11-25-2017

Hi, How to connect to Hadoop in a Java program. Here are few details: I am taking input from the user in HTML form, using JSP to process the form data. I want to connect to Hadoop to fetch some data based on form inputs. How can I connect to Hadoop using Java in this case? Thanks Hari

kosmik · ‎11-17-2017

Hi, This is kind of naive question but I am new to NoSQL paradigm and don't know much about it. So if somebody can help me clearly understand the difference between the HBase and Hadoop or if give some pointers which might help me understand the difference. Till now, I did some research and acc. to my understanding Hadoop provides a framework to work with a raw chunk of data(files) in HDFS and HBase is database engine above Hadoop, which basically works with structured data instead of the raw data chunk. HBase provides a logical layer of HDFS just as SQL does. Is it correct? Pls, feel free to correct me. Thanks. hari

Online	Offline
Last Visited	‎11-25-2017 05:00 AM

Member Since	‎11-21-2017 06:05 AM
Last Visited	‎11-25-2017 05:00 AM
Posts	6
Kudos received	1

Cloudera Community

Re: Hadoop cluster failover

To connect to Hadoop using Java

Difference between HBase and Hadoop/HDFS