Support Questions

Find answers, ask questions, and share your expertise

h2o hadoop nodes cluster

avatar
Master Collaborator

Hi: I am trying to up 5 h2o hadoop nodes, bu i am receiving this error, but with just 1 node is working [

[root@lnxbig01 hadoop]# hadoop jar h2odriver_hdp2.1.jar water.hadoop.h2odriver -libjars ../h2o.jar -mapperXmx 2g -nodes 2 -output /tmp/h2o
WARNING: Use "yarn jar" to launch YARN applications.
Determining driver host interface for mapper->driver callback...
    [Possible callback IP address: 10.1.246.15]
    [Possible callback IP address: 127.0.0.1]
Using mapper->driver callback IP address and port: 10.1.246.15:36832
(You can override these with -driverif and -driverport.)
Driver program compiled with MapReduce V1 (Classic)
Memory Settings:
    mapred.child.java.opts:      -Xms2g -Xmx2g
    mapred.map.child.java.opts:  -Xms2g -Xmx2g
    Extra memory percent:        10
    mapreduce.map.memory.mb:     2252
16/03/04 07:53:32 INFO impl.TimelineClientImpl: Timeline service address: http://lnxbig06.cajarural.gcr:8188/ws/v1/timeline/
16/03/04 07:53:32 INFO client.RMProxy: Connecting to ResourceManager at lnxbig05.cajarural.gcr/10.1.246.19:8050
16/03/04 07:53:34 INFO mapreduce.JobSubmitter: number of splits:2
16/03/04 07:53:34 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1456130415890_1037
16/03/04 07:53:34 INFO impl.YarnClientImpl: Submitted application application_1456130415890_1037
16/03/04 07:53:34 INFO mapreduce.Job: The url to track the job: http://lnxbig05.cajarural.gcr:8088/proxy/application_1456130415890_1037/
Job name 'H2O_64985' submitted
JobTracker job ID is 'job_1456130415890_1037'
For YARN users, logs command is 'yarn logs -applicationId application_1456130415890_1037'
Waiting for H2O cluster to come up...
H2O node 10.1.246.18:54323 requested flatfile
H2O node 10.1.246.15:54321 requested flatfile
Sending flatfiles to nodes...
    [Sending flatfile to node 10.1.246.18:54323]
    [Sending flatfile to node 10.1.246.15:54321]
H2O node 10.1.246.18:54323 reports H2O cluster size 1
H2O node 10.1.246.15:54321 reports H2O cluster size 1
H2O node 10.1.246.15:54321 on host 10.1.246.15 exited with status -1
ERROR: At least one node failed to come up during cluster formation
ERROR: H2O cluster failed to come up
Attempting to clean up hadoop job...
16/03/04 07:53:51 INFO impl.YarnClientImpl: Killed application application_1456130415890_1037
Killed.


Please any suggestions why with more than one node it fail??

Many thanks

1 ACCEPTED SOLUTION

avatar
Master Mentor

@Roberto Sancho

Can you upload the contents of Ambari Server logs found at /var/log/ambari-server/ambari-server.log Ambari Agent logs are found at /var/log/ambari-agent/ambari-agent.log .

View solution in original post

2 REPLIES 2

avatar
Master Mentor

@Roberto Sancho

Can you upload the contents of Ambari Server logs found at /var/log/ambari-server/ambari-server.log Ambari Agent logs are found at /var/log/ambari-agent/ambari-agent.log .

avatar
Master Mentor

Please post the H2O logs