Created 01-07-2016 10:07 AM
Hello all ! I have to make some hadoop services highly available. so i was wondering about the best way to achieve that, here is the services list : HBase Lily indexer, Sqoop, HiveServer2 thanks a lot
Created 01-07-2016 11:35 AM
sqoop is a client only, so you can have sqoop installed in multiple nodes behind a IP load balancer.
I don't know about Lilly indexer (part of HDP Search Connector). Documentation is here: https://doc.lucidworks.com/lucidworks-hdpsearch/2.3/Guide-Jobs.html#_hbase-indexer, but I'm not sure if it has HA out-of-the-box with Solr Cloud.
Created 01-07-2016 10:52 AM
There is a High Availability section in: http://docs.hortonworks.com (choose you version)
For lastest HDP version (2.3.4), see this:
Created 01-07-2016 11:16 AM
Hello ! first of all thanks a lot for you answer, otherwise I already checked the high availability section on the official documentation, but there is no information about how to make lilly indexer, sqoop highly available.
Created 01-07-2016 11:35 AM
sqoop is a client only, so you can have sqoop installed in multiple nodes behind a IP load balancer.
I don't know about Lilly indexer (part of HDP Search Connector). Documentation is here: https://doc.lucidworks.com/lucidworks-hdpsearch/2.3/Guide-Jobs.html#_hbase-indexer, but I'm not sure if it has HA out-of-the-box with Solr Cloud.
Created 01-07-2016 04:15 PM
Are you sur that sqoop is only a client ? i'm agree that the processing is already HA, cause it uses yarn.but what about the Sqoop metastore and job tool ?
Created on 01-07-2016 04:35 PM - edited 08-19-2019 05:18 AM
See this screenshot from sqoop in Ambari. It's only client.
There no sqoop metastore service, by default sqoop uses derby database, but if you want, you can use external mysql or postgres database for sqoop, then, you can configure this database in HA mode.
Created 01-07-2016 05:04 PM
i didn't mention that i'm using sqoop2 with the sqoop server.
Created 01-07-2016 05:13 PM
Hortonworks does not support sqoop2 now. Sqoop supported in lastest hdp 2.3.4 is:
Created 01-07-2016 02:06 PM
Lily is not an HDP product, it was contributed by Ngdata. Perhaps you can reach out to their mailing list for any HA advice? You may have to come up with your own HA for it, then you should certainly share your docs here for the best of the community! As far as HBase, make sure you run multiple master servers.
Created 01-07-2016 04:16 PM
Thanks :), i'll check for Lily.