Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Do we support Nutch, Tika (NLP)and Stanbol on HDP?

avatar
Explorer

Need Installation and configuration docs for Nutch, Tika, Stanbol if we support it.

1 ACCEPTED SOLUTION

avatar

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

View solution in original post

3 REPLIES 3

avatar

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

avatar
Explorer

Thanks Andrew. I found Tika is a library shipped with Solr. Couldnt find Nutch and Stanbol. Will convey to customer about the support as suggested.

avatar
Expert Contributor

mmadan: Nutch is a full web crawling system which uses Hadoop. It has been around for many years - and in fact could be credited with creating Hadoop.

I tried supporting Nutch for a while (Not through Hortonworks of course), but it is still very much R&D software because there are so few companies using it. There is some significant confusion about moving away from MapReduce to YARN.

Stanbol is something I am less familiar with - but since it consists of LOTS of Apache projects I think it would be as complicated as Hadoop to support.