Support Questions
Find answers, ask questions, and share your expertise

Do we support Nutch, Tika (NLP)and Stanbol on HDP?

Explorer

Need Installation and configuration docs for Nutch, Tika, Stanbol if we support it.

1 ACCEPTED SOLUTION

Accepted Solutions

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

View solution in original post

3 REPLIES 3

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

View solution in original post

Explorer

Thanks Andrew. I found Tika is a library shipped with Solr. Couldnt find Nutch and Stanbol. Will convey to customer about the support as suggested.

Contributor

mmadan: Nutch is a full web crawling system which uses Hadoop. It has been around for many years - and in fact could be credited with creating Hadoop.

I tried supporting Nutch for a while (Not through Hortonworks of course), but it is still very much R&D software because there are so few companies using it. There is some significant confusion about moving away from MapReduce to YARN.

Stanbol is something I am less familiar with - but since it consists of LOTS of Apache projects I think it would be as complicated as Hadoop to support.