Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Do we support Nutch, Tika (NLP)and Stanbol on HDP?

Solved Go to solution

Do we support Nutch, Tika (NLP)and Stanbol on HDP?

New Contributor

Need Installation and configuration docs for Nutch, Tika, Stanbol if we support it.

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Do we support Nutch, Tika (NLP)and Stanbol on HDP?

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

3 REPLIES 3

Re: Do we support Nutch, Tika (NLP)and Stanbol on HDP?

Hortonworks supports only the products listed and linked to from this page http://hortonworks.com/hdp/

However, our PS and DS teams have delivered custom solutions using those projects as part of the overall design. It doesn't provide an official endorsement of those in the support offering, however, depends on a customer's risk appetite and any pre-existing vendor support arrangements for those tools.

Highlighted

Re: Do we support Nutch, Tika (NLP)and Stanbol on HDP?

New Contributor

Thanks Andrew. I found Tika is a library shipped with Solr. Couldnt find Nutch and Stanbol. Will convey to customer about the support as suggested.

Re: Do we support Nutch, Tika (NLP)and Stanbol on HDP?

Contributor

mmadan: Nutch is a full web crawling system which uses Hadoop. It has been around for many years - and in fact could be credited with creating Hadoop.

I tried supporting Nutch for a while (Not through Hortonworks of course), but it is still very much R&D software because there are so few companies using it. There is some significant confusion about moving away from MapReduce to YARN.

Stanbol is something I am less familiar with - but since it consists of LOTS of Apache projects I think it would be as complicated as Hadoop to support.

Don't have an account?
Coming from Hortonworks? Activate your account here