<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: cloudera solr integrating with apache nutch 1.7 custom built. in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cloudera-solr-integrating-with-apache-nutch-1-7-custom-built/m-p/18256#M2785</link>
    <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I got this working by changing from nutch version 1.7 to 1.8.&lt;/P&gt;&lt;P&gt;Reason==&amp;gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This was the issue. Further details please follow up the link:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.mail-archive.com/user%40nutch.apache.org/msg12592.html"&gt;http://www.mail-archive.com/user%40nutch.apache.org/msg12592.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks and Regards,&lt;/P&gt;&lt;P&gt;Sandeep B A&lt;/P&gt;</description>
    <pubDate>Fri, 05 Sep 2014 07:45:36 GMT</pubDate>
    <dc:creator>sandeep_ba</dc:creator>
    <dc:date>2014-09-05T07:45:36Z</dc:date>
    <item>
      <title>cloudera solr integrating with apache nutch 1.7 custom built.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cloudera-solr-integrating-with-apache-nutch-1-7-custom-built/m-p/18034#M2784</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm new to solr and nutch and i'm trying to integrate cloudera solr with apache nutch 1.7 custom built by taking source and adding mapred-site.xml,core-site.xml,hadoop-env.sh,hdfs-site.xml,yarn-site.xml.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As such normal crawling works for apache nutch. But when i try to integrate and crawl and index in solr provided by cloudera, it's failing with below exceptions. Since i'm very new to this, i'm unable to figure out how to solve this issue. Kindly can any one tell me how to proceed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;request: &lt;A href="http://bigdata-cl1-nn:8983/solr/update?wt=javabin&amp;amp;version=2" target="_blank"&gt;http://bigdata-cl1-nn:8983/solr/update?wt=javabin&amp;amp;version=2&lt;/A&gt;&lt;BR /&gt;at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)&lt;BR /&gt;at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)&lt;BR /&gt;at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)&lt;BR /&gt;at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)&lt;BR /&gt;at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)&lt;BR /&gt;at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)&lt;BR /&gt;at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:502)&lt;BR /&gt;at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:456)&lt;BR /&gt;at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:415)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PS: What i did was copied schema-solr4.xml to&amp;nbsp;&lt;/P&gt;&lt;P&gt;/usr/share/doc/solr-doc-4.3.0+61/example/solr/collection1/conf and added &amp;nbsp;in&amp;nbsp;351 line: &amp;lt;field name="_version_" type="long" indexed="true" stored="true"/&amp;gt;&lt;/P&gt;&lt;P&gt;and restarted solr.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CDH versions==&amp;gt;&amp;nbsp;5.1.0-1.cdh5.1.0.p0.53&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried to find the Location, i couldn't find solr, hence posting here, please redirect me if this is not the correct group.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the suggestions.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks and Regards,&lt;/P&gt;&lt;P&gt;Sandeep B A&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:06:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/cloudera-solr-integrating-with-apache-nutch-1-7-custom-built/m-p/18034#M2784</guid>
      <dc:creator>sandeep_ba</dc:creator>
      <dc:date>2022-09-16T09:06:36Z</dc:date>
    </item>
    <item>
      <title>Re: cloudera solr integrating with apache nutch 1.7 custom built.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cloudera-solr-integrating-with-apache-nutch-1-7-custom-built/m-p/18256#M2785</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I got this working by changing from nutch version 1.7 to 1.8.&lt;/P&gt;&lt;P&gt;Reason==&amp;gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This was the issue. Further details please follow up the link:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.mail-archive.com/user%40nutch.apache.org/msg12592.html"&gt;http://www.mail-archive.com/user%40nutch.apache.org/msg12592.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks and Regards,&lt;/P&gt;&lt;P&gt;Sandeep B A&lt;/P&gt;</description>
      <pubDate>Fri, 05 Sep 2014 07:45:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/cloudera-solr-integrating-with-apache-nutch-1-7-custom-built/m-p/18256#M2785</guid>
      <dc:creator>sandeep_ba</dc:creator>
      <dc:date>2014-09-05T07:45:36Z</dc:date>
    </item>
  </channel>
</rss>

