<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question solr schema less in text search in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/25793#M5384</link>
    <description>&lt;P&gt;i am using solr version 4.4 CDH 5.3.1 , and was wondering if its possible to insert a log file "unstrucuted" into solr and search for specific words in this text, is it possible as i don't have a schema for the file , its just a text file ? and if yes , how that's can be done using cloudera manager to configure solr to do so ?&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 09:25:03 GMT</pubDate>
    <dc:creator>tarekabouzeid91</dc:creator>
    <dc:date>2022-09-16T09:25:03Z</dc:date>
    <item>
      <title>solr schema less in text search</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/25793#M5384</link>
      <description>&lt;P&gt;i am using solr version 4.4 CDH 5.3.1 , and was wondering if its possible to insert a log file "unstrucuted" into solr and search for specific words in this text, is it possible as i don't have a schema for the file , its just a text file ? and if yes , how that's can be done using cloudera manager to configure solr to do so ?&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:25:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/25793#M5384</guid>
      <dc:creator>tarekabouzeid91</dc:creator>
      <dc:date>2022-09-16T09:25:03Z</dc:date>
    </item>
    <item>
      <title>Re: solr schema less in text search</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/25797#M5385</link>
      <description>&lt;P&gt;i found a command line which takes files in a directory and recursivly index them :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;java -classpath /opt/cloudera/parcels/CDH/lib/solr/solr-core-4.4.0-cdh5.3.1.jar -Dauto=yes -Dc=testing -Ddata=files -Drecursive=yes org.apache.solr.util.SimplePostTool mydata/&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;but i got an error :&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SimplePostTool version 1.5&lt;BR /&gt;Posting files to base url http://localhost:8983/solr/update..&lt;BR /&gt;Entering auto mode. File endings considered are xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log&lt;BR /&gt;Entering recursive mode, max depth=999, delay=0s&lt;BR /&gt;Indexing directory mydata (9 files, depth=0)&lt;BR /&gt;POSTing file Word_Count_input - Copy (4).txt (text/plain)&lt;BR /&gt;SimplePostTool: WARNING: Solr returned an error #404 Not Found&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and it doesn't commit the changes as well so nothing is writtin in solr&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Mar 2015 11:22:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/25797#M5385</guid>
      <dc:creator>tarekabouzeid91</dc:creator>
      <dc:date>2015-03-23T11:22:16Z</dc:date>
    </item>
    <item>
      <title>Re: solr schema less in text search</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/35717#M5386</link>
      <description>&lt;P&gt;i found this URL very helpful , so if anyone is facing this problem , it will help alot&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika&amp;nbsp;" target="_blank"&gt;https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika&amp;nbsp;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;but for solr cloud , there's another good way:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1- configure data import handler in the solrconfig.xml&amp;nbsp;&lt;/P&gt;&lt;P&gt;add this part after any request handler inside the file&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;lt;requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler"&amp;gt;&lt;BR /&gt;&amp;lt;lst name="defaults"&amp;gt;&lt;BR /&gt;&amp;lt;str name="config"&amp;gt;DIHconfigfile.xml&amp;lt;/str&amp;gt;&lt;BR /&gt;&amp;lt;/lst&amp;gt;&lt;BR /&gt;&amp;lt;/requestHandler&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2- create the data import handler file "DIHconfigfile.xml" and reside it next to the solrconfig.xml&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;below you can find more about DIH&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://wiki.apache.org/solr/DataImportHandler" target="_blank"&gt;https://wiki.apache.org/solr/DataImportHandler&lt;/A&gt;&lt;/P&gt;&lt;P&gt;(check file data source part)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;3- reload the core&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;4- from solr web UI you can start indexing the file/files you specified in the DIH ..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Happy indexing&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 31 Dec 2015 12:48:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/solr-schema-less-in-text-search/m-p/35717#M5386</guid>
      <dc:creator>tarekabouzeid91</dc:creator>
      <dc:date>2015-12-31T12:48:17Z</dc:date>
    </item>
  </channel>
</rss>

