<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: SOLR - how to use it in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/299260#M53267</link>
    <description>&lt;P&gt;Solr &lt;STRONG&gt;includes the specified file terms&lt;/STRONG&gt; in an index.&lt;/P&gt;&lt;P&gt;Indexing in Solr would be similar to creating an index at the end of a book that includes the words that appear in that book and their location, so basically we would take an inventory of the words that appear in the book and an inventory of the pages where said words appear&lt;/P&gt;&lt;P&gt;That is, by including content in the index, we make said content available for search by Solr.&lt;/P&gt;&lt;P&gt;This type of index, called an inverted index, is a way of structuring the information that will be retrieved by a search engine.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="inverted-file-index.jpg" style="width: 763px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28184iEDB8042145386E7A/image-size/large?v=v2&amp;amp;px=999" role="button" title="inverted-file-index.jpg" alt="inverted-file-index.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;You may find a longer answer of the way the information is stored and retrieved by solr in &lt;A href="https://www.solr-tutorial.com/indexing-with-solr.html" target="_blank"&gt;https://www.solr-tutorial.com/indexing-with-solr.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 07 Jul 2020 11:34:27 GMT</pubDate>
    <dc:creator>lusitez</dc:creator>
    <dc:date>2020-07-07T11:34:27Z</dc:date>
    <item>
      <title>SOLR - how to use it</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158678#M53264</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I understand SOLR creates a index file and makes searches faster - however I have a fundamental question -&lt;/P&gt;&lt;P&gt;Does SOLR stores the data + index - for example if I have a Table with 100 columns, and I want index on a few columns&lt;/P&gt;&lt;P&gt;Will SOLR store all the Table data so that it can show the full row on search match&lt;/P&gt;&lt;P&gt;OR&lt;/P&gt;&lt;P&gt;The full file can be in HDFS/HBASE&lt;/P&gt;&lt;P&gt;and SOLR can look it up and show the full row?&lt;/P&gt;&lt;P&gt;So can there be an approach where the Data is in HDFS and the primary/secondary indexes in SOLR - and search can find the full data in HDFS. Not only find , can also update / delete.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Avijeet&lt;/P&gt;</description>
      <pubDate>Fri, 03 Feb 2017 13:18:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158678#M53264</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2017-02-03T13:18:30Z</dc:date>
    </item>
    <item>
      <title>Re: SOLR - how to use it</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158679#M53265</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11016/avijeetd.html" nodeid="11016"&gt;@Avijeet Dash&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;I would recommend reading the following link:&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.solrtutorial.com/basic-solr-concepts.html" target="_blank"&gt;http://www.solrtutorial.com/basic-solr-concepts.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;First to answer your question, you cannot keep your data in HBase/HDFS and create an index in SOLR to search that data. SOLR will search its own index. Here is the concept:&lt;/P&gt;&lt;P&gt;Data stored in SOLR is called documents (an analogy from database world is that each document is a row in a table). Before you can store data in SOLR, you will have to define a schema in a file called schema.xml (similar to a table schema in a database). This is where you specify whether your field (think like a column in a database) is indexed as well as stored. I know you understand index which is what SOLR  uses to search. Bu what the hell is "stored". Well, are you only going to get back the indexed fields? Assume a document with 50 fields. May be you want to search only on 5 of the fields. And when you get the result back of your search, you probably want more than the indexed field. So you get back your stored fields. The more fields you store and index, the higher storage requirements.&lt;/P&gt;&lt;P&gt;Read that link and you'll have a good idea. And to reiterate my earlier point, no, you cannot have data in HDFS/HBase and index from SOLR. SOLR is a complete solution. SOLR can use HDFS to store and index its own data, but it's not going to create an index on your HBase file or your ORC/Text etc files on HDFS.&lt;/P&gt;</description>
      <pubDate>Sat, 04 Feb 2017 01:29:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158679#M53265</guid>
      <dc:creator>mqureshi</dc:creator>
      <dc:date>2017-02-04T01:29:43Z</dc:date>
    </item>
    <item>
      <title>Re: SOLR - how to use it</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158680#M53266</link>
      <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/10969/mqureshi.html" nodeid="10969"&gt;@mqureshi&lt;/A&gt; made great points. Also, note that you do not have to store any fields in Solr. You can choose True for either or both: stored=true/false, indexed=true/false. Of course if stored=false, you won't see the value in results but you will at a minimum, see the "uniqueKey" which would be your "id" field. You could also look at the HBase Indexer: &lt;A href="https://community.hortonworks.com/articles/1181/hbase-indexing-to-solr-with-hdp-search-in-hdp-%2023.html" target="_blank"&gt;https://community.hortonworks.com/articles/1181/hbase-indexing-to-solr-with-hdp-search-in-hdp-%2023.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 04 Feb 2017 05:05:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/158680#M53266</guid>
      <dc:creator>james_jones</dc:creator>
      <dc:date>2017-02-04T05:05:20Z</dc:date>
    </item>
    <item>
      <title>Re: SOLR - how to use it</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/299260#M53267</link>
      <description>&lt;P&gt;Solr &lt;STRONG&gt;includes the specified file terms&lt;/STRONG&gt; in an index.&lt;/P&gt;&lt;P&gt;Indexing in Solr would be similar to creating an index at the end of a book that includes the words that appear in that book and their location, so basically we would take an inventory of the words that appear in the book and an inventory of the pages where said words appear&lt;/P&gt;&lt;P&gt;That is, by including content in the index, we make said content available for search by Solr.&lt;/P&gt;&lt;P&gt;This type of index, called an inverted index, is a way of structuring the information that will be retrieved by a search engine.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="inverted-file-index.jpg" style="width: 763px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28184iEDB8042145386E7A/image-size/large?v=v2&amp;amp;px=999" role="button" title="inverted-file-index.jpg" alt="inverted-file-index.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;You may find a longer answer of the way the information is stored and retrieved by solr in &lt;A href="https://www.solr-tutorial.com/indexing-with-solr.html" target="_blank"&gt;https://www.solr-tutorial.com/indexing-with-solr.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jul 2020 11:34:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/SOLR-how-to-use-it/m-p/299260#M53267</guid>
      <dc:creator>lusitez</dc:creator>
      <dc:date>2020-07-07T11:34:27Z</dc:date>
    </item>
  </channel>
</rss>

