<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Purpose of inverted indexes in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/17486#M2690</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am just starting my education in the world of big data, Hadoop, and MapReduce.&amp;nbsp; While I understand the concept of inverted indexes, I'm not sure I understand the purpose.&amp;nbsp; What problem is being solved by creating an inverted index?&amp;nbsp; What valuable information does such an approach provide?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Kevin&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 09:05:54 GMT</pubDate>
    <dc:creator>Kevin_DElia</dc:creator>
    <dc:date>2022-09-16T09:05:54Z</dc:date>
    <item>
      <title>Purpose of inverted indexes</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/17486#M2690</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am just starting my education in the world of big data, Hadoop, and MapReduce.&amp;nbsp; While I understand the concept of inverted indexes, I'm not sure I understand the purpose.&amp;nbsp; What problem is being solved by creating an inverted index?&amp;nbsp; What valuable information does such an approach provide?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Kevin&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:05:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/17486#M2690</guid>
      <dc:creator>Kevin_DElia</dc:creator>
      <dc:date>2022-09-16T09:05:54Z</dc:date>
    </item>
    <item>
      <title>Re: Purpose of inverted indexes</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/19782#M2691</link>
      <description>&lt;P&gt;Kevin,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Hadoop ecosystem is a lot more complex than just a simple key-value store, but a key-value store is sufficient to answer your question. Let's say you have data of the form "Key =&amp;gt; Value1", in one location, and "Key =&amp;gt; Value2" in another location. If you know one value, it's not trivial to find all related values. Unless.... you have an inverted index that allows you to look up the key for any given value, and then use that key to look up other values. For instance, say I have a database that lists the mailing address for each person.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;e.g. Kevin =&amp;gt; 1 Apple St, Sean =&amp;gt; 2 Zebra Ln&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is great if you just want to see where specific people live, but what if your question starts with having an address and needing to know all the people who live there? Instead of the key being the name and the value being the address, you create a different index that inverts this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;e.g 2 Zebra Ln =&amp;gt; Sean, 1 Apple St =&amp;gt; Kevin&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now it's easy to see everyone who shares an address because they would also share a key (which is actually not doable in some key-value stores - in which case you would modify the value field to encode a sequence of values).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I know it's been a while since you asked your question, but I hope this helps!&lt;/P&gt;</description>
      <pubDate>Mon, 06 Oct 2014 21:49:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/19782#M2691</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2014-10-06T21:49:26Z</dc:date>
    </item>
    <item>
      <title>Re: Purpose of inverted indexes</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/20100#M2692</link>
      <description>&lt;P&gt;Hi, Sean -&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, that does help.&amp;nbsp; I took the Developer course and we discussed this.&amp;nbsp; I was surprised to learn (and it's obvious to me now) that book indices are actually inverted indices!&amp;nbsp; Learning to think in Hadoop and M/R will be the first challenge to overcome as I begin my efforts in working in the Big Data arena.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for your help.&lt;/P&gt;&lt;P&gt;Kevin&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2014 13:41:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Purpose-of-inverted-indexes/m-p/20100#M2692</guid>
      <dc:creator>Kevin_DElia</dc:creator>
      <dc:date>2014-10-13T13:41:36Z</dc:date>
    </item>
  </channel>
</rss>

