<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: In-Memory Layer in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/In-Memory-Layer/m-p/138835#M35335</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/9304/tspann.html" nodeid="9304"&gt;@Timothy Spann&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It really all depends on your particular use case and requirements.  First, I'm assuming you have a custom-built application that will be querying this data store.  If so, how complex do the queries need to be?  Do you need Relational (SQL) or Key-Value store?  Also, how much latency can you afford?&lt;/P&gt;&lt;P&gt;I would first explore if HBase (or HBase + Phoenix) would be sufficient.  This will reduce the number of moving parts you have.&lt;/P&gt;&lt;P&gt;If you're set on in-memory data grids/stores then some options would be Redis, Hazelcast, Teracotta Big Memory and GridGain (Apache Ignite).  I believe the last two have connectors to Hadoop that allow writing results of MR jobs directly to the data grid (you'll need to confirm that functionality though)&lt;/P&gt;&lt;P&gt;Like I said before though, I recommend you exhaust the HBase option before moving out-of-stack.  &lt;/P&gt;</description>
    <pubDate>Tue, 16 Aug 2016 05:14:45 GMT</pubDate>
    <dc:creator>egarelnabi</dc:creator>
    <dc:date>2016-08-16T05:14:45Z</dc:date>
    <item>
      <title>In-Memory Layer</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/In-Memory-Layer/m-p/138834#M35334</link>
      <description>&lt;P&gt;I am looking for the best option for in-memory computing, fast data. The most recent data we have (current, 5 minutes, 1 hours, &amp;lt; 1 day) we need to have access to as fast as possible.&lt;/P&gt;&lt;P&gt;It's probably 500G or less.&lt;/P&gt;&lt;P&gt;Something like Pivotal's Butterfly Architecture.&lt;/P&gt;&lt;P&gt;What will work best for keeping some of this fast data?   I have been looking at Apache Geode, Apache Ignite, Alluxio, SnappyData, Redis, HDFS Ram Data Nodes, HBase In-Memory Column Families, Kafka, Spark Streaming.&lt;/P&gt;&lt;P&gt;Any baked solutions out there that work with HDP?&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 01:13:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/In-Memory-Layer/m-p/138834#M35334</guid>
      <dc:creator>TimothySpann</dc:creator>
      <dc:date>2016-07-21T01:13:51Z</dc:date>
    </item>
    <item>
      <title>Re: In-Memory Layer</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/In-Memory-Layer/m-p/138835#M35335</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/9304/tspann.html" nodeid="9304"&gt;@Timothy Spann&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It really all depends on your particular use case and requirements.  First, I'm assuming you have a custom-built application that will be querying this data store.  If so, how complex do the queries need to be?  Do you need Relational (SQL) or Key-Value store?  Also, how much latency can you afford?&lt;/P&gt;&lt;P&gt;I would first explore if HBase (or HBase + Phoenix) would be sufficient.  This will reduce the number of moving parts you have.&lt;/P&gt;&lt;P&gt;If you're set on in-memory data grids/stores then some options would be Redis, Hazelcast, Teracotta Big Memory and GridGain (Apache Ignite).  I believe the last two have connectors to Hadoop that allow writing results of MR jobs directly to the data grid (you'll need to confirm that functionality though)&lt;/P&gt;&lt;P&gt;Like I said before though, I recommend you exhaust the HBase option before moving out-of-stack.  &lt;/P&gt;</description>
      <pubDate>Tue, 16 Aug 2016 05:14:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/In-Memory-Layer/m-p/138835#M35335</guid>
      <dc:creator>egarelnabi</dc:creator>
      <dc:date>2016-08-16T05:14:45Z</dc:date>
    </item>
  </channel>
</rss>

