<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hortonworks Tools for Implementing Lambda Architecture in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168690#M37198</link>
    <description>&lt;P&gt;HAWQ is good for nothing&lt;/P&gt;</description>
    <pubDate>Fri, 10 Nov 2017 03:19:59 GMT</pubDate>
    <dc:creator>milindmore</dc:creator>
    <dc:date>2017-11-10T03:19:59Z</dc:date>
    <item>
      <title>Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168685#M37193</link>
      <description>&lt;P&gt;Has a consensus formed around the best tool stack for implementing the Lambda Architecture on HDP?  I'm particularly interested in the "serving" and "speed" layers.  In "Big Data: Principles and best practices of scalable real-time data systems", Nathan Marz mentions using ElephantDB for the serving layer, but I'm trying to limit myself to tools included in the HDP/HDF stacks.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Aug 2016 20:46:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168685#M37193</guid>
      <dc:creator>ekbrosch</dc:creator>
      <dc:date>2016-08-08T20:46:44Z</dc:date>
    </item>
    <item>
      <title>Re: Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168686#M37194</link>
      <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/944/ekbrosch.html" nodeid="944"&gt;@Eric Brosch&lt;/A&gt; Phoenix and HDB (HAWQ) may be leveraged in a Lambda architecture.  Phoenix supports secondary index and HAWQ is a relational MPP db on HDP.  Both can serve low latency queries.  Choosing between the two? For know query patterns Phoenix will perform well.  For unknown query patterns HAWQ may be your way to go&lt;/P&gt;</description>
      <pubDate>Mon, 08 Aug 2016 21:33:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168686#M37194</guid>
      <dc:creator>sunile_manjee</dc:creator>
      <dc:date>2016-08-08T21:33:10Z</dc:date>
    </item>
    <item>
      <title>Re: Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168687#M37195</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/944/ekbrosch.html" nodeid="944" target="_blank"&gt;@Eric Brosch&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Your tooling selection really all depends on your particular use case.&lt;/P&gt;&lt;P&gt;For "Speed" layer, you can use &lt;STRONG&gt;Storm&lt;/STRONG&gt; or &lt;STRONG&gt;Spark Streaming&lt;/STRONG&gt;.  IMHO the main selection criteria between the two will depend on whether you're interested in ultra low latency (Storm) or high throughput (Spark Streaming).  There's other factors, but these are some of the main drivers.  &lt;/P&gt;&lt;P&gt;For the "Serving" layer, your main choice is &lt;STRONG&gt;HBase&lt;/STRONG&gt;.  Depending on how you're going to query the "Serving" layer you may want to consider putting Phoenix on top of HBase.  Since HBase is a NoSQL store, it has it's own API for making calls.  Phoenix adds an abstraction layer on top of HBase and allows you to make queries in SQL format.  Mind you, it's still in tech preview and may have some bugs here and there.  Also, it's not meant for complex SQL queries.&lt;/P&gt;&lt;P&gt;For your ingest and simple event processing you can look into &lt;STRONG&gt;HDF/Nifi&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;If you move beyond the HDP/HDF stack for the serving layer then your options increase to include other NoSQL stores as well as regular SQL DBs.&lt;/P&gt;&lt;P&gt;Below is a diagram of a sample Lambda architecture for a demo that receives sensor data from trucks and analysis them, along with driver behaviour, to determine the possibility of a driver committing a traffic violation/infraction.  It will give you a better idea of what a lambda deployment may look like.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="6463-lambda.jpg" style="width: 715px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20187i798527119C6344A3/image-size/medium?v=v2&amp;amp;px=400" role="button" title="6463-lambda.jpg" alt="6463-lambda.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 10:55:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168687#M37195</guid>
      <dc:creator>egarelnabi</dc:creator>
      <dc:date>2019-08-18T10:55:36Z</dc:date>
    </item>
    <item>
      <title>Re: Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168688#M37196</link>
      <description>&lt;P&gt;Thank you, &lt;A rel="user" href="https://community.cloudera.com/users/445/egarelnabi.html" nodeid="445"&gt;@Eyad Garelnabi&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/1486/smanjee.html" nodeid="1486"&gt;@Sunile Manjee&lt;/A&gt; .  The database portion is where my primary concerns were.&lt;/P&gt;&lt;P&gt;Eyad, are you layering Phoenix on top of HBase for querying?&lt;/P&gt;</description>
      <pubDate>Mon, 08 Aug 2016 23:05:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168688#M37196</guid>
      <dc:creator>ekbrosch</dc:creator>
      <dc:date>2016-08-08T23:05:26Z</dc:date>
    </item>
    <item>
      <title>Re: Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168689#M37197</link>
      <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/944/ekbrosch.html" nodeid="944"&gt;@Eric Brosch&lt;/A&gt; Phoenix is a SQL skin on top of hbase.  Phoenix allows to create secondary index on hbase which hbase natively does not create.    Phoenix on HDP comes out of the box with hbase.&lt;/P&gt;</description>
      <pubDate>Mon, 08 Aug 2016 23:11:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168689#M37197</guid>
      <dc:creator>sunile_manjee</dc:creator>
      <dc:date>2016-08-08T23:11:07Z</dc:date>
    </item>
    <item>
      <title>Re: Hortonworks Tools for Implementing Lambda Architecture</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168690#M37198</link>
      <description>&lt;P&gt;HAWQ is good for nothing&lt;/P&gt;</description>
      <pubDate>Fri, 10 Nov 2017 03:19:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hortonworks-Tools-for-Implementing-Lambda-Architecture/m-p/168690#M37198</guid>
      <dc:creator>milindmore</dc:creator>
      <dc:date>2017-11-10T03:19:59Z</dc:date>
    </item>
  </channel>
</rss>

