<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Real Pratical Tutorial for Hadoop using HDFS, Hive, Pig and Spark in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Real-Pratical-Tutorial-for-Hadoop-using-HDFS-Hive-Pig-and/m-p/42166#M32327</link>
    <description>The QuickStart VM includes a tutorial that will walk you through a use case&lt;BR /&gt;where you:&lt;BR /&gt;&lt;BR /&gt;- ingest some data into HDFS from a relational database using Sqoop, and&lt;BR /&gt;query it with Impala&lt;BR /&gt;- ingest some data into HDFS from a batch of log files, ETL it with Hive,&lt;BR /&gt;and query it with Impala&lt;BR /&gt;- ingest some data into HDFS from a live stream of logs and index it for&lt;BR /&gt;searching with Solr&lt;BR /&gt;- perform link strength analysis on the data using Spark&lt;BR /&gt;- build a dashboard in Hue&lt;BR /&gt;- if Hue run the scripts to migrate to Cloudera Enterprise, also audit&lt;BR /&gt;access to the data and visualize it's lineage&lt;BR /&gt;&lt;BR /&gt;That sounds like it will cover most of what you're looking for.&lt;BR /&gt;</description>
    <pubDate>Mon, 20 Jun 2016 22:40:57 GMT</pubDate>
    <dc:creator>Sean</dc:creator>
    <dc:date>2016-06-20T22:40:57Z</dc:date>
    <item>
      <title>Real Pratical Tutorial for Hadoop using HDFS, Hive, Pig and Spark</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Real-Pratical-Tutorial-for-Hadoop-using-HDFS-Hive-Pig-and/m-p/42091#M32325</link>
      <description>&lt;P&gt;Hi experts,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;There exists any complete tutorial for Hadoop in Cloudera Environment that demonstrates how to use HDFS , Pig , Hive and Spark ?&lt;BR /&gt;&lt;BR /&gt;I have seen a lot of guides but do not correspond to practical cases and I have had some difficulties to develop a solution ... I am very new to Hadoop ecosystem .&lt;BR /&gt;&lt;BR /&gt;I need to deliver a prototype of a Hadoop solution at the end of July and I'm getting frightened with the constant difficulties and doubts that I have felt .&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I only want to use that components to do some data cleansing and transformation.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I already download this virtual machine to use Spark:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.cloudera.com/downloads/quickstart_vms/5-7.html" target="_blank"&gt;http://www.cloudera.com/downloads/quickstart_vms/5-7.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Can anyone help me ?&lt;BR /&gt;&lt;BR /&gt;Many thanks &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:26:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Real-Pratical-Tutorial-for-Hadoop-using-HDFS-Hive-Pig-and/m-p/42091#M32325</guid>
      <dc:creator>Stewart12586</dc:creator>
      <dc:date>2022-09-16T10:26:02Z</dc:date>
    </item>
    <item>
      <title>Re: Real Pratical Tutorial for Hadoop using HDFS, Hive, Pig and Spark</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Real-Pratical-Tutorial-for-Hadoop-using-HDFS-Hive-Pig-and/m-p/42166#M32327</link>
      <description>The QuickStart VM includes a tutorial that will walk you through a use case&lt;BR /&gt;where you:&lt;BR /&gt;&lt;BR /&gt;- ingest some data into HDFS from a relational database using Sqoop, and&lt;BR /&gt;query it with Impala&lt;BR /&gt;- ingest some data into HDFS from a batch of log files, ETL it with Hive,&lt;BR /&gt;and query it with Impala&lt;BR /&gt;- ingest some data into HDFS from a live stream of logs and index it for&lt;BR /&gt;searching with Solr&lt;BR /&gt;- perform link strength analysis on the data using Spark&lt;BR /&gt;- build a dashboard in Hue&lt;BR /&gt;- if Hue run the scripts to migrate to Cloudera Enterprise, also audit&lt;BR /&gt;access to the data and visualize it's lineage&lt;BR /&gt;&lt;BR /&gt;That sounds like it will cover most of what you're looking for.&lt;BR /&gt;</description>
      <pubDate>Mon, 20 Jun 2016 22:40:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Real-Pratical-Tutorial-for-Hadoop-using-HDFS-Hive-Pig-and/m-p/42166#M32327</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2016-06-20T22:40:57Z</dc:date>
    </item>
  </channel>
</rss>

