<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: ETL WEBSITE CONTENT IN HADOOP SANDBOX in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139422#M102049</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16415/drenushagjurka.html" nodeid="16415"&gt;@voca voca&lt;/A&gt; &lt;/P&gt;&lt;P&gt;An example is the tutorial below:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/hadoop-tutorial/loading-data-into-the-hortonworks-sandbox/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/loading-data-into-the-hortonworks-sandbox/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;A bit more adventurous would be to ingest twitter data using N-Fi, visualizing via Solr/Banana, and then doing some Query processing using Hive:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Full list of tutorials:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/tutorials/" target="_blank"&gt;https://hortonworks.com/tutorials/&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 06 Mar 2017 17:01:16 GMT</pubDate>
    <dc:creator>gmartin</dc:creator>
    <dc:date>2017-03-06T17:01:16Z</dc:date>
    <item>
      <title>ETL WEBSITE CONTENT IN HADOOP SANDBOX</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139421#M102048</link>
      <description>&lt;P&gt;I am very very new to Hadoop Sandbox . I have installed HDP Sandbox on oracle Virtualbox and Putty since last week and im taking these tutorials :

&lt;A href="https://hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/&lt;/A&gt;

Can anyone tell me any tutorial or suggestions how can I get a website content step by step, or facebook content , extract it and analyze then it (ETL)?!
Thanks !&lt;/P&gt;</description>
      <pubDate>Mon, 06 Mar 2017 16:40:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139421#M102048</guid>
      <dc:creator>drenusha_gjurka</dc:creator>
      <dc:date>2017-03-06T16:40:20Z</dc:date>
    </item>
    <item>
      <title>Re: ETL WEBSITE CONTENT IN HADOOP SANDBOX</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139422#M102049</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16415/drenushagjurka.html" nodeid="16415"&gt;@voca voca&lt;/A&gt; &lt;/P&gt;&lt;P&gt;An example is the tutorial below:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/hadoop-tutorial/loading-data-into-the-hortonworks-sandbox/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/loading-data-into-the-hortonworks-sandbox/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;A bit more adventurous would be to ingest twitter data using N-Fi, visualizing via Solr/Banana, and then doing some Query processing using Hive:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Full list of tutorials:&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/tutorials/" target="_blank"&gt;https://hortonworks.com/tutorials/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 06 Mar 2017 17:01:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139422#M102049</guid>
      <dc:creator>gmartin</dc:creator>
      <dc:date>2017-03-06T17:01:16Z</dc:date>
    </item>
    <item>
      <title>Re: ETL WEBSITE CONTENT IN HADOOP SANDBOX</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139423#M102050</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16415/drenushagjurka.html" nodeid="16415"&gt;@voca voca&lt;/A&gt;&lt;/P&gt;&lt;P&gt;     For social media content like Facebook you can take a look at :  &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Analyzing Social Media and Customer Sentiment With Apache NiFi and HDP Search: &lt;/STRONG&gt;&lt;A href="https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/" target="_blank"&gt;https://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 06 Mar 2017 17:03:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ETL-WEBSITE-CONTENT-IN-HADOOP-SANDBOX/m-p/139423#M102050</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-03-06T17:03:50Z</dc:date>
    </item>
  </channel>
</rss>

