<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Introductory Hadoop queries in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Introductory-Hadoop-queries/m-p/162597#M33308</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Being a novice, I am trying to understand answers to the below questions?&lt;/P&gt;&lt;P&gt;1. what is the difference of having configuration defined in hadoop-env.sh vs defining it hdfs-site.xml or yarn-site.xml?&lt;/P&gt;&lt;P&gt;2. My presumption is *-default.xml files will have the standard Apache defined configuration values and any custom values for the standard properties (either Hadoop vendor specific like Hortonworks / Cloudera or implementation specific at a project level) will be defined in the *-site.xml files. Am I correct in my understanding?&lt;/P&gt;&lt;P&gt;3. What is the difference of /usr/hdp/current and /usr/hdp/2.4.0.0.169 folders on Sandbox? What is the importance/ significance of each of these folders? Are they both required even on production deployments?&lt;/P&gt;</description>
    <pubDate>Wed, 29 Jun 2016 17:14:52 GMT</pubDate>
    <dc:creator>bigdata_superno</dc:creator>
    <dc:date>2016-06-29T17:14:52Z</dc:date>
    <item>
      <title>Introductory Hadoop queries</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Introductory-Hadoop-queries/m-p/162597#M33308</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Being a novice, I am trying to understand answers to the below questions?&lt;/P&gt;&lt;P&gt;1. what is the difference of having configuration defined in hadoop-env.sh vs defining it hdfs-site.xml or yarn-site.xml?&lt;/P&gt;&lt;P&gt;2. My presumption is *-default.xml files will have the standard Apache defined configuration values and any custom values for the standard properties (either Hadoop vendor specific like Hortonworks / Cloudera or implementation specific at a project level) will be defined in the *-site.xml files. Am I correct in my understanding?&lt;/P&gt;&lt;P&gt;3. What is the difference of /usr/hdp/current and /usr/hdp/2.4.0.0.169 folders on Sandbox? What is the importance/ significance of each of these folders? Are they both required even on production deployments?&lt;/P&gt;</description>
      <pubDate>Wed, 29 Jun 2016 17:14:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Introductory-Hadoop-queries/m-p/162597#M33308</guid>
      <dc:creator>bigdata_superno</dc:creator>
      <dc:date>2016-06-29T17:14:52Z</dc:date>
    </item>
    <item>
      <title>Re: Introductory Hadoop queries</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Introductory-Hadoop-queries/m-p/162598#M33309</link>
      <description>&lt;P&gt;1) hadoop env are linux environment variables for the processes.  Some things need to be set this way because they are used by the shell scripts starting the applications. ( RAM settings ... ) The XML files can by definition only work after the JVM is started&lt;/P&gt;&lt;P&gt;2) that is true although the defaults don't have everything as well. Some defaults are hard coded in the applications&lt;/P&gt;&lt;P&gt;3) /usr/hdp/2.4.0.0.169 is the actual folder containing the distribution. If you upgrade the cluster HDP will create a new folder /usr/hdp/2.4.2.xxx for example to enable rollback operations. /usr/hdp/current is a folder with symbolic links to the current distribution i.e. pointing to the real underlying folder with the version you have selected. ( They also change the structure a bit ). Under the cover HDP uses autility called hdp-select that sets these symbolic links to the version you selected.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Jun 2016 18:18:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Introductory-Hadoop-queries/m-p/162598#M33309</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-06-29T18:18:33Z</dc:date>
    </item>
  </channel>
</rss>

