<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Best pratice for lacation file in cluster HDP in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167543#M45471</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I deploy my first cluster HDP since one month and it are used by all my departement. &lt;/P&gt;&lt;P&gt;So i want to store various files in the cluster. But i dont know the best pratice to do it. can store file in master node? Edge node? data node? ...&lt;/P&gt;&lt;P&gt;Examples for files i want to store are :&lt;/P&gt;&lt;P&gt;- files for proof of concept&lt;/P&gt;&lt;P&gt;- jars files for application likes spark&lt;/P&gt;&lt;P&gt;- files for teradata client&lt;/P&gt;&lt;P&gt;- ifexp files&lt;/P&gt;</description>
    <pubDate>Mon, 07 Nov 2016 21:06:09 GMT</pubDate>
    <dc:creator>amoussoubaruch1</dc:creator>
    <dc:date>2016-11-07T21:06:09Z</dc:date>
    <item>
      <title>Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167543#M45471</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I deploy my first cluster HDP since one month and it are used by all my departement. &lt;/P&gt;&lt;P&gt;So i want to store various files in the cluster. But i dont know the best pratice to do it. can store file in master node? Edge node? data node? ...&lt;/P&gt;&lt;P&gt;Examples for files i want to store are :&lt;/P&gt;&lt;P&gt;- files for proof of concept&lt;/P&gt;&lt;P&gt;- jars files for application likes spark&lt;/P&gt;&lt;P&gt;- files for teradata client&lt;/P&gt;&lt;P&gt;- ifexp files&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:06:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167543#M45471</guid>
      <dc:creator>amoussoubaruch1</dc:creator>
      <dc:date>2016-11-07T21:06:09Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167544#M45472</link>
      <description>&lt;P&gt;can i create a linux file system to store all file in? and where node can i use? Thanks &lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:07:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167544#M45472</guid>
      <dc:creator>amoussoubaruch1</dc:creator>
      <dc:date>2016-11-07T21:07:47Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167545#M45473</link>
      <description>&lt;P&gt;&lt;STRONG&gt;1. Never use master or data node local storage&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Best practice is definitely not to touch the master nodes or data nodes for local filesystem storage or command line interface (use edge node CLI or local machine via Ambari Views or integration through Knox gateway).&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2. 3rd party tools&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;3rd party tools will specify where to locate their files/jars.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;3. Edge node&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;If you need files (typically jars) for client interface to cluster, place on edge node and use client there.&lt;/P&gt;&lt;P&gt;If you simply want to archive files (e.g. POC work) you can do this on the edge node local file system.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;4. HDFS&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;If you are archiving files on the edge node and it does not have high availability or backup (e.g. autoreplication of mounts) and you want this, putting it into HDFS is a good idea since each is replicated 3x.&lt;/P&gt;&lt;P&gt;When putting into HDFS, from a client perspective there is no specification of name node or data node -- you interact with the namenode and it will store it on the data nodes.  The name node is your interface with the data nodes.&lt;/P&gt;&lt;P&gt;In HDFS, you could define a path like /misc and store these files there.  You can also manage read-write permissions on this folder.&lt;/P&gt;&lt;P&gt;You can manage files (make dir, put file, get file) in hdfs through the command line (edge node is good host for this) or Ambari file view.&lt;/P&gt;&lt;P&gt;See: &lt;A href="http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/best-practices-in-hdfs-authorization-with-apache-ranger/" target="_blank"&gt;http://hortonworks.com/blog/best-practices-in-hdfs-authorization-with-apache-ranger/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:33:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167545#M45473</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-07T21:33:57Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167546#M45474</link>
      <description>&lt;P&gt;Hello Greg,&lt;/P&gt;&lt;P&gt;Thanks for your answers; I don't talk about data files that i can store in hdfs but files like applications jars (jars for spark application) or teradata generate file. &lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:48:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167546#M45474</guid>
      <dc:creator>amoussoubaruch1</dc:creator>
      <dc:date>2016-11-07T21:48:29Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167547#M45475</link>
      <description>&lt;P&gt;If you do anything with the linux file system, it should be on edge node only.  See fuller answer below.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:51:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167547#M45475</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-07T21:51:05Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167548#M45476</link>
      <description>&lt;P&gt;As mentioned in previous comment -- you should only store files in local file system of edge node. You should never use the actual cluster (master and data nodes) for local file storage.  The fuller answer gives the benefit of HDFS if you are worried about automatic backup of files.  (I have seen edge nodes go down and everything lost; thus, either have automatic backup or go to hdfs for files you want to backup.)&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 21:53:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167548#M45476</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-07T21:53:23Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167549#M45477</link>
      <description>&lt;P&gt;ok! thanks very much. &lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 22:16:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167549#M45477</guid>
      <dc:creator>amoussoubaruch1</dc:creator>
      <dc:date>2016-11-07T22:16:59Z</dc:date>
    </item>
    <item>
      <title>Re: Best pratice for lacation file in cluster HDP</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167550#M45478</link>
      <description>&lt;P&gt;If you feel like you have everything you need, let me know by accepting the answer; else, good to wait for additional answers or follow up with additional questions.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 22:26:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-pratice-for-lacation-file-in-cluster-HDP/m-p/167550#M45478</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-07T22:26:43Z</dc:date>
    </item>
  </channel>
</rss>

