<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Best practice architecture and naming hdfs path names/hive database for dev and test on 1 cluster in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Best-practice-architecture-and-naming-hdfs-path-names-hive/m-p/58302#M49791</link>
    <description>&lt;P&gt;We are moving our Oracle "landing" data into Hadoop. In Oracle we have three environments and three Oracle databases: dwdev, dwtest, and dwprod. The goal is to have three separate "landing" zones in Hadoop that will feed into each Oracle database, respectively, i.e. Hadoop dev feeds Oracle dwdev, etc.&lt;/P&gt;&lt;P&gt;The dev and test hadoop environment will exist on a single physical hadoop cluster.&lt;/P&gt;&lt;P&gt;How do we architect this?&lt;/P&gt;&lt;P&gt;HDFS&lt;/P&gt;&lt;P&gt;/&amp;lt;env&amp;gt;/data/&amp;lt;information_area&amp;gt;/&amp;lt;table_name&amp;gt;&lt;/P&gt;&lt;P&gt;/dev/data/marketing/customer_master&lt;/P&gt;&lt;P&gt;/test/data/marketing/customer_master&lt;/P&gt;&lt;P&gt;HIVE&lt;/P&gt;&lt;P&gt;database namespace (or schema_owner) = db_marketing&lt;/P&gt;&lt;P&gt;table name = customer_master&lt;/P&gt;&lt;P&gt;In DEV select * from db_marketing.customer_master would source from /dev/data/marketing/customer_master&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In TEST select * from db_marketing.customer_master would source from /test/data/marketing/customer_master&lt;/P&gt;&lt;P&gt;Does this require multiple metastores?&lt;/P&gt;&lt;P&gt;What is best practice for multiple environments on a single Hadoop cluster?&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 12:01:55 GMT</pubDate>
    <dc:creator>kjfischer</dc:creator>
    <dc:date>2022-09-16T12:01:55Z</dc:date>
  </channel>
</rss>

