<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Oryx max-age params in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44806#M39748</link>
    <description>&lt;P&gt;So, if I am right, these max age parameters will not affect created/updated model in any way right? These parameters are&amp;nbsp;basically affecting only storage usage by Oryx. Correct me if I am wrong please.&lt;/P&gt;</description>
    <pubDate>Wed, 07 Sep 2016 07:52:05 GMT</pubDate>
    <dc:creator>cimox</dc:creator>
    <dc:date>2016-09-07T07:52:05Z</dc:date>
    <item>
      <title>Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44737#M39746</link>
      <description>&lt;P&gt;Hi guys,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;could some of you explain to me purpose and how does&amp;nbsp;&lt;EM&gt;max-age-data-hours&amp;nbsp;&lt;/EM&gt;and&amp;nbsp;&lt;EM&gt;&lt;SPAN&gt;max-age-model-hours&lt;/SPAN&gt;&lt;/EM&gt; works? For example I set max-age-data-hours to 7 days and max-age-model-hours to 31 days. How will this act in practice?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Matus&lt;/P&gt;</description>
      <pubDate>Tue, 06 Sep 2016 09:27:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44737#M39746</guid>
      <dc:creator>cimox</dc:creator>
      <dc:date>2016-09-06T09:27:28Z</dc:date>
    </item>
    <item>
      <title>Re: Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44740#M39747</link>
      <description>&lt;P&gt;This affects historical input data and model data stored on HDFS only. Every time the batch layer runs it will check the data versus these settings and delete old data/models if they're older than the given age.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This does not affect the age of data stored in Kafka topics. The input topic's retention doesn't matter much; just needs to be long enough so that the batch process still sees all data since the last batch by the time it runs. The update topic retention should also be long enough such that at least one model is retained somewhere in the topic. It too should be at least as long as the batch interval. If it's too long, then the speed/serving processes will waste time sifting through old data on startup to catch up.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The effect of deleting old input data is that this data will no longer be used in building future models. There's really no effect of deleting old models, with one exception. In some cases a model is stored on HDFS but is too large to send via Kafka, in which case a reference to its HDFS location is stored. If a model is deleted from HDFS but is still referenced on the Kafka update topic then it will be ignored. That's no big deal, but, I suppose it means you shouldn't delete old models too aggressively.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;batch interval &amp;lt; topic retention times &amp;lt; max age settings is a good general rule.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Sep 2016 09:59:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44740#M39747</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2016-09-06T09:59:38Z</dc:date>
    </item>
    <item>
      <title>Re: Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44806#M39748</link>
      <description>&lt;P&gt;So, if I am right, these max age parameters will not affect created/updated model in any way right? These parameters are&amp;nbsp;basically affecting only storage usage by Oryx. Correct me if I am wrong please.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Sep 2016 07:52:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44806#M39748</guid>
      <dc:creator>cimox</dc:creator>
      <dc:date>2016-09-07T07:52:05Z</dc:date>
    </item>
    <item>
      <title>Re: Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44807#M39749</link>
      <description>&lt;P&gt;Well, the data that is stored certainly affects future models. All historical data is used to build models.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Sep 2016 07:54:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44807#M39749</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2016-09-07T07:54:59Z</dc:date>
    </item>
    <item>
      <title>Re: Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44808#M39750</link>
      <description>Okay, so for example:&lt;BR /&gt;I build model for the first on 1year history data (I am ingesting 1year history in first training), but I have max-age-data-hour setup to 3months. What will happen in this case? Or, if this option is setup to 3months batch layer will read only 3months of data from kafka based on timestamp of data?</description>
      <pubDate>Wed, 07 Sep 2016 07:58:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44808#M39750</guid>
      <dc:creator>cimox</dc:creator>
      <dc:date>2016-09-07T07:58:06Z</dc:date>
    </item>
    <item>
      <title>Re: Oryx max-age params</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44930#M39751</link>
      <description>Anyone please with detailed explanation?</description>
      <pubDate>Fri, 09 Sep 2016 09:21:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oryx-max-age-params/m-p/44930#M39751</guid>
      <dc:creator>cimox</dc:creator>
      <dc:date>2016-09-09T09:21:11Z</dc:date>
    </item>
  </channel>
</rss>

