<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: phoenix creating duplicates in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198610#M76550</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11782/dhirajsardana.html" nodeid="11782"&gt;@Dhiraj Sardana&lt;/A&gt; If information helped you, Could you please accept answer?&lt;/P&gt;</description>
    <pubDate>Thu, 19 Apr 2018 19:43:15 GMT</pubDate>
    <dc:creator>schhabra1</dc:creator>
    <dc:date>2018-04-19T19:43:15Z</dc:date>
    <item>
      <title>phoenix creating duplicates</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198606#M76546</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;We're seeing lots of duplicates in phoenix table, while not in respective Hbase table.&lt;/P&gt;&lt;P&gt;Result : The total count in phoenix table is 3 times to respective Hbase table(say total count, in phoenix is 30 millions and in respective Hbase table is 10 million only).We checked for specific row-keys as well, there are duplicates in Phoenix , but not in Hbase.&lt;/P&gt;&lt;P&gt;More details :For this table we're using SALT_BUCKET property of phoenix and global index on one of the column and Phoenix version is 4.7.&lt;/P&gt;&lt;P&gt;We're consuming data from kafka and storing in Pheonix via Storm-JDBC connector.&lt;/P&gt;&lt;P&gt;Also this is reproducing only when there is so much concurrent requests.Till now we're unable to replicate on normal (dev) environment.&lt;/P&gt;&lt;P&gt;Please guide us if we're missing some config or some other pointers.&lt;/P&gt;&lt;P&gt;@Dhiraj&lt;/P&gt;</description>
      <pubDate>Wed, 28 Mar 2018 05:52:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198606#M76546</guid>
      <dc:creator>dhiraj_sardana</dc:creator>
      <dc:date>2018-03-28T05:52:57Z</dc:date>
    </item>
    <item>
      <title>Re: phoenix creating duplicates</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198607#M76547</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/11782/dhirajsardana.html" nodeid="11782"&gt;@Dhiraj Sardana&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Can you try deleting STATS for table and check count again?&lt;/P&gt;&lt;P&gt;delete from SYSTEM.STATS where physical_name='&amp;lt;TABLE_NAME&amp;gt;' &lt;/P&gt;&lt;P&gt;-Shubham&lt;/P&gt;</description>
      <pubDate>Fri, 30 Mar 2018 13:44:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198607#M76547</guid>
      <dc:creator>schhabra1</dc:creator>
      <dc:date>2018-03-30T13:44:17Z</dc:date>
    </item>
    <item>
      <title>Re: phoenix creating duplicates</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198608#M76548</link>
      <description>&lt;P&gt;Thanks Shubham!&lt;/P&gt;&lt;P&gt;Now we can't see duplicates , we haven't deleted STATS, may be some metadata sync-up removed the issue(may be we'll see it again in some time).I've few queries:&lt;/P&gt;&lt;P&gt;1) Is it safe to delete these tables on production, i mean is phoenix automatically recreates these tables.&lt;/P&gt;&lt;P&gt;2) I tried to look for some web references where i can see that how Phoenix updates/refer these system tables and if there are some config impacting these scenarios.Please share if there is some reference pointing to this specific section of phoenix.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Apr 2018 22:00:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198608#M76548</guid>
      <dc:creator>dhiraj_sardana</dc:creator>
      <dc:date>2018-04-04T22:00:44Z</dc:date>
    </item>
    <item>
      <title>Re: phoenix creating duplicates</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198609#M76549</link>
      <description>&lt;P&gt;1) Is it safe to delete these tables on production, i mean is phoenix automatically recreates these tables.&lt;/P&gt;&lt;P&gt;Yeah, we can safely remove the entry from STATS table. STATS will get updated automatically after every 15 minutes or during compaction.&lt;/P&gt;&lt;P&gt;How to manually generate STATS - &lt;A href="https://phoenix.apache.org/update_statistics.html" target="_blank"&gt;https://phoenix.apache.org/update_statistics.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;2) I tried to look for some web references where i can see that how Phoenix updates/refer these system tables and if there are some config impacting these scenarios.Please share if there is some reference pointing to this specific section of phoenix.&lt;/P&gt;&lt;P&gt;Phoenix uses SYSTEM.STATS table contains stats like guideposts which are used to determine the number of scans. &lt;/P&gt;&lt;P&gt;Some important parameters: &lt;/P&gt;&lt;P&gt;phoenix.stats.guidepost.width - Server-side parameter that specifies the number of bytes between guideposts. A smaller amount increases parallelization, but also increases the number of chunks which must be merged on the client side. The default value is 100 MB. &lt;/P&gt;&lt;P&gt;phoenix.stats.enabled - Whether STATS collection is enabled. By default it is enabled.
&lt;A href="https://phoenix.apache.org/tuning.html" target="_blank"&gt;https://phoenix.apache.org/tuning.html&lt;/A&gt; - Details of All parameters.&lt;/P&gt;&lt;P&gt;Let me know if you need more information.&lt;/P&gt;</description>
      <pubDate>Thu, 05 Apr 2018 01:49:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198609#M76549</guid>
      <dc:creator>schhabra1</dc:creator>
      <dc:date>2018-04-05T01:49:02Z</dc:date>
    </item>
    <item>
      <title>Re: phoenix creating duplicates</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198610#M76550</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11782/dhirajsardana.html" nodeid="11782"&gt;@Dhiraj Sardana&lt;/A&gt; If information helped you, Could you please accept answer?&lt;/P&gt;</description>
      <pubDate>Thu, 19 Apr 2018 19:43:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/phoenix-creating-duplicates/m-p/198610#M76550</guid>
      <dc:creator>schhabra1</dc:creator>
      <dc:date>2018-04-19T19:43:15Z</dc:date>
    </item>
  </channel>
</rss>

