<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Is there a way to do a count Approx for a dataframe (not rdd)in spark 1.6 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208199#M62828</link>
    <description>&lt;P&gt;Thanks this is I what I use but I wish there was one just for the dataframe specifically.&lt;/P&gt;</description>
    <pubDate>Wed, 19 Jul 2017 00:10:17 GMT</pubDate>
    <dc:creator>Former Member</dc:creator>
    <dc:date>2017-07-19T00:10:17Z</dc:date>
    <item>
      <title>Is there a way to do a count Approx for a dataframe (not rdd)in spark 1.6</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208197#M62826</link>
      <description>&lt;P&gt;I was wondering how to do an approx count of a dataframe without converting to an rdd in spark 1.6.&lt;/P&gt;&lt;P&gt;Is there a possible hack or not.&lt;/P&gt;&lt;P&gt;If anyone has any solutions please let me know thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 14 Jun 2017 00:09:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208197#M62826</guid>
      <dc:creator>Former Member</dc:creator>
      <dc:date>2017-06-14T00:09:17Z</dc:date>
    </item>
    <item>
      <title>Re: Is there a way to do a count Approx for a dataframe (not rdd)in spark 1.6</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208198#M62827</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16464/yanks09champs.html" nodeid="16464"&gt;@elliot gimple&lt;/A&gt; I know it's not really what you want but there's an .rdd method you can call on a DataFrame in 1.6 so you could just do `df.rdd.countApprox()` on that. I'd have to look at the DAG more closely but I think the overhead is just going to be in converting DataFrame elements to Rows and not generation of the full RDD before `countApprox` is called -- not 100% sure about that though.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jul 2017 22:59:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208198#M62827</guid>
      <dc:creator>jfrazee</dc:creator>
      <dc:date>2017-07-07T22:59:07Z</dc:date>
    </item>
    <item>
      <title>Re: Is there a way to do a count Approx for a dataframe (not rdd)in spark 1.6</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208199#M62828</link>
      <description>&lt;P&gt;Thanks this is I what I use but I wish there was one just for the dataframe specifically.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jul 2017 00:10:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-there-a-way-to-do-a-count-Approx-for-a-dataframe-not-rdd/m-p/208199#M62828</guid>
      <dc:creator>Former Member</dc:creator>
      <dc:date>2017-07-19T00:10:17Z</dc:date>
    </item>
  </channel>
</rss>

