<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: kudu scan very slow in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/kudu-scan-very-slow/m-p/85498#M11687</link>
    <description>&lt;P&gt;One thing that is clearly happening here is that Kudu is sending much more data than is necessary back to Impala. You specified LIMIT 7, but Kudu&amp;nbsp;doesn't support server-side limits until CDH6.1. For such a small query, this might make things quite a bit faster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Beyond that, I honestly don't see anything suspicious about the numbers from the Kudu side other than the scan took a long time given the amount of work involved. The trace shows nothing out of the ordinary; the metrics are fine. There weren't even cache misses, so everything came out of cache decoded + decompressed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From the Impala profile, the only suspicious-looking thing is that two round trips were required. That shouldn't have been the case with LIMIT 7 as the first batch should have had more than 7 records in it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you run the scan a couple times in a row, does it get much faster? How does the time vary with the LIMIT amount?&lt;/P&gt;</description>
    <pubDate>Fri, 25 Jan 2019 23:18:05 GMT</pubDate>
    <dc:creator>wdberkeley</dc:creator>
    <dc:date>2019-01-25T23:18:05Z</dc:date>
  </channel>
</rss>

